In-depth feature selection and ranking for automated detection of mobile malware

Alejandro Guerra-Manzanares; Sven Nõmm; Hayretdin Bahsi

doi:10.5220/0007349602740283

In-depth feature selection and ranking for automated detection of mobile malware

Alejandro Guerra-Manzanares, Sven Nõmm, Hayretdin Bahsi

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

12 Citations (Scopus)

Abstract

New malware detection techniques are highly needed due to the increasing threat posed by mobile malware. Machine learning techniques have provided promising results in this problem domain. However, feature selection, which is an essential instrument to overcome the curse of dimensionality, presenting higher interpretable results and optimizing the utilization of computational resources, requires more attention in order to induce better learning models for mobile malware detection. In this paper, in order to find out the minimum feature set that provides higher accuracy and analyze the discriminatory powers of different features, we employed feature selection and ranking methods to datasets characterized by system calls and permissions. These features were extracted from malware application samples belonging to two different time-frames (2010-2012 and 2017-2018) and benign applications. We demonstrated that selected feature sets with small sizes, in both feature categories, are able to provide high accuracy results. However, we identified a decline in the discriminatory power of the selected features in both categories when the dataset is induced by the recent malware samples instead of old ones, indicating a concept drift. Although we plan to model the concept drift in our future studies, the feature selection results presented in this study give a valuable insight regarding the change occurred in the best discriminating features during the evolvement of mobile malware over time.

Original language	English
Title of host publication	ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy
Editors	Paolo Mori, Steven Furnell, Olivier Camp
Publisher	SciTePress
Pages	274-283
Number of pages	10
ISBN (Electronic)	9789897583599
DOIs	https://doi.org/10.5220/0007349602740283
Publication status	Published - 2019
Externally published	Yes
Event	5th International Conference on Information Systems Security and Privacy, ICISSP 2019 - Prague, Czech Republic Duration: 23 Feb 2019 → 25 Feb 2019

Publication series

Name	ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy

Conference

Conference	5th International Conference on Information Systems Security and Privacy, ICISSP 2019
Country/Territory	Czech Republic
City	Prague
Period	23/02/19 → 25/02/19

Keywords

Feature Selection
Machine Learning
Mobile Malware

ASJC Scopus subject areas

Computer Networks and Communications
Computer Science Applications
Information Systems
Safety, Risk, Reliability and Quality

Access to Document

10.5220/0007349602740283

Cite this

Guerra-Manzanares, A., Nõmm, S., & Bahsi, H. (2019). In-depth feature selection and ranking for automated detection of mobile malware. In P. Mori, S. Furnell, & O. Camp (Eds.), ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy (pp. 274-283). (ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy). SciTePress. https://doi.org/10.5220/0007349602740283

Guerra-Manzanares, Alejandro ; Nõmm, Sven ; Bahsi, Hayretdin. / In-depth feature selection and ranking for automated detection of mobile malware. ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy. editor / Paolo Mori ; Steven Furnell ; Olivier Camp. SciTePress, 2019. pp. 274-283 (ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy).

@inproceedings{73fa7e540ddb477182c4b38ddc4c3584,

title = "In-depth feature selection and ranking for automated detection of mobile malware",

abstract = "New malware detection techniques are highly needed due to the increasing threat posed by mobile malware. Machine learning techniques have provided promising results in this problem domain. However, feature selection, which is an essential instrument to overcome the curse of dimensionality, presenting higher interpretable results and optimizing the utilization of computational resources, requires more attention in order to induce better learning models for mobile malware detection. In this paper, in order to find out the minimum feature set that provides higher accuracy and analyze the discriminatory powers of different features, we employed feature selection and ranking methods to datasets characterized by system calls and permissions. These features were extracted from malware application samples belonging to two different time-frames (2010-2012 and 2017-2018) and benign applications. We demonstrated that selected feature sets with small sizes, in both feature categories, are able to provide high accuracy results. However, we identified a decline in the discriminatory power of the selected features in both categories when the dataset is induced by the recent malware samples instead of old ones, indicating a concept drift. Although we plan to model the concept drift in our future studies, the feature selection results presented in this study give a valuable insight regarding the change occurred in the best discriminating features during the evolvement of mobile malware over time.",

keywords = "Feature Selection, Machine Learning, Mobile Malware",

author = "Alejandro Guerra-Manzanares and Sven N{\~o}mm and Hayretdin Bahsi",

note = "Publisher Copyright: {\textcopyright} 2019 by SCITEPRESS - Science and Technology Publications, Lda.; 5th International Conference on Information Systems Security and Privacy, ICISSP 2019 ; Conference date: 23-02-2019 Through 25-02-2019",

year = "2019",

doi = "10.5220/0007349602740283",

language = "English",

series = "ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy",

publisher = "SciTePress",

pages = "274--283",

editor = "Paolo Mori and Steven Furnell and Olivier Camp",

booktitle = "ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy",

}

Guerra-Manzanares, A, Nõmm, S & Bahsi, H 2019, In-depth feature selection and ranking for automated detection of mobile malware. in P Mori, S Furnell & O Camp (eds), ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy. ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy, SciTePress, pp. 274-283, 5th International Conference on Information Systems Security and Privacy, ICISSP 2019, Prague, Czech Republic, 23/02/19. https://doi.org/10.5220/0007349602740283

In-depth feature selection and ranking for automated detection of mobile malware. / Guerra-Manzanares, Alejandro; Nõmm, Sven; Bahsi, Hayretdin.
ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy. ed. / Paolo Mori; Steven Furnell; Olivier Camp. SciTePress, 2019. p. 274-283 (ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - In-depth feature selection and ranking for automated detection of mobile malware

AU - Guerra-Manzanares, Alejandro

AU - Nõmm, Sven

AU - Bahsi, Hayretdin

PY - 2019

Y1 - 2019

N2 - New malware detection techniques are highly needed due to the increasing threat posed by mobile malware. Machine learning techniques have provided promising results in this problem domain. However, feature selection, which is an essential instrument to overcome the curse of dimensionality, presenting higher interpretable results and optimizing the utilization of computational resources, requires more attention in order to induce better learning models for mobile malware detection. In this paper, in order to find out the minimum feature set that provides higher accuracy and analyze the discriminatory powers of different features, we employed feature selection and ranking methods to datasets characterized by system calls and permissions. These features were extracted from malware application samples belonging to two different time-frames (2010-2012 and 2017-2018) and benign applications. We demonstrated that selected feature sets with small sizes, in both feature categories, are able to provide high accuracy results. However, we identified a decline in the discriminatory power of the selected features in both categories when the dataset is induced by the recent malware samples instead of old ones, indicating a concept drift. Although we plan to model the concept drift in our future studies, the feature selection results presented in this study give a valuable insight regarding the change occurred in the best discriminating features during the evolvement of mobile malware over time.

AB - New malware detection techniques are highly needed due to the increasing threat posed by mobile malware. Machine learning techniques have provided promising results in this problem domain. However, feature selection, which is an essential instrument to overcome the curse of dimensionality, presenting higher interpretable results and optimizing the utilization of computational resources, requires more attention in order to induce better learning models for mobile malware detection. In this paper, in order to find out the minimum feature set that provides higher accuracy and analyze the discriminatory powers of different features, we employed feature selection and ranking methods to datasets characterized by system calls and permissions. These features were extracted from malware application samples belonging to two different time-frames (2010-2012 and 2017-2018) and benign applications. We demonstrated that selected feature sets with small sizes, in both feature categories, are able to provide high accuracy results. However, we identified a decline in the discriminatory power of the selected features in both categories when the dataset is induced by the recent malware samples instead of old ones, indicating a concept drift. Although we plan to model the concept drift in our future studies, the feature selection results presented in this study give a valuable insight regarding the change occurred in the best discriminating features during the evolvement of mobile malware over time.

KW - Feature Selection

KW - Machine Learning

KW - Mobile Malware

UR - http://www.scopus.com/inward/record.url?scp=85064668886&partnerID=8YFLogxK

U2 - 10.5220/0007349602740283

DO - 10.5220/0007349602740283

M3 - Conference contribution

AN - SCOPUS:85064668886

T3 - ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy

SP - 274

EP - 283

BT - ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy

A2 - Mori, Paolo

A2 - Furnell, Steven

A2 - Camp, Olivier

PB - SciTePress

T2 - 5th International Conference on Information Systems Security and Privacy, ICISSP 2019

Y2 - 23 February 2019 through 25 February 2019

ER -

Guerra-Manzanares A, Nõmm S, Bahsi H. In-depth feature selection and ranking for automated detection of mobile malware. In Mori P, Furnell S, Camp O, editors, ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy. SciTePress. 2019. p. 274-283. (ICISSP 2019 - Proceedings of the 5th International Conference on Information Systems Security and Privacy). doi: 10.5220/0007349602740283

In-depth feature selection and ranking for automated detection of mobile malware

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this