Towards the integration of a post-hoc interpretation step into the machine learning workflow for IoT botnet detection

Sven Nomm; Alejandro Guerra-Manzanares; Hayretdin Bahsi

doi:10.1109/ICMLA.2019.00193

Towards the integration of a post-hoc interpretation step into the machine learning workflow for IoT botnet detection

Sven Nomm, Alejandro Guerra-Manzanares, Hayretdin Bahsi

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

19 Citations (Scopus)

Abstract

The analysis of the interplay between the feature selection and the post-hoc local interpretation steps in a machine learning workflow followed for IoT botnet detection constitutes the research scope of the present paper. While the application of machine learning-based techniques has become a trend in cyber security, the main focus has been almost on detection accuracy. However, providing the relevant explanation for a detection decision is a vital requirement in a tiered incident handling processes of the contemporary security operations centers. Moreover, the design of intrusion detection systems in IoT networks has to take the limitations of the computational resources into consideration. Therefore, resource limitations in addition to human element of incident handling necessitate considering feature selection and interpretability at the same time in machine learning workflows. In this paper, first, we analyzed the selection of features and its implication on the data accuracy. Second, we investigated the impact of feature selection on the explanations generated at the post-hoc interpretation phase. We utilized a filter method, Fisher's Score and Local Interpretable Model-Agnostic Explanation (LIME) at feature selection and post-hoc interpretation phases, respectively. To evaluate the quality of explanations, we proposed a metric that reflects the need of the security analysts. It is demonstrated that the application of both steps for the particular case of IoT botnet detection may result in highly accurate and interpretable learning models induced by fewer features. Our metric enables us to evaluate the detection accuracy and interpretability in an integrated way.

Original language	English
Title of host publication	Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019
Editors	M. Arif Wani, Taghi M. Khoshgoftaar, Dingding Wang, Huanjing Wang, Naeem Seliya
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1162-1169
Number of pages	8
ISBN (Electronic)	9781728145495
DOIs	https://doi.org/10.1109/ICMLA.2019.00193
Publication status	Published - Dec 2019
Externally published	Yes
Event	18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019 - Boca Raton, United States Duration: 16 Dec 2019 → 19 Dec 2019

Publication series

Name	Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019

Conference

Conference	18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019
Country/Territory	United States
City	Boca Raton
Period	16/12/19 → 19/12/19

Keywords

Botnet detection
Interpretation
Machine learning

ASJC Scopus subject areas

Strategy and Management
Artificial Intelligence
Computer Science Applications
Decision Sciences (miscellaneous)
Signal Processing
Media Technology

Access to Document

10.1109/ICMLA.2019.00193

Cite this

Nomm, S., Guerra-Manzanares, A., & Bahsi, H. (2019). Towards the integration of a post-hoc interpretation step into the machine learning workflow for IoT botnet detection. In M. A. Wani, T. M. Khoshgoftaar, D. Wang, H. Wang, & N. Seliya (Eds.), Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019 (pp. 1162-1169). Article 8999281 (Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICMLA.2019.00193

Nomm, Sven ; Guerra-Manzanares, Alejandro ; Bahsi, Hayretdin. / Towards the integration of a post-hoc interpretation step into the machine learning workflow for IoT botnet detection. Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019. editor / M. Arif Wani ; Taghi M. Khoshgoftaar ; Dingding Wang ; Huanjing Wang ; Naeem Seliya. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 1162-1169 (Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019).

@inproceedings{f9e24cc9bd9046ce86d41aa96c9e4f95,

title = "Towards the integration of a post-hoc interpretation step into the machine learning workflow for IoT botnet detection",

abstract = "The analysis of the interplay between the feature selection and the post-hoc local interpretation steps in a machine learning workflow followed for IoT botnet detection constitutes the research scope of the present paper. While the application of machine learning-based techniques has become a trend in cyber security, the main focus has been almost on detection accuracy. However, providing the relevant explanation for a detection decision is a vital requirement in a tiered incident handling processes of the contemporary security operations centers. Moreover, the design of intrusion detection systems in IoT networks has to take the limitations of the computational resources into consideration. Therefore, resource limitations in addition to human element of incident handling necessitate considering feature selection and interpretability at the same time in machine learning workflows. In this paper, first, we analyzed the selection of features and its implication on the data accuracy. Second, we investigated the impact of feature selection on the explanations generated at the post-hoc interpretation phase. We utilized a filter method, Fisher's Score and Local Interpretable Model-Agnostic Explanation (LIME) at feature selection and post-hoc interpretation phases, respectively. To evaluate the quality of explanations, we proposed a metric that reflects the need of the security analysts. It is demonstrated that the application of both steps for the particular case of IoT botnet detection may result in highly accurate and interpretable learning models induced by fewer features. Our metric enables us to evaluate the detection accuracy and interpretability in an integrated way.",

keywords = "Botnet detection, Interpretation, Machine learning",

author = "Sven Nomm and Alejandro Guerra-Manzanares and Hayretdin Bahsi",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019 ; Conference date: 16-12-2019 Through 19-12-2019",

year = "2019",

month = dec,

doi = "10.1109/ICMLA.2019.00193",

language = "English",

series = "Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1162--1169",

editor = "Wani, {M. Arif} and Khoshgoftaar, {Taghi M.} and Dingding Wang and Huanjing Wang and Naeem Seliya",

booktitle = "Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019",

address = "United States",

}

Nomm, S, Guerra-Manzanares, A & Bahsi, H 2019, Towards the integration of a post-hoc interpretation step into the machine learning workflow for IoT botnet detection. in MA Wani, TM Khoshgoftaar, D Wang, H Wang & N Seliya (eds), Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019., 8999281, Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019, Institute of Electrical and Electronics Engineers Inc., pp. 1162-1169, 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019, Boca Raton, United States, 16/12/19. https://doi.org/10.1109/ICMLA.2019.00193

Towards the integration of a post-hoc interpretation step into the machine learning workflow for IoT botnet detection. / Nomm, Sven; Guerra-Manzanares, Alejandro; Bahsi, Hayretdin.
Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019. ed. / M. Arif Wani; Taghi M. Khoshgoftaar; Dingding Wang; Huanjing Wang; Naeem Seliya. Institute of Electrical and Electronics Engineers Inc., 2019. p. 1162-1169 8999281 (Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Towards the integration of a post-hoc interpretation step into the machine learning workflow for IoT botnet detection

AU - Nomm, Sven

AU - Guerra-Manzanares, Alejandro

AU - Bahsi, Hayretdin

PY - 2019/12

Y1 - 2019/12

N2 - The analysis of the interplay between the feature selection and the post-hoc local interpretation steps in a machine learning workflow followed for IoT botnet detection constitutes the research scope of the present paper. While the application of machine learning-based techniques has become a trend in cyber security, the main focus has been almost on detection accuracy. However, providing the relevant explanation for a detection decision is a vital requirement in a tiered incident handling processes of the contemporary security operations centers. Moreover, the design of intrusion detection systems in IoT networks has to take the limitations of the computational resources into consideration. Therefore, resource limitations in addition to human element of incident handling necessitate considering feature selection and interpretability at the same time in machine learning workflows. In this paper, first, we analyzed the selection of features and its implication on the data accuracy. Second, we investigated the impact of feature selection on the explanations generated at the post-hoc interpretation phase. We utilized a filter method, Fisher's Score and Local Interpretable Model-Agnostic Explanation (LIME) at feature selection and post-hoc interpretation phases, respectively. To evaluate the quality of explanations, we proposed a metric that reflects the need of the security analysts. It is demonstrated that the application of both steps for the particular case of IoT botnet detection may result in highly accurate and interpretable learning models induced by fewer features. Our metric enables us to evaluate the detection accuracy and interpretability in an integrated way.

AB - The analysis of the interplay between the feature selection and the post-hoc local interpretation steps in a machine learning workflow followed for IoT botnet detection constitutes the research scope of the present paper. While the application of machine learning-based techniques has become a trend in cyber security, the main focus has been almost on detection accuracy. However, providing the relevant explanation for a detection decision is a vital requirement in a tiered incident handling processes of the contemporary security operations centers. Moreover, the design of intrusion detection systems in IoT networks has to take the limitations of the computational resources into consideration. Therefore, resource limitations in addition to human element of incident handling necessitate considering feature selection and interpretability at the same time in machine learning workflows. In this paper, first, we analyzed the selection of features and its implication on the data accuracy. Second, we investigated the impact of feature selection on the explanations generated at the post-hoc interpretation phase. We utilized a filter method, Fisher's Score and Local Interpretable Model-Agnostic Explanation (LIME) at feature selection and post-hoc interpretation phases, respectively. To evaluate the quality of explanations, we proposed a metric that reflects the need of the security analysts. It is demonstrated that the application of both steps for the particular case of IoT botnet detection may result in highly accurate and interpretable learning models induced by fewer features. Our metric enables us to evaluate the detection accuracy and interpretability in an integrated way.

KW - Botnet detection

KW - Interpretation

KW - Machine learning

UR - http://www.scopus.com/inward/record.url?scp=85080919082&partnerID=8YFLogxK

U2 - 10.1109/ICMLA.2019.00193

DO - 10.1109/ICMLA.2019.00193

M3 - Conference contribution

AN - SCOPUS:85080919082

T3 - Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019

SP - 1162

EP - 1169

BT - Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019

A2 - Wani, M. Arif

A2 - Khoshgoftaar, Taghi M.

A2 - Wang, Dingding

A2 - Wang, Huanjing

A2 - Seliya, Naeem

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019

Y2 - 16 December 2019 through 19 December 2019

ER -

Nomm S, Guerra-Manzanares A, Bahsi H. Towards the integration of a post-hoc interpretation step into the machine learning workflow for IoT botnet detection. In Wani MA, Khoshgoftaar TM, Wang D, Wang H, Seliya N, editors, Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 1162-1169. 8999281. (Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019). doi: 10.1109/ICMLA.2019.00193

Towards the integration of a post-hoc interpretation step into the machine learning workflow for IoT botnet detection

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this