On the application of active learning for efficient and effective IoT botnet detection

Alejandro Guerra-Manzanares; Hayretdin Bahsi

doi:10.1016/j.future.2022.10.024

On the application of active learning for efficient and effective IoT botnet detection

Alejandro Guerra-Manzanares, Hayretdin Bahsi

Research output: Journal Publication › Article › peer-review

11 Citations (Scopus)

Abstract

The active learning approach for machine learning can greatly benefit those environments where a wealth of unlabeled data is available, and the labeling cost of the data can be restrictive. In this regard, security operations centers (SOCs) can take advantage of the human expertise available to improve machine learning-based detection models using the active learning approach. In the context of SOC operations and IoT botnet detection, our study provides a thorough benchmarking of the application of different active learning approaches within the framework of pool-based sampling. The selection of the optimal query instance for learning is evaluated using uncertainty sampling, ranked batch-mode sampling, and query by committee strategies. Our results show that the active learning approach can help to generate better detection models using all the active learning query strategies tested in our benchmarking setup. Leveraging the human–machine interaction can produce high-performance models in the context of IoT botnet detection using significantly less data than the passive approaches traditionally used for the generation of machine learning-based detection systems. Additionally, the impact of wrong-labeled data in the active learning implementation is explored.

Original language	English
Pages (from-to)	40-53
Number of pages	14
Journal	Future Generation Computer Systems
Volume	141
DOIs	https://doi.org/10.1016/j.future.2022.10.024
Publication status	Published - Apr 2023
Externally published	Yes

Keywords

Active learning
Botnet detection
Internet of things
Intrusion detection
IoT
IoT botnet
Machine learning
Query learning

ASJC Scopus subject areas

Software
Hardware and Architecture
Computer Networks and Communications

Access to Document

10.1016/j.future.2022.10.024

Cite this

@article{d790c424777d4b59b958fbaff120eb42,

title = "On the application of active learning for efficient and effective IoT botnet detection",

abstract = "The active learning approach for machine learning can greatly benefit those environments where a wealth of unlabeled data is available, and the labeling cost of the data can be restrictive. In this regard, security operations centers (SOCs) can take advantage of the human expertise available to improve machine learning-based detection models using the active learning approach. In the context of SOC operations and IoT botnet detection, our study provides a thorough benchmarking of the application of different active learning approaches within the framework of pool-based sampling. The selection of the optimal query instance for learning is evaluated using uncertainty sampling, ranked batch-mode sampling, and query by committee strategies. Our results show that the active learning approach can help to generate better detection models using all the active learning query strategies tested in our benchmarking setup. Leveraging the human–machine interaction can produce high-performance models in the context of IoT botnet detection using significantly less data than the passive approaches traditionally used for the generation of machine learning-based detection systems. Additionally, the impact of wrong-labeled data in the active learning implementation is explored.",

keywords = "Active learning, Botnet detection, Internet of things, Intrusion detection, IoT, IoT botnet, Machine learning, Query learning",

author = "Alejandro Guerra-Manzanares and Hayretdin Bahsi",

note = "Publisher Copyright: {\textcopyright} 2022 Elsevier B.V.",

year = "2023",

month = apr,

doi = "10.1016/j.future.2022.10.024",

language = "English",

volume = "141",

pages = "40--53",

journal = "Future Generation Computer Systems",

issn = "0167-739X",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - On the application of active learning for efficient and effective IoT botnet detection

AU - Guerra-Manzanares, Alejandro

AU - Bahsi, Hayretdin

PY - 2023/4

Y1 - 2023/4

N2 - The active learning approach for machine learning can greatly benefit those environments where a wealth of unlabeled data is available, and the labeling cost of the data can be restrictive. In this regard, security operations centers (SOCs) can take advantage of the human expertise available to improve machine learning-based detection models using the active learning approach. In the context of SOC operations and IoT botnet detection, our study provides a thorough benchmarking of the application of different active learning approaches within the framework of pool-based sampling. The selection of the optimal query instance for learning is evaluated using uncertainty sampling, ranked batch-mode sampling, and query by committee strategies. Our results show that the active learning approach can help to generate better detection models using all the active learning query strategies tested in our benchmarking setup. Leveraging the human–machine interaction can produce high-performance models in the context of IoT botnet detection using significantly less data than the passive approaches traditionally used for the generation of machine learning-based detection systems. Additionally, the impact of wrong-labeled data in the active learning implementation is explored.

AB - The active learning approach for machine learning can greatly benefit those environments where a wealth of unlabeled data is available, and the labeling cost of the data can be restrictive. In this regard, security operations centers (SOCs) can take advantage of the human expertise available to improve machine learning-based detection models using the active learning approach. In the context of SOC operations and IoT botnet detection, our study provides a thorough benchmarking of the application of different active learning approaches within the framework of pool-based sampling. The selection of the optimal query instance for learning is evaluated using uncertainty sampling, ranked batch-mode sampling, and query by committee strategies. Our results show that the active learning approach can help to generate better detection models using all the active learning query strategies tested in our benchmarking setup. Leveraging the human–machine interaction can produce high-performance models in the context of IoT botnet detection using significantly less data than the passive approaches traditionally used for the generation of machine learning-based detection systems. Additionally, the impact of wrong-labeled data in the active learning implementation is explored.

KW - Active learning

KW - Botnet detection

KW - Internet of things

KW - Intrusion detection

KW - IoT

KW - IoT botnet

KW - Machine learning

KW - Query learning

UR - http://www.scopus.com/inward/record.url?scp=85142760129&partnerID=8YFLogxK

U2 - 10.1016/j.future.2022.10.024

DO - 10.1016/j.future.2022.10.024

M3 - Article

AN - SCOPUS:85142760129

SN - 0167-739X

VL - 141

SP - 40

EP - 53

JO - Future Generation Computer Systems

JF - Future Generation Computer Systems

ER -

On the application of active learning for efficient and effective IoT botnet detection

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this