Abstract
A Network Intrusion Detection System (NIDS) is a network monitoring technology for identifying cyber attacks, botnet command and control traffic, and other unwanted network activity. Unfortunately, organizational NIDS solutions can often generate tens or hundreds of thousands of alerts on a daily basis, with a significant part of them having low importance or being false positives. Therefore, high priority alerts become hard to spot, which overloads security analysts and complicates their work. The current paper addresses this problem and introduces a machine learning framework for classifying NIDS alerts with the help of stream clustering and supervised learning. We propose a stream-clustering-guided method for creating labeled NIDS alert data sets. The small data sets created using this method can be used for training high-performance supervised NIDS alert classifiers. This significantly reduces the human labeling effort and eases the application of supervised machine learning for NIDS alert classification. The proposed machine learning framework was evaluated on NIDS alerts collected over 2 months from the network of a large academic organization. The experimental results indicate that combining stream clustering and supervised learning into a NIDS alert classification framework significantly decreases the number of false positives, and thus reduces the workload of human security analysts. The framework also features low CPU time and memory consumption and can thus be run on commodity hardware. In conclusion, the proposed framework provides a cost-effective means of integrating machine learning into Security Operation Centers (SOCs). This enables the identification of critical NIDS alerts using high-performance classifiers, thereby assisting in the automation of alert handling tasks for SOC personnel. To address the lack of public data sets in the problem domain and foster further research, we publicly share the large labeled NIDS alert data set used in our experimental setup.
Original language | English |
---|---|
Pages (from-to) | 231-244 |
Number of pages | 14 |
Journal | Future Generation Computer Systems |
Volume | 155 |
DOIs | |
Publication status | Published - Jun 2024 |
Externally published | Yes |
Keywords
- Data labeling
- Data set generation
- High-priority NIDS alert
- IDS
- Intrusion detection
- Network Intrusion Detection System
- Network security
- NIDS
- NIDS alert
- NIDS alert classification
- Security Operations Center
- Small training data set
- SOC
- Stream clustering
- Supervised learning
- Workload reduction
ASJC Scopus subject areas
- Software
- Hardware and Architecture
- Computer Networks and Communications