Stream clustering guided supervised learning for classifying NIDS alerts

Research output: Journal PublicationArticlepeer-review

1 Citation (Scopus)

Abstract

A Network Intrusion Detection System (NIDS) is a network monitoring technology for identifying cyber attacks, botnet command and control traffic, and other unwanted network activity. Unfortunately, organizational NIDS solutions can often generate tens or hundreds of thousands of alerts on a daily basis, with a significant part of them having low importance or being false positives. Therefore, high priority alerts become hard to spot, which overloads security analysts and complicates their work. The current paper addresses this problem and introduces a machine learning framework for classifying NIDS alerts with the help of stream clustering and supervised learning. We propose a stream-clustering-guided method for creating labeled NIDS alert data sets. The small data sets created using this method can be used for training high-performance supervised NIDS alert classifiers. This significantly reduces the human labeling effort and eases the application of supervised machine learning for NIDS alert classification. The proposed machine learning framework was evaluated on NIDS alerts collected over 2 months from the network of a large academic organization. The experimental results indicate that combining stream clustering and supervised learning into a NIDS alert classification framework significantly decreases the number of false positives, and thus reduces the workload of human security analysts. The framework also features low CPU time and memory consumption and can thus be run on commodity hardware. In conclusion, the proposed framework provides a cost-effective means of integrating machine learning into Security Operation Centers (SOCs). This enables the identification of critical NIDS alerts using high-performance classifiers, thereby assisting in the automation of alert handling tasks for SOC personnel. To address the lack of public data sets in the problem domain and foster further research, we publicly share the large labeled NIDS alert data set used in our experimental setup.

Original languageEnglish
Pages (from-to)231-244
Number of pages14
JournalFuture Generation Computer Systems
Volume155
DOIs
Publication statusPublished - Jun 2024
Externally publishedYes

Keywords

  • Data labeling
  • Data set generation
  • High-priority NIDS alert
  • IDS
  • Intrusion detection
  • Network Intrusion Detection System
  • Network security
  • NIDS
  • NIDS alert
  • NIDS alert classification
  • Security Operations Center
  • Small training data set
  • SOC
  • Stream clustering
  • Supervised learning
  • Workload reduction

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Stream clustering guided supervised learning for classifying NIDS alerts'. Together they form a unique fingerprint.

Cite this