Deeply fused flow and topology features for botnet detection based on a pretrained GCN

  • Xiaoyuan Meng*
  • , Bo Lang*
  • , Yuhao Yan
  • , Yanxi Liu
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The characteristics of botnets are mainly reflected in their network behaviors and the intercommunication relationships among their bots. The existing botnet detection methods typically use only one kind of feature, i.e., flow features or topological features; each feature type overlooks the other type of features and affects the resulting model performance. In this paper, for the first time, we propose a botnet detection model that uses a graph convolutional network (GCN) to deeply fuse flow features and topological features. We construct communication graphs from network traffic and represent node attributes with flow features. The extreme sample imbalance phenomenon exhibited by the existing public traffic datasets makes training a GCN model impractical. To address this problem, we propose a pretrained GCN framework that utilizes a public balanced artificial communication graph dataset to pretrain the GCN model, and the feature output obtained from the last hidden layer of the GCN model containing the flow and topology information is input into the Extra Tree classification model. Furthermore, our model can effectively detect command-and-control (C2) and peer-to-peer (P2P) botnets by simply adjusting the number of layers in the GCN. The experimental results obtained on public datasets demonstrate that our approach outperforms the current state-of-the-art botnet detection models. In addition, our model also performs well in real-world botnet detection scenarios.

Original languageEnglish
Article number108084
JournalComputer Communications
Volume233
DOIs
StatePublished - 1 Mar 2025

Keywords

  • Botnet detection
  • Feature fusion
  • Flow feature
  • Graph convolutional network
  • Topology feature

Fingerprint

Dive into the research topics of 'Deeply fused flow and topology features for botnet detection based on a pretrained GCN'. Together they form a unique fingerprint.

Cite this