Skip to main navigation Skip to search Skip to main content

Imbalanced Encrypted Traffic Classification Scheme Using Random Forest

  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Encrypted traffic classification techniques can identify different types of traffic for network security. The existing schemes seldom consider the imbalanced distribution of the traffic. In this paper, we propose an encrypted traffic classification scheme using random forest for imbalanced learning. Firstly, the weighted information gain is used to select the features which are beneficial to the minority class to filter redundant features. Then the hybrid sampling method is used to balance the number of the majority class and the minority class. Finally, the random forest is constructed for encrypted traffic classification. Experimental results show that the feature selection method can filter the redundant features of the traffic and the hybrid sampling method can effectively improve the classification ability. Compared with K-nearest neighbor and C4.5 decision tree algorithm, the proposed scheme can classify imbalanced encrypted traffic more effectively.

Original languageEnglish
Title of host publicationProceedings - IEEE Congress on Cybermatics
Subtitle of host publication2020 IEEE International Conferences on Internet of Things, iThings 2020, IEEE Green Computing and Communications, GreenCom 2020, IEEE Cyber, Physical and Social Computing, CPSCom 2020 and IEEE Smart Data, SmartData 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages837-842
Number of pages6
ISBN (Electronic)9781728176475
DOIs
StatePublished - Nov 2020
Event2020 IEEE Congress on Cybermatics: 13th IEEE International Conferences on Internet of Things, iThings 2020, 16th IEEE International Conference on Green Computing and Communications, GreenCom 2020, 13th IEEE International Conference on Cyber, Physical and Social Computing, CPSCom 2020 and 6th IEEE International Conference on Smart Data, SmartData 2020 - Rhodes Island, Greece
Duration: 2 Nov 20206 Nov 2020

Publication series

NameProceedings - IEEE Congress on Cybermatics: 2020 IEEE International Conferences on Internet of Things, iThings 2020, IEEE Green Computing and Communications, GreenCom 2020, IEEE Cyber, Physical and Social Computing, CPSCom 2020 and IEEE Smart Data, SmartData 2020

Conference

Conference2020 IEEE Congress on Cybermatics: 13th IEEE International Conferences on Internet of Things, iThings 2020, 16th IEEE International Conference on Green Computing and Communications, GreenCom 2020, 13th IEEE International Conference on Cyber, Physical and Social Computing, CPSCom 2020 and 6th IEEE International Conference on Smart Data, SmartData 2020
Country/TerritoryGreece
CityRhodes Island
Period2/11/206/11/20

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Encrypted traffic classification
  • Feature selection
  • Hybrid sampling
  • Imbalanced learning
  • Random forest

Fingerprint

Dive into the research topics of 'Imbalanced Encrypted Traffic Classification Scheme Using Random Forest'. Together they form a unique fingerprint.

Cite this