首页 | 本学科首页   官方微博 | 高级检索  
   检索      

基于深度森林的网络流量分类方法
引用本文:戴瑾,王天宇,王少尉.基于深度森林的网络流量分类方法[J].国防科技大学学报,2020,42(4):30-34.
作者姓名:戴瑾  王天宇  王少尉
作者单位:南京大学 电子科学与工程学院,南京大学 电子科学与工程学院,南京大学 电子科学与工程学院
基金项目:国家自然科学基金资助项目(61801208,61671233, 61931023, U1936202)
摘    要:随着网络应用的迅猛发展,流量分类在网络资源分配、流量调度和网络安全等诸多研究领域受到广泛关注。现有的机器学习流量分类方法对流量数据特征的选取和分布要求苛刻,导致在实际应用中的复杂流量场景下分类精确度和稳定度难以提高。为了解决样本特征属性的复杂性给分类性能带来的不利影响,引入了基于深度森林的流量分类方法。该算法通过级联森林和多粒度扫描机制,能够在样本数量规模和特征属性选取规模有限的情况下,有效地提高流量整体分类性能。通过网络流量公开数据集Moore对支持向量机、随机森林和深度森林机器学习算法进行训练和测试,结果表明基于深度森林的网络流量分类器的分类准确率能够达到96. 36%,性能优于其他机器学习模型。

关 键 词:特征选取  gcForest  机器学习  网络流量分类
收稿时间:2019/12/25 0:00:00
修稿时间:2020/4/3 0:00:00

Network traffic classification method based on deep forest
DAI Jin,WANG Tianyu,WANG Shaowei.Network traffic classification method based on deep forest[J].Journal of National University of Defense Technology,2020,42(4):30-34.
Authors:DAI Jin  WANG Tianyu  WANG Shaowei
Institution:School of Information Science and Engineering, Jinling College, Nanjing University, Nanjing 210089, China;School of Electronic Science and Engineering, Nanjing University, Nanjing 210023, China;School of Electronic Science and Engineering, Nanjing University, Nanjing 210023, China;National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China
Abstract:With the rapid development of network applications, Internet traffic classification has a profound impact on the research fields of network resource allocation, traffic scheduling and network security. The traditional flow analysis method based on machine learning has strict requirements for theSfeatureSselection and distribution of network flows, which it difficult to accurately and stably classify the complex and changeable flow data in practical application. In order to solve the adverse impact of the complexity of sample features on the traffic classification, we propose a new classification method based on deep forest, which utilizes the cascade forest of decision trees and multi-grained scanning mechanisms to aim to improve classification performance in the case of limited scale of samples and features. We train and test the classification model by using Moore, which are flow data sets in public domain. The experiment results show that the classification accuracy using deep forest model is 96.36%, which outperforms the other machine learning models.
Keywords:feature selection  gcForest  machine learning  network traffic classification  
本文献已被 CNKI 等数据库收录!
点击此处可从《国防科技大学学报》浏览原始摘要信息
点击此处可从《国防科技大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号