Internet traffic classification using bayesian analysis techniques
Top Cited Papers
- 6 June 2005
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMETRICS Performance Evaluation Review
- Vol. 33 (1) , 50-60
- https://doi.org/10.1145/1071690.1064220
Abstract
Accurate traffic classification is of fundamental importance to numerous other network activities, from security monitoring to accounting, and from Quality of Service to providing operators with useful forecasts for long-term provisioning. We apply a Naïve Bayes estimator to categorize traffic by application. Uniquely, our work capitalizes on hand-classified network data, using it as input to a supervised Naïve Bayes estimator. In this paper we illustrate the high level of accuracy achievable with the \Naive Bayes estimator. We further illustrate the improved accuracy of refined variants of this estimator.Our results indicate that with the simplest of Naïve Bayes estimator we are able to achieve about 65% accuracy on per-flow classification and with two powerful refinements we can improve this value to better than 95%; this is a vast improvement over traditional techniques that achieve 50--70%. While our technique uses training data, with categories derived from packet-content, all of our training and testing was done using header-derived discriminators. We emphasize this as a powerful aspect of our approach: using samples of well-known traffic to allow the categorization of traffic using commonly available information alone.Keywords
This publication has 11 references indexed in Scilit:
- Toward the Accurate Identification of Network ApplicationsPublished by Springer Nature ,2005
- Class-of-service mapping for QoSPublished by Association for Computing Machinery (ACM) ,2004
- Transport layer identification of P2P trafficPublished by Association for Computing Machinery (ACM) ,2004
- Flow classification by histogramsPublished by Association for Computing Machinery (ACM) ,2004
- Flow Clustering Using Machine Learning TechniquesPublished by Springer Nature ,2004
- An analysis of Internet chat systemsPublished by Association for Computing Machinery (ACM) ,2003
- Ensembles of Classifiers for Morphological Galaxy ClassificationThe Astrophysical Journal, 2001
- Wide area traffic: the failure of Poisson modelingIEEE/ACM Transactions on Networking, 1995
- Entropy of ATM traffic streams: a tool for estimating QoS parametersIEEE Journal on Selected Areas in Communications, 1995
- Empirically derived analytic models of wide-area TCP connectionsIEEE/ACM Transactions on Networking, 1994