A survey of techniques for internet traffic classification using machine learning
Top Cited Papers
- 1 January 2008
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Communications Surveys & Tutorials
- Vol. 10 (4) , 56-76
- https://doi.org/10.1109/surv.2008.080406
Abstract
The research community has begun looking for IP traffic classification techniques that do not rely on `well known' TCP or UDP port numbers, or interpreting the contents of packet payloads. New work is emerging on the use of statistical traffic characteristics to assist in the identification and classification process. This survey paper looks at emerging research into the application of Machine Learning (ML) techniques to IP traffic classification - an inter-disciplinary blend of IP networking and data mining techniques. We provide context and motivation for the application of ML techniques to IP traffic classification, and review 18 significant works that cover the dominant period from 2004 to early 2007. These works are categorized and reviewed according to their choice of ML strategies and primary contributions to the literature. We also discuss a number of key requirements for the employment of ML-based traffic classifiers in operational IP networks, and qualitatively critique the extent to which the reviewed works meet these requirements. Open issues and challenges in the field are also discussed.Keywords
This publication has 39 references indexed in Scilit:
- Traffic classification through simple statistical fingerprintingACM SIGCOMM Computer Communication Review, 2007
- A preliminary performance comparison of five machine learning algorithms for practical IP traffic flow classificationACM SIGCOMM Computer Communication Review, 2006
- Traffic classification on the flyACM SIGCOMM Computer Communication Review, 2006
- Survey of Clustering AlgorithmsIEEE Transactions on Neural Networks, 2005
- Shape-based recognition of wiry objectsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Benchmarking attribute selection techniques for discrete class data miningIEEE Transactions on Knowledge and Data Engineering, 2003
- Clustering validity checking methodsACM SIGMOD Record, 2002
- Wrappers for feature subset selectionArtificial Intelligence, 1997
- Empirically derived analytic models of wide-area TCP connectionsIEEE/ACM Transactions on Networking, 1994
- Objective Criteria for the Evaluation of Clustering MethodsJournal of the American Statistical Association, 1971