GTVS: Boosting the Collection of Application Traffic Ground Truth

Canini, Marco; Li, Wei; Moore, Andrew W.; Bolla, Raffaele

doi:10.1007/978-3-642-01645-5_7

GTVS: Boosting the Collection of Application Traffic Ground Truth

Marco Canini¹⁹,
Wei Li²⁰,
Andrew W. Moore²⁰ &
…
Raffaele Bolla¹⁹

Conference paper

597 Accesses
22 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 5537))

Abstract

Interesting research in the areas of traffic classification, network monitoring, and application-oriented analysis can not proceed without real traffic traces, labeled with actual application information. However, hand-labeled traces are an extremely valuable but scarce resource in the traffic monitoring and analysis community, as a result of both privacy concerns and technical difficulties. Hardly any possibility exists for payloaded data to be released, while the impossibility of obtaining certain ground-truth application information from non-payloaded data has severely constrained the value of anonymized public traces.

The usual way to obtain the ground truth is fragile, inefficient and not directly comparable from one’s work to another. This paper proposes a methodology and details the design of a technical framework that significantly boosts the efficiency in compiling the application traffic ground truth. Further, a case study on a 30 minute real data trace is presented. In contrast with past work, this is an easy hands-on tool suite dedicated to save user’s time and labor and is freely available to the public.

This work was supported by the Engineering and Physical Sciences Research Council through grant GR/T10510/02 http://www.cl.cam.ac.uk/research/srg/netos/brasil/

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Moore, A.W., Papagiannaki, D.: Toward the accurate identification of network applications. In: Dovrolis, C. (ed.) PAM 2005. LNCS, vol. 3431, pp. 41–54. Springer, Heidelberg (2005)
Chapter Google Scholar
Moore, A.W., Zuev, D.: Internet traffic classification using bayesian analysis techniques. In: Proceedings of ACM SIGMETRICS 2005, pp. 50–60 (2005)
Google Scholar
Karagiannis, T., Papagiannaki, K., Faloutsos, M.: Blinc: multilevel traffic classification in the dark. In: Proceedings of ACM SIGCOMM 2005, pp. 229–240 (2005)
Google Scholar
Erman, J., et al.: Traffic classification using clustering algorithms. In: Proceedings of the SIGCOMM workshop on mining network data, MineNet 2006 (2006)
Google Scholar
Dusi, M., et al.: Tunnel hunter: Detecting application-layer tunnels with statistical fingerprinting. Computer Networks 53(1), 81–97 (2009)
Article Google Scholar
Li, W., Moore, A.W.: A machine learning approach for efficient traffic classification. In: Proceedings of IEEE MASCOTS 2007 (October 2007)
Google Scholar
Karagiannis, T., Broido, A., Faloutsos, M., Claffy, K.: Transport layer identification of P2P traffic. In: Proceedings of Internet Measurement Conference (2004)
Google Scholar
Trestian, I., Ranjan, S., Kuzmanovi, A., Nucci, A.: Unconstrained endpoint profiling (googling the internet). In: Proceedings of ACM SIGCOMM 2008, pp. 279–290 (2008)
Google Scholar
Dreger, H., et al.: Dynamic application-layer protocol analysis for network intrusion detection. In: 15th USENIX Security Symposium (2006)
Google Scholar
Szabó, G., et al.: On the validation of traffic classification algorithms. In: Claypool, M., Uhlig, S. (eds.) PAM 2008. LNCS, vol. 4979, pp. 72–81. Springer, Heidelberg (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

DIST, University of Genoa, Italy
Marco Canini & Raffaele Bolla
Computer Laboratory, University of Cambridge, UK
Wei Li & Andrew W. Moore

Authors

Marco Canini
View author publications
You can also search for this author in PubMed Google Scholar
Wei Li
View author publications
You can also search for this author in PubMed Google Scholar
Andrew W. Moore
View author publications
You can also search for this author in PubMed Google Scholar
Raffaele Bolla
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science, F.O.R.T.H., Institute of Computer Science, Vassilika Vouton, P.O. Box 1385, 711 10, Heraklion, Greece, and University of Crete, P.O. Box 2208, 714 09, Heraklion, Crete, Greece
Maria Papadopouli
LAAS – CNRS, 7 Avenue du Colonel Roche, 31077, Toulouse, cedex 4, France
Philippe Owezarski
Dept. of Electrical Engineering, Mathematics and Computer Science, Design and Analysis of Communication Systems Group, University of Twente, P.O. Box 217, 7500 AE, Enschede, The Netherlands
Aiko Pras

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Canini, M., Li, W., Moore, A.W., Bolla, R. (2009). GTVS: Boosting the Collection of Application Traffic Ground Truth. In: Papadopouli, M., Owezarski, P., Pras, A. (eds) Traffic Monitoring and Analysis. TMA 2009. Lecture Notes in Computer Science, vol 5537. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01645-5_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-01645-5_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01644-8
Online ISBN: 978-3-642-01645-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics