SBAD: Sequence Based Attack Detection via Sequence Comparison

Mao, Ching-Hao; Pao, Hsing-Kuo; Faloutsos, Christos; Lee, Hahn-Ming

doi:10.1007/978-3-642-19896-0_7

Ching-Hao Mao²⁴,
Hsing-Kuo Pao²⁴,
Christos Faloutsos²⁵ &
…
Hahn-Ming Lee^24,26

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6549))

Included in the following conference series:

International Workshop on Privacy and Security Issues in Data Mining and Machine Learning

1208 Accesses
4 Citations

Abstract

Given a stream of time-stamped events, like alerts in a network monitoring setting, how can we isolate a sequence of alerts that form a network attack? We propose a Sequence Based Attack Detection (SBAD) method, which makes the following contributions: (a) it automatically identifies groups of alerts that are frequent; (b) it summarizes them into a suspicious sequence of activity, representing them with graph structures; and (c) it suggests a novel graph-based dissimilarity measure. As a whole, SBAD is able to group suspicious alerts, visualize them, and spot anomalies at the sequence level. The evaluations from three datasets—two benchmark datasets (DARPA 1999, PKDD 2007) and a private dataset Acer 2007 gathered from a Security Operation Center in Taiwan—support our approach. The method performs well even without the help of the IP and payload information. No need for privacy information as the input makes the method easy to plug into existing system such as an intrusion detector. To talk about efficiency, the proposed method can deal with large-scale problems, such as processing 300K alerts within 20 mins on a regular PC.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Imielinski, T., Swami, A.: Database mining: a performance perspective. IEEE Trans. on Knowledge and Data Engineering 5(6), 914–925 (1993)
Article Google Scholar
Ding, B., Lo, D., Han, J., Khoo, S.-C.: Efficient mining of closed repetitive gapped subsequences from a sequence database. In: ICDE 2009 (March 2009)
Google Scholar
Exbrayat, M.: ECML/PKDD challenge: analyzing web traffic a boundaries signature approach. In: PKDD 2007, pp. 17–29 (2007)
Google Scholar
Faloutsos, C., Lin, K.-I.D.: Fastmap: A fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets. In: ACM SIGMOD, May 23-25, pp. 163–174 (1995)
Google Scholar
Ji, X., Bailey, J., Dong, G.: Mining minimal distinguishing subsequence patterns with gap constraints. In: ICDM 2005 (2005)
Google Scholar
Ke, Y., Cheng, J., Yu, J.X.: Top-k correlative graph mining. In: SDM, pp. 1038–1049 (2009)
Google Scholar
Keogh, E., Lonardi, S., Ratanamahatana, C.A.: Towards parameter-free data mining. In: KDD 2004, pp. 206–215 (2004)
Google Scholar
Kohavi, R., Provost, F.: Glossary of terms. Editorial for the Special Issue on Applications of Machine Learning and the Knowledge Discovery Process 30, 271–274 (1998)
Google Scholar
Lane, T., Brodley, C.E.: An empirical study of two approaches to sequence learning for anomaly detection. Machine Learning 51(1), 73–107 (2004)
Article MATH Google Scholar
Law, M.H.C., Zhang, N., Jain, A.K.: Nonlinear manifold learning for data stream. In: Jonker, W., Petković, M. (eds.) SDM 2004. LNCS, vol. 3178. Springer, Heidelberg (2004)
Google Scholar
Lee, Y.-J., Mangasarian, O.L.: SSVM: A smooth support vector machine for classification. Comput. Optim. Appl. 20(1), 5–22 (2001)
Article MathSciNet MATH Google Scholar
Li, M., Badger, J.H., Chen, X., Kwong, S., Kearney, P., Zhang, H.: An information-based sequence distance and its application to whole mitochondrialgenome phylogeny. Bioinformatics 17(2), 149–154 (2001)
Article Google Scholar
Li, M., Vitányi, P.: An Introduction to Kolmogorov Complexity and Its Applications, 2nd edn. Springer, New York (1997)
Book MATH Google Scholar
Mannila, H., Toivonen, H., Verkamo, A.I.: Discovering frequent episodes in sequences. In: Fayyad, U.M., Uthurusamy, R. (eds.) KDD 1995 (1995)
Google Scholar
Ning, P., Cui, Y., Reeves, D., Xu, D.: Techniques and tools for analyzing intrusion alerts. ACM Trans. Inf. Sys. Secur. 7(2), 274–318 (2004)
Article Google Scholar
Pao, H.-K., Case, J.: Computing entropy for ortholog detection. In: International Conference on Computational Intelligence, pp. 89–92 (2004)
Google Scholar
Tenenbaum, J.B., de Silva, V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)
Article Google Scholar
Zhou, J., Heckman, M., Reynolds, B., Carlson, A., Bishop, M.: Modeling network intrusion detection alerts for correlation. ACM Trans. Inf. Sys. Secur. 10(1), 1–31 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science & Information Engineering, National Taiwan University of Science & Technology, Taipei, Taiwan
Ching-Hao Mao, Hsing-Kuo Pao & Hahn-Ming Lee
Dept. of Computer Science, Carnegie Mellon University, Pittsburgh, USA
Christos Faloutsos
Institute of Information Science, Academia Sinica, Taipei, Taiwan
Hahn-Ming Lee

Authors

Ching-Hao Mao
View author publications
You can also search for this author in PubMed Google Scholar
Hsing-Kuo Pao
View author publications
You can also search for this author in PubMed Google Scholar
Christos Faloutsos
View author publications
You can also search for this author in PubMed Google Scholar
Hahn-Ming Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Johann Wolfgang Goethe University, Ruth-Moufang-Str. 1, 60438, Frankfurt am Main, Germany
Christos Dimitrakakis
Information Analytics Lab, IBM Research – Zurich, Säumerstrasse 4, 8803, Rüschlikon, Switzerland
Aris Gkoulalas-Divanis
Ecole Polytechnice Fédérale de Lausanne, I&C - ISC - LASEC, Bâtiment INF, Station 14, CH-1015, Lausanne, Switzerland
Aikaterini Mitrokotsa
Department of Computer and Communication Engineering, University of Thessaly, Glavani 37 & 28TH, GR 38221, Octovriou, Volos, Greece
Vassilios S. Verykios
Faculty of Engineering and Natural Sciences, Sabanci University, Orhanli, 34956, Tuzla, Istanbul, Turkey
Yücel Saygin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mao, CH., Pao, HK., Faloutsos, C., Lee, HM. (2011). SBAD: Sequence Based Attack Detection via Sequence Comparison. In: Dimitrakakis, C., Gkoulalas-Divanis, A., Mitrokotsa, A., Verykios, V.S., Saygin, Y. (eds) Privacy and Security Issues in Data Mining and Machine Learning. PSDML 2010. Lecture Notes in Computer Science(), vol 6549. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19896-0_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-19896-0_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19895-3
Online ISBN: 978-3-642-19896-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics