Skip to main content

OrderSpan: Mining Closed Partially Ordered Patterns

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8207))

Abstract

Due to the complexity of the task, partially ordered pattern mining of sequential data has not been subject to much study, despite its usefulness. This paper investigates this data mining challenge by describing OrderSpan, a new algorithm that extracts such patterns from sequential databases and overcomes some of the drawbacks of existing methods. Our work consists in providing a simple and flexible framework to directly mine complex sequences of itemsets, by combining well-known properties on prefixes and suffixes. Experiments were performed on different real datasets to show the benefit of partially ordered patterns.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Geng, L., Hamilton, H.J.: Interestingness measures for data mining: A survey. ACM Computing Survey (2006)

    Google Scholar 

  2. Cheng, H., Yan, X., Han, J., Hsu, C.: Discriminative frequent pattern analysis for effective classification. In: Conference on Data Engineering, ICDE (2007)

    Google Scholar 

  3. Cheng, H., Yan, X., Han, J., Yu, P.S.: Direct discriminative pattern mining for effective classification. In: Conference on Data Engineering, ICDE (2008)

    Google Scholar 

  4. Wang, M., Shang, X.-q., Li, Z.-h.: Sequential pattern mining for protein function prediction. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds.) ADMA 2008. LNCS (LNAI), vol. 5139, pp. 652–658. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  5. Agrawal, R., Srikant, R.: Mining sequential patterns. In: International Conference on Data Engineering, ICDE (1995)

    Google Scholar 

  6. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Int. Conference on Very Large Data Bases, VLDB (1994)

    Google Scholar 

  7. Ren, J., Wang, L., Dong, J., Hu, C., Wang, K.: A novel sequential pattern mining algorithm for the feature discovery of software fault. In: Int. Conference on Computational Intelligence and Software Engineering, CiSE (2009)

    Google Scholar 

  8. Sallaberry, A., Pecheur, N., Bringay, S., Roche, M., Teisseire, M.: Sequential patterns mining and gene sequence visualization to discover novelty from microarray data. Journal of Biomedical Informatics (2011)

    Google Scholar 

  9. George, A., Binu, D.: Drl-prefixspan: A novel pattern growth algorithm for discovering downturn, revision and launch (drl) sequential patterns. Central European Journal of Computer Science (2012)

    Google Scholar 

  10. Wang, J., Han, J.: Bide: efficient mining of frequent closed sequences. In: International Conference on Data Engineering, ICDE (2004)

    Google Scholar 

  11. Yan, X., Han, J., Afshar, R.: CloSpan: Mining Closed Sequential Patterns in Large Datasets. In: SIAM International Conference on Data Mining, SDM (2003)

    Google Scholar 

  12. Mannila, H., Toivonen, H., Verkamo, A.I.: Discovery of Frequent Episodes in Event Sequences. In: Data Mining and Knowledge Discovery (1997)

    Google Scholar 

  13. Tatti, N., Cule, B.: Mining closed strict episodes. In: Data Mining and Knowledge Discovery (2012)

    Google Scholar 

  14. Zhou, W., Liu, H., Cheng, H.: Mining closed episodes from event sequences efficiently. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) PAKDD 2010, Part I. LNCS, vol. 6118, pp. 310–318. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  15. Pei, J., Wang, H., Liu, J., Wang, K., Wang, J., Yu, P.S.: Discovering frequent closed partial orders from strings. IEEE Transactions on Knowledge and Data Engineering (2006)

    Google Scholar 

  16. Casas-Garriga, G.: Summarizing sequential data with closed partial orders. In: SIAM International Conference on Data Mining, SDM (2005)

    Google Scholar 

  17. Pei, J., Han, J., Mortazavi-Asl, B., Wang, J., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: Mining sequential patterns by pattern-growth: The prefixspan approach. IEEE Transactions on Knowledge and Data Engineering (2004)

    Google Scholar 

  18. Fabregue, M., Bringay, S., Poncelet, P., Teisseire, M., Orsetti, B.: Mining microarray data to predict the histological grade of a Breast Cancer. Journal of Biomedical Informatics (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fabrègue, M., Braud, A., Bringay, S., Le Ber, F., Teisseire, M. (2013). OrderSpan: Mining Closed Partially Ordered Patterns. In: Tucker, A., Höppner, F., Siebes, A., Swift, S. (eds) Advances in Intelligent Data Analysis XII. IDA 2013. Lecture Notes in Computer Science, vol 8207. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41398-8_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41398-8_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41397-1

  • Online ISBN: 978-3-642-41398-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics