Advertisement

Mining Sequential Patterns: A Context-Aware Approach

  • Julien Rabatel
  • Sandra Bringay
  • Pascal Poncelet
Chapter
Part of the Studies in Computational Intelligence book series (SCI, volume 471)

Abstract

Traditional sequential patterns do not take into account contextual information associated with sequential data. For instance, when studying purchases of customers in a shop, a sequential pattern could be “frequently, customers buy products A and B at the same time, and then buy product C”. Such a pattern does not consider the age, the gender or the socio-professional category of customers. However, by taking into account contextual information, a decision expert can adapt his/her strategy according to the type of customers. In this paper, we focus on the analysis of a given context (e.g., a category of customers) by extracting context-dependent sequential patterns within this context. For instance, given the context corresponding to young customers, we propose to mine patterns of the form “buying products A and B then product C is a general behavior in this population” or “buying products B and D is frequent for young customers only”. We formally define such context-dependent sequential patterns and highlight relevant properties that lead to an efficient extraction algorithm. We conduct our experimental evaluation on real-world data and demonstrate performance issues.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [Agrawal et al., 1993]
    Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. SIGMOD Rec. 22(2) (1993)Google Scholar
  2. [Agrawal and Srikant, 1995]
    Agrawal, R., Srikant, R.: Mining sequential patterns. In: Yu, P.S., Chen, A.S.P. (eds.) Eleventh International Conference on Data Engineering. IEEE Computer Society Press (1995)Google Scholar
  3. [Dong and Li, 1999]
    Dong, G., Li, J.: Efficient mining of emerging patterns: discovering trends and differences. In: KDD 1999: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York (1999)Google Scholar
  4. [Dong et al., 1999]
    Dong, G., Zhang, X., Wong, L., Li, J.: CAEP: Classification by Aggregating Emerging Patterns. In: Arikawa, S., Nakata, I. (eds.) DS 1999. LNCS (LNAI), vol. 1721, pp. 30–42. Springer, Heidelberg (1999)CrossRefGoogle Scholar
  5. [Hilderman et al., 1998]
    Hilderman, R.J., Carter, C.L., Hamilton, H.J., Cercone, N.: Mining Market Basket Data Using Share Measures and Characterized Itemsets. In: Wu, X., Kotagiri, R., Korb, K.B. (eds.) PAKDD 1998. LNCS, vol. 1394, Springer, Heidelberg (1998)CrossRefGoogle Scholar
  6. [Jindal and Liu, 2008]
    Jindal, N., Liu, B.: Opinion spam and analysis. In: Proceedings of the International Conference on Web Search and Web Data Mining. ACM (2008)Google Scholar
  7. [Kuramochi and Karypis, 2001]
    Kuramochi, M., Karypis, G.: Frequent subgraph discovery. In: Proceedings IEEE International Conference on Data Mining, ICDM 2001, pp. 313–320. IEEE (2001)Google Scholar
  8. [Li et al., 2001]
    Li, J., Dong, G., Ramamohanarao, K.: Making use of the most expressive jumping emerging patterns for classification. Knowledge and Information Systems 3(2), 131–145 (2001)CrossRefGoogle Scholar
  9. [Mannila et al., 1997]
    Mannila, H., Toivonen, H., Inkeri Verkamo, A.: Discovery of frequent episodes in event sequences. Data Mining and Knowledge Discovery 1(3), 259–289 (1997)CrossRefGoogle Scholar
  10. [Pei et al., 2004]
    Pei, J., Han, J., Mortazavi-Asl, B., Wang, J., Pinto, H., Chen, Q., Dayal, U., Hsu, M.: Mining sequential patterns by pattern-growth: the PrefixSpan approach. IEEE Transactions on Knowledge and Data Engineering 16(11) (2004)Google Scholar
  11. [Pinto et al., 2001]
    Pinto, H., Han, J., Pei, J., Wang, K., Chen, Q., Dayal, U.: Multi-dimensional sequential pattern mining. In: Proceedings of the Tenth International Conference on Information and Knowledge Management. ACM (2001)Google Scholar
  12. [Plantevit et al., 2005]
    Plantevit, M., Choong, Y.W., Laurent, A., Laurent, D., Teisseire, M.: M2SP: Mining Sequential Patterns Among Several Dimensions. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 205–216. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  13. [Porter, 1980]
    Porter, M.: An algorithm for suffix stripping. Program: Electronic Library & Information Systems 40(3), 211–218 (1980)Google Scholar
  14. [Rabatel et al., 2010]
    Rabatel, J., Bringay, S., Poncelet, P.: Contextual Sequential Pattern Mining. In: 2010 IEEE International Conference on Data Mining Workshops, pp. 981–988. IEEE (2010)Google Scholar
  15. [Schmid, 1994]
    Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: Proceedings of International Conference on New Methods in Language Processing, vol. 12. Citeseer (1994)Google Scholar
  16. [Ziembiński, 2007]
    Ziembiński, R.: Algorithms for context based sequential pattern mining. Fundamenta Informaticae 76(4), 495–510 (2007)zbMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Julien Rabatel
    • 1
    • 2
  • Sandra Bringay
    • 2
    • 3
  • Pascal Poncelet
    • 2
  1. 1.Tecnalia, Cap OmegaMontpellier Cedex 2France
  2. 2.LIRMM (CNRS UMR 5506), Univ. Montpellier 2Montpellier Cedex 5France
  3. 3.Dpt. MIAPUniv. Montpellier 3Montpellier Cedex 5France

Personalised recommendations