Abstract
Market Basket Analysis often involves applying the de facto association rule mining method on massive sales transaction data. In this paper, we argue that association rule mining is not always the most suitable method for analysing big market-basket data. This is because the data matrix to be used for association rule mining is usually large and sparse, resulting in sluggish generation of many trivial rules with little insight. To address this problem, we summarise a real-world sales transaction data set into time series format. We then use time series clustering to discover commonly purchased items that are useful for pricing or formulating cross-selling strategies. We show that this approach uses a data set that is substantially smaller than the data to be used for association analysis. In addition, it reveals significant patterns and insights that are otherwise hard to uncover when using association analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. Proceedings of the International Conference on Very Large Data Bases. pp. 487–499. (1994)
Baralis, E., Cagliero, L., Cerquitelli, T., Garza, P.: Generalized association rule mining with constraints. Information Sciences. Vol. (194). pp. 68-84. (2012)
Basel, A.M., Amer F.A., and Mohammed Z. Z.: A new sampling technique for association rule mining. Journal of Information Science. Vol. 35. pp. 358–376. (2009)
Blattberg, R.C., Kim, B-D., Neslin, S.A.: Database Marketing, Analyzing and Managing Customers. Series: International Series in Quantitative Marketing. Vol. 18. (2008)
Chen, Y.L., Tang, K., Shen, R.J., Hu, Y.H.: Market basket analysis in a multiple store environment, Decision Support Systems. Vol. 40(2). pp. 339–354. (2005)
Creighton, C., Hanash S.: Mining gene expression databases for association rules. Bioinformatics. Vol. 19 (1), pp.79–86. (2003)
Cunningham, S.J., Frank, E.: Market basket analysis of library circulation data. Proceedings of 6th International Conference on Neural Information Processing. pp.825–830. (1999)
Gutierrez, N.: Demystifying Market Basket Analysis. DM Review Special Report. (2006)
LuÃs C.: A scalable algorithm for the market basket analysis. Journal of Retailing and Consumer Services. Vol. 14(6). pp. 400–407. (2007)
MacQueen, J. B.: Some Methods for classification and Analysis of Multivariate Observations. Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability 1. University of California Press. pp. 281–297. (1967)
Mafruz, Z.A., David, T., Kate, S.: Redundant association rules reduction techniques. International Journal Business Intelligent Data Mining. Vol. 2 (1). pp. 29–63. (2007)
Matteo, A. R., Eli, A.U.: Efficient Discovery of Association Rules and Frequent Itemsets through Sampling with Tight Performance Guarantees. Proceedings of European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) 2012. pp. 25–41. (2012)
Stéphane, L., Olivier, T., Elie, P.: Association rule interestingness: measure and statistical validation. Quality measures in data mining. Springer. (2006)
Tan, S.C.: Simplifying and improving swarm-based clustering. In Proceedings of IEEE Congress on Evolutionary Computation. pp. 1–8. (2012)
Tan, S.C., Ting, K.M., Teng, S.W.: A general stochastic clustering method for automatic cluster discovery. Pattern Recognition. Vol. 44 (10). pp. 2786–2799. (2011)
Xiaozhe, W., Kate A.S., Rob, H., Damminda, A.:A Scalable Method for Time Series Clustering. Technical Report. Monash University. (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer Science+Business Media Singapore
About this paper
Cite this paper
Tan, S.C., Lau, J.P.S. (2014). Time Series Clustering: A Superior Alternative for Market Basket Analysis. In: Herawan, T., Deris, M., Abawajy, J. (eds) Proceedings of the First International Conference on Advanced Data and Information Engineering (DaEng-2013). Lecture Notes in Electrical Engineering, vol 285. Springer, Singapore. https://doi.org/10.1007/978-981-4585-18-7_28
Download citation
DOI: https://doi.org/10.1007/978-981-4585-18-7_28
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-4585-17-0
Online ISBN: 978-981-4585-18-7
eBook Packages: EngineeringEngineering (R0)