Abstract
High-utility itemset mining (HUIM) currently plays an important role in a wide range of applications and data mining community. Several algorithms, methods and data structures have been proposed to improve efficiency of mining for such itemsets. Besides, HUIM is extended in several aspects including the regarding of “regularity or irregularity of occurrence” on high utility itemsets. This leads to the emerging of high-utility regular itemsets mining (HURIM) and high-utility irregular itemsets mining (HUIIM) which can help to observe occurrence behavior of high utility itemsets. Based on HURIM, the regularity threshold is applied to measure interestingness of itemsets and to prune search space. However, on HUIIM, the threshold cannot help to prune uninteresting itemsets causing this task consumes high computational cost. Thus, in this paper, we here present a single-pass algorithm, called HUIIM (High-Utility Irregular Itemset Miner), for efficiently mining high-utility irregular itemsets. The new-modified utility list structure (NUL) is applied for maintaining occurrence information simultaneously with utility value of an itemset and also for fast calculation of total utility of the itemset. Moreover, a new pruning technique is designed and applied to improve computational performance of HUIIM. Experimental studies were conducted on synthetic and real datasets to show efficiency of HUIIM (with and without the new pruning technique) in the terms of computational time and memory usage.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: VLDB, pp. 487–499 (1994)
Ahmed, C.F., Tanbeer, S.K., Jeong, B.S., Lee, Y.K.: Efficient tree structures for high utility pattern mining in incremental databases. IEEE Trans. Knowl. Data Eng. 21(12), 1708–1721 (2009)
Ahmed, C.F., Tanbeer, S.K., Jeong, B.: Mining high utility web access sequences in dynamic web log data. In: Proceeding of the International Conference on Software Engineering Artificial Intelligence Networking and Parallel/Distributed Computing. IEEE, London, UK, June 2010, pp. 76–81 (2010)
Ahmed, C.F., Tanbeer, S.K., Jeong, B.S., Lee, Y.-K.: HUC-prune: an efficient candidate pruning technique to mine high utility patterns. Appl. Intell. 34(2), 181–198 (2011)
Amphawan, K., Surarerks, A.: Pushing regularity constraint on high utility itemsets mining. In: 2015 2nd International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA), pp. 1–6 (2015)
Amphawan, K., Lenca, P.: Mining Top-k frequent/regular patterns based on user-given trade-off between frequency and regularity, pp. 1–12 (2013)
Amphawan, K., Lenca, P.: Mining top-k frequent-regular closed patterns. Expert Syst. Appl. 42(21), 7882–7894 (2015)
Amphawan, K., Lenca, P., Surarerks, A.: Mining top-k periodic-frequent patterns without support threshold. In: Proceedings of the 3rd International Conference on Advances in Information Technology, vol. 55, pp. 18–29 (2009)
Amphawan, K., Lenca, P., Jitpattanakul, A., Surarerks, A.: Mining high utility itemsets with regular occurrence. J. ICT Res. Appl. 10(2), 153–176 (2016)
Chan, R., Yang, Q., Shen, Y.-D.: Mining high utility itemsets. In: Third IEEE International Conference on Data Mining, 2003. ICDM 2003, pp. 19–26 (2003)
Chang, J.H., Lee, W.S.: Finding recent frequent itemsets adaptively over online data streams. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 487–492. ACM (2003)
Dam, T.-L.: PHM: mining periodic high-utility itemsets. In: Advances in Data Mining. Applications and Theoretical Aspects: 16th Industrial Conference, ICDM 2016, New York, NY, USA, 13–17 July 2016. Proceedings, vol. 9728, p. 64. Springer (2016)
Dinh, T., Huynh, V.-N., Le, B.: Mining periodic high utility sequential patterns. In: Asian Conference on Intelligent Information and Database Systems, pp. 545–555. Springer (2017)
Dong, G., Li, J.: Efficient mining of emerging patterns: discovering trends and differences. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 43–52. ACM (1999)
Duong, Q.-H., Fournier-Viger, P., Ramampiaro, H., Nørvåg, K., Dam, T.-L.: Efficient high utility itemset mining using buffered utility-lists. Appl. Intell. (2017)
Eisariyodom, S., Amphawan, K.: Discovering interesting itemsets based on change in regularity of occurrence. In: 2017 9th International Conference on Knowledge and Smart Technology (KST), pp. 138–143. IEEE (2017)
Fournier-Viger, P., Zida, S.: FOSHU: faster on-shelf high utility itemset mining – with or without negative unit profit. In: Proceedings of the 30th Annual ACM Symposium on Applied Computing, SAC’15, pp. 857–864 (2015)
Fournier-Viger, P., Wu, C.-W., Zida, S., Tseng, V.S.: FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning, pp. 83–92 (2014)
Gouda, K., Zaki, M.J.: Efficiently mining maximal frequent itemsets. In: Proceedings IEEE International Conference on Data Mining, 2001. ICDM 2001, pp. 163–170. IEEE (2001)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, SIGMOD’00, pp. 1–12 (2000)
Klangwisan, K., Amphawan, K.: Mining weighted-frequent-regular itemsets from transactional database. In: 2017 9th International Conference on Knowledge and Smart Technology (KST), pp. 66–71. IEEE (2017)
Krishnamoorthy, S.: Pruning strategies for mining high utility itemsets. Expert Syst. Appl. 42(5), 2371–2381 (2015)
Krishnamoorthy, S.: Efficiently mining high utility itemsets with negative unit profits. Knowl.-Based Syst. (2017)
Lan, G.-C., Hong, T.-P., Tseng, V.S.: Discovery of high utility itemsets from on-shelf time periods of products. Expert Syst. Appl. 38(5), 5851–5857 (2011)
Laoviboon, S., Amphawan, K.: Mining high-utility itemsets with irregular occurrence. In: 2017 9th International Conference on Knowledge and Smart Technology (KST), pp. 89–94. IEEE (2017)
Li, H.-F., Lee, S.-Y.: Mining frequent itemsets over data streams using efficient window sliding techniques. Expert Syst. Appl. 36(2), 1466–1477 (2009)
Li, H.-F., Huang, H.-Y., Chen, Y.-C., Liu, Y.-J., Lee, S.-Y.: Fast and memory efficient mining of high utility itemsets in data streams. In: Eighth IEEE International Conference on Data Mining, 2008. ICDM’08, pp. 881–886. IEEE (2008)
Lin, C.-W., Hong, T.-P., Lan, G.-C., Wong, J.-W., Lin, W.-Y.: Efficient updating of discovered high-utility itemsets for transaction deletion in dynamic databases. Adv. Eng. Inform. 29(1), 16–27 (2015)
Lin, J.C.-W., Fournier-Viger, P., Gan, W.: FHN: an efficient algorithm for mining high-utility itemsets with negative unit profits. Knowl.-Based Syst. 111, 283–298 (2016)
Lin, J.C.-W., Zhang, J., Fournier-Viger, P., Hong, T.-P., Chen, C.-M., Su, J.-H.: Efficient mining of short periodic high-utility itemsets. In: 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 003083–003088. IEEE (2016)
Lin, J.C.-W., Ren, S., Fournier-Viger, P., Hong, T.-P., Su, J.-H., Vo, B.: A fast algorithm for mining high average-utility itemsets. Appl. Intell. 47(2), 331–346 (2017)
Lin, J.C.-W., Zhang, J., Fournier-Viger, P., Hong, T.-P., Zhang, J.: A two-phase approach to mine short-period high-utility itemsets in transactional databases. Adv. Eng. Inform. 33, 29–43 (2017)
Liu, M., Qu, J.: Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 55–64 (2012)
Liu, Y., Cheng, C., Tseng, V.S.: Mining differential top-k co-expression pattern from time course comparative gens expression datasets. In: Proceeding of the International Conference on Communication, Computing, and Security. CRC Press, Gurgaon, India, September 2016, p. 230 (2013)
Liu, Y., Liao, W.-K., Choudhary, A.: A two-phase algorithm for fast discovery of high utility itemsets. Adv. Knowl. Discov. Data Min. 3518, 689–695 (2005)
Mai, T., Vo, B., Nguyen, L.T.T.: A lattice-based approach for mining high utility association rules. Inf. Sci. 399, 81–97 (2017)
Pasquier, N., Bastide, Y., Taouil, R., Lakhal, L.: Discovering frequent closed itemsets for association rules. In: International Conference on Database Theory, pp. 398–416. Springer (1999)
Pei, J., Han, J., Mao, R., et al.: Closet: an efficient algorithm for mining frequent closed itemsets. In: ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, vol. 4, pp. 21–30 (2000)
Podpecan, V., Lavrac, N., Kononenko, I.: A fast algorithm for mining utility-frequent itemsets. In: Constraint-Based Mining and Learning, p. 9 (2007)
Ryang, H., Yun, U.: Top-k high utility pattern mining with effective threshold raising strategies. Knowl-Based Syst. 76, 109–126 (2015)
Shie, B.-E., Hsiao, H.-F., Tseng, V., Philip, Y.: Mining high utility mobile sequential patterns in mobile commerce environments. In: Database Systems for Advanced Applications, pp. 224–238. Springer (2011)
Shie, B.-E., Hsiao, H.-F., Tseng, V.S.: Efficient algorithms for discovering high utility user behavior patterns in mobile commerce environments. In: Knowledge and Information Systems, pp. 1–25 (2013)
Tanbeer, S., Ahmed, C., Jeong, B.-S.: Mining regular patterns in data streams. In: Database Systems for Advanced Applications, pp. 399–413. Springer (2010)
Tanbeer, S.K., Ahmed, C.F., Jeong, B.-S., Lee, Y.K.: Discovering periodic-frequent patterns in transactional databases. In: Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, pp. 242–253 (2009)
Tanbeer, S.K., Ahmed, C.F., Jeong, B.-S.: Mining regular patterns in incremental transactional databases. In: 2010 12th International Asia-Pacific Web Conference (APWEB), pp. 375–377. IEEE (2010)
Tao, F., Murtagh, F., Farid, M.: Weighted association rule mining using weighted support and significance framework. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 661–666. ACM (2003)
Thilagu, M., Nadatajan, R.: Efficiently mining of effective web traversal pattern with average utility. In: Proceeding of the International Conference on Communication, Computing, and Security. CRC Press, Gurgaon, India, September 2016, pp. 444–451 (2016)
Tseng, V.S., Shie, B.E., Wu, C.W., Yu, P.S.: Efficient algorithms for mining high utility itemsets from transactional databases. IEEE Trans. Knowl. Data Eng. 25(8), 1772–1786 (2013)
Tseng, V.S., Wu, C.W., Fournier-Viger, P., Yu, P.S.: Efficient algorithms for mining top-k high utility itemsets. IEEE Trans. Knowl. Data Eng. 28(1), 54–67 (2016)
Tseng, V.S., Shie, B.E., Wu, C.W., Yu, P.S.: Up-growth: an efficient algorithm for high utility itemset mining. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 18–27 (2010)
Yun, U., Kim, D.: Mining of high average-utility itemsets using novel list structure and pruning strategy. Future Gener. Comput. Syst. 68, 346–360 (2017)
Acknowledgements
This work was financially supported by the Research Grant of Burapha University through National Research Council of Thailand (Grant no. 15/2561).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Laoviboon, S., Amphawan, K. (2019). Mining High-Utility Irregular Itemsets. In: Fournier-Viger, P., Lin, JW., Nkambou, R., Vo, B., Tseng, V. (eds) High-Utility Pattern Mining. Studies in Big Data, vol 51. Springer, Cham. https://doi.org/10.1007/978-3-030-04921-8_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-04921-8_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04920-1
Online ISBN: 978-3-030-04921-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)