A Condensed Representation of Itemsets for Analyzing Their Evolution over Time
Driven by the need to understand change within domains there is emerging research on methods which aim at analyzing how patterns and in particular itemsets evolve over time. In practice, however, these methods suffer from the problem that many of the observed changes in itemsets are temporally redundant in the sense that they are the side-effect of changes in other itemsets, hence making the identification of the fundamental changes difficult. As a solution we propose temporally closed itemsets, a novel approach for a condensed representation of itemsets which is based on removing temporal redundancies. We investigate how our approach relates to the well-known concept of closed itemsets if the latter would be directly generalized to account for the temporal dimension. Our experiments support the theoretical results by showing that the set of temporally closed itemsets is significantly smaller than the set of closed itemsets.
KeywordsAssociation Rule Frequent Itemsets Condensed Representation Temporal Redundancy Support History
- 2.Agrawal, R., Psaila, G.: Active data mining. In: Fayyad, U.M., Uthurusamy, R. (eds.) Proceedings of the 1st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Montreal, Quebec, Canada, pp. 3–8. AAAI Press, Menlo Park (1995)Google Scholar
- 3.Chakrabarti, S., Sarawagi, S., Dom, B.: Mining surprising patterns using temporal description length. In: Proceedings of the 24th International Conference on Very Large Databases, pp. 606–617. Morgan Kaufmann Publishers Inc., San Francisco (1998)Google Scholar
- 4.Liu, B., Ma, Y., Lee, R.: Analyzing the interestingness of association rules from the temporal dimension. In: Proceedings of the IEEE International Conference on Data Mining, pp. 377–384. IEEE Computer Society Press, Los Alamitos (2001)Google Scholar
- 8.Liu, B., Hsu, W., Ma, Y.: Discovering the set of fundamental rule changes. In: Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 335–340 (2001)Google Scholar
- 14.Ruggles, S., Sobek, M., Alexander, T., Fitch, C.A., Goeken, R., Hall, P.K., King, M., Ronnander, C.: Integrated public use microdata series: Version 4.0, machine-readable database. Minnesota population center, Minneapolis (producer and distributor) (2008)Google Scholar