Building the Data Warehouse of Frequent Itemsets in the DWFIST Approach

Monteiro, Rodrigo Salvador; Zimbrão, Geraldo; Schwarz, Holger; Mitschang, Bernhard; de Souza, Jano Moreira

doi:10.1007/11425274_31

Rodrigo Salvador Monteiro^22,23,
Geraldo Zimbrão^22,24,
Holger Schwarz²³,
Bernhard Mitschang²³ &
…
Jano Moreira de Souza^22,24

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3488))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

1095 Accesses
5 Citations

Abstract

Some data mining tasks can produce such great amounts of data that we have to cope with a new knowledge management problem. Frequent itemset mining fits in this category. Different approaches were proposed to handle or avoid somehow this problem. All of them have problems and limitations. In particular, most of them need the original data during the analysis phase, which is not feasible for data streams. The DWFIST (Data Warehouse of Frequent ItemSets Tactics) approach aims at providing a powerful environment for the analysis of itemsets and derived patterns, such as association rules, without accessing the original data during the analysis phase. This approach is based on a Data Warehouse of Frequent Itemsets. It provides frequent itemsets in a flexible and efficient way as well as a standardized logical view upon which analytical tools can be developed. This paper presents how such a data warehouse can be built.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. In: Proc. ACM SIGMOD Conf., Washington, pp. 207–216 (1993)
Google Scholar
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R.: Advances in Knowledge Discovery and Data Mining. AAAI Press, Menlo Park (1998)
Google Scholar
Han, J.: OLAP Mining: An Integration of OLAP with Data Mining. In: Proceedings of the 1997 IFIP Conference on Data Semantics (DS-7), Leysin, Switzerland, October 1997, pp. 1–11 (1997)
Google Scholar
Imielinski, T., Mannila, H.: A database perspective on knowledge discovery. Communications of ACM 39, 58–64 (1996)
Article Google Scholar
Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Proceedings KDD 1998, pp. 80–86. AAAI Press, New York (1998)
Google Scholar
Beyer, K., Ramakrishnan, R.: Bottom-up computation of sparse and iceberg cubes. In: Proc. ACM-SIGMOD Int. Conf. Management of Data (SIGMOD 1999), pp. 359–370 (1999)
Google Scholar
Wang, H., Yang, J., Wang, W., Yu, P.S.: Clustering by pattern similarity in large data sets. In: Proc. ACM-SIGMOD Int. Conf. on Management of Data, pp. 418–427 (2002)
Google Scholar
Mannila, H., Toivonen, H.: Multiple Uses of Frequent Sets and Condensed Representations. In: Proceedings KDD 1996, pp. 189–194. AAAI Press, Portland (1996)
Google Scholar
Giannella, C., Han, J., Pei, J., Yan, X., Yu, P.S.: Mining Frequent Patterns in Data Streams at Multiple Time Granularities. In: Kargupta, H., et al. (eds.) Data Mining: Next Generation Challenges and Future Directions. AAAI/MIT Press (2003)
Google Scholar
Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modelling, 2nd edn. Wiley Publishers, Chichester (2002) ISBN 0471200247
Google Scholar
Monteiro, R.S., Zimbrão, G., Souza, J.M.: An Analytical Approach for Handling Association Rule Mining Results. In: Proc. AusDM Workshop, Canberra, Australia (2003)
Google Scholar
Boulicaut, J.: Inductive databases and multiple uses of frequent itemsets: the cInQ approach. In: Meo, R., Lanzi, P.L., Klemettinen, M. (eds.) Database Support for Data Mining Applications. LNCS (LNAI), vol. 2682, pp. 3–26. Springer, Heidelberg (2004)
Chapter Google Scholar
Tryfona, N., Busborg, F., Christiansen, J.G.B.: starER: A Conceptual Model for Data Warehouse Design. In: Proc. Int. Workshop on Data Warehousing and OLAP, pp. 3–8 (1999)
Google Scholar
Li, Y., Ning, P., Wang, X.S., Jajodia, S.: Discovering calendar-based temporal association rules. In: Proc. Int. Symp. Temp. Representation and Reasoning, pp. 111–118 (2001)
Google Scholar
The PANDA Project (2004), http://dke.cti.gr/panda/

Download references

Author information

Authors and Affiliations

Computer Science Department, Graduate School of Engineering, Federal University, of Rio de Janeiro, PO Box 68511, 21945-970, Rio de Janeiro, Brazil
Rodrigo Salvador Monteiro, Geraldo Zimbrão & Jano Moreira de Souza
Institute f. Parallel & Distributed Systems, University of Stuttgart, Universitaetsstr. 38, 70569, Stuttgart
Rodrigo Salvador Monteiro, Holger Schwarz & Bernhard Mitschang
Computer Science Department, Institute of Mathematics, UFRJ, Brazil
Geraldo Zimbrão & Jano Moreira de Souza

Authors

Rodrigo Salvador Monteiro
View author publications
You can also search for this author in PubMed Google Scholar
Geraldo Zimbrão
View author publications
You can also search for this author in PubMed Google Scholar
Holger Schwarz
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard Mitschang
View author publications
You can also search for this author in PubMed Google Scholar
Jano Moreira de Souza
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

LIRIS - UFR d’Informatique, Université Claude Bernard Lyon 1, 43, boulevard du 11 novembre 1918, 69622, Villeurbanne, France
Mohand-Said Hacid
Department of Computer Science, State University of New York, 12222, Albany, NY, USA
Neil V. Murray
Department of Computer Science, University of North Carolina, 28223, Charlotte, NC, USA
Zbigniew W. Raś
Shimane University, 89-1 Enya-cho Izumo, 6938501, Shimane, Japan
Shusaku Tsumoto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Monteiro, R.S., Zimbrão, G., Schwarz, H., Mitschang, B., de Souza, J.M. (2005). Building the Data Warehouse of Frequent Itemsets in the DWFIST Approach. In: Hacid, MS., Murray, N.V., Raś, Z.W., Tsumoto, S. (eds) Foundations of Intelligent Systems. ISMIS 2005. Lecture Notes in Computer Science(), vol 3488. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11425274_31

Download citation

DOI: https://doi.org/10.1007/11425274_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25878-0
Online ISBN: 978-3-540-31949-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics