Finding Unexpected Patterns in Data

Padmanabhan, Balaji; Tuzhilin, Alexander

doi:10.1007/978-3-7908-1791-1_10

Finding Unexpected Patterns in Data

Balaji Padmanabhan⁵ &
Alexander Tuzhilin⁶

Chapter

279 Accesses
2 Citations

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 95))

Abstract

Many pattern discovery methods in the KDD literature have the drawbacks of (1) discovering too many obvious or irrelevant patterns and (2) not using prior knowledge systematically. In this chapter we present an approach that addresses these drawbacks. In particular we present an approach to characterizing the unexpectedness of patterns based on prior background knowledge in the form of beliefs. Based on this characterization of unexpectedness we present an algorithm, ZoomUR, for discovering unexpected patterns in data.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Imielinski, T. and Swami, A., 1993. Mining Association Rules Between Sets of Items in Large Databases. In Proc. of the ACM SIGMOD Conference on Management of Data, pp. 207–216.
Google Scholar
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H. and Verkamo,A.I., 1995. Fast Discovery of Association Rules. In Fayyad, U.M., PiatetskyShapiro, G., Smyth, P., and Uthurusamy, R. eds., Advances in Knowledge Discovery and Data Mining. AAAI Press.
Google Scholar
AT97] Adomavicius, G., and Tuzhilin, A., 1997. Discovery of Actionable Patterns in Databases: The Action Hierarchy Approach. In Proc. of the Third Intl. Conference on Knowledge Discovery and Data Mining (KDD 97).
Google Scholar
Buchanan, B.G. and E.A. Feigenbaum. DENDRAL and METADENDRAL: Their Applications Dimensions. Artificial Intelligence, 11:5 — 24, 1978.
Google Scholar
Brin, S., Motwani, R., Ullman, J.D., and Tsur, S., 1997. Dynamic Itemset Counting and Implication Rules for Market Basket Data. Procs. ACM SIGMOD Int. Conf. on Mgmt. of Data, pp. 255–264.
Google Scholar
Dhar, V., and Tuzhilin, A., 1993. Abstract-Driven Pattern Discovery in Databases. IEEE Transactions on Knowledge and Data Engineering, v.5, no. 6 December 1993.
Google Scholar
Forbes Magazine, Sep. 8, 1997. Believe in yourself, believe in the merchandise, pp.118–124.
Google Scholar
Frawley, W.J., Piatetsky-Shapiro, G. and Matheus, C.J., 1991. Knowledge Discovery in Databases: An Overview. In Piatetsky-Shapiro, G. and Frawley, W.J. eds., Know. Disc. in Databases. AAAI/MIT Press, 1991.
Google Scholar
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., 1996. From Data Mining to Knowledge Discovery: An Overview. In Fayyad, U.M.,Piatetsky-Shapiro, G., Smyth, P., and Uthurusamy, R. eds., Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press.
Google Scholar
Klemettinen, M., Mannila, H., Ronkainen, P., Toivonen, H. and Verkamo, A.I., 1994. Finding Interesting Rules from Large Sets of Discovered Association Rules. In Proc. of the Third International Conference on Information and Knowledge Management, pp. 401–407.
Google Scholar
D.B. Lenat and J.S. Brown. Why AM and EURISKO appear to work. Artificial Intelligence, 23 (3): 269–294. 1984.
Article Google Scholar
Y. Lee, B.G. Buchanan, and J.M. Aronis. Knowledge-based Learning in Exploratory Science: Learning Rules to Predict Rodent Carcinogenicity. Machine Learning, 30: 217–240. 1998.
Article Google Scholar
D.B. Lenat. AM: Discovery in Mathematics as Heuristic Search. In R. Davis and D. Lenat, editors. Knowledge-Based Systems in Artificial Intelligence. McGraw-Hill. 1983.
Google Scholar
Liu, B. and Hsu, W., 1996. Post-Analysis of Learned Rules. In Proc. of the Thirteenth National Conf. on Artificial Intelligence (AAAI ‘86), pp. 828–834.
Google Scholar
Liu, B., Hsu, W. and Chen, S, 1997. Using General Impressions to Analyze Discovered Classification Rules. In Proc. of the Third Intl. Conf. on Knowledge Discovery and Data Mining (KDD 97 ), pp. 31–36.
Google Scholar
Mitchell, T. The need for biases in learning generalizations. Technical Report CBM-TR-117, Dept. of Computer Science, Rutgers University, 1980.
Google Scholar
Michalski, R.S. and Kaufman, K.A. Data Mining and Knowledge Discovery: A Review of Issues and a Multistrategy Approach. Technical Report P97–3 MLI 97–2, Machine Learning and Inference Laboratory, George Mason University, 1997.
Google Scholar
Padmanabhan, B, 1999. Discovering Unexpected Patterns in Data Mining Applications. Doctoral dissertation, Department of Information Systems, Stern School of Business, New York University.
Google Scholar
Pazzani, M. and Kibler, D. “The Utility of Knowledge in Inductive Learning.” Machine Learning, 9 (1): 57–94, 1992.
Google Scholar
Piatetsky-Shapiro, G. and Matheus, C.J., 1994. The Interestingness of Deviations. In Proc. of AAAI-94 Workshop on Know. Discovery in Databases, pp. 25–36.
Google Scholar
Padmanabhan, B. and Tuzhilin, A., 1997. On the Discovery of Unexpected Rules in Data Mining Applications. In Procs. of the Workshop on Information Technology and Systems (WITS ‘87), pp. 81–90.
Google Scholar
Stedman, C., 1997. Data Mining for Fool’s Gold. Computerworld, Vol. 31,No. 48, Dec. 1997.
Google Scholar
Shrager, J. and P. Langley. Computational Models of Scientific Discovery and Theory Formation. San Mateo, CA: Morgan Kaufmann, 1990.
Google Scholar
Silberschatz, A. and Tuzhilin, A., 1995. On Subjective Measures of Interestingness in Knowledge Discovery. In Proc. of the First International Conference on Knowledge Discovery and Data Mining, pp. 275–281.
Google Scholar
Silberschatz, A. and Tuzhilin, A., 1996. What Makes Patterns Interesting in Knowledge Discovery Systems. IEEE Trans. on Know. and Data Engineering. Spec. Issue on Data Mining, v. 5, no. 6, pp. 970–974.
Article Google Scholar
Silberschatz, A. and Tuzhilin, A., 1996. A Belief-Driven Discovery Framework Based on Data Monitoring and Triggering. Working Paper #IS-96–26, Dept. of Information Systems, Stern School of Business, NYU.
Google Scholar
Suzuki, E., 1997. Autonomous Discovery of Reliable Exception Rules. In Proc. of the Third International Conference on Knowledge Discovery and Data Mining, pp. 259–262.
Google Scholar
Srikant, R., Vu, Q. and Agrawal, R. Mining Association Rules with Item Constraints. In Proc. of the Third International Conference on Knowledge Discovery and Data Mining (KDD 97), pp. 67–73.
Google Scholar
Zytkow, J., J. Zhu, and A. Hussam. Automated Discovery in Chemistry Laboratory. Proceedings of the Eighth National Conference on Artificial Intelligence. pp 889–894, 1990.
Google Scholar

Download references

Author information

Authors and Affiliations

Operations and Information Management Department, The Wharton School, University of Pennsylvania, USA
Balaji Padmanabhan
Information Systems Department, Stern School of Business, New York University, USA
Alexander Tuzhilin

Authors

Balaji Padmanabhan
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Tuzhilin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics and Computer Science, San Jose State University The Metropolitan University of Silicon Valley, One Washington Square, 95192-0103, San Jose, CA, USA
Tsau Young Lin
Department of Computer Science, University of Regina, S4S 0A2, Regina, Saskatchewan, Canada
Yiyu Y. Yao
Computer Science Division and Electronics Research Laboratory Department of Electrical and Electronics, University of California Berkeley Initiative in Soft Computing (BISC), 94720-1776, Berkeley, CA, USA
Lotfi A. Zadeh

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Padmanabhan, B., Tuzhilin, A. (2002). Finding Unexpected Patterns in Data. In: Lin, T.Y., Yao, Y.Y., Zadeh, L.A. (eds) Data Mining, Rough Sets and Granular Computing. Studies in Fuzziness and Soft Computing, vol 95. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1791-1_10

Download citation

DOI: https://doi.org/10.1007/978-3-7908-1791-1_10
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-2508-4
Online ISBN: 978-3-7908-1791-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics