Skip to main content

Choosing the Right Lens: Finding What is Interesting in Data Mining

  • Chapter
Quality Measures in Data Mining

Part of the book series: Studies in Computational Intelligence ((SCI,volume 43))

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aczel J. and Daroczy, Z. On Measures of Information and Their Characterizations. Academic Press, New York, 1975.

    MATH  Google Scholar 

  2. Agrawal, R. and Srikant, R. Fast algorithms for mining association rules. In Proceedings of the 20th International Conference on Very Large Data Bases, pages 487-499, Santiago, Chile, 1994.

    Google Scholar 

  3. Baeza-Yates, R. and Ribeiro-Neto, B. Modern Information Retrieval. Addison Wesley, Boston, 1999.

    Google Scholar 

  4. Barnett, V., and Lewis, T. Outliers in Statistical Data. John Wiley and Sons, New York, 1994.

    MATH  Google Scholar 

  5. Bastide, Y., Pasquier, N., Taouil, R., Stumme, G., and Lakhal, L. Mining minimal non-redundant association rules using frequent closed itemsets. In Proceedings of the First International Conference on Computational Logic, pages 972-986, London, UK, 2000.

    Google Scholar 

  6. Bay, S.D. and Pazzani, M.J. Detecting change in categorical data: mining contrast sets. In Proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining (KDD99), pages 302-306, San Diego, USA, 1999.

    Google Scholar 

  7. Bayardo, R.J. and Agrawal R. Mining the most interesting rules. In Proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining (KDD99), pages 145-154, San Diego, USA, 1999.

    Google Scholar 

  8. Carvalho, D.R. and Freitas, A.A. A genetic algorithm-based solution for the problem of small disjuncts. In Proceedings of the Fourth European Conference on Principles of Data Mining and Knowledge Discovery (PKDD2000), pages 345-352, Lyon, France, 2000.

    Google Scholar 

  9. Chan, R., Yang, Q., and Shen, Y. Mining high utility itemsets. In Proceedings of the Third IEEE International Conference on Data Mining (ICDM03), pages 19-26, Melbourne, FL, 2003.

    Google Scholar 

  10. Dong G. and Li, J. Interestingness of discovered association rules in terms of neighborhood-based unexpectedness. In Proceedings of Second Pacific Asia Conference on Knowledge Discovery in Databases (PAKDD98), pages 72-86, Melbourne, 1998.

    Google Scholar 

  11. Duda, R.O., Hart, P.E., and Stork, D.G. Pattern Classification. Wiley-Interscience, 2001.

    Google Scholar 

  12. Encaoua, D. and Jacquemin, A. Indices de concentration et pouvoir de monopole. Revue Economique, 29(3):514-537, 1978.

    Article  Google Scholar 

  13. Forsyth, R.S., Clarke, D.D., and Wright, R.L. Overfitting revisited: an information-theoretic approach to simplifying discrimination trees. Journal of Experimental and Theoretical Artificial Intelligence, 6:289-302, 1994.

    Article  Google Scholar 

  14. Freitas, A.A. On objective measures of rule surprisingness. In Proceedings of the Second European Symposium on Principles of Data Mining and Knowledge Discovery (PKDD1998), pages 1-9, Nantes, France, 1998.

    Google Scholar 

  15. Gras, R., Couturier, R., Blanchard, J., Briand, H., Kuntz, P., and Peter, P. Quelques critres pour une mesure de qualit des rgles d’association. Revue des Nouvelles Technologies de l’Information, pages 3-31, 2004.

    Google Scholar 

  16. Gray, B. and Orlowska, M.E. Ccaiia: clustering categorical attributes into interesting association rules. In Proceedings of Second Pacific Asia Conference on Knowledge Discovery in Databases (PAKDD98), pages 132-143, Melbourne, 1998.

    Google Scholar 

  17. Hamilton, H.J., Geng, L., Findlater, L., and Randall, D.J. Spatio-temporal data mining with expected distribution domain generalization graphs. In Proceedings 10th Symposium on Temporal Representation and Reasoning/International Conference on Temporal Logic (TIME-ICTL 2003), pages 181-191, Cairns, Australia, 2003.

    Google Scholar 

  18. Hastie, T., Tibshirani, R., and Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, New York, 2001.

    MATH  Google Scholar 

  19. Hilderman, R.J. and Hamilton, H.J. Knowledge Discovery and Measures of Interest. Kluwer Academic Publishers, 2001. 22 Liqiang Geng and Howard J. Hamilton

    Google Scholar 

  20. Jaroszewicz, S. and Simovici, D.A. A general measure of rule interestingness. In Proceedings of the Fifth European Conference on Principles of Data Mining and Knowledge Discovery (PKDD2001), pages 253-265, Freiburg, Germany, 2001.

    Google Scholar 

  21. Klosgen, W. Explora: A multipattern and multistrategy discovery assistant. In Advances in Knowledge Discovery and Data Mining (Fayyad et al. eds), pages 249-271, California, 1996. AAAI Press/MIT Press.

    Google Scholar 

  22. Knorr, E.M., Ng, R.T., and Tucakov, V. Distance based outliers: Algorithms and applications. International Journal on Very Large Data Bases, 8:237-253, 2000.

    Article  Google Scholar 

  23. Lavrac, N., Flach, P., and Zupan, B. Rule evaluation measures: A unifying view. In Proceedings of the Ninth International Workshop on Inductive Logic Programming (ILP’99)), (Dzeroski and Flach eds), pages 174-185, Bled, Slovenia, 1999. Springer-Verlag.

    Google Scholar 

  24. Lenca P., Meyer P., Vaillant B., Lallich S. A. Multicriteria decision aid for im selection. Technical Report Technical Report LUSSI-TR-2004-01-EN, LUSSI Department, GET/ENST Bretagne, France, 2004.

    Google Scholar 

  25. Li, G. and Hamilton, H.J. Basic association rules. In Proceedings of 2004 SIAM International Conference on Data Mining (SDM04), pages 166-177, Orlando, USA, 2004.

    Google Scholar 

  26. Ling C., Chen, T., Yang Q., and Chen J. Mining optimal actions for profitable crm. In Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM2002), pages 767-770, Maebashi City, Japan, 2002.

    Google Scholar 

  27. Liu, B., Hsu, W., and Chen, S. Using general impressions to analyze discovered classification rules. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (KDD97), pages 31-36, Newport Beach, California, USA, 1997.

    Google Scholar 

  28. Liu, B., Hsu, W., Mun, L., and Lee, H. Finding interesting patterns using user expectations. IEEE Transactions on Knowledge and Data Engineering, 11(6):817-832, 1999.

    Article  Google Scholar 

  29. Lu, S., Hu, H., and Li, F. Mining weighted association rules. Intelligent Data Analysis, 5(3):211-225, 2001.

    MATH  Google Scholar 

  30. Mitchell, T.M. Machine Learning. McGraw-Hill, 1997.

    Google Scholar 

  31. Ohsaki, M., Kitaguchi, S., Okamoto, K., Yokoi, H., and Yamaguchi, T. Evaluation of rule ims with a clinical dataset on hepatitis. In Proceedings of the Eighth European Conference on Principles of Data Mining and Knowledge Discovery (PKDD2004), pages 362-373, Pisa, Italy, 2004.

    Google Scholar 

  32. Padmanabhan, B. and Tuzhilin A. A belief-driven method for discovering unexpected patterns. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD 98), pages 94-100, New York City, 1998.

    Google Scholar 

  33. Padmanabhan, B. and Tuzhilin A. Small is beautiful: Discovering the minimal set of unexpected patterns. In Proceedings of Proceedings of the Sixth International Conference on Knowledge Discovery and Data Mining (KDD 2000), pages 54-63, Boston, USA, 2000.

    Google Scholar 

  34. Piatetsky-Shapiro, G. Discovery, analysis and presentation of strong rules. In Knowledge Discovery in Databases (Piatetsky-Shapiro and Frawley eds), pages 229-248, MIT Press, Cambridge, MA, 1991.

    Google Scholar 

  35. Piatetsky-Shapiro, G. and Matheus, C. The interestingness of deviations. In Proceedings of KDD Workshop 1994 (KDD 94), pages 77-87, Seattle, USA, 1994.

    Google Scholar 

  36. Sahar, S. Interestingness via what is not interesting. In Proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining, pages 332-336, San Diego, USA, 1999.

    Google Scholar 

  37. Sarawagi, S. Explaining differences in multidimensional aggregates. In Proceedings of the 25th International Conference on Very Large Data Bases (VLDB), pages 42-53, Edinburgh, Scotland, 1999.

    Google Scholar 

  38. Sarawagi, S., Agrawal, R., and Megiddo, N. Discovery driven exploration of olap data cubes. In Proceedinds of the Sixth International Conference of Extending Database Technology (EDBT’98), pages 168-182, Valencia, Spain, 1998.

    Google Scholar 

  39. Shen, Y.D., Zhang, Z., and Yang, Q. Objective-oriented utility-based association mining. In Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM02), pages 426-433, Maebashi City, Japan, 2002.

    Google Scholar 

  40. Silberschatz, A. and Tuzhilin, A. On subjective measures of interestingness in knowledge discovery. In First International Conference on Knowledge Discovery and Data Mining, pages 275-281, Montreal, Canada, 1995.

    Google Scholar 

  41. Silberschatz, A. and Tuzhilin, A. What makes patterns interesting in knowledge discovery systems. IEEE Transactions on Knowledge and Data Engineering, 8(6):970-974, 1996.

    Article  Google Scholar 

  42. Tan, P and Kumar, V. Ims for association patterns: A perspective. Technical Report Technical Report 00-036, Department of Computer Science, University of Minnesota, 2000.

    Google Scholar 

  43. Tan, P., Kumar, V., and Srivastava, J. Selecting the right im for association patterns. In Proceedings of Proceedings of the Eighth International Conference on Knowledge Discovery and Data Mining (KDD02), pages 32-41, Edmonton, Canada, 2002.

    Google Scholar 

  44. Vaillant, B., Lenca, P., and Lallich, S. A clustering of ims. In Proceedings of the Seventh International Conference on Discovery Science, (DS’2004), pages 290-297, Padova, Italy, 2004.

    Google Scholar 

  45. Vitanyi, P.M.B. and Li, M. Minimum description length induction, bayesianism, and kolmogorov complexity. IEEE Transactions on Information Theory, 46(2):446-464, 2000.

    Article  MATH  MathSciNet  Google Scholar 

  46. Wang, K., Zhou, S. and Han, J. Profit mining: from patterns to actions. In Proceedings of the Eighth Conference on Extending Database Technology (EDBT 2002), pages 70-87, Prague, 2002.

    Google Scholar 

  47. Webb, G.I., and Brain, D. Generality is predictive of prediction accuracy. In Proceedings of the 2002 Pacific Rim Knowledge Acquisition Workshop (PKAW 2002), pages 117-130, Tokyo, Japan, 2002.

    Google Scholar 

  48. Yao, H., Hamilton, H.J., and Butz, C.J. A foundational approach for mining itemset utilities from databases. In Proceedings of SIAM International Conference on Data Mining, pages 482-486, Orlando, FL, 2004.

    Google Scholar 

  49. Yao, Y.Y. and Zhong, N. An analysis of quantitative measures associated with rules. In Proceedings of the Third Pacific-Asia Conference on Knowledge Discovery and Data Mining, pages 479-488, Beijing, China, 1999.

    Google Scholar 

  50. Yao, Y.Y., Chen, Y., and Yang, X.D. A measurement-theoretic foundation of rule interestingness evaluation. In Foundations and New Directions in Data Mining, (Lin et al. eds), pages 221-227, Melbourne, Florida, 2003.

    Google Scholar 

  51. Zhong, N., Yao, Y.Y., and Ohshima, M. Peculiarity oriented multidatabase mining. IEEE Transactions on Knowledge and Data Engineering, 15(4):952-960,2003.

    Article  Google Scholar 

  52. Zighed, D., Auray, J.P., and Duru, G. Sipina: Methode et logiciel. In Editions Lacassagne, Lyon, France, 1992.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Geng, L., Hamilton, H.J. (2007). Choosing the Right Lens: Finding What is Interesting in Data Mining. In: Guillet, F.J., Hamilton, H.J. (eds) Quality Measures in Data Mining. Studies in Computational Intelligence, vol 43. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44918-8_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-44918-8_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44911-9

  • Online ISBN: 978-3-540-44918-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics