Skip to main content

Contrast Sets

  • Chapter
  • First Online:

Abstract

Contrast set mining is one of the most important tasks in the supervised descriptive pattern mining field. It aims at finding patterns whose frequencies differ significantly among sets of data under contrast. This chapter introduces therefore the contrast set mining problem as well as similarities and differences with regard to related techniques. Then, the problem is formally described and the most widely used metrics in this task are mathematically defined. In this chapter, the most important approaches in this field are analysed, including STUCCO and CIGAR among others. Finally, some additional proposals in the field are also described.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. C.C. Aggarwal, J. Han, Frequent Pattern Mining (Springer International Publishing, Berlin, 2014)

    Book  Google Scholar 

  2. R. Agrawal, T. Imielinski, A.N. Swami, Mining association rules between sets of items in large databases, in Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, SIGMOD Conference ’93, pp. 207–216, Washington, DC (1993)

    Google Scholar 

  3. Z. Al Aghbari, I.N. Junejo, Discovering contrast sets for efficient classification of big data, in Proceedings of the 2nd International Conference on Open and Big Data (OBD 2016), pp. 45–51, Vienna, August 2016

    Google Scholar 

  4. J. Bailey, T. Manoukian, K. Ramamohanarao, Fast algorithms for mining emerging patterns, in Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery, PKDD ’02, pp. 39–50, London (2002)

    Google Scholar 

  5. S.D. Bay, M.J. Pazzani, Detecting change in categorical data: mining contrast sets, in Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-99), pp. 302–306, New York (1999)

    Google Scholar 

  6. S.D. Bay, M.J. Pazzani, Detecting group differences: mining contrast sets. Data Min. Knowl. Disc. 5(3), 213–246 (2001)

    Article  Google Scholar 

  7. G. Dong, J. Bailey (eds.), Contrast Data Mining: Concepts, Algorithms, and Applications (CRC Press, Boca Raton, 2013)

    Google Scholar 

  8. H. Fan, K. Ramamohanarao, Fast discovery and the generalization of strong jumping emerging patterns for building compact and accurate classifiers. IEEE Trans. Knowl. Data Eng. 18(6), 721–737 (2006)

    Article  Google Scholar 

  9. J. Han, J. Pei, Y. Yin, R. Mao, Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min. Knowl. Disc. 8, 53–87 (2004)

    Article  MathSciNet  Google Scholar 

  10. F. Herrera, C.J. Carmona, P. González, M.J. del Jesus, An overview on subgroup discovery: foundations and applications. Knowl. Inf. Syst. 29(3), 495–525 (2011)

    Article  Google Scholar 

  11. R.J. Hilderman, T. Peckham, A statistically sound alternative approach to mining contrast sets, in Proceedings of the 4th Australasian Data Mining Conference (AusDM), pp. 157–172, Sydney, December 2005

    Google Scholar 

  12. N. Lavrac, P. Kralj, D. Gamberger, A. Krstacic, Supporting factors to improve the explanatory potential of contrast set mining: analyzing brain ischaemia data, in Proceedings of the 11th Mediterranean Conference on Medical and Biological Engineering and Computing (MEDICON-07), pp. 157–161, Ljubljana, June 2007

    Google Scholar 

  13. P.K. Novak, N. Lavrač, D. Gamberger, A. Krstačic, CSM-SD: methodology for contrast set mining through subgroup discovery. J. Biomed. Inform. 42(1), 113–122 (2009)

    Article  Google Scholar 

  14. P.K. Novak, N. Lavrač, G.I. Webb, Supervised descriptive rule discovery: a unifying survey of contrast set, emerging pattern and subgroup mining. J. Mach. Learn. Res. 10, 377–403 (2009)

    MATH  Google Scholar 

  15. M. Simeon, R.J. Hilderman, Exploratory quantitative contrast set mining: A discretization approach, in Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence (IEEE ICTAI’07), pp. 124–131, Patras, October 2007

    Google Scholar 

  16. K.K.W. Siu, S.M. Butler, T. Beveridge, J.E. Gillam, C.J. Hall, A.H. Kaye, R.A. Lewis, K. Mannan, G. McLoughlin, S. Pearson, A.R. Round, E. Schultke, G.I. Webb, S.J. Wilkinson, Identifying markers of pathology in SAXS data of malignant tissues of the brain. Nucl. Inst. Methods Phys. Res. A 548, 140–146 (2005)

    Article  Google Scholar 

  17. S. Ventura, J.M. Luna, Pattern Mining with Evolutionary Algorithms (Springer International Publishing, Berlin, 2016)

    Book  Google Scholar 

  18. G.I. Webb, S.M. Butler, D.A. Newlands, On detecting differences between groups, in Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 256–265, Washington, DC, August 24 (2003)

    Google Scholar 

  19. T.T. Wong, K.L. Tseng, Mining negative contrast sets from data with discrete attributes. Expert Syst. Appl. 29(2), 401–407 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Ventura, S., Luna, J.M. (2018). Contrast Sets. In: Supervised Descriptive Pattern Mining. Springer, Cham. https://doi.org/10.1007/978-3-319-98140-6_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-98140-6_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-98139-0

  • Online ISBN: 978-3-319-98140-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics