Advertisement

Identifying the Components

  • Matthijs van Leeuwen
  • Jilles Vreeken
  • Arno Siebes
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5781)

Abstract

Most, if not all, databases are mixtures of samples from different distributions. In many cases, however, nothing is known about the source components of these mixtures. Therefore, many methods that induce models regard a database as sampled from a single data distribution. Models that do take into account that databases actually are sampled from mixtures of distributions are often superior to those that do not, independent of whether this is modelled explicitly or implicitly.

References

  1. 1.
    van Leeuwen, M., Vreeken, J., Siebes, A.: Identifying the Components. Data Mining and Knowledge Discovery (2009) doi: 10.1007/s10618-009-0137-2Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Matthijs van Leeuwen
    • 1
  • Jilles Vreeken
    • 1
  • Arno Siebes
    • 1
  1. 1.Department of Computer ScienceUniversiteit UtrechtNetherlands

Personalised recommendations