Skip to main content

Less Is More

  • Chapter

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 453))

Abstract

As computer and database technologies rapidly advance, human beings rely more and more on computers to accumulate data, process data, and make use of data. Machine learning, knowledge discovery, and data mining are some intelligent tools that help mankind accomplish those tasks. Researchers and practitioners realize that in order to use these tools effectively, an important part is pre-processing in which data is processed before it is presented to any learning, discovering, or visualizing algorithm. In many discovery applications (for example, marketing data analysis), a key operation is to find subsets of the population that behave enough alike to be worthy of focused analysis (Brackman and Anand, 1996). Although many learning methods attempt to select, extract, or construct features, both theoretical analyses and experimental studies indicate that many algorithms scale poorly in domains with large numbers of irrelevant and/or redundant features (Langley, 1996). All the evidence suggests the need for additional methods to overcome the difficulties.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Aha, D.W. (1998). Feature Weighting for Lazy Learning Algorithms. This volume, pages 13–32.

    Google Scholar 

  • Baldoni, M., Baroglio, C., Cavagnino, D., and Saitta L. (1998). Towards Automatic Fractal Feature Extraction for Image Recognition. This volume, pages 357–373.

    Google Scholar 

  • Barbara, D., DuMouchel, W., Faloutsos, C., Haas, P., Hellerstein, J., foannidis, Y., Jagadish, H., Johnson, T., Ng, R., Poosala, V., Ross, K., and Sevcik, K. (1997). The New Jersey data reduction report. Bulletin of the Technical Committee on Data Engineering, 20(4).

    Google Scholar 

  • Bloedorn, E. and Michalski, R.S. (1998). Data-driven Constructive Induction: Methodology and Applications. This volume, pages 51–68.

    Google Scholar 

  • Brachman, R.J., and Anand, T. (1996). The Process of Knowledge Discovery in Databases: A Human-centered Approach. In U.M. Fayyad, G. PiatetskyShapiro, P. Smyth, and R. Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining, pages 37–57. AAAI Press/The MIT Press.

    Google Scholar 

  • Dash, M. and Liu, H. (1997). Feature Selection Methods for Classifications. Intelligent Data Analysis: An International Journal, 1(3).

    Google Scholar 

  • Donoho, S. and Rendell, L. (1998). Feature Construction Using Fragmentary Knowledge. This volume, pages 273–288.

    Google Scholar 

  • Gama, J. and Brazdil, P. (1998). Constructive Induction on Continuous Spaces. This volume, pages 289–303.

    Google Scholar 

  • Hu, Y. (1998). Constructive Induction: Covering Attribute Spectrum. This volume, pages 257–272.

    Google Scholar 

  • Kohavi, R. and John, G.H. (1998). The Wrapper Approach. This volume, pages 33–50.

    Google Scholar 

  • Blum, A.L. and Langley, P. (1997). Selection of Relevant Features and Examples in Machine Learning. Artificial Intelligence, 97:246–271.

    Article  MathSciNet  Google Scholar 

  • Langley, P. (1996). Elements of Machine Learning. Morgan Kaufmann Publishers, Inc.

    Google Scholar 

  • Lavrač, N., Gamberger, D., and Turney, P. (1998). A Relevancy Filter for Constructive Induction. This volume, pages 137–154.

    Google Scholar 

  • Liu, H. and Motoda, H. (1998). Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers.

    Google Scholar 

  • Mallet, Y., de Vel, O., and Coomans, D. (1998). Integrated Feature Extraction Using Adaptive Wavelets. This volume, pages 175–189.

    Google Scholar 

  • Matheus, C. (1991). The Need for Constructive Induction. In L.A. Birnbaum and Collins G.C., editors, Machine Learning - Proceedings of the Eighth International Workshop, pages 173–177, June.

    Google Scholar 

  • Mephu Nguifo, E. and Njiwoua, P. (1998). Using Lattice-based Framework as a Tool for Feature Extraction. This volume, pages 205–218.

    Google Scholar 

  • Merz, C. and Murphy, P. (1996). UCI repository of machine learning databases. http://www.ics.uci.edu/~mlearn/MLRepository.html. Irvine, CA: University of California, Department of Information and Computer Science.

    Google Scholar 

  • Pazzani, M.J. (1998). Constructive Induction of Cartesian Product Attributes. This volume, pages 341–354.

    Google Scholar 

  • Perrin, P. and Petry, P. (1998). Lexical Contextual Relations for the Unsupervised Discovery of Texts Features. This volume, pages 157–173.

    Google Scholar 

  • Pudil, P. and Novovičová, J. (1998). Novel Methods for Subset Selection with Respect to Problem Knowledge. This volume, pages 101–116.

    Google Scholar 

  • Seabra Lopes, L. and Camarinha-Matos, L.M. (1998). Feature Transformation Strategies for a Robot Learning Problem. This volume, pages 375–391.

    Google Scholar 

  • Setiono, R. and Liu, H. (1998). Feature Extraction via Neural Networks. This volume, pages 191–204.

    Google Scholar 

  • Terano, T. and Ishino, Y. (1998). Interactive Genetic Algorithm Based Feature Selection and Its Application to Marketing Data Analysis. This volume, pages 393–406.

    Google Scholar 

  • Thornton, C.J. (1992). Techniques of Computational Learning: an Introduction. Chapman and Hall.

    Google Scholar 

  • Utgoff, P.E. and Precup, D. (1998). Constructive Function Approximation. This volume, pages 219–235.

    Google Scholar 

  • Vafaie, H. and De Jong, K. (1998). Evolutionary Feature Space Transformation. This volume, pages 307–323.

    Google Scholar 

  • Wang, H., Bell, D., and Murtagh, F. (1998). Relevance Approach to Feature Subset Selection. This volume, pages 85–99.

    Google Scholar 

  • Wang, K. and Sundaresh, S. (1998). Selecting Features by Vertical Compactness of Data. This volume, pages 71–84.

    Google Scholar 

  • Washio, T and Motoda, H. (1997). Discovering Admissible Models of Complex Systems Based on Scale-Types and Identity Constraints. In Proc. of the 15th International Joint Conference on Artificial Intelligence,pages 810–817.

    Google Scholar 

  • Wnek, J. and Michalski, R.S. (1994). Hypothesis-driven Constructive Induction in AQ17-HCI: A Method and Experiments. Machine Learning, 14:139–168.

    Article  MATH  Google Scholar 

  • Wyse, N., Dubes, R., and Jain, A.K. (1980). A Critical Evaluation of Intrinsic Dimensionality Algorithms. In Gelsema, E.S. and Kanal, L.N., editors, Pattern Recognition in Practice, pages 415–425. Morgan Kaufmann Publishers, Inc.

    Google Scholar 

  • Yang, J. and Honavar, V. (1998). Feature Subset Selection Using A Genetic Algorithm. This volume, pages 117–136.

    Google Scholar 

  • Zheng, Z. (1998) A Comparison of Constructing Different Types of New Feature for Decision Tree Learning. This volume, pages 239–255.

    Google Scholar 

  • Zupan, B., Bohanec, M, Demsar, J., and Bratko, I. (1998). Feature Transformation by Function Decomposition. This volume, pages 325–340.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer Science+Business Media New York

About this chapter

Cite this chapter

Liu, H., Motoda, H. (1998). Less Is More. In: Liu, H., Motoda, H. (eds) Feature Extraction, Construction and Selection. The Springer International Series in Engineering and Computer Science, vol 453. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-5725-8_1

Download citation

  • DOI: https://doi.org/10.1007/978-1-4615-5725-8_1

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-7622-4

  • Online ISBN: 978-1-4615-5725-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics