Less Is More

Liu, Huan; Motoda, Hiroshi

doi:10.1007/978-1-4615-5725-8_1

Less Is More

Huan Liu³ &
Hiroshi Motoda⁴

Chapter

1351 Accesses
8 Citations

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 453))

Abstract

As computer and database technologies rapidly advance, human beings rely more and more on computers to accumulate data, process data, and make use of data. Machine learning, knowledge discovery, and data mining are some intelligent tools that help mankind accomplish those tasks. Researchers and practitioners realize that in order to use these tools effectively, an important part is pre-processing in which data is processed before it is presented to any learning, discovering, or visualizing algorithm. In many discovery applications (for example, marketing data analysis), a key operation is to find subsets of the population that behave enough alike to be worthy of focused analysis (Brackman and Anand, 1996). Although many learning methods attempt to select, extract, or construct features, both theoretical analyses and experimental studies indicate that many algorithms scale poorly in domains with large numbers of irrelevant and/or redundant features (Langley, 1996). All the evidence suggests the need for additional methods to overcome the difficulties.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aha, D.W. (1998). Feature Weighting for Lazy Learning Algorithms. This volume, pages 13–32.
Google Scholar
Baldoni, M., Baroglio, C., Cavagnino, D., and Saitta L. (1998). Towards Automatic Fractal Feature Extraction for Image Recognition. This volume, pages 357–373.
Google Scholar
Barbara, D., DuMouchel, W., Faloutsos, C., Haas, P., Hellerstein, J., foannidis, Y., Jagadish, H., Johnson, T., Ng, R., Poosala, V., Ross, K., and Sevcik, K. (1997). The New Jersey data reduction report. Bulletin of the Technical Committee on Data Engineering, 20(4).
Google Scholar
Bloedorn, E. and Michalski, R.S. (1998). Data-driven Constructive Induction: Methodology and Applications. This volume, pages 51–68.
Google Scholar
Brachman, R.J., and Anand, T. (1996). The Process of Knowledge Discovery in Databases: A Human-centered Approach. In U.M. Fayyad, G. PiatetskyShapiro, P. Smyth, and R. Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining, pages 37–57. AAAI Press/The MIT Press.
Google Scholar
Dash, M. and Liu, H. (1997). Feature Selection Methods for Classifications. Intelligent Data Analysis: An International Journal, 1(3).
Google Scholar
Donoho, S. and Rendell, L. (1998). Feature Construction Using Fragmentary Knowledge. This volume, pages 273–288.
Google Scholar
Gama, J. and Brazdil, P. (1998). Constructive Induction on Continuous Spaces. This volume, pages 289–303.
Google Scholar
Hu, Y. (1998). Constructive Induction: Covering Attribute Spectrum. This volume, pages 257–272.
Google Scholar
Kohavi, R. and John, G.H. (1998). The Wrapper Approach. This volume, pages 33–50.
Google Scholar
Blum, A.L. and Langley, P. (1997). Selection of Relevant Features and Examples in Machine Learning. Artificial Intelligence, 97:246–271.
Article MathSciNet Google Scholar
Langley, P. (1996). Elements of Machine Learning. Morgan Kaufmann Publishers, Inc.
Google Scholar
Lavrač, N., Gamberger, D., and Turney, P. (1998). A Relevancy Filter for Constructive Induction. This volume, pages 137–154.
Google Scholar
Liu, H. and Motoda, H. (1998). Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers.
Google Scholar
Mallet, Y., de Vel, O., and Coomans, D. (1998). Integrated Feature Extraction Using Adaptive Wavelets. This volume, pages 175–189.
Google Scholar
Matheus, C. (1991). The Need for Constructive Induction. In L.A. Birnbaum and Collins G.C., editors, Machine Learning - Proceedings of the Eighth International Workshop, pages 173–177, June.
Google Scholar
Mephu Nguifo, E. and Njiwoua, P. (1998). Using Lattice-based Framework as a Tool for Feature Extraction. This volume, pages 205–218.
Google Scholar
Merz, C. and Murphy, P. (1996). UCI repository of machine learning databases. http://www.ics.uci.edu/~mlearn/MLRepository.html. Irvine, CA: University of California, Department of Information and Computer Science.
Google Scholar
Pazzani, M.J. (1998). Constructive Induction of Cartesian Product Attributes. This volume, pages 341–354.
Google Scholar
Perrin, P. and Petry, P. (1998). Lexical Contextual Relations for the Unsupervised Discovery of Texts Features. This volume, pages 157–173.
Google Scholar
Pudil, P. and Novovičová, J. (1998). Novel Methods for Subset Selection with Respect to Problem Knowledge. This volume, pages 101–116.
Google Scholar
Seabra Lopes, L. and Camarinha-Matos, L.M. (1998). Feature Transformation Strategies for a Robot Learning Problem. This volume, pages 375–391.
Google Scholar
Setiono, R. and Liu, H. (1998). Feature Extraction via Neural Networks. This volume, pages 191–204.
Google Scholar
Terano, T. and Ishino, Y. (1998). Interactive Genetic Algorithm Based Feature Selection and Its Application to Marketing Data Analysis. This volume, pages 393–406.
Google Scholar
Thornton, C.J. (1992). Techniques of Computational Learning: an Introduction. Chapman and Hall.
Google Scholar
Utgoff, P.E. and Precup, D. (1998). Constructive Function Approximation. This volume, pages 219–235.
Google Scholar
Vafaie, H. and De Jong, K. (1998). Evolutionary Feature Space Transformation. This volume, pages 307–323.
Google Scholar
Wang, H., Bell, D., and Murtagh, F. (1998). Relevance Approach to Feature Subset Selection. This volume, pages 85–99.
Google Scholar
Wang, K. and Sundaresh, S. (1998). Selecting Features by Vertical Compactness of Data. This volume, pages 71–84.
Google Scholar
Washio, T and Motoda, H. (1997). Discovering Admissible Models of Complex Systems Based on Scale-Types and Identity Constraints. In Proc. of the 15th International Joint Conference on Artificial Intelligence,pages 810–817.
Google Scholar
Wnek, J. and Michalski, R.S. (1994). Hypothesis-driven Constructive Induction in AQ17-HCI: A Method and Experiments. Machine Learning, 14:139–168.
Article MATH Google Scholar
Wyse, N., Dubes, R., and Jain, A.K. (1980). A Critical Evaluation of Intrinsic Dimensionality Algorithms. In Gelsema, E.S. and Kanal, L.N., editors, Pattern Recognition in Practice, pages 415–425. Morgan Kaufmann Publishers, Inc.
Google Scholar
Yang, J. and Honavar, V. (1998). Feature Subset Selection Using A Genetic Algorithm. This volume, pages 117–136.
Google Scholar
Zheng, Z. (1998) A Comparison of Constructing Different Types of New Feature for Decision Tree Learning. This volume, pages 239–255.
Google Scholar
Zupan, B., Bohanec, M, Demsar, J., and Bratko, I. (1998). Feature Transformation by Function Decomposition. This volume, pages 325–340.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Systems & Computer Science, National University of Singapore, Singapore, 119260
Huan Liu
Institute of Scientific & Industrial Research, Osaka University, Ibaraki, Osaka, 567-0047, Japan
Hiroshi Motoda

Authors

Huan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Motoda
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National University of Singapore, Singapore
Huan Liu
Osaka University, Osaka, Japan
Hiroshi Motoda

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Liu, H., Motoda, H. (1998). Less Is More. In: Liu, H., Motoda, H. (eds) Feature Extraction, Construction and Selection. The Springer International Series in Engineering and Computer Science, vol 453. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-5725-8_1

Download citation

DOI: https://doi.org/10.1007/978-1-4615-5725-8_1
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-7622-4
Online ISBN: 978-1-4615-5725-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics