Skip to main content

An Optimization Approach for Feature Selection in an Electric Billing Database

  • Conference paper
Book cover Knowledge-Based Intelligent Information and Engineering Systems (KES 2005)

Abstract

Feature or attribute selection is a crucial activity when knowledge discovery is applied to very large databases. Its main objective is to eliminate irrelevant or redundant attributes to obtain a computationally tractable problem, without affecting the classification quality. In this article a novel optimization approach is evaluated. This method uses concave programming to minimize the number of attributes to input to the mining algorithm and also, to minimize the classification error. This technique is evaluated using a billing data base from the national electric utility in Mexico. The results are compared against those obtained by traditional techniques. From this experimentation, several improvements to the optimization approach are suggested.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Pyle, D.: Data preparation for data mining. Morgan Kaufmann, Sn Fco (1999)

    Google Scholar 

  2. Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. Journal of machine learning research 3, 1157–1182 (2003)

    Article  MATH  Google Scholar 

  3. Bradley, P., Mangasarian, O.: Feature selection via concave minimization and support vector machines. In: Machine learning, 15th int. conference ICML 1998, SnFco., Calif., pp. 82–90 (1998)

    Google Scholar 

  4. Mitra, S., et al.: Data mining in soft computing framework: a survey. IEEE Trans. on neural networks 13(1), 3–14 (2002)

    Article  Google Scholar 

  5. Kohavi, R., John, G.: Wrappers for feature subset selection. Artificial Intelligence Journal, Special issue on relevance, 273–324 (1997)

    Google Scholar 

  6. Quinlan, J.: Unknown attribute values in ID3. In: Int. conf. Machine learning, pp. 164–168 (1989)

    Google Scholar 

  7. Piramuthu, S.: Evaluating feature selection methods for learning in data mining applications. In: Proc. 31st annual Hawaii Int. conf. on system sciences, pp. 294–301 (1998)

    Google Scholar 

  8. Molina, L., Belanche, L., Nebot, A.: Feature selection algorithms, a survey and experimental evaluation. In: IEEE Int. conf. on data mining, Maebashi City Japan, pp. 306–313 (2002)

    Google Scholar 

  9. Domingos, P.: When and how to subsample. SIGKDD 3(2), 74–75 (2001)

    Article  Google Scholar 

  10. General Algebraic Modeling System, GAMS IDE, Rev. 135 (2003)

    Google Scholar 

  11. Stoppiglia, H., Dreyfus, G., et al.: Ranking a random feature for variable and feature selection. Journal of machine learning research 3, 1399–1414 (2003)

    Article  MATH  Google Scholar 

  12. http://www.cs.waikato.ac.nz/ml/weka (2004)

  13. http://www.ia.uned.es/~lvira/ (2004)

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mejía-Lavalle, M., Rodríguez, G., Arroyo, G. (2005). An Optimization Approach for Feature Selection in an Electric Billing Database. In: Khosla, R., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2005. Lecture Notes in Computer Science(), vol 3684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11554028_9

Download citation

  • DOI: https://doi.org/10.1007/11554028_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28897-8

  • Online ISBN: 978-3-540-31997-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics