Skip to main content

Applying Feature Selection Methods to Improve the Predictive Model of a Direct Marketing Problem

  • Conference paper
Software Engineering and Computer Systems (ICSECS 2011)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 179))

Included in the following conference series:

Abstract

The ability to forecast job advertisement demands is vital to enhance the customer retention rate for recruitment companies. On top of that, it is uneconomical to cold call every individual on a regular basis for companies with a large pool of customers. This paper presents a novel approach in predicting the re-ordering demand of a potential group of SMEs customers in a large online recruitment company. Two feature selection techniques, namely Correlation-based Feature Selection (CFS) and Subset Consistency (SC) Feature Selection, were applied to predictive models in this study. The predictive models were compared with other similar models in the absence of feature selections. Results of various experiments show that those models using feature selections generally outperform those without feature selections. The results support the authors’ hypothesis that the predictive model can perform better and further ahead than similar methods that exclude feature selection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kim, Y.S., Street, W.N.: An Intelligent System for Customer Targeting: A Data Mining Approach. Decision Support Systems 37, 215–228 (2004)

    Article  Google Scholar 

  2. Bella, A., Ferri, C., Hernández-Orallo, J., Ramírez-Quintana, M.J.: Joint Cutoff Probabilistic Estimation Using Simulation: A Mailing Campaign Application. In: Yin, H., Tino, P., Corchado, E., Byrne, W., Yao, X. (eds.) IDEAL 2007. LNCS, vol. 4881, pp. 609–619. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  3. Cox, L.A.: Data Mining and Causal Modeling of Customer Behaviors. Telecommunication Systems 21, 349–381 (2002)

    Article  Google Scholar 

  4. Hung, S.Y., Yen, C., Wang, H.Y.: Applying Data Mining to Telecom Churn Management. Expert Systems with Applications 31, 515–524 (2006)

    Article  Google Scholar 

  5. Au, W.H., Chan, K.C.C.: Mining Fuzzy Association Rules in A Bank-Account Database. IEEE Transactions on Fuzzy Systems 11, 238–248 (2003)

    Article  Google Scholar 

  6. Kumar, D.A., Ravi, V.: Predicting Credit Card Customer Churn in Banks Using Data Mining. Int. J. Data Anal. Tech. Strateg. 1, 4–28 (2008)

    Article  Google Scholar 

  7. Ling, C.X., Li, C.: Data Mining for Direct Marketing: Problems and Solutions. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pp. 73–79 (1998)

    Google Scholar 

  8. Blum, A.L., Langley, P.: Selection of Relevant Features and Examples in Machine Learning. Artificial Intelligence 97, 245–271 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  9. Kohavi, R., John, G.H.: Wrappers for Feature Subset Selection. Artificial Intelligence 97, 273–324 (1997)

    Article  MATH  Google Scholar 

  10. Jain, A.K., Duin, R.P.W., Mao, J.: Statistical Pattern Recognition: A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 4–37 (2000)

    Article  Google Scholar 

  11. Guyon, I., Elisseeff, A.: An Introduction to Variable and Feature Selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)

    MATH  Google Scholar 

  12. Elashoff, J.D., Elashoff, R.M., Goldman, G.E.: On the Choice of Variables in Classification Problems with Dichotomous Variables. Biometrika 54, 668–670 (1967)

    Article  MathSciNet  Google Scholar 

  13. Toussaint, G.: Note on Optimal Selection of Independent Binary-Valued Features for Pattern Recognition (Corresp.). IEEE Transactions on Information Theory 17, 618 (1971)

    Google Scholar 

  14. Tirelli, T., Pessani, D.: Importance of Feature Selection in Decision-Tree and Artificial-Neural-Network Ecological Applications. Alburnus Alburnus Alborella: A Practical Example. Ecological Informatics (in press, corrected proof, Available online November 30, 2010)

    Google Scholar 

  15. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)

    MATH  Google Scholar 

  16. Holte, R.C.: Very Simple Classification Rules Perform Well on Most Commonly Used Datasets. Machine Learning 11, 63–91 (1993)

    Article  MATH  Google Scholar 

  17. Liu, H., Motoda, H.: Computational Methods of Feature Selection. Chapman & Hall/CRC, Boca Raton (2007)

    MATH  Google Scholar 

  18. Chen, K., Liu, H.: Towards An Evolutionary Algorithm: Comparison of Two Feature Selection Algorithms. In: Proceedings of the Congress on Evolutionary Computation, vol. 2, pp. 1309–1313 (1999)

    Google Scholar 

  19. Liu, H., Setiono, R.: A Probabilistic Approach to Feature Selection - A Filter Solution. In: Proceedings of the Thirteenth International Conference on Machine Learning, pp. 319–327 (1996)

    Google Scholar 

  20. Hall, M.: Correlation-Based Feature Selection for Discrete and Numeric Class Machine Learning. In: Langley, P. (ed.) Proceedings of the Seventeenth International Conference on Machine Learning, pp. 359–366 (2000)

    Google Scholar 

  21. Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C., Wirth, R.: CRISP-DM 1.0: Step-by-Step Data Mining Guide. CRISP-DM consortium (2000)

    Google Scholar 

  22. Ngai, E.W.T., Xiu, L., Chau, D.C.K.: Application of Data Mining Techniques in Customer Relationship Management: A Literature Review and Classification. Expert Systems with Applications 36, 2592–2602 (2009)

    Article  Google Scholar 

  23. Provost, F., Fawcett, T.: Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions. In: Heckerman, D., Mannila, H., Pregibon, D., Uthurusamy, R. (eds.) Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pp. 43–48. AAAI Press, California (1997)

    Google Scholar 

  24. Huang, J., Ling, C.X.: Using AUC and Accuracy in Evaluating Learning Algorithms. IEEE Transactions on Knowledge and Data Engineering 17, 299–310 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tan, DW., Sim, YW., Yeoh, W. (2011). Applying Feature Selection Methods to Improve the Predictive Model of a Direct Marketing Problem. In: Mohamad Zain, J., Wan Mohd, W.M.b., El-Qawasmeh, E. (eds) Software Engineering and Computer Systems. ICSECS 2011. Communications in Computer and Information Science, vol 179. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22170-5_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-22170-5_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-22169-9

  • Online ISBN: 978-3-642-22170-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics