Skip to main content

The Performance of One Dimensional Naïve Bayes Classifier for Feature Selection in Predicting Prospective Car Insurance Buyers

  • Conference paper
  • First Online:
Data Mining and Big Data (DMBD 2019)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1071))

Included in the following conference series:

Abstract

One of the products sold by insurance companies is car insurance. To offer this product, one of the techniques used by the company is cold calling. This method often decreases the sellers’ mentalities because they face many rejections when offering insurance products. This problem can be reduced by classifying prospective buyers’ data first. The data can be classified as customers with the potential to buy insurance and customers who have no potential to buy insurance. From the obtained data, there are certainly many features that support the classification process. However, not all features contributed to improving classification accuracy. Machine learning especially the method of feature selection helps to reduce dimensions and to improve classification accuracy. In this paper, we examine One-Dimensional Naïve Bayes Classifier (1-DBC) as a feature selection method that is applied to two classifier methods, i.e., Support Vector Machine and Logistic Regression. Our simulations show that the two classifiers can use fewer features to produce comparable accuracies in classifying prospective car insurance buyers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.statista.com/statistics/281134/number-of-vehicles-in-use-worldwide/.

  2. 2.

    http://www.who.int/mediacentre/factsheets/fs358/en/.

  3. 3.

    https://www.kaggle.com/emmaren/cold-calls-data-mining-and-modelselection/data.

  4. 4.

    https://www.analyticsindiamag.com/5-ways-handle-missing-values-machine-learning-datasets.

  5. 5.

    https://www.analyticsvidhya.com/blog/2015/11/easy-methods-deal-categorical-variables-predictive-modeling/.

References

  1. Schiffman, S.: Cold Calling Techniques: (That Really Work!). Adams Media, Avon (2014)

    Google Scholar 

  2. Mitchell, T.M.: Machine learning and data mining. Commun. ACM 42(11), 30–36 (1999)

    Google Scholar 

  3. Mitchell, T.M.: The Discipline of Machine Learning. Carnegie Mellon University, School of Computer Science, Machine Learning Department, Pittsburgh (2006)

    Google Scholar 

  4. Wang, S., Li, D., Song, X., Wei, Y., Li, H.: A feature selection method based on improved fisher’s discriminant ratio for text sentiment classification. Expert Syst. Appl. 38, 8696–8702 (2011)

    Article  Google Scholar 

  5. Cinelli, M., et al.: Feature selection using a one dimensional naïve Bayes’ classifier increases the accuracy of support vector machine classification of CDR3 repertoires. Bioinformatics 33, 951–955 (2017)

    Google Scholar 

  6. Chandrashekar, G., Sahin, F.: A survey on feature selection methods. Comput. Electr. Eng. 40, 16–28 (2014)

    Article  Google Scholar 

  7. Jain, D., Singh, V.: Feature selection and classification systems for chronic disease prediction: a review. Egyptian Inform. J. 19, 179–189 (2018)

    Article  Google Scholar 

  8. Jalil, M.A., Mohd, F., Noor, N.M.M.: A comparative study to evaluate filtering methods for crime data feature selection. Procedia Comput. Sci. 116, 113–120 (2017)

    Article  Google Scholar 

  9. He, B., Shi, Y., Wan, Q., Zhao, X.: Prediction of customer attrition of commercial banks based on SVM model. Procedia Comput. Sci. 31, 423–430 (2014)

    Article  Google Scholar 

  10. Soofi, A., Awan, A.: Classification techniques in machine learning: applications and issues. J. Basic Appl. Sci. 13, 459–465 (2017)

    Article  Google Scholar 

  11. Liu, Y.: On goodness-of-fit of logistic regression model (2007)

    Google Scholar 

  12. Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)

    MATH  Google Scholar 

  13. Vanderplas, J.: Python Data Science Handbook: Tools and Techniques for Developers. OReilly, Beijing (2016)

    Google Scholar 

  14. Malik, J.S., Goyal, P., Sharma, A.K.: A comprehensive approach towards data preprocessing techniques & association rules. In: Proceedings of the 4th National Conference (2010)

    Google Scholar 

Download references

Acknowledgment

This work was supported by Universitas Indonesia under PITTA 2019 grant. Any opinions, findings, and conclusions or recommendations are the authors’ and do not necessarily reflect those of the sponsor.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Dilla Fadlillah Salma , Hendri Murfi or Devvi Sarwinda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Salma, D.F., Murfi, H., Sarwinda, D. (2019). The Performance of One Dimensional Naïve Bayes Classifier for Feature Selection in Predicting Prospective Car Insurance Buyers. In: Tan, Y., Shi, Y. (eds) Data Mining and Big Data. DMBD 2019. Communications in Computer and Information Science, vol 1071. Springer, Singapore. https://doi.org/10.1007/978-981-32-9563-6_13

Download citation

  • DOI: https://doi.org/10.1007/978-981-32-9563-6_13

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-32-9562-9

  • Online ISBN: 978-981-32-9563-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics