Skip to main content

Importance of Missing Value Estimation in Feature Selection for Crime Analysis

  • Conference paper
  • First Online:
Intelligent Communication and Computational Technologies

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 19))

  • 1201 Accesses

Abstract

Missing values are most likely to be present in voluminous datasets that often lead to poor performance of the decision-making system. The present work carries out an experiment with a crime dataset that deals with the existence of missing values in it. The proposed methodology depicts a graph-based approach for selecting important features relevant to crime after estimating the missing values with the help of a multiple regression model. The method selects some features with missing values as important features. The selected features subsequently undergo some classification techniques that help in determining the importance of missing value estimation without discarding the feature for crime analysis. The proposed method is compared with existing feature selection algorithms and it promises a better classification accuracy, which shows the importance of the method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Acua, E., Rodriguez, C.: The treatment of missing values and its effect in the classifier accuracy. In: Classification, Clustering and Data Mining Applications. (July 2004) 639–647

    Google Scholar 

  2. Kalkan, H.: Online feature selection and classification with incomplete data. Volume 22. (2014) 1625–1636

    Google Scholar 

  3. Lou, Q., Obradovic, Z.: Margin-based feature selection in incomplete data. In: Proceedings of the Twenty-Sixth Association for the Advancement of Artificial Intelligence. (July 2012) 1040–1046

    Google Scholar 

  4. Meesad, P., Hengpraprohm, K.: Combination of knn-based feature selection and knn-based missing-value imputation of microarray data. In: The 3rd Intetnational Conference on Innovative Computing Information and Control (ICICIC’08). (August 2008)

    Google Scholar 

  5. Sun, Y., Braga-Neto, U., Dougherty, E.R.: Impact of missing value imputation on classification for dna microarray gene expression data model-based study. EURASIP Journal on Bioinformatics and Systems Biology (2009) 1–17

    Google Scholar 

  6. Doquire, G., Verleysen, M.: Feature selection with missing data using mutual information estimators. Neurocomputing 90 (August 2012) 3–11

    Google Scholar 

  7. Doquire, G., Verleysen, M.: Mutual information for feature selection with missing data. In: European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. (April 2011) 269–274

    Google Scholar 

  8. Saar-Tsechansky, M., Provost, F.: Handling missing values when applying classification models. Journal of Machine Learning Research 8 (July 2011) 1625–1657

    Google Scholar 

  9. Kraus, E.J., Dougherty, E.R.: Segmentation-free morphological character recognition. Proc. SPIE 2181 (1994) 14–23

    Google Scholar 

  10. Yao, C.S.C., Shen, L., Yu, X.: Improving classification accuracy using missing data filling algorithms for the criminal dataset. International Journal of Hybrid Information Technology 9(4) (2016) 367–374

    Google Scholar 

  11. Sun, C., Yao, C., Li, X., Yu, X.: Detecting crime types using classification algorithms. Biotechnology-An Indian Journal 10(24) (2014) 15452–15457

    Google Scholar 

  12. N.Poolsawad, C.Kambhampati, Cleland, J.G.F.: Feature selection approaches with missing values handling for data mining - a case study of heart failure dataset. Volume 5. (2011) 671–680

    Google Scholar 

  13. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (2005)

    Google Scholar 

  14. Dash, M., Liu, H.: Consistency-based search in feature selection. Artificial Intelligence 151(12) (2003) 155–176

    Google Scholar 

  15. Hall, M.A.: Correlation-based feature selection for machine learning. Technical report, Waikato University, Department of Computer Science (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Soubhik Rakshit .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte. Ltd.

About this paper

Cite this paper

Rakshit, S., Das, P., Das, A.K. (2018). Importance of Missing Value Estimation in Feature Selection for Crime Analysis. In: Hu, YC., Tiwari, S., Mishra, K., Trivedi, M. (eds) Intelligent Communication and Computational Technologies. Lecture Notes in Networks and Systems, vol 19. Springer, Singapore. https://doi.org/10.1007/978-981-10-5523-2_10

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-5523-2_10

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-5522-5

  • Online ISBN: 978-981-10-5523-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics