Importance of Missing Value Estimation in Feature Selection for Crime Analysis

Rakshit, Soubhik; Das, Priyanka; Das, Asit Kumar

doi:10.1007/978-981-10-5523-2_10

Soubhik Rakshit⁶,
Priyanka Das⁶ &
Asit Kumar Das⁶

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 19))

1201 Accesses

Abstract

Missing values are most likely to be present in voluminous datasets that often lead to poor performance of the decision-making system. The present work carries out an experiment with a crime dataset that deals with the existence of missing values in it. The proposed methodology depicts a graph-based approach for selecting important features relevant to crime after estimating the missing values with the help of a multiple regression model. The method selects some features with missing values as important features. The selected features subsequently undergo some classification techniques that help in determining the importance of missing value estimation without discarding the feature for crime analysis. The proposed method is compared with existing feature selection algorithms and it promises a better classification accuracy, which shows the importance of the method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Acua, E., Rodriguez, C.: The treatment of missing values and its effect in the classifier accuracy. In: Classification, Clustering and Data Mining Applications. (July 2004) 639–647
Google Scholar
Kalkan, H.: Online feature selection and classification with incomplete data. Volume 22. (2014) 1625–1636
Google Scholar
Lou, Q., Obradovic, Z.: Margin-based feature selection in incomplete data. In: Proceedings of the Twenty-Sixth Association for the Advancement of Artificial Intelligence. (July 2012) 1040–1046
Google Scholar
Meesad, P., Hengpraprohm, K.: Combination of knn-based feature selection and knn-based missing-value imputation of microarray data. In: The 3rd Intetnational Conference on Innovative Computing Information and Control (ICICIC’08). (August 2008)
Google Scholar
Sun, Y., Braga-Neto, U., Dougherty, E.R.: Impact of missing value imputation on classification for dna microarray gene expression data model-based study. EURASIP Journal on Bioinformatics and Systems Biology (2009) 1–17
Google Scholar
Doquire, G., Verleysen, M.: Feature selection with missing data using mutual information estimators. Neurocomputing 90 (August 2012) 3–11
Google Scholar
Doquire, G., Verleysen, M.: Mutual information for feature selection with missing data. In: European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. (April 2011) 269–274
Google Scholar
Saar-Tsechansky, M., Provost, F.: Handling missing values when applying classification models. Journal of Machine Learning Research 8 (July 2011) 1625–1657
Google Scholar
Kraus, E.J., Dougherty, E.R.: Segmentation-free morphological character recognition. Proc. SPIE 2181 (1994) 14–23
Google Scholar
Yao, C.S.C., Shen, L., Yu, X.: Improving classification accuracy using missing data filling algorithms for the criminal dataset. International Journal of Hybrid Information Technology 9(4) (2016) 367–374
Google Scholar
Sun, C., Yao, C., Li, X., Yu, X.: Detecting crime types using classification algorithms. Biotechnology-An Indian Journal 10(24) (2014) 15452–15457
Google Scholar
N.Poolsawad, C.Kambhampati, Cleland, J.G.F.: Feature selection approaches with missing values handling for data mining - a case study of heart failure dataset. Volume 5. (2011) 671–680
Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (2005)
Google Scholar
Dash, M., Liu, H.: Consistency-based search in feature selection. Artificial Intelligence 151(12) (2003) 155–176
Google Scholar
Hall, M.A.: Correlation-based feature selection for machine learning. Technical report, Waikato University, Department of Computer Science (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur, Howrah, India
Soubhik Rakshit, Priyanka Das & Asit Kumar Das

Authors

Soubhik Rakshit
View author publications
You can also search for this author in PubMed Google Scholar
Priyanka Das
View author publications
You can also search for this author in PubMed Google Scholar
Asit Kumar Das
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Soubhik Rakshit .

Editor information

Editors and Affiliations

Department of Computer Science and Information Management, Providence University, Taichung City, Taiwan
Yu-Chen Hu
CSED, ABES Engineering College, Ghaziabad, Uttar Pradesh, India
Shailesh Tiwari
Department of Computer Science and Engineering, Motilal Nehru National Institute of Technology Allahabad, Allahabad, Uttar Pradesh, India
Krishn K. Mishra
Department of Computer Science and Engineering, ABES Engineering College, Ghaziabad, India
Munesh C. Trivedi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rakshit, S., Das, P., Das, A.K. (2018). Importance of Missing Value Estimation in Feature Selection for Crime Analysis. In: Hu, YC., Tiwari, S., Mishra, K., Trivedi, M. (eds) Intelligent Communication and Computational Technologies. Lecture Notes in Networks and Systems, vol 19. Springer, Singapore. https://doi.org/10.1007/978-981-10-5523-2_10

Download citation

DOI: https://doi.org/10.1007/978-981-10-5523-2_10
Published: 24 October 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-5522-5
Online ISBN: 978-981-10-5523-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics