Abstract
Predictive analytics can forecast trends and determines statistical probabilities and to act upon fraud and security threats for big data applications such as business trading, fraud detection, crime investigation, banking, insurance, enterprise security, government, healthcare, e-commerce, and telecommunications. Predictive analytics as a service (PAaaS) framework is proposed in our earlier work. One solution based upon ensemble model that uses Gaussian process with varying hyper-parameters is also given in our earlier works. Test results proved that the third hyper-parameter values yielded a good result with less error rate and more variance which is reliable for a predictive model. This paper presents solution based upon ensemble model that uses best out of prediction algorithms such as artificial neural networks (ANN), auto-regression algorithm (ARX) and Gaussian process (GP). Feature engineering methods such as recursive feature elimination that uses random forest algorithm is used for attribute selection. Performance measures NRMSE and COD are used to analyze the model. Test results proved that neural networks performed well when compared to regression and Gaussian process.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Buytendijk, F., Trepanier, L.: Predictive Analytics: Bringing the Tools to the Data. An Oracle White Paper (2010)
http://cio.economictimes.indiatimes.com/news/business-analytics/income-tax-department-to-use-analytics-to-look-for-discrepancies-in-bank-accounts/55849827. Accessed 10 Dec 2016
Packard, Dr.T.: SACHS Literature Review: Predictive Analytics in Human Services. Sandiego State University School of Social Work (2016)
https://www.vertexinc.com/blog/tax-matters/learning-lab-2-big-data-and-predictive-analytics. Accessed 14 May 2017
Bahnsen, A.C., Aouada, D., Stojanovic, A., Ottersten, B.: Feature engineering strategies for credit card fraud detection. Expert Syst. Appl., 134–142 (2016)
LeCun, Y.A., Bottou, L., Orr, G.B., Miller, K.-R.: Neural Netowrks: Tricks of the trade: Efficient BackProp, 2nd edn. LNCS, vol. 7700 p. 16. Springer (2012)
Howard, A.: https://www.oreilly.com/ideas/predictive-data-analytics-big-data-nyc. 26 June 2012
Gharehchopogh, F.S., Bonab, T.H., Khaze, S.R.: A linear regression approach to prediction of stock market trading volume: a case study. IJMVSC 4(3) (2013)
Predictive Analytics as a service for IoT. http://www.opengardensblog.futuretext.com/archives/2014/10/predictive-analytics-as-a-service-for-iot.html. Accessed 21 Oct 2014
Khaidem, L.,Saha, S., Roy Dey, S.: Predicting the direction of stock market prices using random forest. Appl. Math. Finan. (2016)
Kishore Babu, S., Vasavi, S., Nagarjuna, K.: Framework for Predictive Analytics as a Service Using Ensemble Model. IEEE IACC 2017 (in press)
Hashimzade, N., Myles, G.D., Rablen, M.D.: Predictive Analytics and the Targeting of Audits (2014). https://tarc.exeter.ac.uk/media/universityofexeter/businessschool/documents/centres/tarc/publications/discussionpapers/Predictive_28-10-14.pdf
Myles, G.D., Hashimzade, N., Page, F., Rablen, M.: Targeting Audits Using Predictive Analytics (2013). https://tarc.exeter.ac.uk/media/universityofexeter/businessschool/documents/centres/tarc/publications/Targeting_Audits_Using_Predictive_Analytics.pdf
Serrano, A.M.R., da Costa, J.P.C.L., Cardonha, C.H., Fernandes, A.A., de Sousa, R.T.: Neural Network Predictor for Fraud Detection: A Study Case for the Federal Patrimony Department, pp. 61–66 (2012). icofcs.org
Patidar, R., Sharma, L.: Credit Card fraud detection using neural network. Int. J. Soft Comput. Eng. (IJSCE) 1(NCAI2011), 32–38 (2011). ISSN: 2231-2307
Anupriya, K., Kanimozhi, C.: Predicting eshopping data using deep learning middle-East. J. Sci. Res. 24(S1), 250–256 (2016)
Team, A.V.C., Kaushik, S., Jaju, S., Gupta, A.: A Complete Tutorial on Tree Based Modeling from Scratch (in R & Python), Analytics Vidhya, 31 Jan 2017
Lecun, Y., Bottou, L., Robert Mullar, K.: Efficient BackProp. Image Processing Research Department AT & T Labs - Research
Karsoliya, S.: Approximating number of hidden layer neurons in multiple hidden layer BPNN architecture. Int. J. Eng. Trends Technol. 3(6) (2012)
course1.winona.edu/cmalone/stat360/…/Handout18.docx. Accessed March 2017
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kishore Babu, S., Vasavi, S. (2019). Predictive Analytic as a Service on Tax Evasion Using Feature Engineering Strategies. In: Satapathy, S., Bhateja, V., Das, S. (eds) Smart Intelligent Computing and Applications . Smart Innovation, Systems and Technologies, vol 104. Springer, Singapore. https://doi.org/10.1007/978-981-13-1921-1_39
Download citation
DOI: https://doi.org/10.1007/978-981-13-1921-1_39
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1920-4
Online ISBN: 978-981-13-1921-1
eBook Packages: EngineeringEngineering (R0)