Abstract
Privacy preserving data mining is of paramount importance in many areas. In this paper, we employ Particle Swarm Optimization (PSO) trained Auto Associative Neural Network (PSOAANN) for preservation privacy in input feature values. The privacy preserved input features are fed to the Dynamic Evolving Neuro Fuzzy Inference System (DENFIS) and Classification and Regression Tree (CART) separately for rule extraction purpose. We also propose a new feature selection method using PSOAANN. Thus, in this study, PSOAANN accomplishes privacy preservation as well as feature selection. The performance of the hybrid is tested using 10 fold cross validation on 5 regression datasets viz. Auto MPG, Body Fat, Boston Housing, Forest Fires and Pollution. The study demonstrates the effectiveness of the proposed approach in generating accurate regression rules with and without feature selection. The ttest at 1% level of significance is performed to see whether the difference in results obtained in the case of with and without feature selection is statistically significant or not. In the case of PSOAANN + CART, it is observed that the result is statistical insignificant between with and without feature selection in four datasets. In the case of PSOAANN + DENFIS, it is observed that statistical significance between with and without feature selection for three datasets. Hence, from the t-test it is concluded that the proposed feature selection method yielded better or comparable results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Preserving Privacy in Data Mining. In: ACM SIGMOD International Conference on Management of Data (May 2000)
Lindell, Y., Pinkas, B.: Privacy Preserving in Data Mining. In: Proceeding of the 20th Annual Cryptology Conference in Advances on Cryptology, pp. 36–54 (2000)
Xiao-Dan, W.U., Dian-Min, Y.U.E., Feng-Li, L.I.U., Yun-Feng, W., Chao-Hsien, C.H.: Privacy Preserving Data Mining Algorithms by Data Distortion. Management Science and Engineering, 223–228 (2006)
Behlen, F.M., Johnson, S.B.: Multicenter Patient Records Research: Security Policies and Tools. J. Am. Med. Inform. Assoc. 6(6), 435–443 (1999)
Berman, J.J.: Confidentiality Issues for Medical Data Miners. Artificial Intelligent Med. 26(1-2), 25–36 (2002)
Thuraisingham, B.: Web Data Mining and its Applications in Business Intelligence and Counter-terrorism. CRC Press (2003)
Fienberg, S.E.: Homeland insecurity: Data mining, terrorism detection, and confidentiality. In: Australian Bureau of Statistics, 55th Session of the International Statistical Institute (ISI), Sydney (2005)
Sweeney, L.: Privacy-Preserving Bio-terrorism Surveillance. In: AAAI Spring Symposium, AI Technologies for Homeland Security (2005)
Oliveira, S.R.M., Zaiane, O.R.: A privacy-preserving clustering approach toward secure and effective data analysis for business collaboration. Journal of Computer and Security 26, 81–83 (2007)
Boyens, C., Krishnan, R., Padman, R.: On privacy-preserving access to distributed heterogeneous healthcare information. In: Proceedings of the 37th International Conference on Annual Hawaii System Sciences (2004)
Bertino, E.: A Framework for Evaluating Privacy Preserving Data Mining Algorithms. Data Mining and Knowledge Discovery 11, 121–154 (2005)
Granmo, O.C., Oleshchuk, V.A.: Privacy Preserving Data Mining in Telecommunication Services. International Journal of Computing 3(4), 85–90 (2005)
Vaidya, J., Clifton, C., Zhu, M.: Privacy Preserving Data Mining. In: Advances in Information Security, vol. 19. Springer, Heidelberg (2006) ISBN: 978-0-387-25886-7
Crises, G.: Non-Perturbative Methods for Microdata Privacy in Statistical Databases (2004), http://citeseer.ist.psu.edu/crises04nonperturbative.html
Pinkas, B.: Cryptographic techniques for privacy-preserving data mining. SIGKDD Explorations 4 (2002)
Ramu, K., Ravi, V.: Privacy preservation in data mining using hybrid perturbation methods: an application to bankruptcy prediction in banks. International Journal Data Analysis Techniques and Strategies 1(4), 313–331 (2009)
Bansal, A., Chen, T., Zhong, S.: Privacy Preserving Back-Propagation neural network learning over arbitrarily partitioned data. Journal of Neuro Computing and Applications, 1433–3058 (2010)
Paramjeet, Ravi, V., Naveen, N., Raghavendra Rao, C.: Privacy Preserving Data Mining using Particle Swarm Optimization trained Auto-Associative Neural Network: an Application to Bankruptcy Prediction in Banks (Accepted International Journal of Data Mining Modeling and Management)
Naveen, N., Ravi, V., Raghavendra Rao, C.: Rule Extraction from Privacy Preserving Neural Network: Application to Banking. In: International Conference on Control, Robotics and Cybernetics (ICCRC 2011), India, March 21-24, pp. 408–412 (2011)
Kennedy, J., Eberhart, R.C.: Particle Swarm Optimization. In: Proceeding of IEEE International Conference on Neural Networks, Piscataway, NJ, USA, pp. 1942–1948 (1995)
Kasabov, N., Song, Q.: DENFIS: Dynamic, evolving neural-fuzzy inference systems and its application for time-series prediction. IEEE Transactions on Fuzzy Systems 10, 144–154 (2002)
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth International Group, Belmont, California (1984)
Hruschka, H., Natter, M.: Comparing performance of feedforward neural nets and K-means for cluster-based market segmentation. European Journal of Operational Research 114, 346–353 (1999)
Kramer, M.A.: Nonlinear principal component analysis using auto associative neural networks. AIChE Journal 37(2), 233–243 (1991)
Ravi, V., Pramodh, C.: Non-linear principal component analysis-based hybrid classifiers: an application to bankruptcy prediction in banks. International Journal of Information and Decision Sciences 2(1), 50–67 (2010)
Guyon, B., Elisseeff, A.: An introduction to variable and feature selection. Journal of Machine Learning Research 3, 1157–1182 (2003)
Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. University of California, School of Information and Computer Science, Irvine, CA (2007)
Penrose, K.W., Nelson, A.G., Fisher, A.G.: FACSM, Human Performance Research Center, Brigham Young University. Provo, Utah 84602 as listed in Medicine and Science in Sports and Exercise 17(2), 189 (1985)
Cortez, P., Morais, A.: A Data Mining Approach to Predict Forest Fires using Meteorological Data. In: Neves, J., Santos, M.F., Machado, J. (eds.) New Trends in Artificial Intelligence, Proceedings of the 13th EPIA 2007 - Portuguese Conference on Artificial Intelligence, Guimarães, Portugal, pp. 512–523 (2007)
McDonald, G.C., Schwing, R.C.: Instabilities of regression estimates relating air Pollution to mortality. Technometrics 15, 463–482 (1973)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Naveen, N., Ravi, V., Raghavendra Rao, C. (2011). Forecasting Using Rules Extracted from Privacy Preservation Neural Network. In: Sombattheera, C., Agarwal, A., Udgata, S.K., Lavangnananda, K. (eds) Multi-disciplinary Trends in Artificial Intelligence. MIWAI 2011. Lecture Notes in Computer Science(), vol 7080. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25725-4_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-25725-4_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25724-7
Online ISBN: 978-3-642-25725-4
eBook Packages: Computer ScienceComputer Science (R0)