Abstract
Data cleaning perform in the Data Preprocessing and Mining. The clean data work of web server logs irrelevant items and useless data can not completely removed and Overlapped data causes difficulty during retrieving data from datasource. Previous paper had given 30% performance of datasource. So We have Implemented Smart Two-level clustering method to get pattern data for mining. This paper presents WebLogCleaner can filter out much irrelevant, inconsistent data based on the common of their URLs and it is going to improving 8% of the data quality, performance, Accuracy and efficiency of any Datasource.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Rathod, D., Khanna, S.: A comparison of k-means clustering and smart two level k-means clustering algorithm. IJSART (2017)
Rathod, D., Khanna, S.: A survey on different efficient clustering techniques used in web mining. IJSRD 4(8), 51–53 (2016)
Mengar, K., Rathod, D.: Ant based data reduction in web usage mining using k-means clustering algorithm. IJSDR (2016)
Rathod, D., Khanna, S.: Improved two level k-means clustering algorithm to generate user pattern clustering. IJSART (2016)
Rathod, D., Khanna, S.: Implemented two level k-means clustering algorithm to improve quality in user pattern mining. IJSART (2016)
Rathod, D., Khanna, S.: Improve quality in user pattern mining approach using two level k-means clustering methodology. IJSRD 4(1), 351–353 (2016)
Jeba, J.M.P., Bhuvaneswari, M.S., Muneeswaran, K.: Extracting usage patterns from web server log. IEEE (2016)
Sisodia, D.S., Verma, S.: Web usage pattern analysis through web logs: a review. IEEE (2016)
Dhanalakshmi, P., Ramani, K., Eswara Reddy, B.: The research of preprocessing and pattern discovery techniques on web log files. IEEE (2016)
Mehrotra, S., Kohli, S.: Comparative analysis of k-means with other clustering algorithms to improve search result. IEEE (2015)
Shaa, H., Liub, T., Qinb, P., Sunb, Y., Liub, Q.: EPLogCleaner: improving data quality of enterprise proxy logs for efficient web usage mining. Proc. Comput. Sci. 17, 812–818 (2013). Information Technology and Quantitative Management, ITQM 2013
Hussain, T., Asghar, S., Masood, N.: Web usage mining: a survey on preprocessing of web log file. In: Proceedings of the 2010 International Conference on Information and Emerging Technologies (ICIET), pp. 1–6. IEEE (2010)
Tyagi, N., Solanki, A., Tyagi, S.: An algorithmic approach to data preprocessing in web usage mining. Int. J. Inf. Technol. Knowl. Manag. 2(2), 279–283 (2010)
Zheng, L., Gui, H., Li, F.: Optimized data preprocessing technology for web log mining. In: International Conference on Computer Design and Applications (ICCDA 2010) (2010)
Munk, M., Kapustaa, J., Šveca, P.: Data preprocessing evaluation for web log mining: reconstruction of activities of a web visitor. Proc. Comput. Sci. 1, 2273–2280 (2012). International Conference on Computational Science, ICCS 2011
Nithya, P., Sumathi, P.: Novel pre-processing technique for web log mining by removing global noise and web robots. In: National Conference on Computing and Communication Systems (NCCCS). IEEE (2012)
Sujatha, V., Punithavalli: Improved user navigation pattern prediction technique from web log data. Proc. Eng. 30, 92 (2012). International Conference on Communication Technology and System Design 2011
Aye, T.T.: Web log cleaning for mining of web usage patterns. IEEE (2011)
Lee, C.-H., Lo, Y., Fu, Y.-H.: A novel prediction model based on hierarchical characteristic of web site. Expert Syst. Appl. 38, 3422–3430 (2011)
Losarwar, V., Joshi, M.: Data preprocessing in web usage mining. In: International Conference on Artificial Intelligence and Embedded Systems (ICAIES 2012), 15–16 July (2012)
Agarwal, R., Arya, K.V., Shekhar, S. Kumar, R.: An efficient weighted algorithm for web information retrieval system. IEEE (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Rathod, D., Khanna, S., Singh, M. (2018). Smart Two Level K-Means Algorithm to Generate Dynamic User Pattern Cluster. In: Satapathy, S., Joshi, A. (eds) Information and Communication Technology for Intelligent Systems (ICTIS 2017) - Volume 2. ICTIS 2017. Smart Innovation, Systems and Technologies, vol 84. Springer, Cham. https://doi.org/10.1007/978-3-319-63645-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-63645-0_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63644-3
Online ISBN: 978-3-319-63645-0
eBook Packages: EngineeringEngineering (R0)