Skip to main content

Smart Two Level K-Means Algorithm to Generate Dynamic User Pattern Cluster

  • Conference paper
  • First Online:
Information and Communication Technology for Intelligent Systems (ICTIS 2017) - Volume 2 ( ICTIS 2017)

Abstract

Data cleaning perform in the Data Preprocessing and Mining. The clean data work of web server logs irrelevant items and useless data can not completely removed and Overlapped data causes difficulty during retrieving data from datasource. Previous paper had given 30% performance of datasource. So We have Implemented Smart Two-level clustering method to get pattern data for mining. This paper presents WebLogCleaner can filter out much irrelevant, inconsistent data based on the common of their URLs and it is going to improving 8% of the data quality, performance, Accuracy and efficiency of any Datasource.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Rathod, D., Khanna, S.: A comparison of k-means clustering and smart two level k-means clustering algorithm. IJSART (2017)

    Google Scholar 

  2. Rathod, D., Khanna, S.: A survey on different efficient clustering techniques used in web mining. IJSRD 4(8), 51–53 (2016)

    Google Scholar 

  3. Mengar, K., Rathod, D.: Ant based data reduction in web usage mining using k-means clustering algorithm. IJSDR (2016)

    Google Scholar 

  4. Rathod, D., Khanna, S.: Improved two level k-means clustering algorithm to generate user pattern clustering. IJSART (2016)

    Google Scholar 

  5. Rathod, D., Khanna, S.: Implemented two level k-means clustering algorithm to improve quality in user pattern mining. IJSART (2016)

    Google Scholar 

  6. Rathod, D., Khanna, S.: Improve quality in user pattern mining approach using two level k-means clustering methodology. IJSRD 4(1), 351–353 (2016)

    Google Scholar 

  7. Jeba, J.M.P., Bhuvaneswari, M.S., Muneeswaran, K.: Extracting usage patterns from web server log. IEEE (2016)

    Google Scholar 

  8. Sisodia, D.S., Verma, S.: Web usage pattern analysis through web logs: a review. IEEE (2016)

    Google Scholar 

  9. Dhanalakshmi, P., Ramani, K., Eswara Reddy, B.: The research of preprocessing and pattern discovery techniques on web log files. IEEE (2016)

    Google Scholar 

  10. Mehrotra, S., Kohli, S.: Comparative analysis of k-means with other clustering algorithms to improve search result. IEEE (2015)

    Google Scholar 

  11. Shaa, H., Liub, T., Qinb, P., Sunb, Y., Liub, Q.: EPLogCleaner: improving data quality of enterprise proxy logs for efficient web usage mining. Proc. Comput. Sci. 17, 812–818 (2013). Information Technology and Quantitative Management, ITQM 2013

    Article  Google Scholar 

  12. Hussain, T., Asghar, S., Masood, N.: Web usage mining: a survey on preprocessing of web log file. In: Proceedings of the 2010 International Conference on Information and Emerging Technologies (ICIET), pp. 1–6. IEEE (2010)

    Google Scholar 

  13. Tyagi, N., Solanki, A., Tyagi, S.: An algorithmic approach to data preprocessing in web usage mining. Int. J. Inf. Technol. Knowl. Manag. 2(2), 279–283 (2010)

    Google Scholar 

  14. Zheng, L., Gui, H., Li, F.: Optimized data preprocessing technology for web log mining. In: International Conference on Computer Design and Applications (ICCDA 2010) (2010)

    Google Scholar 

  15. Munk, M., Kapustaa, J., Šveca, P.: Data preprocessing evaluation for web log mining: reconstruction of activities of a web visitor. Proc. Comput. Sci. 1, 2273–2280 (2012). International Conference on Computational Science, ICCS 2011

    Article  Google Scholar 

  16. Nithya, P., Sumathi, P.: Novel pre-processing technique for web log mining by removing global noise and web robots. In: National Conference on Computing and Communication Systems (NCCCS). IEEE (2012)

    Google Scholar 

  17. Sujatha, V., Punithavalli: Improved user navigation pattern prediction technique from web log data. Proc. Eng. 30, 92 (2012). International Conference on Communication Technology and System Design 2011

    Google Scholar 

  18. Aye, T.T.: Web log cleaning for mining of web usage patterns. IEEE (2011)

    Google Scholar 

  19. Lee, C.-H., Lo, Y., Fu, Y.-H.: A novel prediction model based on hierarchical characteristic of web site. Expert Syst. Appl. 38, 3422–3430 (2011)

    Article  Google Scholar 

  20. Losarwar, V., Joshi, M.: Data preprocessing in web usage mining. In: International Conference on Artificial Intelligence and Embedded Systems (ICAIES 2012), 15–16 July (2012)

    Google Scholar 

  21. Agarwal, R., Arya, K.V., Shekhar, S. Kumar, R.: An efficient weighted algorithm for web information retrieval system. IEEE (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dushyantsinh Rathod .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Rathod, D., Khanna, S., Singh, M. (2018). Smart Two Level K-Means Algorithm to Generate Dynamic User Pattern Cluster. In: Satapathy, S., Joshi, A. (eds) Information and Communication Technology for Intelligent Systems (ICTIS 2017) - Volume 2. ICTIS 2017. Smart Innovation, Systems and Technologies, vol 84. Springer, Cham. https://doi.org/10.1007/978-3-319-63645-0_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-63645-0_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-63644-3

  • Online ISBN: 978-3-319-63645-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics