Skip to main content

A Comparative Analysis of the Different Data Mining Tools by Using Supervised Learning Algorithms

  • Conference paper
  • First Online:
Proceedings of the Eighth International Conference on Soft Computing and Pattern Recognition (SoCPaR 2016) (SoCPaR 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 614))

Included in the following conference series:

  • 1313 Accesses

Abstract

These days a lot of raw data is generated from various common sources. This large amount of data, which would appear useless at first glance, is very important for companies and researchers as could provide a lot of helpful information. The data could be mined to get useful knowledge that could be used to make fruitful decisions. A lot of online tools and proprietary toolkits are available to the users and it becomes all the more cumbersome for them to know which is the best tool among these for the supervised learning algorithm and datasets they are applying. In order to aid this process, the paper progresses in this direction by doing a comparison of various data mining tools on the basis of their classification finesse. The various tools used in the paper are weka, knime and tanagra. Rigorous work on this has given the result that the performance of the tools is affected by the kind of datasets used and the way in which the supervised learning is done.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Baker, R.S., Yacef, K.: The state of educational data mining in 2009: a review and future visions. JEDM-J. Educ. Data Min. 1(1), 3–17 (2009)

    Google Scholar 

  2. Bengio, Y., Grandvalet, Y.: No unbiased estimator of the variance of k-fold cross-validation. J. Mach. Learn. Res. 5, 1089–1105 (2004)

    MathSciNet  MATH  Google Scholar 

  3. bin Othman, M.F., Yau, T.M.S.: Comparison of different classification techniques using WEKA for breast cancer. In: 3rd Kuala Lumpur International Conference on Biomedical Engineering 2006, pp. 520–523. Springer, Berlin, Heidelberg (2007)

    Google Scholar 

  4. Chauhan, N., Gautam, N.: Parametric comparison of data mining tools (2015)

    Google Scholar 

  5. David, S.K., Saeb, A.T., Al Rubeaan, K.: Comparative analysis of data mining tools and classification techniques using weka in medical bioinformatics. Comput. Eng. Intell. Syst. 4(13), 28–38 (2013)

    Google Scholar 

  6. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newslett. 11(1), 10–18 (2009)

    Article  Google Scholar 

  7. Iyer, A., Jeyalatha, S., Sumbaly, R.: Diagnosis of diabetes using classification mining techniques.arXiv preprint arXiv:1502.03774 (2015)

  8. Jain, D.: A comparison of data mining tools using the implementation of C4.5 Algorithm. Int. J. Sci. Res. 3(8), 33–37 (2014)

    Google Scholar 

  9. Patil, P.H., Thube, S., Ratnaparkhi, B., Rajeswari, K.: Analysis of different data mining tools using classification, clustering and association rule mining. Int. J. Comput. Appl. 93(8), 35–39 (2014)

    Google Scholar 

  10. Solanki, H.: Comparative study of data mining tools and analysis with unified data mining theory. Int. J. Comput. Appl. 75(16) (2013)

    Google Scholar 

  11. Tolan, G.M., Soliman, O.S.: An experimental study of classification algorithms for terrorism prediction (2015)

    Google Scholar 

  12. Vaithiyanathan, V., Rajeswari, K., Tajane, K., Pitale, R.: Comparison of different classification techniques using different datasets. Int. J. Adv. Eng. Technol. 6(2), 764 (2013)

    Google Scholar 

  13. Wimmer, H., Powell, L.M.: A comparison of open source tools for data science. J. Inf. Syst. Appl. Res. 9(2), 4 (2016)

    Google Scholar 

  14. WEKA, the University of Waikato. http://www.cs.waikato.ac.nz/ml/weka/

  15. Tanagra – a Free Data Mining Software for Teaching and Research. http://eric.univ-lyon2.fr/~ricco/tanagra/en/tanagra.html

  16. KNIME (Konstanz Information Miner). http://www.knime.org/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Akarsh Goyal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Goyal, A., Khandelwal, I., Anand, R., Srivastava, A., Swarnalatha, P. (2018). A Comparative Analysis of the Different Data Mining Tools by Using Supervised Learning Algorithms. In: Abraham, A., Cherukuri, A., Madureira, A., Muda, A. (eds) Proceedings of the Eighth International Conference on Soft Computing and Pattern Recognition (SoCPaR 2016). SoCPaR 2016. Advances in Intelligent Systems and Computing, vol 614. Springer, Cham. https://doi.org/10.1007/978-3-319-60618-7_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-60618-7_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-60617-0

  • Online ISBN: 978-3-319-60618-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics