Skip to main content

Part of the book series: Advances in Soft Computing ((AINSC,volume 50))

Summary

Current applications from industry, science, and business are storing huge amount of data everyday. This data most of the time comes from distributed sources and are usually analysed for the organizations to discover knowledge and recognize patterns by means of Data Mining (DM) techniques. This analysis usually requires to put all information together in a big centralized datasets. Analysing this huge dataset could be very expensive in terms of time and memory consuming. For reducing this cost some Distributed Data Mining (DDM) architectures have been developed in recently years. This paper presents an approach to building a distributed ID3 classifier which takes only metadata from distributed datasets avoiding the total access to the original data. This approach reduces the computing time nedeed to build the classifier.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Witten, H., Frank, E.: Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann Publishers, San Francisco (2005)

    MATH  Google Scholar 

  2. Talia, D., Trunfio, P., Verta, O.: Weka4WS: A WSRF-Enabled Weka Toolkit for Distributed Data Mining on Grids. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 309–320. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  3. Khoussainov, R., Zuo, X., Kushmerick, N.: Grid-enabled Weka: A Toolkit for Machine Learning on the Grid. ERCIM 59, 47–48 (2004)

    Google Scholar 

  4. Shaikh Ali, A., Rana, O.F., Taylor, I.J.: Web Services Composition for Distributed Data Mining. In: International Conference Workshop on Parallel Processing, pp. 11–18. IEEE, Los Alamitos (2005)

    Google Scholar 

  5. Perez, M.S., Sanchez, A., Herrero, P., Robles, V.: Adapting the Weka Data Mining Toolkit to a Grid based environment. In: Szczepaniak, P.S., Kacprzyk, J., Niewiadomski, A. (eds.) AWIC 2005. LNCS (LNAI), vol. 3528, pp. 492–497. Springer, Heidelberg (2005)

    Google Scholar 

  6. Quinlan, J.R.: Induction of Decision Trees, Machine Learning, Hingham, MA, USA, vol. 1(1), pp. 81–106. Kluwer Academic Publishers, Dordrecht (1986)

    Google Scholar 

  7. Ross Quinlan, J.: C4.5: programs for machine learning. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  8. McQueen, J.: Some methods for classification and analysis of multivariations. In: Proc. 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–2297 (1967)

    Google Scholar 

  9. Mierswa, I., Wurst, M., Klinkenberg, R., Scholz, M., Euler, T.: YALE: Rapid Prototyping for Complex Data Mining Tasks. In: 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)

    Google Scholar 

  10. University of Illinois and Data Mining Research Group and DAIS Research Laboratory, IlliMine 1.1.0, http://illimine.cs.uiuc.edu/

  11. Statistics Department of the University of Auckland, R Project 2.6.1, http://www.r-project.org/

  12. Williams, G.: Rattle 2.2.74, http://rattle.togaware.com/

  13. Artificial Intelligence Unit of University of Dortmund, Yale 4.0, http://rapid-i.com/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Juan M. Corchado Sara Rodríguez James Llinas José M. Molina

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jasso-Luna, O., Sosa-Sosa, V., Lopez-Arevalo, I. (2009). An Approach to Building a Distributed ID3 Classifier. In: Corchado, J.M., Rodríguez, S., Llinas, J., Molina, J.M. (eds) International Symposium on Distributed Computing and Artificial Intelligence 2008 (DCAI 2008). Advances in Soft Computing, vol 50. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85863-8_45

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85863-8_45

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85862-1

  • Online ISBN: 978-3-540-85863-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics