Abstract
Decision trees are well known, widely used algorithm for building efficient classifiers.We propose the modification of the Parallel Hoeffding Tree algorithm that could deal with large streaming data. The proposed method were evaluated on the basis of computer experiment which were carried on few real datasets. The algorithm uses parallel approach and the Hoeffding inequality for better performance with large streaming data. The paper present the analysis of Hoeffding tree and its issues.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ben-Haim, Y., Tom-Tov, E.: A streaming parallel decision tree algorithm. J. Mach. Learn. Res. 11, 849–872 (2010)
Domingos, P., Hulten, G.: Mining high-speed data streams. In: Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, pp. 71–80 (2000)
Hoeffding, W.: Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 13–30 (1963)
Ikonomovska, E., Gama, J., Zenko, B., Dzeroski, S.: Speeding-up hoeffding-based regression trees with options. In: ICML, pp. 537–544. Omnipress (2011)
Jin, R., Agrawal, G.: Communication and memory efficient parallel decision tree construction. In: The 3rd SIAM International Conference on Data Mining (2003)
Jin, R., Agrawal, G.: Efficient decision tree construction on streaming data. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, pp. 571–576 (2003)
Kacprzak, T., Walkowiak, K., Wozniak, M.: Optimization of overlay distributed computing systems for multiple classifier system - heuristic approach. Logic Journal of the IGPL 20(4), 677–688 (2012)
Kufrin, R.: Decision trees on parallel processors. In: Parallel Processing for Artificial Intelligence 3. Elsevier Science, pp. 279–306. Elsevier (1995)
Mitchell, T.M.: Machine Learning, 1st edn. McGraw-Hill, Inc., New York (1997)
Pfahringer, B., Holmes, G., Kirkby, R.: New options for hoeffding trees. In: Orgun, M.A., Thornton, J. (eds.) AI 2007. LNCS (LNAI), vol. 4830, pp. 90–99. Springer, Heidelberg (2007)
Quinlan, J.R.: Induction of decision trees. Mach. Learn. (1986)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann (1993)
Yildiz, O.T., Dikmen, O.: Parallel univariate decision trees. Pattern Recogn. Lett. 28(7), 825–832 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Cal, P., Woźniak, M. (2013). Parallel Hoeffding Decision Tree for Streaming Data. In: Omatu, S., Neves, J., Rodriguez, J., Paz Santana, J., Gonzalez, S. (eds) Distributed Computing and Artificial Intelligence. Advances in Intelligent Systems and Computing, vol 217. Springer, Cham. https://doi.org/10.1007/978-3-319-00551-5_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-00551-5_4
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-00550-8
Online ISBN: 978-3-319-00551-5
eBook Packages: EngineeringEngineering (R0)