Abstract
Boosting of tree-based classifiers has been interfaced to the Geographical Information System (GIS) GRASS to create predictive classification models from digital maps. On a risk management problem in landscape ecology, the performance of the boosted tree model is better than either with a single classifier or with bagging. This results in an improved digital map of the risk of human exposure to tick-borne diseases in Trentino (Italian Alps) given sampling on 388 sites and the use of several overlaying georeferenced data bases. Margin distributions are compared for bagging and boosting. Boosting is confirmed to give the most accurate model on two additional and independent test sets of reported cases of bites on humans and of infestation measured on roe deer. An interesting feature of combining classification models within a GIS is the visualization through maps of the single elements of the combination: each boosting step map focuses on different details of data distribution. In this problem, the best performance is obtained without controlling tree sizes, which indicates that there is a strong interaction between input variables.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Breiman, L.: Bagging Predictors. Machine Learning. 24(2) (1996) 123–140
Freund, Y., and Schapire R.: Experiments with a new boosting algorithm. In: Machine Learning: Proceedings of the Thirteenth International Conference (1996) 148–156
Breiman, L.: Combining predictors. In:Sharkey, A, (ed): Combining Artificial Neural Nets: Ensemble and Modular Multi-Net Systems. Springer-Verlag, London (1999) 31–50
Schapire, R., Freund, Y., Bartlett, P., and Lee W.: Boosting the margin: a new explanation for the effectiveness of voting methods. The Annals of Statistics 26(5) (1998) 1651–1686
Friedman, J., Hastie, T., and Tibshirani R.: Additive logistic regression: a statistical view of boosting. Technical report, Stanford University, (1999)
Merler, S., Furlanello, C., Chemini, C., and Nicolini, G.: Classification tree methods for analysis of mesoscale distribution of ixodes ricinus (acari: ixodidae) in Trentino, Italian Alps. Journal of Medical Entomology 33(6) (1996) 888–8937.
Efron, B., and Tibshirani, R.: Cross-validation and the bootstrap: estimating the error rate of a prediction rule. Technical report, Standford University, (1995)
Merler, S., and Furlanello, C.: Selection of tree-based classifiers with the bootstrap 632+ rule. Biometrical Journal 39(2) (1997) 1–14
Furlanello, C., Merler, S., Rizzoli, A., Chemini, C., and Genchi, C.: Bagging as a predictive method for landscape epidemiology of Lyme disease. Giornale Italiano di Cardiologia 29(5) (1999) 143–147
Furlanello, C., Merler, S., and Chemini, C.: Tree-based classifiers and GIS for biological risk forecasting. In:Morabito, F., (ed): Advanced in Intelligent Systems. IOS Press, Amsterdam (1997) 316–323
Dietterich, T.G: An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization. Machine Learning (1999) (to appear).
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Furlanello, C., Merle, S. (2000). Boosting of Tree-Based Classifiers for Predictive Risk Modeling in GIS. In: Multiple Classifier Systems. MCS 2000. Lecture Notes in Computer Science, vol 1857. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45014-9_21
Download citation
DOI: https://doi.org/10.1007/3-540-45014-9_21
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67704-8
Online ISBN: 978-3-540-45014-6
eBook Packages: Springer Book Archive