Use of the Decision Tree Technique to Estimate Sugarcane Productivity Under Edaphoclimatic Conditions
- 149 Downloads
A number of biometric evaluations are performed during harvest for measuring the growth and development of the sugarcane crop. From these evaluations, hundreds of data values are generated, containing certain information on the productivity of the culture in that crop and edaphoclimatic region. Accordingly, the objective of this work was to identify, using a decision tree classification technique, the biometric attribute having the greatest effect on the productivity of the plant cane in different planting configurations and edaphoclimatic conditions. To accomplish this, data were evaluated from four experiments with sugarcane, located within the São Paulo municipalities of Teodoro Sampaio, Guaíra, Iracemápolis, and Lençóis Paulista. The classification model was generated using the decision tree technique, a type of intuitive learning that creates a hypothesis based on particular instances that results in general conclusions. The decision trees applied to the data of the four sites showed that the population of plants per hectare has the highest information gain (split attribute) on the class attribute (productivity). Using the “Chi-square” method of attribute selection, the population of plants per hectare was observed to have the largest correlation with the final productivity of the culture. Therefore, the decision tree indicates that the attribute “plant population per area” should be used as the method to evaluate the productive potential of the culture during its growth cycle. It has the best correlation with the final productivity of the crop, in addition to being an attribute easy to measure in the field.
KeywordsProductivity classification Selection of biometric attributes Methods for data classification Saccharum spp
We thank sugarcane mills Alcídia, Guaíra, Iracema, Porto das Águas, and Zilor for their support in the execution of field experiments, and the BNDES Project/Jet/CTBE for financing the data collection in the field.
This study was funded by a project of CTBE (Brazilian Bioethanol Science and Technology Centre) and Jacto by Funtec of the BNDES. The author João Rossi Neto received a CAPES masters scholarship during the achievement of the project.
Compliance with Ethical Standards
Conflict of interest
The authors declare that they have no conflict of interest.
- Cabena, P., P. Hadjinian, R. Stadler, J. Verhees, and A. Zanasi. 1998. Discovering data mining: From concept to implementation, 1st ed. Upper Saddle River, NJ: Prentice-Hall Publishing.Google Scholar
- Cavalett, O., T.L. Junqueira, M.O.S. Dias, C.D. Jesus, P.E. Mantelatto, M.P. Cunha, H.C.J. Franco, T.F. Cardoso, R. Maciel Filho, C.E.V. Rossell, and A. Bonomi. 2012. Environmental and economic assessment of sugarcane first generation biorefineries in Brazil. Clean Technologies and Environmental Policy 14: 399–410. doi: 10.1007/s10098-011-0424-7.CrossRefGoogle Scholar
- Dalchiavon, F., M.P. Carvalho, R. Montanari, M. Andreotti, and A.R. Panosso. 2014. Produtividade da cana-de-açúcar: variabilidade linear e espacial entre componentes tecnológicos e da produção. Bioscience Journal 30: 390–400.Google Scholar
- Ehsanullah, K.J., M. Jamil, and A. Ghafar. 2011. Optimizing the row spacing and seeding density to improve yield and quality of sugarcane. Crop and Environment 2: 1–5.Google Scholar
- Elmasri, R., and S.B. Navathe. 2005. Sistemas de Banco de Dados, 4th ed. São Paulo, SP: Pearson Education.Google Scholar
- Miller, D., J. Mccarthy, and A. Zakzeski. 2009. A fresh approach to agricultural statistics: Data mining and remote sensing. In 2009 joint statistical meetings, 3144–3155. American Statistical Association, Washington, DC.Google Scholar
- Quinlan, J.R. 1993. C4.5: Programs for machine learning, 1st ed. Burlington, MA: Morgan Kaufmann Press.Google Scholar
- Witten, I.H., and E. Frank. 2005. Data mining: Practical machine learning tools and techniques, 2nd ed. Burlington, MA: Morgan Kaufmann Press.Google Scholar
- Witten, I.H., E. Frank, and M.A. Hall. 2011. Data mining: Practical machine learning tools and techniques, 3rd ed. Burlington, MA: Morgan Kaufmann Press.Google Scholar