SciELO - Scientific Electronic Library Online

 
vol.29 issue54Optimization of the Effect of Temperature and Bed Height on Cr (VI) Bioadsorption in Continuous SystemExploratory Study on Wetlands Area Decrease in Bogota due to Construction Activity: 1950-2016 author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • On index processCited by Google
  • Have no similar articlesSimilars in SciELO
  • On index processSimilars in Google

Share


Revista Facultad de Ingeniería

Print version ISSN 0121-1129On-line version ISSN 2357-5328

Abstract

LAMOS-DIAZ, Henry; PUENTES-GARZON, David-Esteban  and  ZARATE-CAICEDO, Diego-Alejandro. Comparison Between Machine Learning Models for Yield Forecast in Cocoa Crops in Santander, Colombia. Rev. Fac. ing. [online]. 2020, vol.29, n.54, e10853.  Epub July 30, 2020. ISSN 0121-1129.  https://doi.org/10.19053/01211129.v29.n54.2020.10853.

The identification of influencing factors in crop yield (kg·ha-1) provides essential information for decision-making processes related to the prediction and improvement of productivity, which gives farmers the opportunity to increase their income. The current study investigates the application of multiple machine learning algorithms for cocoa yield prediction and influencing factors identification. The Support Vector Machines (SVM) and Ensemble Learning Models (Random Forests, Gradient Boosting) are compared with Least Absolute Shrinkage and Selection Operator (LASSO) regression models. The considered predictors were climate conditions, cocoa variety, fertilization level and sun exposition in an experimental crop located in Rionegro, Santander. Results showed that Gradient Boosting is the best prediction alternative with Coefficient of determination (R2) = 68%, Mean Absolute Error (MAE) = 13.32, and Root Mean Square Error (RMSE) = 20.41. The crop yield variability is explained mainly by the radiation one month before harvest, the accumulated rainfall on the harvest month, and the temperature one month before harvest. Likewise, the crop yields are evaluated based on the kind of sun exposure, and it was found that radiation one month before harvest is the most influential factor in shade-grown plants. On the other hand, rainfall and soil moisture are determining variables in sun-grown plants, which is associated with the water requirements. These results suggest a differentiated management for crops depending on the kind of sun exposure to avoid compromising productivity, since there is no significant difference in the yield of both agricultural managements.

Keywords : agricultural yield; agroforestry system; cocoa; machine learning; prediction; productivity.

        · abstract in Spanish | Portuguese     · text in English     · English ( pdf )