An Experimental Analysis of Traditional Machine Learning Algorithms for Maize Yield Prediction
DOI:
https://doi.org/10.37256/cm.5420244481Keywords:
classic machine learning, ensemble learning, maize, yield prediction, secondary dataAbstract
Maize plays a significant role in the African diet and is one of the main staple foods in many parts of the continent. Accurate yield estimations ensure an adequate food supply, contributing to food security and reducing the risk of food shortages. They also enable market planning and price setting. Machine learning is well known as one of the most advanced statistical methods for predicting crop yields. This paper provides extensive experiment results of machine learning models on maize production. Thirteen basic supervised learning algorithms classified into classic and ensemble learning are compared using three datasets of different sizes and from various sources (Kaggle, Zenodo). These datasets are from three main origins: experimentation, specifically covering crop data with 240 observations; predictions on crop yield from the FAO (Food and Agriculture Organization) and World Data Bank with 4,121 observations; and historical data from China with 975 observations. The metrics used to evaluate the models are the coefficient of determination, the mean absolute error, the root mean square error, and the explained variance score. Moreover, permutation importance is used on the best models to identify the most relevant predictors for the models according to the data. The results show that extremely randomized trees (ERT) and extreme gradient boosting (XGBoost) are more suitable for predicting maize yield with a coefficient of determination between 0.75 and 0.96 and 0.73 and 0.96, respectively. With the other metrics, the ERT model shows a low performance. Its training time varies between 2,547 and 7,814 seconds as obtained from a computer with characteristics of HP core i5, CPU @ 1.00 GHz, 1.9 GHz, and 8 GB RAM under 134 Windows 10. ERT and XGBoost are best suited to these databases of varying dimensions, making them perfect for predicting maize yield and streamlining decision-making processes.
Downloads
Published
How to Cite
Issue
Section
Categories
License
Copyright (c) 2024 Souand P.G. Tahi, et al.
This work is licensed under a Creative Commons Attribution 4.0 International License.