EXPLAINABILITY OF END AND MID-SEASON COTTON YIELD PREDICTORS IN CONUS


Çelik M. F., Işık M. S., Erten E., Camps-Valls G.

2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023, California, United States Of America, 16 - 21 July 2023 identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/igarss52108.2023.10283252
  • City: California
  • Country: United States Of America
  • Istanbul Technical University Affiliated: Yes

Abstract

In this study, we examined the effectiveness of integrating satellite-based crop biophysical parameters, meteorological conditions, and soil properties for the end and mid-season cotton yield prediction in the continental United States (CONUS) region. We employed six machine learning algorithms: decision tree (DT), random forest (RF), adaptive boosting (AdaBoost), gradient boosting (GB), light gradient boosting machine (LightGBM), and extreme gradient boosting machine (XGBoost). By employing this rigorous approach to hyperparameter tuning based on Bayesian optimization, the XGBoost method was found as the best method for both mid-season and end-season cotton yield prediction. Furthermore, we investigated the global importance of temporal and static features using the Shapley Additive Global importancE (SAGE) method to understand the driving factors of cotton yield prediction. As a result of global feature importance analysis, precipitation (P), enhanced vegetation index (EVI), and leaf area index (LAI) were found as the most important temporal features, while silt and pH were found as the most important soil properties.