Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches

Wheat is the most important staple crop grown in Australia, and Australia is one of the top wheat exporting countries globally. Timely and reliable wheat yield prediction in Australia is important for regional and global food security. Prior studies use either climate data, or satellite data, or a c...

Full description

Bibliographic Details
Main Authors: Cai, Y., Guan, K., Lobell, D. B., Potgieter, A.B., Wang, S., You, Liangzhi
Format: Journal Article
Language:Inglés
Published: Elsevier 2019
Subjects:
Online Access:https://hdl.handle.net/10568/108339
_version_ 1855517143192305664
author Cai, Y.
Guan, K.
Lobell, D. B.
Potgieter, A.B.
Wang, S.
You, Liangzhi
author_browse Cai, Y.
Guan, K.
Lobell, D. B.
Potgieter, A.B.
Wang, S.
You, Liangzhi
author_facet Cai, Y.
Guan, K.
Lobell, D. B.
Potgieter, A.B.
Wang, S.
You, Liangzhi
author_sort Cai, Y.
collection Repository of Agricultural Research Outputs (CGSpace)
description Wheat is the most important staple crop grown in Australia, and Australia is one of the top wheat exporting countries globally. Timely and reliable wheat yield prediction in Australia is important for regional and global food security. Prior studies use either climate data, or satellite data, or a combination of these two to build empirical models to predict crop yield. However, though the performance of yield prediction using empirical methods is improved by combining the use of climate and satellite data, the contributions from different data sources are still not clear. In addition, how the regression-based methods compare with various machine-learning based methods in their performance in yield prediction is also not well understood and needs in-depth investigation. This work integrated various sources of data to predict wheat yield across Australia from 2000 to 2014 at the statistical division (SD) level. We adopted a well-known regression method (LASSO, as a benchmark) and three mainstream machine learning methods (support vector machine, random forest, and neural network) to build various empirical models for yield prediction. For satellite data, we used the enhanced vegetation index (EVI) from MODIS and solar-induced chlorophyll fluorescence (SIF) from GOME-2 and SCIAMACHY as metrics to approximate crop productivity. The machine-learning based methods outperform the regression method in modeling crop yield. Our results confirm that combining climate and satellite data can achieve high performance of yield prediction at the SD level (R2 ˜ 0.75). The satellite data track crop growth condition and gradually capture the variability of yield evolving with the growing season, and their contributions to yield prediction usually saturate at the peak of the growing season. Climate data provide extra and unique information beyond what the satellite data have offered for yield prediction, and our empirical modeling work shows the added values of climate variables exist across the whole season, not only at some certain stages. We also find that using EVI as an input can achieve better performance in yield prediction than SIF, primarily due to the large noise in the satellite-based SIF data (i.e. coarse resolution in both space and time). In addition, we also explored the potential for timely wheat yield prediction in Australia, and we can achieve the optimal prediction performance with approximately two-month lead time before wheat maturity. The proposed methodology in this paper can be extended to different crops and different regions for crop yield prediction.
format Journal Article
id CGSpace108339
institution CGIAR Consortium
language Inglés
publishDate 2019
publishDateRange 2019
publishDateSort 2019
publisher Elsevier
publisherStr Elsevier
record_format dspace
spelling CGSpace1083392025-12-08T10:11:39Z Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches Cai, Y. Guan, K. Lobell, D. B. Potgieter, A.B. Wang, S. You, Liangzhi climatic data satellites wheat crop yield food security forestry spatial data data agricultural production forecasting technology machine learning fluorescence climate change Wheat is the most important staple crop grown in Australia, and Australia is one of the top wheat exporting countries globally. Timely and reliable wheat yield prediction in Australia is important for regional and global food security. Prior studies use either climate data, or satellite data, or a combination of these two to build empirical models to predict crop yield. However, though the performance of yield prediction using empirical methods is improved by combining the use of climate and satellite data, the contributions from different data sources are still not clear. In addition, how the regression-based methods compare with various machine-learning based methods in their performance in yield prediction is also not well understood and needs in-depth investigation. This work integrated various sources of data to predict wheat yield across Australia from 2000 to 2014 at the statistical division (SD) level. We adopted a well-known regression method (LASSO, as a benchmark) and three mainstream machine learning methods (support vector machine, random forest, and neural network) to build various empirical models for yield prediction. For satellite data, we used the enhanced vegetation index (EVI) from MODIS and solar-induced chlorophyll fluorescence (SIF) from GOME-2 and SCIAMACHY as metrics to approximate crop productivity. The machine-learning based methods outperform the regression method in modeling crop yield. Our results confirm that combining climate and satellite data can achieve high performance of yield prediction at the SD level (R2 ˜ 0.75). The satellite data track crop growth condition and gradually capture the variability of yield evolving with the growing season, and their contributions to yield prediction usually saturate at the peak of the growing season. Climate data provide extra and unique information beyond what the satellite data have offered for yield prediction, and our empirical modeling work shows the added values of climate variables exist across the whole season, not only at some certain stages. We also find that using EVI as an input can achieve better performance in yield prediction than SIF, primarily due to the large noise in the satellite-based SIF data (i.e. coarse resolution in both space and time). In addition, we also explored the potential for timely wheat yield prediction in Australia, and we can achieve the optimal prediction performance with approximately two-month lead time before wheat maturity. The proposed methodology in this paper can be extended to different crops and different regions for crop yield prediction. 2019-08 2020-05-27T15:20:26Z 2020-05-27T15:20:26Z Journal Article https://hdl.handle.net/10568/108339 en Limited Access Elsevier Cai, Y., Guan, K., Lobell, D. B., Potgieter, A. B., Wang, S., You, Liangzhi, et al. (2019). Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches. Agricultural and Forest Meteorology, 274:144-159. https://doi.org/10.1016/j.agrformet.2019.03.010
spellingShingle climatic data
satellites
wheat
crop yield
food security
forestry
spatial data
data
agricultural production
forecasting
technology
machine learning
fluorescence
climate change
Cai, Y.
Guan, K.
Lobell, D. B.
Potgieter, A.B.
Wang, S.
You, Liangzhi
Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches
title Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches
title_full Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches
title_fullStr Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches
title_full_unstemmed Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches
title_short Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches
title_sort integrating satellite and climate data to predict wheat yield in australia using machine learning approaches
topic climatic data
satellites
wheat
crop yield
food security
forestry
spatial data
data
agricultural production
forecasting
technology
machine learning
fluorescence
climate change
url https://hdl.handle.net/10568/108339
work_keys_str_mv AT caiy integratingsatelliteandclimatedatatopredictwheatyieldinaustraliausingmachinelearningapproaches
AT guank integratingsatelliteandclimatedatatopredictwheatyieldinaustraliausingmachinelearningapproaches
AT lobelldb integratingsatelliteandclimatedatatopredictwheatyieldinaustraliausingmachinelearningapproaches
AT potgieterab integratingsatelliteandclimatedatatopredictwheatyieldinaustraliausingmachinelearningapproaches
AT wangs integratingsatelliteandclimatedatatopredictwheatyieldinaustraliausingmachinelearningapproaches
AT youliangzhi integratingsatelliteandclimatedatatopredictwheatyieldinaustraliausingmachinelearningapproaches