Multivariate random forest prediction of poverty and malnutrition prevalence

Advances in remote sensing and machine learning enable increasingly accurate, inexpensive, and timely estimation of poverty and malnutrition indicators to guide development and humanitarian agencies’ programming. However, state of the art models often rely on proprietary data and/or deep or transfer...

Full description

Bibliographic Details
Main Authors: Browne, Chris, Matteson, David S., McBride, Linden, Hu, Leiqiu, Liu, Yanyan, Sun, Ying, Wen, Jiaming, Barrett, Christopher B.
Format: Journal Article
Language:Inglés
Published: Public Library of Science 2021
Subjects:
Online Access:https://hdl.handle.net/10568/142847
Description
Summary:Advances in remote sensing and machine learning enable increasingly accurate, inexpensive, and timely estimation of poverty and malnutrition indicators to guide development and humanitarian agencies’ programming. However, state of the art models often rely on proprietary data and/or deep or transfer learning methods whose underlying mechanics may be challenging to interpret. We demonstrate how interpretable random forest models can produce estimates of a set of (potentially correlated) malnutrition and poverty prevalence measures using free, open access, regularly updated, georeferenced data. We demonstrate two use cases: contemporaneous prediction, which might be used for poverty mapping, geographic targeting, or monitoring and evaluation tasks, and a sequential nowcasting task that can inform early warning systems. Applied to data from 11 low and lower-middle income countries, we find predictive accuracy broadly comparable for both tasks to prior studies that use proprietary data and/or deep or transfer learning methods.