A Bayesian methodology for building consistent datasets for structural modeling

Simulation models are powerful tools that help us understand, analyze, and explain dynamic, complex systems. They provide empirical methodologies to explore how systems and agents behave and consider how they may change when responding to shocks and stresses. The power of these tools, however, depen...

Descripción completa

Detalles Bibliográficos
Autores principales: Mason-D'Croz, Daniel, Robinson, Sherman, Dunston, Shahnila, Sulser, Timothy B.
Formato: Conference Paper
Lenguaje:Inglés
Publicado: 2018
Materias:
Acceso en línea:https://hdl.handle.net/10568/145843
_version_ 1855526529973354496
author Mason-D'Croz, Daniel
Robinson, Sherman
Dunston, Shahnila
Sulser, Timothy B.
author_browse Dunston, Shahnila
Mason-D'Croz, Daniel
Robinson, Sherman
Sulser, Timothy B.
author_facet Mason-D'Croz, Daniel
Robinson, Sherman
Dunston, Shahnila
Sulser, Timothy B.
author_sort Mason-D'Croz, Daniel
collection Repository of Agricultural Research Outputs (CGSpace)
description Simulation models are powerful tools that help us understand, analyze, and explain dynamic, complex systems. They provide empirical methodologies to explore how systems and agents behave and consider how they may change when responding to shocks and stresses. The power of these tools, however, depends on the quality of the data on which they are built. Many complex systems studied in the social sciences, including economic systems, are characterized by sparseness of available data on behavioral characteristics and system outcomes. Generally, there is no single data source that can provide all the necessary information and detail for building a complex, structural, simulation model. Even where good data are available, few datasets are “model ready” without a lot of processing and cleaning. To populate models with data requires significant effort to stitch together a complete, coherent, and model-consistent dataset from a multitude of sources that vary in scope, time-scale, completeness, and quality. Due to information scarcity and variable quality, this challenge is well-suited to a Bayesian approach to efficiently use all available data. To this end, we present a data management system where we apply information theoretic, cross-entropy estimation methods to various FAO agricultural datasets to generate a complete global database of agricultural production, demand, and trade for use in IFPRI’s IMPACT model, a global agricultural partial equilibrium multi-market model. We will describe the information theory that serves as the foundation of this methodology, as well as the practical implementation for use in IMPACT. This data estimation methodology was developed for a partial equilibrium modeling framework, but the principals presented, are applicable to other data processing problems, where there is sparse and poor-quality data (e.g., data for computable general equilibrium models).
format Conference Paper
id CGSpace145843
institution CGIAR Consortium
language Inglés
publishDate 2018
publishDateRange 2018
publishDateSort 2018
record_format dspace
spelling CGSpace1458432025-12-08T10:11:39Z A Bayesian methodology for building consistent datasets for structural modeling Mason-D'Croz, Daniel Robinson, Sherman Dunston, Shahnila Sulser, Timothy B. simulation models models data management data agricultural economics Simulation models are powerful tools that help us understand, analyze, and explain dynamic, complex systems. They provide empirical methodologies to explore how systems and agents behave and consider how they may change when responding to shocks and stresses. The power of these tools, however, depends on the quality of the data on which they are built. Many complex systems studied in the social sciences, including economic systems, are characterized by sparseness of available data on behavioral characteristics and system outcomes. Generally, there is no single data source that can provide all the necessary information and detail for building a complex, structural, simulation model. Even where good data are available, few datasets are “model ready” without a lot of processing and cleaning. To populate models with data requires significant effort to stitch together a complete, coherent, and model-consistent dataset from a multitude of sources that vary in scope, time-scale, completeness, and quality. Due to information scarcity and variable quality, this challenge is well-suited to a Bayesian approach to efficiently use all available data. To this end, we present a data management system where we apply information theoretic, cross-entropy estimation methods to various FAO agricultural datasets to generate a complete global database of agricultural production, demand, and trade for use in IFPRI’s IMPACT model, a global agricultural partial equilibrium multi-market model. We will describe the information theory that serves as the foundation of this methodology, as well as the practical implementation for use in IMPACT. This data estimation methodology was developed for a partial equilibrium modeling framework, but the principals presented, are applicable to other data processing problems, where there is sparse and poor-quality data (e.g., data for computable general equilibrium models). 2018-05-24 2024-06-21T09:05:10Z 2024-06-21T09:05:10Z Conference Paper https://hdl.handle.net/10568/145843 en Open Access Mason-D'Croz, Daniel; Robinson, Sherman; Dunston, Shahnila; and Sulser, Timothy. 2018. A Bayesian methodology for building consistent datasets for structural modeling. Presented at the 21st Annual Conference on Global Economic Analysis, in Cartagena de Indias Convention Center, Cartagena, Colombia, June 13-15, 2018. https://www.gtap.agecon.purdue.edu/resources/res_display.asp?RecordID=5545
spellingShingle simulation models
models
data management
data
agricultural economics
Mason-D'Croz, Daniel
Robinson, Sherman
Dunston, Shahnila
Sulser, Timothy B.
A Bayesian methodology for building consistent datasets for structural modeling
title A Bayesian methodology for building consistent datasets for structural modeling
title_full A Bayesian methodology for building consistent datasets for structural modeling
title_fullStr A Bayesian methodology for building consistent datasets for structural modeling
title_full_unstemmed A Bayesian methodology for building consistent datasets for structural modeling
title_short A Bayesian methodology for building consistent datasets for structural modeling
title_sort bayesian methodology for building consistent datasets for structural modeling
topic simulation models
models
data management
data
agricultural economics
url https://hdl.handle.net/10568/145843
work_keys_str_mv AT masondcrozdaniel abayesianmethodologyforbuildingconsistentdatasetsforstructuralmodeling
AT robinsonsherman abayesianmethodologyforbuildingconsistentdatasetsforstructuralmodeling
AT dunstonshahnila abayesianmethodologyforbuildingconsistentdatasetsforstructuralmodeling
AT sulsertimothyb abayesianmethodologyforbuildingconsistentdatasetsforstructuralmodeling
AT masondcrozdaniel bayesianmethodologyforbuildingconsistentdatasetsforstructuralmodeling
AT robinsonsherman bayesianmethodologyforbuildingconsistentdatasetsforstructuralmodeling
AT dunstonshahnila bayesianmethodologyforbuildingconsistentdatasetsforstructuralmodeling
AT sulsertimothyb bayesianmethodologyforbuildingconsistentdatasetsforstructuralmodeling