Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction

The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and, more recently, by incorporation of molecular marker genotypes. However,...

Full description

Bibliographic Details
Main Authors: Yunbi Xu, Zhang Xingping, Huihui Li, Hongjian Zheng, Jianan Zhang, Olsen, Michael, Varshney, Rajeev K., Boddupalli, P.M., Qian Qian
Format: Journal Article
Language:Inglés
Published: Elsevier 2022
Subjects:
Online Access:https://hdl.handle.net/10568/127374
_version_ 1855529899271389184
author Yunbi Xu
Zhang Xingping
Huihui Li
Hongjian Zheng
Jianan Zhang
Olsen, Michael
Varshney, Rajeev K.
Boddupalli, P.M.
Qian Qian
author_browse Boddupalli, P.M.
Hongjian Zheng
Huihui Li
Jianan Zhang
Olsen, Michael
Qian Qian
Varshney, Rajeev K.
Yunbi Xu
Zhang Xingping
author_facet Yunbi Xu
Zhang Xingping
Huihui Li
Hongjian Zheng
Jianan Zhang
Olsen, Michael
Varshney, Rajeev K.
Boddupalli, P.M.
Qian Qian
author_sort Yunbi Xu
collection Repository of Agricultural Research Outputs (CGSpace)
description The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and, more recently, by incorporation of molecular marker genotypes. However, plant performance or phenotype (P) is determined by the combined effects of genotype (G), envirotype (E), and genotype by environment interaction (GEI). Phenotypes can be predicted more precisely by training a model using data collected from multiple sources, including spatiotemporal omics (genomics, phenomics, and enviromics across time and space). Integration of 3D information profiles (G-P-E), each with multidimensionality, provides predictive breeding with both tremendous opportunities and great challenges. Here, we first review innovative technologies for predictive breeding. We then evaluate multidimensional information profiles that can be integrated with a predictive breeding strategy, particularly envirotypic data, which have largely been neglected in data collection and are nearly untouched in model construction. We propose a smart breeding scheme, integrated genomic-enviromic prediction (iGEP), as an extension of genomic prediction, using integrated multiomics information, big data technology, and artificial intelligence (mainly focused on machine and deep learning). We discuss how to implement iGEP, including spatiotemporal models, environmental indices, factorial and spatiotemporal structure of plant breeding data, and cross-species prediction. A strategy is then proposed for prediction-based crop redesign at both the macro (individual, population, and species) and micro (gene, metabolism, and network) scales. Finally, we provide perspectives on translating smart breeding into genetic gain through integrative breeding platforms and open-source breeding initiatives. We call for coordinated efforts in smart breeding through iGEP, institutional partnerships, and innovative technological support.
format Journal Article
id CGSpace127374
institution CGIAR Consortium
language Inglés
publishDate 2022
publishDateRange 2022
publishDateSort 2022
publisher Elsevier
publisherStr Elsevier
record_format dspace
spelling CGSpace1273742025-12-08T10:11:39Z Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction Yunbi Xu Zhang Xingping Huihui Li Hongjian Zheng Jianan Zhang Olsen, Michael Varshney, Rajeev K. Boddupalli, P.M. Qian Qian breeding marker-assisted selection crops machine learning data artificial intelligence The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and, more recently, by incorporation of molecular marker genotypes. However, plant performance or phenotype (P) is determined by the combined effects of genotype (G), envirotype (E), and genotype by environment interaction (GEI). Phenotypes can be predicted more precisely by training a model using data collected from multiple sources, including spatiotemporal omics (genomics, phenomics, and enviromics across time and space). Integration of 3D information profiles (G-P-E), each with multidimensionality, provides predictive breeding with both tremendous opportunities and great challenges. Here, we first review innovative technologies for predictive breeding. We then evaluate multidimensional information profiles that can be integrated with a predictive breeding strategy, particularly envirotypic data, which have largely been neglected in data collection and are nearly untouched in model construction. We propose a smart breeding scheme, integrated genomic-enviromic prediction (iGEP), as an extension of genomic prediction, using integrated multiomics information, big data technology, and artificial intelligence (mainly focused on machine and deep learning). We discuss how to implement iGEP, including spatiotemporal models, environmental indices, factorial and spatiotemporal structure of plant breeding data, and cross-species prediction. A strategy is then proposed for prediction-based crop redesign at both the macro (individual, population, and species) and micro (gene, metabolism, and network) scales. Finally, we provide perspectives on translating smart breeding into genetic gain through integrative breeding platforms and open-source breeding initiatives. We call for coordinated efforts in smart breeding through iGEP, institutional partnerships, and innovative technological support. 2022-11 2023-01-18T10:38:42Z 2023-01-18T10:38:42Z Journal Article https://hdl.handle.net/10568/127374 en Open Access application/pdf Elsevier Xu, Y., Zhang, X., Li, H., Zheng, H., Zhang, J., Olsen, M. S., Varshney, R. K., Prasanna, B. M., & Qian, Q. (2022). Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. Molecular Plant, 15(11), 1664–1695. https://doi.org/10.1016/j.molp.2022.09.001
spellingShingle breeding
marker-assisted selection
crops
machine learning
data
artificial intelligence
Yunbi Xu
Zhang Xingping
Huihui Li
Hongjian Zheng
Jianan Zhang
Olsen, Michael
Varshney, Rajeev K.
Boddupalli, P.M.
Qian Qian
Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction
title Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction
title_full Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction
title_fullStr Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction
title_full_unstemmed Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction
title_short Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction
title_sort smart breeding driven by big data artificial intelligence and integrated genomic enviromic prediction
topic breeding
marker-assisted selection
crops
machine learning
data
artificial intelligence
url https://hdl.handle.net/10568/127374
work_keys_str_mv AT yunbixu smartbreedingdrivenbybigdataartificialintelligenceandintegratedgenomicenviromicprediction
AT zhangxingping smartbreedingdrivenbybigdataartificialintelligenceandintegratedgenomicenviromicprediction
AT huihuili smartbreedingdrivenbybigdataartificialintelligenceandintegratedgenomicenviromicprediction
AT hongjianzheng smartbreedingdrivenbybigdataartificialintelligenceandintegratedgenomicenviromicprediction
AT jiananzhang smartbreedingdrivenbybigdataartificialintelligenceandintegratedgenomicenviromicprediction
AT olsenmichael smartbreedingdrivenbybigdataartificialintelligenceandintegratedgenomicenviromicprediction
AT varshneyrajeevk smartbreedingdrivenbybigdataartificialintelligenceandintegratedgenomicenviromicprediction
AT boddupallipm smartbreedingdrivenbybigdataartificialintelligenceandintegratedgenomicenviromicprediction
AT qianqian smartbreedingdrivenbybigdataartificialintelligenceandintegratedgenomicenviromicprediction