Genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks

Germplasm conserved in gene banks is underutilized, owing mainly to the cost of characterization. Genomic prediction can be applied to predict the genetic merit of germplasm. Germplasm utilization could be greatly accelerated if prediction accuracy were sufficiently high with a training population o...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Sang, Liu, Hongyan, Zhan, Junhui, Meng, Yun, Wang, Yamei, Wang, Feng, Ye, Guoyou
Formato: Journal Article
Lenguaje:Inglés
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://hdl.handle.net/10568/164042
_version_ 1855538794237788160
author He, Sang
Liu, Hongyan
Zhan, Junhui
Meng, Yun
Wang, Yamei
Wang, Feng
Ye, Guoyou
author_browse He, Sang
Liu, Hongyan
Meng, Yun
Wang, Feng
Wang, Yamei
Ye, Guoyou
Zhan, Junhui
author_facet He, Sang
Liu, Hongyan
Zhan, Junhui
Meng, Yun
Wang, Yamei
Wang, Feng
Ye, Guoyou
author_sort He, Sang
collection Repository of Agricultural Research Outputs (CGSpace)
description Germplasm conserved in gene banks is underutilized, owing mainly to the cost of characterization. Genomic prediction can be applied to predict the genetic merit of germplasm. Germplasm utilization could be greatly accelerated if prediction accuracy were sufficiently high with a training population of practical size. Large-scale resequencing projects in rice have generated high quality genome-wide variation information for many diverse accessions, making it possible to investigate the potential of genomic prediction in rice germplasm management and exploitation. We phenotyped six traits in nearly 2000 indica (XI) and japonica (GJ) accessions from the Rice 3K project and investigated different scenarios for forming training populations. A composite core training set was considered in two levels which targets used for prediction of subpopulations within subspecies or prediction across subspecies. Composite training sets incorporating 400 or 200 accessions from either subpopulation of XI or GJ showed satisfactory prediction accuracy. A composite training set of 600 XI and GJ accessions showed sufficiently high prediction accuracy for both XI and GJ subspecies. Comparable or even higher prediction accuracy was observed for the composite training set than for the corresponding homogeneous training sets comprising accessions only of specific subpopulations of XI or GJ (within-subspecies level) or pure XI or GJ accessions (across-subspecies level) that were included in the composite training set. Validation using an independent population of 281 rice cultivars supported the predictive ability of the composite training set. Reliability, which reflects the robustness of a training set, was markedly higher for the composite training set than for the corresponding homogeneous training sets. A core training set formed from diverse accessions could accurately predict the genetic merit of rice germplasm.
format Journal Article
id CGSpace164042
institution CGIAR Consortium
language Inglés
publishDate 2022
publishDateRange 2022
publishDateSort 2022
publisher Elsevier
publisherStr Elsevier
record_format dspace
spelling CGSpace1640422024-12-19T14:13:31Z Genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks He, Sang Liu, Hongyan Zhan, Junhui Meng, Yun Wang, Yamei Wang, Feng Ye, Guoyou plant science agronomy and crop science Germplasm conserved in gene banks is underutilized, owing mainly to the cost of characterization. Genomic prediction can be applied to predict the genetic merit of germplasm. Germplasm utilization could be greatly accelerated if prediction accuracy were sufficiently high with a training population of practical size. Large-scale resequencing projects in rice have generated high quality genome-wide variation information for many diverse accessions, making it possible to investigate the potential of genomic prediction in rice germplasm management and exploitation. We phenotyped six traits in nearly 2000 indica (XI) and japonica (GJ) accessions from the Rice 3K project and investigated different scenarios for forming training populations. A composite core training set was considered in two levels which targets used for prediction of subpopulations within subspecies or prediction across subspecies. Composite training sets incorporating 400 or 200 accessions from either subpopulation of XI or GJ showed satisfactory prediction accuracy. A composite training set of 600 XI and GJ accessions showed sufficiently high prediction accuracy for both XI and GJ subspecies. Comparable or even higher prediction accuracy was observed for the composite training set than for the corresponding homogeneous training sets comprising accessions only of specific subpopulations of XI or GJ (within-subspecies level) or pure XI or GJ accessions (across-subspecies level) that were included in the composite training set. Validation using an independent population of 281 rice cultivars supported the predictive ability of the composite training set. Reliability, which reflects the robustness of a training set, was markedly higher for the composite training set than for the corresponding homogeneous training sets. A core training set formed from diverse accessions could accurately predict the genetic merit of rice germplasm. 2022-08 2024-12-19T12:53:21Z 2024-12-19T12:53:21Z Journal Article https://hdl.handle.net/10568/164042 en Open Access Elsevier He, Sang; Liu, Hongyan; Zhan, Junhui; Meng, Yun; Wang, Yamei; Wang, Feng and Ye, Guoyou. 2022. Genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks. The Crop Journal, Volume 10 no. 4 p. 1073-1082
spellingShingle plant science
agronomy and crop science
He, Sang
Liu, Hongyan
Zhan, Junhui
Meng, Yun
Wang, Yamei
Wang, Feng
Ye, Guoyou
Genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks
title Genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks
title_full Genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks
title_fullStr Genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks
title_full_unstemmed Genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks
title_short Genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks
title_sort genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks
topic plant science
agronomy and crop science
url https://hdl.handle.net/10568/164042
work_keys_str_mv AT hesang genomicpredictionusingcompositetrainingsetsisaneffectivemethodforexploitinggermplasmconservedinricegenebanks
AT liuhongyan genomicpredictionusingcompositetrainingsetsisaneffectivemethodforexploitinggermplasmconservedinricegenebanks
AT zhanjunhui genomicpredictionusingcompositetrainingsetsisaneffectivemethodforexploitinggermplasmconservedinricegenebanks
AT mengyun genomicpredictionusingcompositetrainingsetsisaneffectivemethodforexploitinggermplasmconservedinricegenebanks
AT wangyamei genomicpredictionusingcompositetrainingsetsisaneffectivemethodforexploitinggermplasmconservedinricegenebanks
AT wangfeng genomicpredictionusingcompositetrainingsetsisaneffectivemethodforexploitinggermplasmconservedinricegenebanks
AT yeguoyou genomicpredictionusingcompositetrainingsetsisaneffectivemethodforexploitinggermplasmconservedinricegenebanks