Using an incomplete block design to allocate lines to environments improves sparse genome-based prediction in plant breeding

Genomic selection (GS) is a predictive methodology that trains statistical machine-learning models with a reference population that is used to perform genome-enabled predictions of new lines. In plant breeding, it has the potential to increase the speed and reduce the cost of selection. However, to...

Full description

Bibliographic Details
Main Authors: Montesinos López, Osval A., Montesinos López, Abelardo, Acosta, Ricardo, Varshney, Rajeev K., Bentley, Alison R., Crossa, José
Format: Journal Article
Language:Inglés
Published: Wiley 2022
Subjects:
Online Access:https://hdl.handle.net/10568/126502
_version_ 1855540160963280896
author Montesinos López, Osval A.
Montesinos López, Abelardo
Acosta, Ricardo
Varshney, Rajeev K.
Bentley, Alison R.
Crossa, José
author_browse Acosta, Ricardo
Bentley, Alison R.
Crossa, José
Montesinos López, Abelardo
Montesinos López, Osval A.
Varshney, Rajeev K.
author_facet Montesinos López, Osval A.
Montesinos López, Abelardo
Acosta, Ricardo
Varshney, Rajeev K.
Bentley, Alison R.
Crossa, José
author_sort Montesinos López, Osval A.
collection Repository of Agricultural Research Outputs (CGSpace)
description Genomic selection (GS) is a predictive methodology that trains statistical machine-learning models with a reference population that is used to perform genome-enabled predictions of new lines. In plant breeding, it has the potential to increase the speed and reduce the cost of selection. However, to optimize resources, sparse testing methods have been proposed. A common approach is to guarantee a proportion of nonoverlapping and overlapping lines allocated randomly in locations, that is, lines appearing in some locations but not in all. In this study we propose using incomplete block designs (IBD), principally, for the allocation of lines to locations in such a way that not all lines are observed in all locations. We compare this allocation with a random allocation of lines to locations guaranteeing that the lines are allocated to the same number of locations as under the IBD design. We implemented this benchmarking on several crop data sets under the Bayesian genomic best linear unbiased predictor (GBLUP) model, finding that allocation under the principle of IBD outperformed random allocation by between 1.4% and 26.5% across locations, traits, and data sets in terms of mean square error. Although a wide range of performance improvements were observed, our results provide evidence that using IBD for the allocation of lines to locations can help improve predictive performance compared with random allocation. This has the potential to be applied to large-scale plant breeding programs.
format Journal Article
id CGSpace126502
institution CGIAR Consortium
language Inglés
publishDate 2022
publishDateRange 2022
publishDateSort 2022
publisher Wiley
publisherStr Wiley
record_format dspace
spelling CGSpace1265022025-11-12T05:35:08Z Using an incomplete block design to allocate lines to environments improves sparse genome-based prediction in plant breeding Montesinos López, Osval A. Montesinos López, Abelardo Acosta, Ricardo Varshney, Rajeev K. Bentley, Alison R. Crossa, José marker-assisted selection plant breeding genetic improvement programmes Genomic selection (GS) is a predictive methodology that trains statistical machine-learning models with a reference population that is used to perform genome-enabled predictions of new lines. In plant breeding, it has the potential to increase the speed and reduce the cost of selection. However, to optimize resources, sparse testing methods have been proposed. A common approach is to guarantee a proportion of nonoverlapping and overlapping lines allocated randomly in locations, that is, lines appearing in some locations but not in all. In this study we propose using incomplete block designs (IBD), principally, for the allocation of lines to locations in such a way that not all lines are observed in all locations. We compare this allocation with a random allocation of lines to locations guaranteeing that the lines are allocated to the same number of locations as under the IBD design. We implemented this benchmarking on several crop data sets under the Bayesian genomic best linear unbiased predictor (GBLUP) model, finding that allocation under the principle of IBD outperformed random allocation by between 1.4% and 26.5% across locations, traits, and data sets in terms of mean square error. Although a wide range of performance improvements were observed, our results provide evidence that using IBD for the allocation of lines to locations can help improve predictive performance compared with random allocation. This has the potential to be applied to large-scale plant breeding programs. 2022-03 2023-01-03T14:13:05Z 2023-01-03T14:13:05Z Journal Article https://hdl.handle.net/10568/126502 en Open Access application/pdf Wiley Montesinos‐Lopez, O. A., Montesinos‐Lopez, A., Acosta, R., Varshney, R. K., Bentley, A., & Crossa, J. (2022). Using an incomplete block design to allocate lines to environments improves sparse genome‐based prediction in plant breeding. The Plant Genome, 15(1). Portico. https://doi.org/10.1002/tpg2.20194
spellingShingle marker-assisted selection
plant breeding
genetic improvement
programmes
Montesinos López, Osval A.
Montesinos López, Abelardo
Acosta, Ricardo
Varshney, Rajeev K.
Bentley, Alison R.
Crossa, José
Using an incomplete block design to allocate lines to environments improves sparse genome-based prediction in plant breeding
title Using an incomplete block design to allocate lines to environments improves sparse genome-based prediction in plant breeding
title_full Using an incomplete block design to allocate lines to environments improves sparse genome-based prediction in plant breeding
title_fullStr Using an incomplete block design to allocate lines to environments improves sparse genome-based prediction in plant breeding
title_full_unstemmed Using an incomplete block design to allocate lines to environments improves sparse genome-based prediction in plant breeding
title_short Using an incomplete block design to allocate lines to environments improves sparse genome-based prediction in plant breeding
title_sort using an incomplete block design to allocate lines to environments improves sparse genome based prediction in plant breeding
topic marker-assisted selection
plant breeding
genetic improvement
programmes
url https://hdl.handle.net/10568/126502
work_keys_str_mv AT montesinoslopezosvala usinganincompleteblockdesigntoallocatelinestoenvironmentsimprovessparsegenomebasedpredictioninplantbreeding
AT montesinoslopezabelardo usinganincompleteblockdesigntoallocatelinestoenvironmentsimprovessparsegenomebasedpredictioninplantbreeding
AT acostaricardo usinganincompleteblockdesigntoallocatelinestoenvironmentsimprovessparsegenomebasedpredictioninplantbreeding
AT varshneyrajeevk usinganincompleteblockdesigntoallocatelinestoenvironmentsimprovessparsegenomebasedpredictioninplantbreeding
AT bentleyalisonr usinganincompleteblockdesigntoallocatelinestoenvironmentsimprovessparsegenomebasedpredictioninplantbreeding
AT crossajose usinganincompleteblockdesigntoallocatelinestoenvironmentsimprovessparsegenomebasedpredictioninplantbreeding