Analysis of historical data for optimization of genomic selection pipeline in cassava

Breeding for high-yielding and broadly adapted varieties of cassava has been the primary target of the International Institute of Tropical Agriculture (IITA) cassava breeding program based in Nigeria. However, this target has been hindered due to the presence of genotype-by-environment interaction (...

Full description

Bibliographic Details
Main Author:	Bakare, M.A.
Format:	Tesis
Language:	Inglés
Published:	Cornell University 2023
Subjects:	cassava breeding varieties yields
Online Access:	https://hdl.handle.net/10568/138339

_version_	1855527260456484864
author	Bakare, M.A.
author_browse	Bakare, M.A.
author_facet	Bakare, M.A.
author_sort	Bakare, M.A.
collection	Repository of Agricultural Research Outputs (CGSpace)
description	Breeding for high-yielding and broadly adapted varieties of cassava has been the primary target of the International Institute of Tropical Agriculture (IITA) cassava breeding program based in Nigeria. However, this target has been hindered due to the presence of genotype-by-environment interaction (GEI) and phenotypic recurrent selection as a traditional approach of breeding. This approach generates gains slowly due to long breeding cycle, resulting to low rate of realized genetic gain per unit time for complex traits like fresh root yield. Taking advantage of recent advances in computational resources, this study focused on exploring historical data using advance statistical techniques and stochastic simulation. The main objective was to identify a breeding scheme which optimizes the cost of field operation and rate of genetic gain. First, I used classical linear-bilinear model to dissect existing patterns of GEI of 36 elite cassava clones evaluated in 11 locations over 3 growing seasons. This aims to identify the optimum number of environments from target population of environments for future testing of genetic lines for key traits such as fresh root yield, dry matter content, and top yield. Second, I exploited the complex pattern of GEI from 96 varieties assessed in 48 trials using variance structure models on fresh root yield to identify an optimal model that captures GEI and stable clones, identify mega-environments and key environmental covariables driving GEI. Lastly, I used stochastic simulation to assess different breeding scenarios to identify an optimal breeding scheme which maximized genetic gain for cassava in Nigeria by investing the breeding resources in one breeding program for broad adaptation or splitting the resources into two sets of testing locations for narrow adaptation. Key lessons from these studies include: (1) Regardless the number of environments sampled to represent TPE, prediction accuracy of fresh root yield is lower than that of dry matter content and top yield. (2) The testing locations within the same geographic region were clustered and dissimilar from locations in other regions indicating some locations within each cluster may be dropped for field trial to maximize the budget cost. (2) A factor analytic statistical model with three factors was identified as the parsimonious model whose common latent factors captured 79.0% of total genetic variability. (3) Maximization of covariance between latent factor loadings and weather variables was an effective approach for identifying weather conditions driving genotypic response to testing environments. (4) The rate of genetic gain per unit time from genomic-enabled breeding programs were consistently higher than that of phenotypic-based conventional breeding program.
format	Tesis
id	CGSpace138339
institution	CGIAR Consortium
language	Inglés
publishDate	2023
publishDateRange	2023
publishDateSort	2023
publisher	Cornell University
publisherStr	Cornell University
record_format	dspace
spelling	CGSpace1383392024-01-24T00:28:47Z Analysis of historical data for optimization of genomic selection pipeline in cassava Bakare, M.A. cassava breeding varieties yields Breeding for high-yielding and broadly adapted varieties of cassava has been the primary target of the International Institute of Tropical Agriculture (IITA) cassava breeding program based in Nigeria. However, this target has been hindered due to the presence of genotype-by-environment interaction (GEI) and phenotypic recurrent selection as a traditional approach of breeding. This approach generates gains slowly due to long breeding cycle, resulting to low rate of realized genetic gain per unit time for complex traits like fresh root yield. Taking advantage of recent advances in computational resources, this study focused on exploring historical data using advance statistical techniques and stochastic simulation. The main objective was to identify a breeding scheme which optimizes the cost of field operation and rate of genetic gain. First, I used classical linear-bilinear model to dissect existing patterns of GEI of 36 elite cassava clones evaluated in 11 locations over 3 growing seasons. This aims to identify the optimum number of environments from target population of environments for future testing of genetic lines for key traits such as fresh root yield, dry matter content, and top yield. Second, I exploited the complex pattern of GEI from 96 varieties assessed in 48 trials using variance structure models on fresh root yield to identify an optimal model that captures GEI and stable clones, identify mega-environments and key environmental covariables driving GEI. Lastly, I used stochastic simulation to assess different breeding scenarios to identify an optimal breeding scheme which maximized genetic gain for cassava in Nigeria by investing the breeding resources in one breeding program for broad adaptation or splitting the resources into two sets of testing locations for narrow adaptation. Key lessons from these studies include: (1) Regardless the number of environments sampled to represent TPE, prediction accuracy of fresh root yield is lower than that of dry matter content and top yield. (2) The testing locations within the same geographic region were clustered and dissimilar from locations in other regions indicating some locations within each cluster may be dropped for field trial to maximize the budget cost. (2) A factor analytic statistical model with three factors was identified as the parsimonious model whose common latent factors captured 79.0% of total genetic variability. (3) Maximization of covariance between latent factor loadings and weather variables was an effective approach for identifying weather conditions driving genotypic response to testing environments. (4) The rate of genetic gain per unit time from genomic-enabled breeding programs were consistently higher than that of phenotypic-based conventional breeding program. 2023-08 2024-01-23T11:37:38Z 2024-01-23T11:37:38Z Thesis https://hdl.handle.net/10568/138339 en Limited Access Cornell University Bakare, M.A. (2023). Analysis of historical data for optimization of genomic selection pipeline in cassava. Ithaca, United States: Cornell University. (196 p.).
spellingShingle	cassava breeding varieties yields Bakare, M.A. Analysis of historical data for optimization of genomic selection pipeline in cassava
title	Analysis of historical data for optimization of genomic selection pipeline in cassava
title_full	Analysis of historical data for optimization of genomic selection pipeline in cassava
title_fullStr	Analysis of historical data for optimization of genomic selection pipeline in cassava
title_full_unstemmed	Analysis of historical data for optimization of genomic selection pipeline in cassava
title_short	Analysis of historical data for optimization of genomic selection pipeline in cassava
title_sort	analysis of historical data for optimization of genomic selection pipeline in cassava
topic	cassava breeding varieties yields
url	https://hdl.handle.net/10568/138339
work_keys_str_mv	AT bakarema analysisofhistoricaldataforoptimizationofgenomicselectionpipelineincassava

Analysis of historical data for optimization of genomic selection pipeline in cassava

Similar Items