Inference of ancestries and heterozygosity proportion and genotype imputation in West African cattle populations

Several studies have evaluated computational methods that infer the haplotypes from population genotype data in European cattle populations. However, little is known about how well they perform in African indigenous and crossbred populations. This study investigates: (1) global and local ancestry in...

Descripción completa

Detalles Bibliográficos
Autores principales: Zergaw, N., Aliloo, H., Strucken, E.M., Marshall, Karen, Al Kalaldeh, M., Missohou, A., Gibson, J.P.
Formato: Journal Article
Lenguaje:Inglés
Publicado: Frontiers Media 2021
Materias:
Acceso en línea:https://hdl.handle.net/10568/113359
_version_ 1855519635785383936
author Zergaw, N.
Aliloo, H.
Strucken, E.M.
Marshall, Karen
Al Kalaldeh, M.
Missohou, A.
Gibson, J.P.
author_browse Al Kalaldeh, M.
Aliloo, H.
Gibson, J.P.
Marshall, Karen
Missohou, A.
Strucken, E.M.
Zergaw, N.
author_facet Zergaw, N.
Aliloo, H.
Strucken, E.M.
Marshall, Karen
Al Kalaldeh, M.
Missohou, A.
Gibson, J.P.
author_sort Zergaw, N.
collection Repository of Agricultural Research Outputs (CGSpace)
description Several studies have evaluated computational methods that infer the haplotypes from population genotype data in European cattle populations. However, little is known about how well they perform in African indigenous and crossbred populations. This study investigates: (1) global and local ancestry inference; (2) heterozygosity proportion estimation; and (3) genotype imputation in West African indigenous and crossbred cattle populations. Principal component analysis (PCA), ADMIXTURE, and LAMP-LD were used to analyse a medium-density single nucleotide polymorphism (SNP) dataset from Senegalese crossbred cattle. Reference SNP data of East and West African indigenous and crossbred cattle populations were used to investigate the accuracy of imputation from low to medium-density and from medium to high-density SNP datasets using Minimac v3. The first two principal components differentiatedBos indicusfrom EuropeanBos taurusand AfricanBos taurusfrom other breeds. Irrespective of assuming two or three ancestral breeds for the Senegalese crossbreds, breed proportion estimates from ADMIXTURE and LAMP-LD showed a high correlation (r≥ 0.981). The observed ancestral origin heterozygosity proportion in putative F1 crosses was close to the expected value of 1.0, and clearly differentiated F1 from all other crosses. The imputation accuracies (estimated as correlation) between imputed and the real data in crossbred animals ranged from 0.142 to 0.717 when imputing from low to medium-density, and from 0.478 to 0.899 for imputation from medium to high-density. The imputation accuracy was generally higher when the reference data came from the same geographical region as the target population, and when crossbred reference data was used to impute crossbred genotypes. The lowest imputation accuracies were observed for indigenous breed genotypes. This study shows that ancestral origin heterozygosity can be estimated with high accuracy and will be far superior to the use of observed individual heterozygosity for estimating heterosis in African crossbred populations. It was not possible to achieve high imputation accuracy in West African crossbred or indigenous populations based on reference data sets from East Africa, and population-specific genotyping with high-density SNP assays is required to improve imputation.
format Journal Article
id CGSpace113359
institution CGIAR Consortium
language Inglés
publishDate 2021
publishDateRange 2021
publishDateSort 2021
publisher Frontiers Media
publisherStr Frontiers Media
record_format dspace
spelling CGSpace1133592024-05-01T08:16:04Z Inference of ancestries and heterozygosity proportion and genotype imputation in West African cattle populations Zergaw, N. Aliloo, H. Strucken, E.M. Marshall, Karen Al Kalaldeh, M. Missohou, A. Gibson, J.P. cattle livestock animal breeding genetics Several studies have evaluated computational methods that infer the haplotypes from population genotype data in European cattle populations. However, little is known about how well they perform in African indigenous and crossbred populations. This study investigates: (1) global and local ancestry inference; (2) heterozygosity proportion estimation; and (3) genotype imputation in West African indigenous and crossbred cattle populations. Principal component analysis (PCA), ADMIXTURE, and LAMP-LD were used to analyse a medium-density single nucleotide polymorphism (SNP) dataset from Senegalese crossbred cattle. Reference SNP data of East and West African indigenous and crossbred cattle populations were used to investigate the accuracy of imputation from low to medium-density and from medium to high-density SNP datasets using Minimac v3. The first two principal components differentiatedBos indicusfrom EuropeanBos taurusand AfricanBos taurusfrom other breeds. Irrespective of assuming two or three ancestral breeds for the Senegalese crossbreds, breed proportion estimates from ADMIXTURE and LAMP-LD showed a high correlation (r≥ 0.981). The observed ancestral origin heterozygosity proportion in putative F1 crosses was close to the expected value of 1.0, and clearly differentiated F1 from all other crosses. The imputation accuracies (estimated as correlation) between imputed and the real data in crossbred animals ranged from 0.142 to 0.717 when imputing from low to medium-density, and from 0.478 to 0.899 for imputation from medium to high-density. The imputation accuracy was generally higher when the reference data came from the same geographical region as the target population, and when crossbred reference data was used to impute crossbred genotypes. The lowest imputation accuracies were observed for indigenous breed genotypes. This study shows that ancestral origin heterozygosity can be estimated with high accuracy and will be far superior to the use of observed individual heterozygosity for estimating heterosis in African crossbred populations. It was not possible to achieve high imputation accuracy in West African crossbred or indigenous populations based on reference data sets from East Africa, and population-specific genotyping with high-density SNP assays is required to improve imputation. 2021-03-23 2021-04-15T06:44:21Z 2021-04-15T06:44:21Z Journal Article https://hdl.handle.net/10568/113359 en Open Access Frontiers Media Zergaw, N., Aliloo, H., Strucken, E.M., Marshall, K., Al Kalaldeh, M., Missohou, A. and Gibson, J.P. 2021. Inference of ancestries and heterozygosity proportion and genotype imputation in West African cattle populations. Frontiers in Genetics 12:584355.
spellingShingle cattle
livestock
animal breeding
genetics
Zergaw, N.
Aliloo, H.
Strucken, E.M.
Marshall, Karen
Al Kalaldeh, M.
Missohou, A.
Gibson, J.P.
Inference of ancestries and heterozygosity proportion and genotype imputation in West African cattle populations
title Inference of ancestries and heterozygosity proportion and genotype imputation in West African cattle populations
title_full Inference of ancestries and heterozygosity proportion and genotype imputation in West African cattle populations
title_fullStr Inference of ancestries and heterozygosity proportion and genotype imputation in West African cattle populations
title_full_unstemmed Inference of ancestries and heterozygosity proportion and genotype imputation in West African cattle populations
title_short Inference of ancestries and heterozygosity proportion and genotype imputation in West African cattle populations
title_sort inference of ancestries and heterozygosity proportion and genotype imputation in west african cattle populations
topic cattle
livestock
animal breeding
genetics
url https://hdl.handle.net/10568/113359
work_keys_str_mv AT zergawn inferenceofancestriesandheterozygosityproportionandgenotypeimputationinwestafricancattlepopulations
AT alilooh inferenceofancestriesandheterozygosityproportionandgenotypeimputationinwestafricancattlepopulations
AT struckenem inferenceofancestriesandheterozygosityproportionandgenotypeimputationinwestafricancattlepopulations
AT marshallkaren inferenceofancestriesandheterozygosityproportionandgenotypeimputationinwestafricancattlepopulations
AT alkalaldehm inferenceofancestriesandheterozygosityproportionandgenotypeimputationinwestafricancattlepopulations
AT missohoua inferenceofancestriesandheterozygosityproportionandgenotypeimputationinwestafricancattlepopulations
AT gibsonjp inferenceofancestriesandheterozygosityproportionandgenotypeimputationinwestafricancattlepopulations