Discovering ancestors and connecting relatives in large genomic databases

Genomic evaluation has improved both plant and animal breeding by allowing more accurate estimation of an individual's genetic potential. Because often only a small proportion of the population to be evaluated has been genotyped, genomic estimations rely heavily on complete pedigree information. Con...

Descripción completa

Detalles Bibliográficos
Autores principales: Nani, Juan Pablo, Bacheller, Lillian R., Cole, John B., VanRaden, Paul M.
Formato: Artículo
Lenguaje:Inglés
Publicado: Elsevier; American Dairy Science Association 2020
Materias:
Acceso en línea:http://hdl.handle.net/20.500.12123/7030
https://www.journalofdairyscience.org/article/S0022-0302(19)31091-4/fulltext
https://doi.org/10.3168/jds.2019-17580
_version_ 1855483928745345024
author Nani, Juan Pablo
Bacheller, Lillian R.
Cole, John B.
VanRaden, Paul M.
author_browse Bacheller, Lillian R.
Cole, John B.
Nani, Juan Pablo
VanRaden, Paul M.
author_facet Nani, Juan Pablo
Bacheller, Lillian R.
Cole, John B.
VanRaden, Paul M.
author_sort Nani, Juan Pablo
collection INTA Digital
description Genomic evaluation has improved both plant and animal breeding by allowing more accurate estimation of an individual's genetic potential. Because often only a small proportion of the population to be evaluated has been genotyped, genomic estimations rely heavily on complete pedigree information. Confirmation, discovery, and correction of parentage and connected relatives allow the creation of more complete pedigrees, which in turn increase the number of usable phenotypic records and prediction accuracy. Previous methods accounted for parent-progeny conflicts using SNP. More recently haplotype methods allowed discovery of distant relationships such as maternal grandsire (MGS) and maternal great-grandsire (MGGS) with improved accuracy. However, discovered MGS and MGGS often were not used, because no dam information was available to link them to the calf. An automated procedure to discover and fill missing maternal identification information was developed, allowing discovered MGS and MGGS to be used in imputation as well as in calculating breeding values for animals in the US dairy cattle database. An MGS was discovered for 295,136 animals with unknown dam, and the MGGS was discovered for 153,909 of these animals. A virtual maternal identification was added for animals with missing information. The effect of pedigree completion on progeny inbreeding, breeding values, and reliabilities was examined. Mean inbreeding of animals with missing maternal pedigree information was 6.69% before and 6.87% after pedigree assignment; expected future inbreeding was 7.24% before and 7.20% after assignment. Reliabilities for traditional breeding values increased from 26.6 to 32.6% for milk yield, 25.9 to 32.0% for fat yield, and 26.9 to 32.9% for protein yield; genomic reliabilities also increased slightly from 76.2 to 77.1% for milk, 76.0 to 76.9% for fat, and 76.3 to 77.3% for protein. The procedure developed for pedigree completion is a useful tool for improving accuracy of national and international evaluations and aiding producers in making better mating decisions.
format Artículo
id INTA7030
institution Instituto Nacional de Tecnología Agropecuaria (INTA -Argentina)
language Inglés
publishDate 2020
publishDateRange 2020
publishDateSort 2020
publisher Elsevier; American Dairy Science Association
publisherStr Elsevier; American Dairy Science Association
record_format dspace
spelling INTA70302020-04-02T19:20:14Z Discovering ancestors and connecting relatives in large genomic databases Nani, Juan Pablo Bacheller, Lillian R. Cole, John B. VanRaden, Paul M. Ancestry Genetic Inheritance Pedigree Livestock Genomics Genotypes Ascendencia Herencia Genética Pedigrí Genómica Genotipos Genomic evaluation has improved both plant and animal breeding by allowing more accurate estimation of an individual's genetic potential. Because often only a small proportion of the population to be evaluated has been genotyped, genomic estimations rely heavily on complete pedigree information. Confirmation, discovery, and correction of parentage and connected relatives allow the creation of more complete pedigrees, which in turn increase the number of usable phenotypic records and prediction accuracy. Previous methods accounted for parent-progeny conflicts using SNP. More recently haplotype methods allowed discovery of distant relationships such as maternal grandsire (MGS) and maternal great-grandsire (MGGS) with improved accuracy. However, discovered MGS and MGGS often were not used, because no dam information was available to link them to the calf. An automated procedure to discover and fill missing maternal identification information was developed, allowing discovered MGS and MGGS to be used in imputation as well as in calculating breeding values for animals in the US dairy cattle database. An MGS was discovered for 295,136 animals with unknown dam, and the MGGS was discovered for 153,909 of these animals. A virtual maternal identification was added for animals with missing information. The effect of pedigree completion on progeny inbreeding, breeding values, and reliabilities was examined. Mean inbreeding of animals with missing maternal pedigree information was 6.69% before and 6.87% after pedigree assignment; expected future inbreeding was 7.24% before and 7.20% after assignment. Reliabilities for traditional breeding values increased from 26.6 to 32.6% for milk yield, 25.9 to 32.0% for fat yield, and 26.9 to 32.9% for protein yield; genomic reliabilities also increased slightly from 76.2 to 77.1% for milk, 76.0 to 76.9% for fat, and 76.3 to 77.3% for protein. The procedure developed for pedigree completion is a useful tool for improving accuracy of national and international evaluations and aiding producers in making better mating decisions. EEA Rafaela Fil: Nani, Juan Pablo. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Rafaela; Argentina. Agricultural Research Service. Animal Genomics and Improvement Laboratory; Estados Unidos Fil: Bacheller, Lillian R. Council on Dairy Cattle Breeding; Estados Unidos Fil: Cole, John B. Agricultural Research Service. Animal Genomics and Improvement Laboratory; Estados Unidos Fil: VanRaden, Paul M. Agricultural Research Service. Animal Genomics and Improvement Laboratory; Estados Unidos 2020-04-02T16:03:20Z 2020-04-02T16:03:20Z 2020-02 info:ar-repo/semantics/artículo info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion http://hdl.handle.net/20.500.12123/7030 https://www.journalofdairyscience.org/article/S0022-0302(19)31091-4/fulltext 0022-0302 https://doi.org/10.3168/jds.2019-17580 eng info:eu-repo/semantics/restrictedAccess application/pdf Elsevier; American Dairy Science Association Journal of dairy science 103 (2) : 1729 - 1734. (February 2020)
spellingShingle Ancestry
Genetic Inheritance
Pedigree Livestock
Genomics
Genotypes
Ascendencia
Herencia Genética
Pedigrí
Genómica
Genotipos
Nani, Juan Pablo
Bacheller, Lillian R.
Cole, John B.
VanRaden, Paul M.
Discovering ancestors and connecting relatives in large genomic databases
title Discovering ancestors and connecting relatives in large genomic databases
title_full Discovering ancestors and connecting relatives in large genomic databases
title_fullStr Discovering ancestors and connecting relatives in large genomic databases
title_full_unstemmed Discovering ancestors and connecting relatives in large genomic databases
title_short Discovering ancestors and connecting relatives in large genomic databases
title_sort discovering ancestors and connecting relatives in large genomic databases
topic Ancestry
Genetic Inheritance
Pedigree Livestock
Genomics
Genotypes
Ascendencia
Herencia Genética
Pedigrí
Genómica
Genotipos
url http://hdl.handle.net/20.500.12123/7030
https://www.journalofdairyscience.org/article/S0022-0302(19)31091-4/fulltext
https://doi.org/10.3168/jds.2019-17580
work_keys_str_mv AT nanijuanpablo discoveringancestorsandconnectingrelativesinlargegenomicdatabases
AT bachellerlillianr discoveringancestorsandconnectingrelativesinlargegenomicdatabases
AT colejohnb discoveringancestorsandconnectingrelativesinlargegenomicdatabases
AT vanradenpaulm discoveringancestorsandconnectingrelativesinlargegenomicdatabases