Optimal sampling strategy and core collection size of Andean tetraploid potato based on isozyme data: a simulation study

Selection of an appropriate sampling strategy is an important prerequisite to establish core collections of appropriate size in order to adequately represent the genetic spectrum and maximally capture the genetic diversity in available crop collections. We developed a simulation approach to identify...

Descripción completa

Detalles Bibliográficos
Autores principales: Chandra, S., Huaman, Z., Hari Krishna, S., Ortíz, R.
Formato: Journal Article
Lenguaje:Inglés
Publicado: Springer 2002
Materias:
Acceso en línea:https://hdl.handle.net/10568/99977
_version_ 1855517038805516288
author Chandra, S.
Huaman, Z.
Hari Krishna, S.
Ortíz, R.
author_browse Chandra, S.
Hari Krishna, S.
Huaman, Z.
Ortíz, R.
author_facet Chandra, S.
Huaman, Z.
Hari Krishna, S.
Ortíz, R.
author_sort Chandra, S.
collection Repository of Agricultural Research Outputs (CGSpace)
description Selection of an appropriate sampling strategy is an important prerequisite to establish core collections of appropriate size in order to adequately represent the genetic spectrum and maximally capture the genetic diversity in available crop collections. We developed a simulation approach to identify an optimal sampling strategy and core-collection size, using isozyme data from a CIP germplasm collection on an Andean tetraploid potato. Five sampling strategies, constant (C), proportional (P), logarithmic (L), square-root (S) and random (R), were tested on isozyme data from 9,396 Andean tetraploid potato accessions characterized for nine isozyme loci having a total of 38 alleles. The 9,396 accessions, though comprising 2,379 morphologically distinct accessions, were found to represent 1,910 genetically distinct groups of accessions for the nine isozyme loci using a sort-and-duplicate-search algorithm. From each group, one accession was randomly selected to form a genetically refined entire collection (GREC) of size 1,910. The GREC was used to test the five sampling strategies. To assess the behavior of the results in repeated sampling, k = 1,500 and 5,000 independent random samples (without replacement) of admissible sizes n = 50(50)1,000 for each strategy were drawn from GREC. Allele frequencies (AF) for the 38 alleles and locus heterozygosity (LH) for the nine loci were estimated for each sample. The goodness of fit of samples AF and LH with those from GREC was tested using the χ 2 test. A core collection of size n = 600, selected using either the P or the R sampling strategy, was found adequately to represent the GREC for both AF and LH. As similar results were obtained at k = 1,500 and 5,000, it seems adequate to draw 1,500 independent random samples of different sizes to test the behavior of different sampling strategies in order to identify an appropriate sampling approach, as well as to determine an optimal core collection size.
format Journal Article
id CGSpace99977
institution CGIAR Consortium
language Inglés
publishDate 2002
publishDateRange 2002
publishDateSort 2002
publisher Springer
publisherStr Springer
record_format dspace
spelling CGSpace999772024-08-27T10:35:00Z Optimal sampling strategy and core collection size of Andean tetraploid potato based on isozyme data: a simulation study Chandra, S. Huaman, Z. Hari Krishna, S. Ortíz, R. potatoes germplasm genetic markers alleles genomes genetics biotechnology Selection of an appropriate sampling strategy is an important prerequisite to establish core collections of appropriate size in order to adequately represent the genetic spectrum and maximally capture the genetic diversity in available crop collections. We developed a simulation approach to identify an optimal sampling strategy and core-collection size, using isozyme data from a CIP germplasm collection on an Andean tetraploid potato. Five sampling strategies, constant (C), proportional (P), logarithmic (L), square-root (S) and random (R), were tested on isozyme data from 9,396 Andean tetraploid potato accessions characterized for nine isozyme loci having a total of 38 alleles. The 9,396 accessions, though comprising 2,379 morphologically distinct accessions, were found to represent 1,910 genetically distinct groups of accessions for the nine isozyme loci using a sort-and-duplicate-search algorithm. From each group, one accession was randomly selected to form a genetically refined entire collection (GREC) of size 1,910. The GREC was used to test the five sampling strategies. To assess the behavior of the results in repeated sampling, k = 1,500 and 5,000 independent random samples (without replacement) of admissible sizes n = 50(50)1,000 for each strategy were drawn from GREC. Allele frequencies (AF) for the 38 alleles and locus heterozygosity (LH) for the nine loci were estimated for each sample. The goodness of fit of samples AF and LH with those from GREC was tested using the χ 2 test. A core collection of size n = 600, selected using either the P or the R sampling strategy, was found adequately to represent the GREC for both AF and LH. As similar results were obtained at k = 1,500 and 5,000, it seems adequate to draw 1,500 independent random samples of different sizes to test the behavior of different sampling strategies in order to identify an appropriate sampling approach, as well as to determine an optimal core collection size. 2002-06 2019-03-03T05:54:26Z 2019-03-03T05:54:26Z Journal Article https://hdl.handle.net/10568/99977 en Limited Access Springer Chandra, S., Huaman, Z., Hari Krishna, S. & Ortiz, R. (2002) Optimal sampling strategy and core collection size of Andean tetraploid potato based on isozyme data – a simulation study. Theoretical and Applied Genetic. 104, 1325–1334.
spellingShingle potatoes
germplasm
genetic markers
alleles
genomes
genetics
biotechnology
Chandra, S.
Huaman, Z.
Hari Krishna, S.
Ortíz, R.
Optimal sampling strategy and core collection size of Andean tetraploid potato based on isozyme data: a simulation study
title Optimal sampling strategy and core collection size of Andean tetraploid potato based on isozyme data: a simulation study
title_full Optimal sampling strategy and core collection size of Andean tetraploid potato based on isozyme data: a simulation study
title_fullStr Optimal sampling strategy and core collection size of Andean tetraploid potato based on isozyme data: a simulation study
title_full_unstemmed Optimal sampling strategy and core collection size of Andean tetraploid potato based on isozyme data: a simulation study
title_short Optimal sampling strategy and core collection size of Andean tetraploid potato based on isozyme data: a simulation study
title_sort optimal sampling strategy and core collection size of andean tetraploid potato based on isozyme data a simulation study
topic potatoes
germplasm
genetic markers
alleles
genomes
genetics
biotechnology
url https://hdl.handle.net/10568/99977
work_keys_str_mv AT chandras optimalsamplingstrategyandcorecollectionsizeofandeantetraploidpotatobasedonisozymedataasimulationstudy
AT huamanz optimalsamplingstrategyandcorecollectionsizeofandeantetraploidpotatobasedonisozymedataasimulationstudy
AT harikrishnas optimalsamplingstrategyandcorecollectionsizeofandeantetraploidpotatobasedonisozymedataasimulationstudy
AT ortizr optimalsamplingstrategyandcorecollectionsizeofandeantetraploidpotatobasedonisozymedataasimulationstudy