Identifying and reducing AFLP genotyping error: an example of tradeoffs when comparing population structure in broadcast spawning versus brooding oysters
- 25 January 2012
- journal article
- research article
- Published by Springer Science and Business Media LLC in Heredity
- Vol. 108 (6), 616-625
- https://doi.org/10.1038/hdy.2011.132
Abstract
Phylogeographic inferences about gene flow are strengthened through comparison of co-distributed taxa, but also depend on adequate genomic sampling. Amplified fragment length polymorphisms (AFLPs) provide a rapid and inexpensive source of multilocus allele frequency data for making genomically robust inferences. Every AFLP study initially generates markers with a range of locus-specific genotyping error rates and applies criteria to select a subset for analysis. However, there has been very little empirical evaluation of the best tradeoff between culling all but the lowest-error loci to minimize overall genotyping error versus the potential for increasing population genetic signal by retaining more loci. Here, we used AFLPs to compare population structure in co-distributed broadcast spawning (Crassostrea virginica) and brooding (Ostrea equestris) oyster species. Using existing methods for almost entirely automated marker selection and scoring, genotyping error tradeoffs were evaluated by comparing results across a nested series of data sets with mean mismatch errors of 0, 1, 2, 3, 4 and >4%. Artifactual population structure was diagnosed in high-error data sets and we assessed the low-error point at which expected population substructure signal was lost. In both species, we identified substructure patterns deemed to be inaccurate at average mismatch error rates 2 and >4%. In the species comparison, the optimum data sets showed higher gene flow for the brooding oyster with more oceanic salinity tolerances. AFLP tradeoffs may differ among studies, but our results suggest that important signal may be lost in the pursuit of ‘acceptable’ error levels and our procedures provide a general method for empirically exploring these tradeoffs.Keywords
This publication has 34 references indexed in Scilit:
- Evaluating the impact of scoring parameters on the structure of intra-specific genetic variation using RawGeno, an R package for automating AFLP scoringBMC Bioinformatics, 2009
- An objective, rapid and reproducible method for scoring AFLP peak-height data that minimizes genotyping errorMolecular Ecology Resources, 2008
- Optimizing Automated AFLP Scoring Parameters to Improve Phylogenetic ResolutionSystematic Biology, 2008
- Impact of Amplified Fragment Length Polymorphism Size Homoplasy on the Estimation of Population Genetic Diversity and the Detection of Selective LociGenetics, 2008
- The Utility of Amplified Fragment Length Polymorphisms in Phylogenetics: A Comparison of Homology within and between GenomesSystematic Biology, 2007
- Almost Forgotten or Latest Practice? AFLP applications, analyses and advancesTrends in Plant Science, 2007
- genalex 6: genetic analysis in Excel. Population genetic software for teaching and researchMolecular Ecology Notes, 2005
- Dispersal, Genetic Differentiation and Speciation in Estuarine OrganismsEstuarine, Coastal and Shelf Science, 2002
- PeakmatcherCrop Science, 2002
- AFLP: a new technique for DNA fingerprintingNucleic Acids Research, 1995