Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jun 9;5(6):e11034.
doi: 10.1371/journal.pone.0011034.

In vitro vs in silico detected SNPs for the development of a genotyping array: what can we learn from a non-model species?

Affiliations

In vitro vs in silico detected SNPs for the development of a genotyping array: what can we learn from a non-model species?

Camille Lepoittevin et al. PLoS One. .

Abstract

Background: There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size ( approximately 23.8 Gb/C).

Methodology/principal findings: A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates).

Conclusions/significance: This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Examples of clustering observed for the P. pinaster SNP array.
Each dot represents the mean intensity derived from a population of beads for a single sample. The normalized R (y axis) is the normalized sum of intensities of the two dyes (Cy3 and Cy5), and the normalized Theta (x axis) is ((2/Л)Tan−1 (Cy5/Cy3)), where a normalized Theta value nearest 0 is a homozygous for allele A and a Theta value nearest 1 is homozygous for allele B. A/ classical pattern with three clusters for a SNP considered as successful and polymorphic. B and C/ “cluster compression” when both homozygous clusters are closer to each other than expected. In panel B, the clustering algorithm is able to distinguish the three clusters and gives a GenTrain score of 0.58, however this kind of pattern was considered as a genotyping failure in our analysis because one of the homozygous cluster normalized Theta value does not fall in the [0, 0.1] or [0.9, 1] ranges. In panel C the clustering algorithm was not able to distinguish the three clusters because of low separation scores, and the SNP was automatically considered as a genotyping failure because of its low GenTrain score. D and E/ SNPs interpreted as genotyping failures either because of abnormal Theta values (D) or because of the presence of subgroups in a cluster (E).
Figure 2
Figure 2. Distribution of the 200 in silico SNPs according to the number of ESTs considered for the detection.
Figure 3
Figure 3. Allele frequency spectrum for 257 successfully genotyped in vitro and in silico SNPs.
Figure 4
Figure 4. Genotyping success rate according to functionality score for the 384 SNPs of the assay.
The number of SNPs in each functionality score class is indicated above each bar.

Similar articles

Cited by

References

    1. Henry RJ. Plant Genotyping II: SNP Technology. New York: Oxford University Press; 2008. 285
    1. Rafalski A. Applications of single nucleotide polymorphisms in crop genetics. Current Opinion in Plant Biology. 2002;5:94–100. - PubMed
    1. Rafalski JA. Novel genetic mapping tools in plants: SNPs and LD-based approaches. Plant Science. 2002;162:329–333.
    1. Gupta PK, Roy JK, Prasad M. Single nucleotide polymorphisms: a new paradigm for molecular marker technology and DNA polymorphism detection with emphasis on their use in plants. Current Science. 2001;80:524–535.
    1. Gibbs JR, Singleton A. Application of genome-wide single nucleotide polymorphism typing: simple association and beyond. PLoS Genetics. 2006;2(10):e150. - PMC - PubMed

Publication types