Selection of oligonucleotide probes for protein coding sequences
- PMID: 12724288
- DOI: 10.1093/bioinformatics/btg086
Selection of oligonucleotide probes for protein coding sequences
Abstract
Motivation: Large arrays of oligonucleotide probes have become popular tools for analyzing RNA expression. However to date most oligo collections contain poorly validated sequences or are biased toward untranslated regions (UTRs). Here we present a strategy for picking oligos for microarrays that focus on a design universe consisting exclusively of protein coding regions. We describe the constraints in oligo design that are imposed by this strategy, as well as a software tool that allows the strategy to be applied broadly.
Result: In this work we sequentially apply a variety of simple filters to candidate sequences for oligo probes. The primary filter is a rejection of probes that contain contiguous identity with any other sequence in the sample universe that exceeds a pre-established threshold length. We find that rejection of oligos that contain 15 bases of perfect match with other sequences in the design universe is a feasible strategy for oligo selection for probe arrays designed to interrogate mammalian RNA populations. Filters to remove sequences with low complexity and predicted poor probe accessibility narrow the candidate probe space only slightly. Rejection based on global sequence alignment is performed as a secondary, rather than primary, test, leading to an algorithm that is computationally efficient. Splice isoforms pose unique challenges and we find that isoform prevalence will for the most part have to be determined by analysis of the patterns of hybridization of partially redundant oligonucleotides.
Availability: The oligo design program OligoPicker and its source code are freely available at our website.
Similar articles
-
Comprehensive aligned sequence construction for automated design of effective probes (CASCADE-P) using 16S rDNA.Bioinformatics. 2003 Aug 12;19(12):1461-8. doi: 10.1093/bioinformatics/btg200. Bioinformatics. 2003. PMID: 12912825
-
Fast large scale oligonucleotide selection using the longest common factor approach.J Bioinform Comput Biol. 2003 Jul;1(2):343-61. doi: 10.1142/s0219720003000125. J Bioinform Comput Biol. 2003. PMID: 15290776
-
Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy.BMC Bioinformatics. 2007 Sep 19;8:350. doi: 10.1186/1471-2105-8-350. BMC Bioinformatics. 2007. PMID: 17880708 Free PMC article.
-
Expression profiling of microRNA using oligo DNA arrays.Methods. 2008 Jan;44(1):22-30. doi: 10.1016/j.ymeth.2007.10.010. Methods. 2008. PMID: 18158129 Free PMC article. Review.
-
Algorithms for high-density oligonucleotide array.Curr Opin Drug Discov Devel. 2003 May;6(3):339-45. Curr Opin Drug Discov Devel. 2003. PMID: 12833666 Review.
Cited by
-
A general framework for designing and validating oligomer-based DNA microarrays and its application to Clostridium acetobutylicum.Appl Environ Microbiol. 2007 Jul;73(14):4631-8. doi: 10.1128/AEM.00144-07. Epub 2007 May 25. Appl Environ Microbiol. 2007. PMID: 17526797 Free PMC article.
-
Post-genomics of the model haloarchaeon Halobacterium sp. NRC-1.Saline Syst. 2006 Mar 16;2:3. doi: 10.1186/1746-1448-2-3. Saline Syst. 2006. PMID: 16542428 Free PMC article.
-
ESTPiper--a web-based analysis pipeline for expressed sequence tags.BMC Genomics. 2009 Apr 21;10:174. doi: 10.1186/1471-2164-10-174. BMC Genomics. 2009. PMID: 19383159 Free PMC article.
-
PrimerStation: a highly specific multiplex genomic PCR primer design server for the human genome.Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W665-9. doi: 10.1093/nar/gkl297. Nucleic Acids Res. 2006. PMID: 16845094 Free PMC article.
-
Empirical establishment of oligonucleotide probe design criteria.Appl Environ Microbiol. 2005 Jul;71(7):3753-60. doi: 10.1128/AEM.71.7.3753-3760.2005. Appl Environ Microbiol. 2005. PMID: 16000786 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources