Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2001 Nov 1;29(21):E102-2.
doi: 10.1093/nar/29.21.e102.

TissueInfo: high-throughput identification of tissue expression profiles and specificity

Affiliations

TissueInfo: high-throughput identification of tissue expression profiles and specificity

L Skrabanek et al. Nucleic Acids Res. .

Abstract

We describe TissueInfo, a knowledge-based method for the high-throughput identification of tissue expression profiles and tissue specificity. TissueInfo defines a set of tissue information calculations that can be computed for large numbers of genes, expressed sequence tags (ESTs) or proteins. Tissue information records that result from the TissueInfo calculations are used to generate tables suitable for data mining and for the selection of genes according to a given expression profile or specificity. When benchmarked against a test set of 116 proteins and literature information, TissueInfo was found to be accurate for 69% of identified tissue specificities and for 80% of expression profiles. The accuracy of the identifications can be increased if query sequences for which little information is available from dbEST are ignored. Thus, with 80% coverage, TissueInfo achieves an accuracy of 76% for specificity and 89% for expression. For the same set of proteins, the curated tissue specificity offered in SWISS-PROT was accurate in 78% of cases. TissueInfo can be useful for the selection of clones for custom microarrays, selection of training sets for ab initio identification of tissue information, gene discovery and genome-wide predictions. Further information about the program can be found at http://icb.mssm.edu/tissueinfo.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The components of TissueInfo are illustrated and their relation to a local infrastructure. Arrows represent either a flow of data or relationships among components that provide or use services. Information Management imports raw data from dbEST, stores it in a relational database and supports the curation of such data. Curated information is made available as objects empowered with methods necessary to implement the algorithms described in Table 1. See the text for a description of the other components. TissueInfo requires access to dbEST, MegaBLAST and BLAST, which must be provided by the local bioinformatics infrastructure. TissueInfo records are generated for each query sequence going through the pipeline. A suitable data mining environment helps navigate data sets produced when large numbers of query sequences are annotated.
Figure 2
Figure 2
Reliability of predictions. A summary of the number of times the ‘correct’ or ‘agree’ tissue specificity or tissue expression is found when predictions are compared to SWISS-PROT and the literature. First line, accuracy of specificity prediction. Second line, accuracy of expression prediction. Each column represents a protein, an X indicates an incorrect prediction and a period represents a ‘correct’ or ‘agree’ prediction. (See text for evaluation details.) Third and following lines, adjusted number of hits, i.e. number of hits for which information is available in TissueInfo. The adjusted number of hits is given reading downwards at every point. Predictions are sorted according to increasing adjusted hit number.

Similar articles

Cited by

References

    1. 1 Wolfsberg T.G. and Landsman,D. (1997) A comparison of expressed sequence tags (ESTs) to human genomic sequences. Nucleic Acids Res., 25, 1626–1632. - PMC - PubMed
    1. Schuler G.D. (1997) Pieces of the puzzle: expressed sequence tags and the catalog of human genes. J. Mol. Med., 75, 694–698. - PubMed
    1. Marra M.A., Hillier,L. and Waterston,R.H. (1998) Expressed sequence tags—ESTablishing bridges between genomes. Trends Genet., 14, 4–7. - PubMed
    1. Putney S.D., Herlihy,W.C. and Schimmel,P. (1983) A new troponin T and cDNA clones for 13 different muscle proteins, found by shotgun sequencing. Nature, 302, 718–721. - PubMed
    1. Boguski M.S., Lowe,T.M. and Tolstoshev,C.M. (1993) dbEST—database for ‘expressed sequence tags’. Nature Genet., 4, 332–333. - PubMed

Publication types