Integration of genomic datasets to predict protein complexes in yeast

doi:10.1023/a:1020495201615

. 2002;2(2):71-81.

doi: 10.1023/a:1020495201615.

Integration of genomic datasets to predict protein complexes in yeast

Ronald Jansen¹, Ning Lan, Jiang Qian, Mark Gerstein

Affiliations

PMID: 12836664
DOI: 10.1023/a:1020495201615

Integration of genomic datasets to predict protein complexes in yeast

Ronald Jansen et al. J Struct Funct Genomics. 2002.

. 2002;2(2):71-81.

doi: 10.1023/a:1020495201615.

Authors

Ronald Jansen¹, Ning Lan, Jiang Qian, Mark Gerstein

Affiliation

¹ Department of Molecular Biophysics & Biochemistry, 266 Whitney Avenue, Yale University, PO Box 208114, New Haven, CT 06520, USA.

PMID: 12836664
DOI: 10.1023/a:1020495201615

Abstract

The ultimate goal of functional genomics is to define the function of all the genes in the genome of an organism. A large body of information of the biological roles of genes has been accumulated and aggregated in the past decades of research, both from traditional experiments detailing the role of individual genes and proteins, and from newer experimental strategies that aim to characterize gene function on a genomic scale. It is clear that the goal of functional genomics can only be achieved by integrating information and data sources from the variety of these different experiments. Integration of different data is thus an important challenge for bioinformatics. The integration of different data sources often helps to uncover non-obvious relationships between genes, but there are also two further benefits. First, it is likely that whenever information from multiple independent sources agrees, it should be more valid and reliable. Secondly, by looking at the union of multiple sources, one can cover larger parts of the genome. This is obvious for integrating results from multiple single gene or protein experiments, but also necessary for many of the results from genome-wide experiments since they are often confined to certain (although sizable) subsets of the genome. In this paper, we explore an example of such a data integration procedure. We focus on the prediction of membership in protein complexes for individual genes. For this, we recruit six different data sources that include expression profiles, interaction data, essentiality and localization information. Each of these data sources individually contains some weakly predictive information with respect to protein complexes, but we show how this prediction can be improved by combining all of them. Supplementary information is available at http:// bioinfo.mbb.yale.edu/integrate/interactions/.

PubMed Disclaimer

Cited by

Mining and state-space modeling and verification of sub-networks from large-scale biomolecular networks.
Hu X, Wu FX. Hu X, et al. BMC Bioinformatics. 2007 Aug 31;8:324. doi: 10.1186/1471-2105-8-324. BMC Bioinformatics. 2007. PMID: 17764552 Free PMC article.
Incorporating Ontology-Driven Similarity Knowledge into Functional Genomics: An Exploratory Study.
Azuaje F, Bodenreider O. Azuaje F, et al. BIBE 2004. 2004 May;2004:317-324. doi: 10.1109/BIBE.2004.1317360. BIBE 2004. 2004. PMID: 25635264 Free PMC article.
Single-Cell Co-expression Analysis Reveals Distinct Functional Modules, Co-regulation Mechanisms and Clinical Outcomes.
Wang J, Xia S, Arand B, Zhu H, Machiraju R, Huang K, Ji H, Qian J. Wang J, et al. PLoS Comput Biol. 2016 Apr 21;12(4):e1004892. doi: 10.1371/journal.pcbi.1004892. eCollection 2016 Apr. PLoS Comput Biol. 2016. PMID: 27100869 Free PMC article.
Motifs, themes and thematic maps of an integrated Saccharomyces cerevisiae interaction network.
Zhang LV, King OD, Wong SL, Goldberg DS, Tong AH, Lesage G, Andrews B, Bussey H, Boone C, Roth FP. Zhang LV, et al. J Biol. 2005;4(2):6. doi: 10.1186/jbiol23. Epub 2005 Jun 1. J Biol. 2005. PMID: 15982408 Free PMC article.
Predicting co-complexed protein pairs using genomic and proteomic data integration.
Zhang LV, Wong SL, King OD, Roth FP. Zhang LV, et al. BMC Bioinformatics. 2004 Apr 16;5:38. doi: 10.1186/1471-2105-5-38. BMC Bioinformatics. 2004. PMID: 15090078 Free PMC article.

See all "Cited by" articles

References

1. Genome Res. 1999 Nov;9(11):1106-15 - PubMed
1. Nat Genet. 2001 Feb;27(2):167-71 - PubMed
1. Nucleic Acids Res. 2000 Mar 15;28(6):1481-8 - PubMed
1. Nucleic Acids Res. 1999 Jan 1;27(1):69-73 - PubMed
1. Genome Res. 1996 Jul;6(7):639-45 - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
- Ovid Technologies, Inc.
Molecular Biology Databases
- Saccharomyces Genome Database
Miscellaneous
- NCI CPTAC Assay Portal

[1] Genome Res. 1999 Nov;9(11):1106-15 - PubMed

[2] Genome Res. 1999 Nov;9(11):1106-15 - PubMed

[3] Nat Genet. 2001 Feb;27(2):167-71 - PubMed

[4] Nat Genet. 2001 Feb;27(2):167-71 - PubMed

[5] Nucleic Acids Res. 2000 Mar 15;28(6):1481-8 - PubMed

[6] Nucleic Acids Res. 2000 Mar 15;28(6):1481-8 - PubMed

[7] Nucleic Acids Res. 1999 Jan 1;27(1):69-73 - PubMed

[8] Nucleic Acids Res. 1999 Jan 1;27(1):69-73 - PubMed

[9] Genome Res. 1996 Jul;6(7):639-45 - PubMed

[10] Genome Res. 1996 Jul;6(7):639-45 - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Integration of genomic datasets to predict protein complexes in yeast

Affiliation

Integration of genomic datasets to predict protein complexes in yeast

Authors

Affiliation

Abstract

Similar articles

Cited by

References

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Molecular Biology Databases

Miscellaneous