Apparently low reproducibility of true differential expression discoveries in microarray studies
- PMID: 18632747
- DOI: 10.1093/bioinformatics/btn365
Apparently low reproducibility of true differential expression discoveries in microarray studies
Abstract
Motivation: Differentially expressed gene (DEG) lists detected from different microarray studies for a same disease are often highly inconsistent. Even in technical replicate tests using identical samples, DEG detection still shows very low reproducibility. It is often believed that current small microarray studies will largely introduce false discoveries.
Results: Based on a statistical model, we show that even in technical replicate tests using identical samples, it is highly likely that the selected DEG lists will be very inconsistent in the presence of small measurement variations. Therefore, the apparently low reproducibility of DEG detection from current technical replicate tests does not indicate low quality of microarray technology. We also demonstrate that heterogeneous biological variations existing in real cancer data will further reduce the overall reproducibility of DEG detection. Nevertheless, in small subsamples from both simulated and real data, the actual false discovery rate (FDR) for each DEG list tends to be low, suggesting that each separately determined list may comprise mostly true DEGs. Rather than simply counting the overlaps of the discovery lists from different studies for a complex disease, novel metrics are needed for evaluating the reproducibility of discoveries characterized with correlated molecular changes. Supplementaty information: Supplementary data are available at Bioinformatics online.
Similar articles
-
Evaluating reproducibility of differential expression discoveries in microarray studies by considering correlated molecular changes.Bioinformatics. 2009 Jul 1;25(13):1662-8. doi: 10.1093/bioinformatics/btp295. Epub 2009 May 5. Bioinformatics. 2009. PMID: 19417058 Free PMC article.
-
False discovery rate, sensitivity and sample size for microarray studies.Bioinformatics. 2005 Jul 1;21(13):3017-24. doi: 10.1093/bioinformatics/bti448. Epub 2005 Apr 19. Bioinformatics. 2005. PMID: 15840707
-
Novel statistical framework to identify differentially expressed genes allowing transcriptomic background differences.Bioinformatics. 2010 Jun 1;26(11):1431-6. doi: 10.1093/bioinformatics/btq163. Epub 2010 Apr 16. Bioinformatics. 2010. PMID: 20400756
-
Empirical Bayes screening of many p-values with applications to microarray studies.Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2. Bioinformatics. 2005. PMID: 15691856
-
Stability and aggregation of ranked gene lists.Brief Bioinform. 2009 Sep;10(5):556-68. doi: 10.1093/bib/bbp034. Brief Bioinform. 2009. PMID: 19679825 Review.
Cited by
-
Cross-study homogeneity of psoriasis gene expression in skin across a large expression range.PLoS One. 2013;8(1):e52242. doi: 10.1371/journal.pone.0052242. Epub 2013 Jan 4. PLoS One. 2013. PMID: 23308107 Free PMC article. Clinical Trial.
-
Reproducibility and concordance of differential DNA methylation and gene expression in cancer.PLoS One. 2012;7(1):e29686. doi: 10.1371/journal.pone.0029686. Epub 2012 Jan 3. PLoS One. 2012. PMID: 22235325 Free PMC article.
-
Identification of human HK genes and gene expression regulation study in cancer from transcriptomics data analysis.PLoS One. 2013;8(1):e54082. doi: 10.1371/journal.pone.0054082. Epub 2013 Jan 31. PLoS One. 2013. PMID: 23382867 Free PMC article.
-
Evaluation of the psoriasis transcriptome across different studies by gene set enrichment analysis (GSEA).PLoS One. 2010 Apr 20;5(4):e10247. doi: 10.1371/journal.pone.0010247. PLoS One. 2010. PMID: 20422035 Free PMC article.
-
Identifying clinically relevant drug resistance genes in drug-induced resistant cancer cell lines and post-chemotherapy tissues.Oncotarget. 2015 Dec 1;6(38):41216-27. doi: 10.18632/oncotarget.5649. Oncotarget. 2015. PMID: 26515599 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources