Apparently low reproducibility of true differential expression discoveries in microarray studies
- PMID: 18632747
- DOI: 10.1093/bioinformatics/btn365
Apparently low reproducibility of true differential expression discoveries in microarray studies
Abstract
Motivation: Differentially expressed gene (DEG) lists detected from different microarray studies for a same disease are often highly inconsistent. Even in technical replicate tests using identical samples, DEG detection still shows very low reproducibility. It is often believed that current small microarray studies will largely introduce false discoveries.
Results: Based on a statistical model, we show that even in technical replicate tests using identical samples, it is highly likely that the selected DEG lists will be very inconsistent in the presence of small measurement variations. Therefore, the apparently low reproducibility of DEG detection from current technical replicate tests does not indicate low quality of microarray technology. We also demonstrate that heterogeneous biological variations existing in real cancer data will further reduce the overall reproducibility of DEG detection. Nevertheless, in small subsamples from both simulated and real data, the actual false discovery rate (FDR) for each DEG list tends to be low, suggesting that each separately determined list may comprise mostly true DEGs. Rather than simply counting the overlaps of the discovery lists from different studies for a complex disease, novel metrics are needed for evaluating the reproducibility of discoveries characterized with correlated molecular changes. Supplementaty information: Supplementary data are available at Bioinformatics online.
Similar articles
-
Evaluating reproducibility of differential expression discoveries in microarray studies by considering correlated molecular changes.Bioinformatics. 2009 Jul 1;25(13):1662-8. doi: 10.1093/bioinformatics/btp295. Epub 2009 May 5. Bioinformatics. 2009. PMID: 19417058 Free PMC article.
-
False discovery rate, sensitivity and sample size for microarray studies.Bioinformatics. 2005 Jul 1;21(13):3017-24. doi: 10.1093/bioinformatics/bti448. Epub 2005 Apr 19. Bioinformatics. 2005. PMID: 15840707
-
Novel statistical framework to identify differentially expressed genes allowing transcriptomic background differences.Bioinformatics. 2010 Jun 1;26(11):1431-6. doi: 10.1093/bioinformatics/btq163. Epub 2010 Apr 16. Bioinformatics. 2010. PMID: 20400756
-
Empirical Bayes screening of many p-values with applications to microarray studies.Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2. Bioinformatics. 2005. PMID: 15691856
-
Stability and aggregation of ranked gene lists.Brief Bioinform. 2009 Sep;10(5):556-68. doi: 10.1093/bib/bbp034. Brief Bioinform. 2009. PMID: 19679825 Review.
Cited by
-
Biological impact of missing-value imputation on downstream analyses of gene expression profiles.Bioinformatics. 2011 Jan 1;27(1):78-86. doi: 10.1093/bioinformatics/btq613. Epub 2010 Nov 2. Bioinformatics. 2011. PMID: 21045072 Free PMC article.
-
Extracting consistent knowledge from highly inconsistent cancer gene data sources.BMC Bioinformatics. 2010 Feb 5;11:76. doi: 10.1186/1471-2105-11-76. BMC Bioinformatics. 2010. PMID: 20137077 Free PMC article.
-
Genes dysregulated to different extent or oppositely in estrogen receptor-positive and estrogen receptor-negative breast cancers.PLoS One. 2013 Jul 18;8(7):e70017. doi: 10.1371/journal.pone.0070017. Print 2013. PLoS One. 2013. PMID: 23875016 Free PMC article.
-
Pitfalls in experimental designs for characterizing the transcriptional, methylational and copy number changes of oncogenes and tumor suppressor genes.PLoS One. 2013;8(3):e58163. doi: 10.1371/journal.pone.0058163. Epub 2013 Mar 5. PLoS One. 2013. PMID: 23472150 Free PMC article.
-
Sequential analysis of myocardial gene expression with phenotypic change: Use of cross-platform concordance to strengthen biologic relevance.PLoS One. 2019 Aug 30;14(8):e0221519. doi: 10.1371/journal.pone.0221519. eCollection 2019. PLoS One. 2019. PMID: 31469842 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources