Soft truncation thresholding for gene set analysis of RNA-seq data: application to a vaccine study
- PMID: 24104466
- PMCID: PMC3793215
- DOI: 10.1038/srep02898
Soft truncation thresholding for gene set analysis of RNA-seq data: application to a vaccine study
Abstract
Gene set analysis (GSA) has been used for analysis of microarray data to aid the interpretation and to increase statistical power. With the advent of next-generation sequencing, the use of GSA is even more relevant, as studies are often conducted on a small number of samples. We propose the use of soft truncation thresholding and the Gamma Method (GM) to determine significant gene set (GS), where a generalized linear model is used to assess per-gene significance. The approach was compared to other methods using an extensive simulation study and RNA-seq data from smallpox vaccine study. The GM was found to outperform other proposed methods. Application of the GM to the smallpox vaccine study found the GSs to be moderately associated with response, including focal adhesion (p = 0.04) and extracellular matrix receptor interaction (p = 0.05). The application of GSA to RNA-seq data will provide new insights into the genomic basis of complex traits.
Conflict of interest statement
GAP is the chair of a Safety Evaluation Committee for novel investigational vaccine trials being conducted by Merck Research Laboratories. GAP offers consultative advice on vaccine development to Merck & Co. Inc., CSL Biotherapies, Avianax, Sanofi Pasteur, Dynavax, Novartis Vaccines and Therapeutics, and PAXVAX Inc. These activities have been reviewed by the Mayo Clinic Conflict of Interest Review Board and are conducted in compliance with Mayo Clinic Conflict of Interest policies. All other authors have no disclosures to disclose.
Figures
Similar articles
-
Comparative evaluation of gene set analysis approaches for RNA-Seq data.BMC Bioinformatics. 2014 Dec 5;15(1):397. doi: 10.1186/s12859-014-0397-8. BMC Bioinformatics. 2014. PMID: 25475910 Free PMC article.
-
Deterministic column subset selection for single-cell RNA-Seq.PLoS One. 2019 Jan 25;14(1):e0210571. doi: 10.1371/journal.pone.0210571. eCollection 2019. PLoS One. 2019. PMID: 30682053 Free PMC article.
-
Statistical detection of differentially expressed genes based on RNA-seq: from biological to phylogenetic replicates.Brief Bioinform. 2016 Mar;17(2):243-8. doi: 10.1093/bib/bbv035. Epub 2015 Jun 24. Brief Bioinform. 2016. PMID: 26108230 Review.
-
A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.PLoS One. 2017 May 1;12(5):e0176185. doi: 10.1371/journal.pone.0176185. eCollection 2017. PLoS One. 2017. PMID: 28459823 Free PMC article.
-
From RNA-seq reads to differential expression results.Genome Biol. 2010;11(12):220. doi: 10.1186/gb-2010-11-12-220. Epub 2010 Dec 22. Genome Biol. 2010. PMID: 21176179 Free PMC article. Review.
Cited by
-
Fifteen Years of Gene Set Analysis for High-Throughput Genomic Data: A Review of Statistical Approaches and Future Challenges.Entropy (Basel). 2020 Apr 10;22(4):427. doi: 10.3390/e22040427. Entropy (Basel). 2020. PMID: 33286201 Free PMC article. Review.
-
Gene signatures associated with adaptive humoral immunity following seasonal influenza A/H1N1 vaccination.Genes Immun. 2016 Dec;17(7):371-379. doi: 10.1038/gene.2016.34. Epub 2016 Aug 18. Genes Immun. 2016. PMID: 27534615 Free PMC article.
-
Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline.Brief Bioinform. 2016 May;17(3):393-407. doi: 10.1093/bib/bbv069. Epub 2015 Sep 4. Brief Bioinform. 2016. PMID: 26342128 Free PMC article.
-
DOT: Gene-set analysis by combining decorrelated association statistics.PLoS Comput Biol. 2020 Apr 14;16(4):e1007819. doi: 10.1371/journal.pcbi.1007819. eCollection 2020 Apr. PLoS Comput Biol. 2020. PMID: 32287273 Free PMC article.
-
Whole Transcriptome Profiling Identifies CD93 and Other Plasma Cell Survival Factor Genes Associated with Measles-Specific Antibody Response after Vaccination.PLoS One. 2016 Aug 16;11(8):e0160970. doi: 10.1371/journal.pone.0160970. eCollection 2016. PLoS One. 2016. PMID: 27529750 Free PMC article.
References
-
- Goeman J. J. & Buhlmann P. Analyzing gene expression data in terms of gene sets: methodological issues. Bioinformatics 23, 980–987 (2007). - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Miscellaneous