Genome-wide functional analysis of the cotton transcriptome by creating an integrated EST database
- PMID: 22087239
- PMCID: PMC3210780
- DOI: 10.1371/journal.pone.0026980
Genome-wide functional analysis of the cotton transcriptome by creating an integrated EST database
Abstract
A total of 28,432 unique contigs (25,371 in consensus contigs and 3,061 as singletons) were assembled from all 268,786 cotton ESTs currently available. Several in silico approaches [comparative genomics, Blast, Gene Ontology (GO) analysis, and pathway enrichment by Kyoto Encyclopedia of Genes and Genomes (KEGG)] were employed to investigate global functions of the cotton transcriptome. Cotton EST contigs were clustered into 5,461 groups with a maximum cluster size of 196 members. A total of 27,956 indel mutants and 149,616 single nucleotide polymorphisms (SNPs) were identified from consensus contigs. Interestingly, many contigs with significantly high frequencies of indels or SNPs encode transcription factors and protein kinases. In a comparison with six model plant species, cotton ESTs show the highest overall similarity to grape. A total of 87 cotton miRNAs were identified; 59 of these have not been reported previously from experimental or bioinformatics investigations. We also predicted 3,260 genes as miRNAs targets, which are associated with multiple biological functions, including stress response, metabolism, hormone signal transduction and fiber development. We identified 151 and 4,214 EST-simple sequence repeats (SSRs) from contigs and raw ESTs respectively. To make these data widely available, and to facilitate access to EST-related genetic information, we integrated our results into a comprehensive, fully downloadable web-based cotton EST database (www.leonxie.com).
Conflict of interest statement
Figures
Similar articles
-
Transcriptome analysis of extant cotton progenitors revealed tetraploidization and identified genome-specific single nucleotide polymorphism in diploid and allotetraploid cotton.BMC Res Notes. 2014 Aug 6;7:493. doi: 10.1186/1756-0500-7-493. BMC Res Notes. 2014. PMID: 25099166 Free PMC article.
-
Using genome-referenced expressed sequence tag assembly to analyze the origin and expression patterns of Gossypium hirsutum transcripts.J Integr Plant Biol. 2013 Jul;55(7):576-85. doi: 10.1111/jipb.12066. J Integr Plant Biol. 2013. PMID: 23675784
-
Toward allotetraploid cotton genome assembly: integration of a high-density molecular genetic linkage map with DNA sequence information.BMC Genomics. 2012 Oct 9;13:539. doi: 10.1186/1471-2164-13-539. BMC Genomics. 2012. PMID: 23046547 Free PMC article.
-
WildSilkbase: an EST database of wild silkmoths.BMC Genomics. 2008 Jul 17;9:338. doi: 10.1186/1471-2164-9-338. BMC Genomics. 2008. PMID: 18637161 Free PMC article.
-
Recent insights into cotton functional genomics: progress and future perspectives.Plant Biotechnol J. 2018 Mar;16(3):699-713. doi: 10.1111/pbi.12856. Epub 2018 Jan 15. Plant Biotechnol J. 2018. PMID: 29087016 Free PMC article. Review.
Cited by
-
Exploring valid reference genes for quantitative real-time PCR analysis in Plutella xylostella (Lepidoptera: Plutellidae).Int J Biol Sci. 2013 Aug 20;9(8):792-802. doi: 10.7150/ijbs.5862. eCollection 2013. Int J Biol Sci. 2013. PMID: 23983612 Free PMC article.
-
RNA-Seq transcriptome profiling of upland cotton (Gossypium hirsutum L.) root tissue under water-deficit stress.PLoS One. 2013 Dec 6;8(12):e82634. doi: 10.1371/journal.pone.0082634. eCollection 2013. PLoS One. 2013. PMID: 24324815 Free PMC article.
-
Comprehensive analysis of the Gossypium hirsutum L. respiratory burst oxidase homolog (Ghrboh) gene family.BMC Genomics. 2020 Jan 29;21(1):91. doi: 10.1186/s12864-020-6503-6. BMC Genomics. 2020. PMID: 31996127 Free PMC article.
-
Selection of suitable reference genes for normalization of quantitative RT-PCR in peripheral blood samples of bottlenose dolphins (Tursiops truncatus).Sci Rep. 2015 Oct 21;5:15425. doi: 10.1038/srep15425. Sci Rep. 2015. PMID: 26486099 Free PMC article.
-
Selection of reference genes for qRT-PCR and expression analysis of high-altitude-related genes in grassland caterpillars (Lepidoptera: Erebidae: Gynaephora) along an altitude gradient.Ecol Evol. 2017 Sep 25;7(21):9054-9065. doi: 10.1002/ece3.3431. eCollection 2017 Nov. Ecol Evol. 2017. PMID: 29152197 Free PMC article.
References
-
- IAC. Cotton: Review of World Situation, Monogram by International Advisory Committee. 1996. Washington, D.C.
-
- Zhang BH, Wang QL, Wang KB, Pan XP, Liu F, et al. Identification of cotton microRNAs and their targets. Gene. 2007;397:26–37. - PubMed
-
- Seki M, Hayashida N, Kato N, Yohda M, Shinozaki K. Rapid construction of a transcription map for a cosmid contig of Arabidopsis thaliana genome using a novel cDNA selection method. Plant J. 1997;12:481–487. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials