Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli
- PMID: 8805245
- DOI: 10.1016/s0960-9822(02)00478-5
Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli
Abstract
Background: The 1.83 Megabase (Mb) sequence of the Haemophilus influenzae chromosome, the first completed genome sequence of a cellular life form, has been recently reported. Approximately 75 % of the 4.7 Mb genome sequence of Escherichia coli is also available. The life styles of the two bacteria are very different - H. influenzae is an obligate parasite that lives in human upper respiratory mucosa and can be cultivated only on rich media, whereas E. coli is a saprophyte that can grow on minimal media. A detailed comparison of the protein products encoded by these two genomes is expected to provide valuable insights into bacterial cell physiology and genome evolution.
Results: We describe the results of computer analysis of the amino-acid sequences of 1703 putative proteins encoded by the complete genome of H. influenzae. We detected sequence similarity to proteins in current databases for 92 % of the H. influenzae protein sequences, and at least a general functional prediction was possible for 83 %. A comparison of the H. influenzae protein sequences with those of 3010 proteins encoded by the sequenced 75 % of the E. coli genome revealed 1128 pairs of apparent orthologs, with an average of 59 % identity. In contrast to the high similarity between orthologs, the genome organization and the functional repertoire of genes in the two bacteria were remarkably different. The smaller genome size of H. influenzae is explained, to a large extent, by a reduction in the number of paralogous genes. There was no long range colinearity between the E. coli and H. influenzae gene orders, but over 70 % of the orthologous genes were found in short conserved strings, only about half of which were operons in E. coli. Superposition of the H. influenzae enzyme repertoire upon the known E. coli metabolic pathways allowed us to reconstruct similar and alternative pathways in H. influenzae and provides an explanation for the known nutritional requirements.
Conclusions: By comparing proteins encoded by the two bacterial genomes, we have shown that extensive gene shuffling and variation in the extent of gene paralogy are major trends in bacterial evolution; this comparison has also allowed us to deduce crucial aspects of the largely uncharacterized metabolism of H. influenzae.
Similar articles
-
The evolutionary relationships between the two bacteria Escherichia coli and Haemophilus influenzae and their putative last common ancestor.Mol Biol Evol. 1998 Jan;15(1):17-27. doi: 10.1093/oxfordjournals.molbev.a025843. Mol Biol Evol. 1998. PMID: 9491601
-
Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea.Mol Microbiol. 1997 Aug;25(4):619-37. doi: 10.1046/j.1365-2958.1997.4821861.x. Mol Microbiol. 1997. PMID: 9379893
-
A minimal gene set for cellular life derived by comparison of complete bacterial genomes.Proc Natl Acad Sci U S A. 1996 Sep 17;93(19):10268-73. doi: 10.1073/pnas.93.19.10268. Proc Natl Acad Sci U S A. 1996. PMID: 8816789 Free PMC article.
-
Complete genome sequences of cellular life forms: glimpses of theoretical evolutionary genomics.Curr Opin Genet Dev. 1996 Dec;6(6):757-62. doi: 10.1016/s0959-437x(96)80032-3. Curr Opin Genet Dev. 1996. PMID: 8994848 Review.
-
Novel PTS proteins revealed by bacterial genome sequencing: a unique fructose-specific phosphoryl transfer protein with two HPr-like domains in Haemophilus influenzae.Res Microbiol. 1996 May;147(4):209-15. doi: 10.1016/0923-2508(96)81381-7. Res Microbiol. 1996. PMID: 8763608 Review.
Cited by
-
The environmentally-regulated interplay between local three-dimensional chromatin organisation and transcription of proVWX in E. coli.Nat Commun. 2023 Nov 17;14(1):7478. doi: 10.1038/s41467-023-43322-y. Nat Commun. 2023. PMID: 37978176 Free PMC article.
-
Optimization based automated curation of metabolic reconstructions.BMC Bioinformatics. 2007 Jun 20;8:212. doi: 10.1186/1471-2105-8-212. BMC Bioinformatics. 2007. PMID: 17584497 Free PMC article.
-
Congruent evolution of different classes of non-coding DNA in prokaryotic genomes.Nucleic Acids Res. 2002 Oct 1;30(19):4264-71. doi: 10.1093/nar/gkf549. Nucleic Acids Res. 2002. PMID: 12364605 Free PMC article.
-
Comparative genome analysis of the pathogenic spirochetes Borrelia burgdorferi and Treponema pallidum.Infect Immun. 2000 Mar;68(3):1633-48. doi: 10.1128/IAI.68.3.1633-1648.2000. Infect Immun. 2000. PMID: 10678983 Free PMC article.
-
The past, present and future of genome-wide re-annotation.Genome Biol. 2002;3(2):COMMENT2001. doi: 10.1186/gb-2002-3-2-comment2001. Epub 2002 Jan 31. Genome Biol. 2002. PMID: 11864365 Free PMC article. Review.
Publication types
MeSH terms
Substances
Associated data
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous