Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Dec 15;364(5):964-73.
doi: 10.1016/j.jmb.2006.09.074. Epub 2006 Oct 3.

Mosaic structure of human coronavirus NL63, one thousand years of evolution

Affiliations

Mosaic structure of human coronavirus NL63, one thousand years of evolution

Krzysztof Pyrc et al. J Mol Biol. .

Abstract

Before the SARS outbreak only two human coronaviruses (HCoV) were known: HCoV-OC43 and HCoV-229E. With the discovery of SARS-CoV in 2003, a third family member was identified. Soon thereafter, we described the fourth human coronavirus (HCoV-NL63), a virus that has spread worldwide and is associated with croup in children. We report here the complete genome sequence of two HCoV-NL63 clinical isolates, designated Amsterdam 57 and Amsterdam 496. The genomes are 27,538 and 27,550 nucleotides long, respectively, and share the same genome organization. We identified two variable regions, one within the 1a and one within the S gene, whereas the 1b and N genes were most conserved. Phylogenetic analysis revealed that HCoV-NL63 genomes have a mosaic structure with multiple recombination sites. Additionally, employing three different algorithms, we assessed the evolutionary rate for the S gene of group Ib coronaviruses to be approximately 3 x 10(-4) substitutions per site per year. Using this evolutionary rate we determined that HCoV-NL63 diverged in the 11th century from its closest relative HCoV-229E.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Molecular variability of HCoV-NL63 along the genome. (a) Frequency of polymorphic sites at the nucleotide level among four isolates of HCoV-NL63. (b)Frequency of polymorphic sites on the synonymous and non-synonymous positions among four isolates of HCoV-NL63. The analysis was done with a 500 nt non-overlapping window.
Figure 2
Figure 2
Discordance in phylogenetic clustering of different isolates of HCoV-NL63 at regions 3K, 6K and 21K. Phylogenetic trees were constructed as described in Materials and Methods using an UPGMA algorithm. The scale bar unit represents a 0.002 substitution per site. The trees were rooted with the sequences of it's closest relative: for 3K and 6K the HCoV-229E and for the 21K region the PEDV. Four completely sequenced isolates are marked with colored boxes, illustrating the discordance in clustering in different regions of the genome.
Figure 3
Figure 3
Discordance in clustering of different isolates of HCoV-NL63 at regions 3K, 6K and 21K. Three alignments of only variable sites subtracted from the original sequence with DnaSP 4.0 software, shows the discordance in clustering at different regions of the genome. Groups A and B were created arbitrarily to show the discordance. Group A was defined as the group that contains the Amsterdam 1 isolate.
Figure 4
Figure 4
Identification of recombination sites in the S gene. The alignment includes only subtracted variable sites. These variable sites were subtracted with DnaSP 4.0 software. The change of color represents the alternation of genetic clustering between isolates. The numbers at the top represent the beginning of the S gene (nt 20,472 in the Amsterdam 1 isolate), coordinates of recombination spots inside the S gene (nt 21,061–21,072 and 21,575–21,576 in the Amsterdam 1 isolate) and the 3′ terminus of the S gene (nt 24,542 in the Amsterdam 1 isolate).
Figure 5
Figure 5
Signs of interspecies recombination in the M gene. Similarity plot (a) and bootscan analysis with the Kimura (two-parameter) distance model, neighbor-joining tree model and 500 bootstrap replicates (b) of the HCoV-NL63 M protein. (c) Analysis of the substitution pattern on the synonymous and non-synonymous level between the M gene of HCoV-NL63 and HCoV-229E. The window used was 40 nucleotides with a step of one nucleotide.

Similar articles

Cited by

References

    1. Guy J.S., Breslin J.J., Breuhaus B., Vivrette S., Smith L.G. Characterization of a coronavirus isolated from a diarrheic foal. J. Clin. Microbiol. 2000;38:4523–4526. - PMC - PubMed
    1. Lai M.M.C., Holmes K.V. Coronaviridae and their replication. In: Howley P., Griffin D., Lamb R., Martin M., Roizman B., Straus S., Knipe D., editors. Fields-Virology. Lippincott Williams and Wilkins; London: 2005. pp. 1163–1185.
    1. Cavanagh D., Brian D.A., Brinton M.A., Enjuanes L., Holmes K.V., Horzinek M.C. The Coronaviridae now comprises two genera, coronavirus and torovirus: report of the Coronaviridae Study Group. Adv. Exp. Med. Biol. 1993;342:255–257. - PubMed
    1. van der Hoek L., Pyrc K., Jebbink M.F., Vermeulen-Oost W., Berkhout R.J., Wolthers K.C. Identification of a new human coronavirus. Nature Med. 2004;10:368–373. - PMC - PubMed
    1. Pyrc K., Jebbink M.F., Berkhout B., van der Hoek L. Genome structure and transcriptional regulation of human coronavirus NL63. Virol. J. 2004;1:7. - PMC - PubMed

Publication types

Associated data

LinkOut - more resources