Abstract
Heterogeneous nuclear ribonucleoproteins (hnRNPs) are thought to influence the structure of hnRNA and participate in the processing of hnRNA to mRNA. The hnRNP U protein is an abundant nucleoplasmic phosphoprotein that is the largest of the major hnRNP proteins (120 kDa by SDS-PAGE). HnRNP U binds pre-mRNA in vivo and binds both RNA and ssDNA in vitro. Here we describe the cloning and sequencing of a cDNA encoding the hnRNP U protein, the determination of its amino acid sequence and the delineation of a region in this protein that confers RNA binding. The predicted amino acid sequence of hnRNP U contains 806 amino acids (88,939 Daltons), and shows no extensive homology to any known proteins. The N-terminus is rich in acidic residues and the C-terminus is glycine-rich. In addition, a glutamine-rich stretch, a putative NTP binding site and a putative nuclear localization signal are present. It could not be defined from the sequence what segment of the protein confers its RNA binding activity. We identified an RNA binding activity within the C-terminal glycine-rich 112 amino acids. This region, designated U protein glycine-rich RNA binding region (U-gly), can by itself bind RNA. Furthermore, fusion of U-gly to a heterologous bacterial protein (maltose binding protein) converts this fusion protein into an RNA binding protein. A 26 amino acid peptide within U-gly is necessary for the RNA binding activity of the U protein. Interestingly, this peptide contains a cluster of RGG repeats with characteristic spacing and this motif is found also in several other RNA binding proteins. We have termed this region the RGG box and propose that it is an RNA binding motif and a predictor of RNA binding activity.
Full text
PDFImages in this article
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Adam S. A., Nakagawa T., Swanson M. S., Woodruff T. K., Dreyfuss G. mRNA polyadenylate-binding protein: gene isolation and sequencing and identification of a ribonucleoprotein consensus sequence. Mol Cell Biol. 1986 Aug;6(8):2932–2943. doi: 10.1128/mcb.6.8.2932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aris J. P., Blobel G. cDNA cloning and sequencing of human fibrillarin, a conserved nucleolar protein recognized by autoimmune antisera. Proc Natl Acad Sci U S A. 1991 Feb 1;88(3):931–935. doi: 10.1073/pnas.88.3.931. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bandziulis R. J., Swanson M. S., Dreyfuss G. RNA-binding proteins as developmental regulators. Genes Dev. 1989 Apr;3(4):431–437. doi: 10.1101/gad.3.4.431. [DOI] [PubMed] [Google Scholar]
- Bourbon H. M., Lapeyre B., Amalric F. Structure of the mouse nucleolin gene. The complete sequence reveals that each RNA binding domain is encoded by two independent exons. J Mol Biol. 1988 Apr 20;200(4):627–638. doi: 10.1016/0022-2836(88)90476-7. [DOI] [PubMed] [Google Scholar]
- Brennan C. A., Platt T. Mutations in an RNP1 consensus sequence of Rho protein reduce RNA binding affinity but facilitate helicase turnover. J Biol Chem. 1991 Sep 15;266(26):17296–17305. [PubMed] [Google Scholar]
- Burd C. G., Matunis E. L., Dreyfuss G. The multiple RNA-binding domains of the mRNA poly(A)-binding protein have different RNA-binding activities. Mol Cell Biol. 1991 Jul;11(7):3419–3424. doi: 10.1128/mcb.11.7.3419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burd C. G., Swanson M. S., Görlach M., Dreyfuss G. Primary structures of the heterogeneous nuclear ribonucleoprotein A2, B1, and C2 proteins: a diversity of RNA binding proteins is generated by small peptide inserts. Proc Natl Acad Sci U S A. 1989 Dec;86(24):9788–9792. doi: 10.1073/pnas.86.24.9788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buvoli M., Biamonti G., Tsoulfas P., Bassi M. T., Ghetti A., Riva S., Morandi C. cDNA cloning of human hnRNP protein A1 reveals the existence of multiple mRNA isoforms. Nucleic Acids Res. 1988 May 11;16(9):3751–3770. doi: 10.1093/nar/16.9.3751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caizergues-Ferrer M., Mariottini P., Curie C., Lapeyre B., Gas N., Amalric F., Amaldi F. Nucleolin from Xenopus laevis: cDNA cloning and expression during development. Genes Dev. 1989 Mar;3(3):324–333. doi: 10.1101/gad.3.3.324. [DOI] [PubMed] [Google Scholar]
- Calnan B. J., Biancalana S., Hudson D., Frankel A. D. Analysis of arginine-rich peptides from the HIV Tat protein reveals unusual features of RNA-protein recognition. Genes Dev. 1991 Feb;5(2):201–210. doi: 10.1101/gad.5.2.201. [DOI] [PubMed] [Google Scholar]
- Choi Y. D., Dreyfuss G. Isolation of the heterogeneous nuclear RNA-ribonucleoprotein complex (hnRNP): a unique supramolecular assembly. Proc Natl Acad Sci U S A. 1984 Dec;81(23):7471–7475. doi: 10.1073/pnas.81.23.7471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Choi Y. D., Dreyfuss G. Monoclonal antibody characterization of the C proteins of heterogeneous nuclear ribonucleoprotein complexes in vertebrate cells. J Cell Biol. 1984 Dec;99(6):1997–1204. doi: 10.1083/jcb.99.6.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chou P. Y., Fasman G. D. Prediction of the secondary structure of proteins from their amino acid sequence. Adv Enzymol Relat Areas Mol Biol. 1978;47:45–148. doi: 10.1002/9780470122921.ch2. [DOI] [PubMed] [Google Scholar]
- Christensen M. E., Fuxa K. P. The nucleolar protein, B-36, contains a glycine and dimethylarginine-rich sequence conserved in several other nuclear RNA-binding proteins. Biochem Biophys Res Commun. 1988 Sep 30;155(3):1278–1283. doi: 10.1016/s0006-291x(88)81279-8. [DOI] [PubMed] [Google Scholar]
- Cobianchi F., Karpel R. L., Williams K. R., Notario V., Wilson S. H. Mammalian heterogeneous nuclear ribonucleoprotein complex protein A1. Large-scale overproduction in Escherichia coli and cooperative binding to single-stranded nucleic acids. J Biol Chem. 1988 Jan 15;263(2):1063–1071. [PubMed] [Google Scholar]
- Cobianchi F., SenGupta D. N., Zmudzka B. Z., Wilson S. H. Structure of rodent helix-destabilizing protein revealed by cDNA cloning. J Biol Chem. 1986 Mar 15;261(8):3536–3543. [PubMed] [Google Scholar]
- Davis R. L., Cheng P. F., Lassar A. B., Weintraub H. The MyoD DNA binding domain contains a recognition code for muscle-specific gene activation. Cell. 1990 Mar 9;60(5):733–746. doi: 10.1016/0092-8674(90)90088-v. [DOI] [PubMed] [Google Scholar]
- Dorer D. R., Christensen A. C., Johnson D. H. A novel RNA helicase gene tightly linked to the Triplo-lethal locus of Drosophila. Nucleic Acids Res. 1990 Sep 25;18(18):5489–5494. doi: 10.1093/nar/18.18.5489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dreyfuss G., Adam S. A., Choi Y. D. Physical change in cytoplasmic messenger ribonucleoproteins in cells treated with inhibitors of mRNA transcription. Mol Cell Biol. 1984 Mar;4(3):415–423. doi: 10.1128/mcb.4.3.415. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dreyfuss G., Choi Y. D., Adam S. A. Characterization of heterogeneous nuclear RNA-protein complexes in vivo with monoclonal antibodies. Mol Cell Biol. 1984 Jun;4(6):1104–1114. doi: 10.1128/mcb.4.6.1104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dreyfuss G. Structure and function of nuclear and cytoplasmic ribonucleoprotein particles. Annu Rev Cell Biol. 1986;2:459–498. doi: 10.1146/annurev.cb.02.110186.002331. [DOI] [PubMed] [Google Scholar]
- Dreyfuss G., Swanson M. S., Piñol-Roma S. Heterogeneous nuclear ribonucleoprotein particles and the pathway of mRNA formation. Trends Biochem Sci. 1988 Mar;13(3):86–91. doi: 10.1016/0968-0004(88)90046-1. [DOI] [PubMed] [Google Scholar]
- Garcia-Bustos J., Heitman J., Hall M. N. Nuclear protein localization. Biochim Biophys Acta. 1991 Mar 7;1071(1):83–101. doi: 10.1016/0304-4157(91)90013-m. [DOI] [PubMed] [Google Scholar]
- Henríquez R., Blobel G., Aris J. P. Isolation and sequencing of NOP1. A yeast gene encoding a nucleolar protein homologous to a human autoimmune antigen. J Biol Chem. 1990 Feb 5;265(4):2209–2215. [PubMed] [Google Scholar]
- Hoffman D. W., Query C. C., Golden B. L., White S. W., Keene J. D. RNA-binding domain of the A protein component of the U1 small nuclear ribonucleoprotein analyzed by NMR spectroscopy is structurally similar to ribosomal proteins. Proc Natl Acad Sci U S A. 1991 Mar 15;88(6):2495–2499. doi: 10.1073/pnas.88.6.2495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hope I. A., Struhl K. Functional dissection of a eukaryotic transcriptional activator protein, GCN4 of yeast. Cell. 1986 Sep 12;46(6):885–894. doi: 10.1016/0092-8674(86)90070-x. [DOI] [PubMed] [Google Scholar]
- Iggo R. D., Jamieson D. J., MacNeill S. A., Southgate J., McPheat J., Lane D. P. p68 RNA helicase: identification of a nucleolar form and cloning of related genes containing a conserved intron in yeasts. Mol Cell Biol. 1991 Mar;11(3):1326–1333. doi: 10.1128/mcb.11.3.1326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jessen T. H., Oubridge C., Teo C. H., Pritchard C., Nagai K. Identification of molecular contacts between the U1 A small nuclear ribonucleoprotein and U1 RNA. EMBO J. 1991 Nov;10(11):3447–3456. doi: 10.1002/j.1460-2075.1991.tb04909.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jong A. Y., Clark M. W., Gilbert M., Oehm A., Campbell J. L. Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins. Mol Cell Biol. 1987 Aug;7(8):2947–2955. doi: 10.1128/mcb.7.8.2947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kay B. K., Sawhney R. K., Wilson S. H. Potential for two isoforms of the A1 ribonucleoprotein in Xenopus laevis. Proc Natl Acad Sci U S A. 1990 Feb;87(4):1367–1371. doi: 10.1073/pnas.87.4.1367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kenan D. J., Query C. C., Keene J. D. RNA recognition: towards identifying determinants of specificity. Trends Biochem Sci. 1991 Jun;16(6):214–220. doi: 10.1016/0968-0004(91)90088-d. [DOI] [PubMed] [Google Scholar]
- Kozak M. Comparison of initiation of protein synthesis in procaryotes, eucaryotes, and organelles. Microbiol Rev. 1983 Mar;47(1):1–45. doi: 10.1128/mr.47.1.1-45.1983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lapeyre B., Bourbon H., Amalric F. Nucleolin, the major nucleolar protein of growing eukaryotic cells: an unusual protein structure revealed by the nucleotide sequence. Proc Natl Acad Sci U S A. 1987 Mar;84(6):1472–1476. doi: 10.1073/pnas.84.6.1472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lazinski D., Grzadzielska E., Das A. Sequence-specific recognition of RNA hairpins by bacteriophage antiterminators requires a conserved arginine-rich motif. Cell. 1989 Oct 6;59(1):207–218. doi: 10.1016/0092-8674(89)90882-9. [DOI] [PubMed] [Google Scholar]
- Lee W. C., Xue Z. X., Mélèse T. The NSR1 gene encodes a protein that specifically binds nuclear localization sequences and has two RNA recognition motifs. J Cell Biol. 1991 Apr;113(1):1–12. doi: 10.1083/jcb.113.1.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lipman D. J., Pearson W. R. Rapid and sensitive protein similarity searches. Science. 1985 Mar 22;227(4693):1435–1441. doi: 10.1126/science.2983426. [DOI] [PubMed] [Google Scholar]
- Lutz-Freyermuth C., Query C. C., Keene J. D. Quantitative determination that one of two potential RNA-binding domains of the A protein component of the U1 small nuclear ribonucleoprotein complex binds with high affinity to stem-loop II of U1 RNA. Proc Natl Acad Sci U S A. 1990 Aug;87(16):6393–6397. doi: 10.1073/pnas.87.16.6393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maridor G., Krek W., Nigg E. A. Structure and developmental expression of chicken nucleolin and NO38: coordinate expression of two abundant non-ribosomal nucleolar proteins. Biochim Biophys Acta. 1990 Jun 21;1049(2):126–133. doi: 10.1016/0167-4781(90)90032-w. [DOI] [PubMed] [Google Scholar]
- Matunis E. L., Matunis M. J., Dreyfuss G. Characterization of the major hnRNP proteins from Drosophila melanogaster. J Cell Biol. 1992 Jan;116(2):257–269. doi: 10.1083/jcb.116.2.257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matunis M. J., Michael W. M., Dreyfuss G. Characterization and primary structure of the poly(C)-binding heterogeneous nuclear ribonucleoprotein complex K protein. Mol Cell Biol. 1992 Jan;12(1):164–171. doi: 10.1128/mcb.12.1.164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nadler S. G., Merrill B. M., Roberts W. J., Keating K. M., Lisbin M. J., Barnett S. F., Wilson S. H., Williams K. R. Interactions of the A1 heterogeneous nuclear ribonucleoprotein and its proteolytic derivative, UP1, with RNA and DNA: evidence for multiple RNA binding domains and salt-dependent binding mode transitions. Biochemistry. 1991 Mar 19;30(11):2968–2976. doi: 10.1021/bi00225a034. [DOI] [PubMed] [Google Scholar]
- Nagai K., Oubridge C., Jessen T. H., Li J., Evans P. R. Crystal structure of the RNA-binding domain of the U1 small nuclear ribonucleoprotein A. Nature. 1990 Dec 6;348(6301):515–520. doi: 10.1038/348515a0. [DOI] [PubMed] [Google Scholar]
- Nakagawa T. Y., Swanson M. S., Wold B. J., Dreyfuss G. Molecular cloning of cDNA for the nuclear ribonucleoprotein particle C proteins: a conserved gene family. Proc Natl Acad Sci U S A. 1986 Apr;83(7):2007–2011. doi: 10.1073/pnas.83.7.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Omenn G. S., Fontana A., Anfinsen C. B. Modification of the single tryptophan residue of staphylococcal nuclease by a new mild oxidizing agent. J Biol Chem. 1970 Apr 25;245(8):1895–1902. [PubMed] [Google Scholar]
- Pavletich N. P., Pabo C. O. Zinc finger-DNA recognition: crystal structure of a Zif268-DNA complex at 2.1 A. Science. 1991 May 10;252(5007):809–817. doi: 10.1126/science.2028256. [DOI] [PubMed] [Google Scholar]
- Piñol-Roma S., Choi Y. D., Matunis M. J., Dreyfuss G. Immunopurification of heterogeneous nuclear ribonucleoprotein particles reveals an assortment of RNA-binding proteins. Genes Dev. 1988 Feb;2(2):215–227. doi: 10.1101/gad.2.2.215. [DOI] [PubMed] [Google Scholar]
- Rossmann M. G., Moras D., Olsen K. W. Chemical and biological evolution of nucleotide-binding protein. Nature. 1974 Jul 19;250(463):194–199. doi: 10.1038/250194a0. [DOI] [PubMed] [Google Scholar]
- Sanger F., Nicklen S., Coulson A. R. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 1977 Dec;74(12):5463–5467. doi: 10.1073/pnas.74.12.5463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scherly D., Boelens W., van Venrooij W. J., Dathan N. A., Hamm J., Mattaj I. W. Identification of the RNA binding segment of human U1 A protein and definition of its binding site on U1 snRNA. EMBO J. 1989 Dec 20;8(13):4163–4170. doi: 10.1002/j.1460-2075.1989.tb08601.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Srivastava M., Fleming P. J., Pollard H. B., Burns A. L. Cloning and sequencing of the human nucleolin cDNA. FEBS Lett. 1989 Jun 19;250(1):99–105. doi: 10.1016/0014-5793(89)80692-1. [DOI] [PubMed] [Google Scholar]
- Steitz T. A. Structural studies of protein-nucleic acid interaction: the sources of sequence-specific binding. Q Rev Biophys. 1990 Aug;23(3):205–280. doi: 10.1017/s0033583500005552. [DOI] [PubMed] [Google Scholar]
- Suzuki K., Olvera J., Wool I. G. Primary structure of rat ribosomal protein S2. A ribosomal protein with arginine-glycine tandem repeats and RGGF motifs that are associated with nucleolar localization and binding to ribonucleic acids. J Biol Chem. 1991 Oct 25;266(30):20007–20010. [PubMed] [Google Scholar]
- Swanson M. S., Dreyfuss G. Classification and purification of proteins of heterogeneous nuclear ribonucleoprotein particles by RNA-binding specificities. Mol Cell Biol. 1988 May;8(5):2237–2241. doi: 10.1128/mcb.8.5.2237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Swanson M. S., Nakagawa T. Y., LeVan K., Dreyfuss G. Primary structure of human nuclear ribonucleoprotein particle C proteins: conservation of sequence and domain structures in heterogeneous nuclear RNA, mRNA, and pre-rRNA-binding proteins. Mol Cell Biol. 1987 May;7(5):1731–1739. doi: 10.1128/mcb.7.5.1731. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Szewczyk B., Summers D. F. Preparative elution of proteins blotted to Immobilon membranes. Anal Biochem. 1988 Jan;168(1):48–53. doi: 10.1016/0003-2697(88)90008-5. [DOI] [PubMed] [Google Scholar]
- Vinson C. R., Sigler P. B., McKnight S. L. Scissors-grip model for DNA recognition by a family of leucine zipper proteins. Science. 1989 Nov 17;246(4932):911–916. doi: 10.1126/science.2683088. [DOI] [PubMed] [Google Scholar]
- Walker J. E., Saraste M., Runswick M. J., Gay N. J. Distantly related sequences in the alpha- and beta-subunits of ATP synthase, myosin, kinases and other ATP-requiring enzymes and a common nucleotide binding fold. EMBO J. 1982;1(8):945–951. doi: 10.1002/j.1460-2075.1982.tb01276.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wechsler S. L., Nesburn A. B., Zwaagstra J., Ghiasi H. Sequence of the latency-related gene of herpes simplex virus type 1. Virology. 1989 Jan;168(1):168–172. doi: 10.1016/0042-6822(89)90416-9. [DOI] [PubMed] [Google Scholar]
- Wolberger C., Vershon A. K., Liu B., Johnson A. D., Pabo C. O. Crystal structure of a MAT alpha 2 homeodomain-operator complex suggests a general model for homeodomain-DNA interactions. Cell. 1991 Nov 1;67(3):517–528. doi: 10.1016/0092-8674(91)90526-5. [DOI] [PubMed] [Google Scholar]