Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Sep 19;367(1602):2574-83.
doi: 10.1098/rstb.2012.0010.

Protein kinases display minimal interpositional dependence on substrate sequence: potential implications for the evolution of signalling networks

Affiliations

Protein kinases display minimal interpositional dependence on substrate sequence: potential implications for the evolution of signalling networks

Brian A Joughin et al. Philos Trans R Soc Lond B Biol Sci. .

Abstract

Characterization of in vitro substrates of protein kinases by peptide library screening provides a wealth of information on the substrate specificity of kinases for amino acids at particular positions relative to the site of phosphorylation, but provides no information concerning interdependence among positions. High-throughput techniques have recently made it feasible to identify large numbers of in vivo kinase substrates. We used data from experiments on the kinases ATM/ATR and CDK1, and curated CK2 substrates to evaluate the prevalence of interactions between substrate positions within a motif and the utility of these interactions in predicting kinase substrates. Among these data, evidence of interpositional sequence dependencies is strikingly rare, and what dependency exists does little to aid in the prediction of novel kinase substrates. Significant increases in the ability of models to predict kinase-substrate specificity beyond position-independent models must come largely from inclusion of elements of biological and cellular context, rather than further analysis of substrate sequences alone. Our results suggest that, evolutionarily, kinase substrate fitness exists in a smooth energetic landscape. Taken with results from others indicating that phosphopeptide-binding domains do exhibit interpositional dependence, our data suggest that incorporation of new substrate molecules into phospho-signalling networks may be rate-limited by the evolution of suitability for binding by phosphopeptide-binding domains.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
A comparison of the ability of first- and second-order models to correctly identify true kinase substrates. A field of true kinase substrates withheld from training was hidden among mock substrates for each kinase. A field of 10% true and 90% mock substrates was scored using first- and second-order models, and the fraction of true substrates in the top 10% of highest-scoring sequences was counted. The procedure was repeated 1000 times. Plotted boxes span the 25th to 75th percentile of values, with the red line in the boxes marking the medians. Whiskers extend 1.5 times the distance between the 25th and 75th percentiles, and any points more distant from the median are explicitly plotted. Red- and blue-dashed lines at the values 10% and 100% represent the fraction of true substrates expected in the top 10% of scores expected if true and mock substrates were scored randomly, and the maximum possible fraction of true substrates in the top 10% of scores, respectively. (a) Mock substrates generated by shuffling true substrates to maintain the probability of each amino acid at each position while breaking interpositional dependencies. (b) Mock substrates chosen by randomly selecting sequences from the human proteome conforming to basic known elements of kinase specificity: ‘pS/pT-P’ for ATM/ATR, ‘pS/pT-P’ for CDK1/Cyclin B and ‘pS/pT-X-X-D/E’ for CK2. Because CK2 phosphorylates a number of true substrates that do not have +3 D/E, the CK2 models were trained and tested both with all substrate sequences and with only +3 D/E sequences included.
Figure 2.
Figure 2.
A comparison of the ability of first- and second-order models to correctly identify true kinase substrates, correcting for occurrence of amino acid pairs not present among training data. True kinase substrates were predicted as in figure 1, but rather than examining the top 10% of scored test sequences, a number of sequences for each random splitting of test and training data was examined equal to the least of: 10% of the tested sequences, or the number of tested sequences given a non-zero score under the first- or second-order model. (a) Mock substrates generated by position-wise shuffling of true substrates. (b) Mock substrates chosen by randomly selecting sequences from the human proteome conforming to basic known elements of kinase specificity.
Figure 3.
Figure 3.
A model of evolutionary fitness landscapes for substrates of kinases and phosphopeptide-binding domains. (a) Data presented in this paper indicate that kinase substrate fitness may be position-wise independent in the substrate amino acid sequence, and therefore favourable regions of the substrate fitness space may be accessed relatively easily by a chain of sequential single random mutations. (b) Data presented elsewhere [–23] indicate that phosphopeptide-binding domains express significant interpositional dependencies, indicating that favourable regions of ligand fitness may be separated by energetic barriers.

Similar articles

Cited by

References

    1. Amanchy R., Periaswamy B., Mathivanan S., Reddy R., Tattikota S. G., Pandey A. 2007. A curated compendium of phosphorylation motifs. Nat. Biotechnol. 25, 285–286 10.1038/nbt0307-285 (doi:10.1038/nbt0307-285) - DOI - DOI - PubMed
    1. Alexander J., et al. 2011. Spatial exclusivity combined with positive and negative selection of phosphorylation motifs is the basis for context-dependent mitotic signaling. Sci. Signal. 4, ra42 10.1126/scisignal.2001796 (doi:10.1126/scisignal.2001796) - DOI - DOI - PMC - PubMed
    1. Obenauer J. C., Cantley L. C., Yaffe M. B. 2003. Scansite 2.0: proteome-wide prediction of cell signaling interactions using short sequence motifs. Nucleic Acids Res. 31, 3635–3741 10.1093/nar/gkg584 (doi:10.1093/nar/gkg584) - DOI - DOI - PMC - PubMed
    1. Blom N., Sicheritz-Ponten T., Gupta R., Gammeltoft S., Brunak S. 2004. Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence. Proteomics 4, 1633–1649 10.1002/pmic.200300771 (doi:10.1002/pmic.200300771) - DOI - DOI - PubMed
    1. Miller M. L., et al. 2008. Linear motif atlas for phosphorylation-dependent signaling. Sci Signal. 1, ra2 10.1126/scisignal.1159433 (doi:10.1126/scisignal.1159433) - DOI - DOI - PMC - PubMed

Publication types