PON-Sol: prediction of effects of amino acid substitutions on protein solubility
- PMID: 27153720
- DOI: 10.1093/bioinformatics/btw066
PON-Sol: prediction of effects of amino acid substitutions on protein solubility
Abstract
Motivation: Solubility is one of the fundamental protein properties. It is of great interest because of its relevance to protein expression. Reduced solubility and protein aggregation are also associated with many diseases.
Results: We collected from literature the largest experimentally verified solubility affecting amino acid substitution (AAS) dataset and used it to train a predictor called PON-Sol. The predictor can distinguish both solubility decreasing and increasing variants from those not affecting solubility. PON-Sol has normalized correct prediction ratio of 0.491 on cross-validation and 0.432 for independent test set. The performance of the method was compared both to solubility and aggregation predictors and found to be superior. PON-Sol can be used for the prediction of effects of disease-related substitutions, effects on heterologous recombinant protein expression and enhanced crystallizability. One application is to investigate effects of all possible AASs in a protein to aid protein engineering.
Availability and implementation: PON-Sol is freely available at http://structure.bmc.lu.se/PON-Sol The training and test data are available at http://structure.bmc.lu.se/VariBench/ponsol.php
Contact: mauno.vihinen@med.lu.se
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Similar articles
-
PON-P2: prediction method for fast and reliable identification of harmful variants.PLoS One. 2015 Feb 3;10(2):e0117380. doi: 10.1371/journal.pone.0117380. eCollection 2015. PLoS One. 2015. PMID: 25647319 Free PMC article.
-
Classification of Amino Acid Substitutions in Mismatch Repair Proteins Using PON-MMR2.Hum Mutat. 2015 Dec;36(12):1128-34. doi: 10.1002/humu.22900. Epub 2015 Sep 22. Hum Mutat. 2015. PMID: 26333163
-
Performance of protein disorder prediction programs on amino acid substitutions.Hum Mutat. 2014 Jul;35(7):794-804. doi: 10.1002/humu.22564. Epub 2014 May 21. Hum Mutat. 2014. PMID: 24753228
-
Bioinformatics approaches for improved recombinant protein production in Escherichia coli: protein solubility prediction.Brief Bioinform. 2014 Nov;15(6):953-62. doi: 10.1093/bib/bbt057. Epub 2013 Aug 7. Brief Bioinform. 2014. PMID: 23926206 Review.
-
Aggrescan4D: A comprehensive tool for pH-dependent analysis and engineering of protein aggregation propensity.Protein Sci. 2024 Oct;33(10):e5180. doi: 10.1002/pro.5180. Protein Sci. 2024. PMID: 39324697 Free PMC article. Review.
Cited by
-
Prediction of Thermostability of Enzymes Based on the Amino Acid Index (AAindex) Database and Machine Learning.Molecules. 2023 Dec 15;28(24):8097. doi: 10.3390/molecules28248097. Molecules. 2023. PMID: 38138586 Free PMC article.
-
Sequence-based prediction of the intrinsic solubility of peptides containing non-natural amino acids.Nat Commun. 2023 Nov 17;14(1):7475. doi: 10.1038/s41467-023-42940-w. Nat Commun. 2023. PMID: 37978172 Free PMC article.
-
SoluProtMutDB: A manually curated database of protein solubility changes upon mutations.Comput Struct Biotechnol J. 2022 Nov 9;20:6339-6347. doi: 10.1016/j.csbj.2022.11.009. eCollection 2022. Comput Struct Biotechnol J. 2022. PMID: 36420168 Free PMC article.
-
Variation benchmark datasets: update, criteria, quality and applications.Database (Oxford). 2020 Jan 1;2020:baz117. doi: 10.1093/database/baz117. Database (Oxford). 2020. PMID: 32016318 Free PMC article.
-
PON-Sol2: Prediction of Effects of Variants on Protein Solubility.Int J Mol Sci. 2021 Jul 27;22(15):8027. doi: 10.3390/ijms22158027. Int J Mol Sci. 2021. PMID: 34360790 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous