DeepSP: A Deep Learning Framework for Spatial Proteomics
- PMID: 37314414
- DOI: 10.1021/acs.jproteome.2c00394
DeepSP: A Deep Learning Framework for Spatial Proteomics
Abstract
The study of protein subcellular localization (PSL) is a fundamental step toward understanding the mechanism of protein function. The recent development of mass spectrometry (MS)-based spatial proteomics to quantify the distribution of proteins across subcellular fractions provides us a high-throughput approach to predict unknown PSLs based on known PSLs. However, the accuracy of PSL annotations in spatial proteomics is limited by the performance of existing PSL predictors based on traditional machine learning algorithms. In this study, we present a novel deep learning framework named DeepSP for PSL prediction of an MS-based spatial proteomics data set. DeepSP constructs the new feature map of a difference matrix by capturing detailed changes between different subcellular fractions of protein occupancy profiles and uses the convolutional block attention module to improve the prediction performance of PSL. DeepSP achieved significant improvement in accuracy and robustness for PSL prediction in independent test sets and unknown PSL prediction compared to current state-of-the-art machine learning predictors. As an efficient and robust framework for PSL prediction, DeepSP is expected to facilitate spatial proteomics studies and contributes to the elucidation of protein functions and the regulation of biological processes.
Keywords: attention mechanism; deep learning; difference matrix; protein subcellular localization; spatial proteomics.
Similar articles
-
TransGCN: a semi-supervised graph convolution network-based framework to infer protein translocations in spatio-temporal proteomics.Brief Bioinform. 2024 Jan 22;25(2):bbae055. doi: 10.1093/bib/bbae055. Brief Bioinform. 2024. PMID: 38426320 Free PMC article.
-
DeepSP: Deep learning-based spatial properties to predict monoclonal antibody stability.Comput Struct Biotechnol J. 2024 May 18;23:2220-2229. doi: 10.1016/j.csbj.2024.05.029. eCollection 2024 Dec. Comput Struct Biotechnol J. 2024. PMID: 38827232 Free PMC article.
-
Application of Machine Learning in Spatial Proteomics.J Chem Inf Model. 2022 Dec 12;62(23):5875-5895. doi: 10.1021/acs.jcim.2c01161. Epub 2022 Nov 15. J Chem Inf Model. 2022. PMID: 36378082 Review.
-
Learning from Heterogeneous Data Sources: An Application in Spatial Proteomics.PLoS Comput Biol. 2016 May 13;12(5):e1004920. doi: 10.1371/journal.pcbi.1004920. eCollection 2016 May. PLoS Comput Biol. 2016. PMID: 27175778 Free PMC article.
-
Deep Learning in Proteomics.Proteomics. 2020 Nov;20(21-22):e1900335. doi: 10.1002/pmic.201900335. Epub 2020 Oct 30. Proteomics. 2020. PMID: 32939979 Free PMC article. Review.
Cited by
-
TransGCN: a semi-supervised graph convolution network-based framework to infer protein translocations in spatio-temporal proteomics.Brief Bioinform. 2024 Jan 22;25(2):bbae055. doi: 10.1093/bib/bbae055. Brief Bioinform. 2024. PMID: 38426320 Free PMC article.
-
A Review for Artificial Intelligence Based Protein Subcellular Localization.Biomolecules. 2024 Mar 27;14(4):409. doi: 10.3390/biom14040409. Biomolecules. 2024. PMID: 38672426 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials