A Bioconductor workflow for processing and analysing spatial proteomics data

doi:10.12688/f1000research.10411.2

. 2016 Dec 28:5:2926.

doi: 10.12688/f1000research.10411.2. eCollection 2016.

A Bioconductor workflow for processing and analysing spatial proteomics data

Lisa M Breckels^{1

2}, Claire M Mulvey², Kathryn S Lilley², Laurent Gatto^{1

2}

Affiliations

¹ Computational Proteomics Unit, Cambridge Systems Biology Centre, University of Cambridge, Cambridge, UK.
² Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK.

PMID: 30079225
PMCID: PMC6053703
DOI: 10.12688/f1000research.10411.2

A Bioconductor workflow for processing and analysing spatial proteomics data

Lisa M Breckels et al. F1000Res. 2016.

. 2016 Dec 28:5:2926.

doi: 10.12688/f1000research.10411.2. eCollection 2016.

Authors

Lisa M Breckels^{1

2}, Claire M Mulvey², Kathryn S Lilley², Laurent Gatto^{1

2}

Affiliations

¹ Computational Proteomics Unit, Cambridge Systems Biology Centre, University of Cambridge, Cambridge, UK.
² Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK.

PMID: 30079225
PMCID: PMC6053703
DOI: 10.12688/f1000research.10411.2

Abstract

Spatial proteomics is the systematic study of protein sub-cellular localisation. In this workflow, we describe the analysis of a typical quantitative mass spectrometry-based spatial proteomics experiment using the MSnbase and pRoloc Bioconductor package suite. To walk the user through the computational pipeline, we use a recently published experiment predicting protein sub-cellular localisation in pluripotent embryonic mouse stem cells. We describe the software infrastructure at hand, importing and processing data, quality control, sub-cellular marker definition, visualisation and interactive exploration. We then demonstrate the application and interpretation of statistical learning methods, including novelty detection using semi-supervised learning, classification, clustering and transfer learning and conclude the pipeline with data export. The workflow is aimed at beginners who are familiar with proteomics in general and spatial proteomics in particular.

Keywords: Bioconductor; R Package; machine learning; mass spectromery; protein sub-cellular localisation; proteomics; spatial proteomics; transfer learning.

PubMed Disclaimer

Conflict of interest statement

Competing interests: No competing interests were dislcosed.

Figures

**Figure 1.. Schematic overview of the pRoloc pipeline from data import, through to data processing, machine learning and data export.**

**Figure 2.. Simplified representation of the MSnSet data structure (reproduced with permission from the MSnbase vignette).**

**Figure 3.. A screenshot of the data in the spreadsheet.**

**Figure 4.. Heatmap of missing values.**
Note that the features are re-ordered to highlight clusters of proteins with similar numbers of missing values.

**Figure 5.. PCA plot of the mouse stem cell data hl.**
Each dot represents a single protein, and cluster of proteins represent proteins residing in the same sub-cellular niche. The figure on the right bins proteins and represent the bins density to highlight the presence of protein clusters.

**Figure 6.. Protein profiles and distribution of channel intensities.**
The red dots represent the mean relative intensity for each channel.

**Figure 7.. Annotated PCA plots of the hl dataset, as produced with plot2D.**

**Figure 8.. Mitochondrion and peroxisome protein profiles.**

**Figure 9.. Using the plot3D function to visualise the hl dataset along PCs 1, 2 and 7.**

**Figure 10.. Highlighting protein features of interest.**

**Figure 11.. PCA plots of replicates 1 and 2.**

**Figure 12.. A screen shot of clickable interface and zoomable PCA plot of the main app in the pRolocGUI package.**

**Figure 13.. The compare application, main panel.**

**Figure 14.. Results of the novelty detection algorithm.**

**Figure 15.. Assessment of the classification model parameter optimisation.**

**Figure 16.. Classification results.**
Colours indicate class membership and point size are representative of the classification confidence.

**Figure 17.. Visualistion of class-specific classification score distribution.**

**Figure 18.. Results of the localisation preductions after thresholding.**

**Figure 19.. The classify application enable the interactive exploration of classification score thresholding.**

**Figure 20.. Visualisation of the transfer learning parameter optimisation procedure.**
Each row displays the frequency of observed weights (along the columns) for a specific sub-cellular class, with large dots representing higher observation frequencies.

**Figure 21.. Hierarchical clustering of the average marker profiles summarising the relation between organelles profiles.**

See this image and copyright information in PMC

Cited by

Reduced mitochondria provide an essential function for the cytosolic methionine cycle.
Zítek J, Füssy Z, Treitli SC, Peña-Diaz P, Vaitová Z, Zavadska D, Harant K, Hampl V. Zítek J, et al. Curr Biol. 2022 Dec 5;32(23):5057-5068.e5. doi: 10.1016/j.cub.2022.10.028. Epub 2022 Nov 7. Curr Biol. 2022. PMID: 36347252 Free PMC article.
Spatiotemporal proteomic profiling of the pro-inflammatory response to lipopolysaccharide in the THP-1 human leukaemia cell line.
Mulvey CM, Breckels LM, Crook OM, Sanders DJ, Ribeiro ALR, Geladaki A, Christoforou A, Britovšek NK, Hurrell T, Deery MJ, Gatto L, Smith AM, Lilley KS. Mulvey CM, et al. Nat Commun. 2021 Oct 1;12(1):5773. doi: 10.1038/s41467-021-26000-9. Nat Commun. 2021. PMID: 34599159 Free PMC article.
A Bioconductor workflow for the Bayesian analysis of spatial proteomics.
Crook OM, Breckels LM, Lilley KS, Kirk PDW, Gatto L. Crook OM, et al. F1000Res. 2019 Apr 11;8:446. doi: 10.12688/f1000research.18636.1. eCollection 2019. F1000Res. 2019. PMID: 31119032 Free PMC article.
Mapping diversity in African trypanosomes using high resolution spatial proteomics.
Moloney NM, Barylyuk K, Tromer E, Crook OM, Breckels LM, Lilley KS, Waller RF, MacGregor P. Moloney NM, et al. Nat Commun. 2023 Jul 21;14(1):4401. doi: 10.1038/s41467-023-40125-z. Nat Commun. 2023. PMID: 37479728 Free PMC article.
Comparative Analysis of Quantitative Mass Spectrometric Methods for Subcellular Proteomics.
Tannous A, Boonen M, Zheng H, Zhao C, Germain CJ, Moore DF, Sleat DE, Jadot M, Lobel P. Tannous A, et al. J Proteome Res. 2020 Apr 3;19(4):1718-1730. doi: 10.1021/acs.jproteome.9b00862. Epub 2020 Mar 5. J Proteome Res. 2020. PMID: 32134668 Free PMC article.

See all "Cited by" articles

References

1. Gatto L, Vizcaíno JA, Hermjakob H, et al. : Organelle proteomics experimental designs and analysis. Proteomics. 2010;10(22):3957–69. 10.1002/pmic.201000244 - DOI - PubMed
1. Christoforou A, Mulvey CM, Breckels LM, et al. : A draft map of the mouse pluripotent stem cell spatial proteome. Nat Commun. 2016;7: 8992. 10.1038/ncomms9992 - DOI - PMC - PubMed
1. Itzhak DN, Tyanova S, Cox J, et al. : Global, quantitative and dynamic mapping of protein subcellular localization. eLife. 2016;5: pii: e16950. 10.7554/eLife.16950 - DOI - PMC - PubMed
1. Jean Beltran PM, Mathias RA, Cristea IM: A Portrait of the Human Organelle Proteome In Space and Time During Cytomegalovirus Infection. Cell Syst. 2016;3(4):361–373.e6. 10.1016/j.cels.2016.08.012 - DOI - PMC - PubMed
1. Itzhak DN, Davies C, Tyanova S, et al. : A Mass Spectrometry-Based Approach for Mapping Protein Subcellular Localization Reveals the Spatial Proteome of Mouse Primary Neurons. Cell Rep. 2017;20(11):2706–2718. 10.1016/j.celrep.2017.08.063 - DOI - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

[1] Gatto L, Vizcaíno JA, Hermjakob H, et al. : Organelle proteomics experimental designs and analysis. Proteomics. 2010;10(22):3957–69. 10.1002/pmic.201000244 - DOI - PubMed

[2] Gatto L, Vizcaíno JA, Hermjakob H, et al. : Organelle proteomics experimental designs and analysis. Proteomics. 2010;10(22):3957–69. 10.1002/pmic.201000244 - DOI - PubMed

[3] Christoforou A, Mulvey CM, Breckels LM, et al. : A draft map of the mouse pluripotent stem cell spatial proteome. Nat Commun. 2016;7: 8992. 10.1038/ncomms9992 - DOI - PMC - PubMed

[4] Christoforou A, Mulvey CM, Breckels LM, et al. : A draft map of the mouse pluripotent stem cell spatial proteome. Nat Commun. 2016;7: 8992. 10.1038/ncomms9992 - DOI - PMC - PubMed

[5] Itzhak DN, Tyanova S, Cox J, et al. : Global, quantitative and dynamic mapping of protein subcellular localization. eLife. 2016;5: pii: e16950. 10.7554/eLife.16950 - DOI - PMC - PubMed

[6] Itzhak DN, Tyanova S, Cox J, et al. : Global, quantitative and dynamic mapping of protein subcellular localization. eLife. 2016;5: pii: e16950. 10.7554/eLife.16950 - DOI - PMC - PubMed

[7] Jean Beltran PM, Mathias RA, Cristea IM: A Portrait of the Human Organelle Proteome In Space and Time During Cytomegalovirus Infection. Cell Syst. 2016;3(4):361–373.e6. 10.1016/j.cels.2016.08.012 - DOI - PMC - PubMed

[8] Jean Beltran PM, Mathias RA, Cristea IM: A Portrait of the Human Organelle Proteome In Space and Time During Cytomegalovirus Infection. Cell Syst. 2016;3(4):361–373.e6. 10.1016/j.cels.2016.08.012 - DOI - PMC - PubMed

[9] Itzhak DN, Davies C, Tyanova S, et al. : A Mass Spectrometry-Based Approach for Mapping Protein Subcellular Localization Reveals the Spatial Proteome of Mouse Primary Neurons. Cell Rep. 2017;20(11):2706–2718. 10.1016/j.celrep.2017.08.063 - DOI - PMC - PubMed

[10] Itzhak DN, Davies C, Tyanova S, et al. : A Mass Spectrometry-Based Approach for Mapping Protein Subcellular Localization Reveals the Spatial Proteome of Mouse Primary Neurons. Cell Rep. 2017;20(11):2706–2718. 10.1016/j.celrep.2017.08.063 - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Bioconductor workflow for processing and analysing spatial proteomics data

Affiliations

A Bioconductor workflow for processing and analysing spatial proteomics data

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources