Figures
Abstract
Complex traits, including common disease-related traits, are affected by many different genes that function in multiple pathways and networks. The apoptosis, MAPK, Notch, and Wnt signalling pathways play important roles in development and disease progression. At the moment we have a poor understanding of how allelic variation affects gene expression in these pathways at the level of translation. Here we report the effect of natural genetic variation on transcript and protein abundance involved in developmental signalling pathways in Caenorhabditis elegans. We used selected reaction monitoring to analyse proteins from the abovementioned four pathways in a set of recombinant inbred lines (RILs) generated from the wild-type strains N2 (Bristol) and CB4856 (Hawaii) to enable quantitative trait locus (QTL) mapping. About half of the cases from the 44 genes tested showed a statistically significant change in protein abundance between various strains, most of these were however very weak (below 1.3-fold change). We detected a distant QTL on the left arm of chromosome II that affected protein abundance of the phosphatidylserine receptor protein PSR-1, and two separate QTLs that influenced embryonic and ionizing radiation-induced apoptosis on chromosome IV. Our results demonstrate that natural variation in C. elegans is sufficient to cause significant changes in signalling pathways both at the gene expression (transcript and protein abundance) and phenotypic levels.
Citation: Singh KD, Roschitzki B, Snoek LB, Grossmann J, Zheng X, Elvin M, et al. (2016) Natural Genetic Variation Influences Protein Abundances in C. elegans Developmental Signalling Pathways. PLoS ONE 11(3): e0149418. https://doi.org/10.1371/journal.pone.0149418
Editor: Suzannah Rutherford, Fred Hutchinson Cancer Research Center, UNITED STATES
Received: October 18, 2015; Accepted: January 30, 2016; Published: March 17, 2016
Copyright: © 2016 Singh et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All SRM raw data, transition lists, MS/MS library and mProphet output files are available from the PeptideAtlas (http://www.peptideatlas.org) database (accession number PASS00748). QTL (protein and apoptosis) and processed transcriptomics data files are available from the WormQTL (http://www.wormqtl.org) database (accession numbers 70-75 and 80). All microarray data files are available from the Gene Expression Omnibus (GEO; http://www.ncbi.nlm.nih.gov/geo) database (accession number GSE77905).
Funding: This work was supported by the Swiss National Science Foundation (grant No. 31003A_143932; http://www.snf.ch) to MOH and the European Community's Health Seventh Framework Programme under project PANACEA (project No. 222936; http://www.panaceaproject.eu) to JEK. LBS was funded by the Netherlands Organisation for Scientific Research (project No. 823.01.001; http://www.nwo.nl). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Most complex traits, including many common diseases such as cancer, neurodegenerative, and autoimmune diseases are affected by multiple genes. There is overwhelming evidence that individuals with different genotypes have a different susceptibility to various complex diseases. For instance, the genetic background influences onset and progression of cancer in mice and humans [1–3] and also plays an important role in other complex diseases such as renal failure [4], autoimmune diseases [5], and retinal degeneration [6]. Studying natural genetic variation is crucial for understanding the effect of allelic variation of gene expression and for determining the genetic basis of complex traits. However, the genetic mechanisms underlying these effects are often poorly understood.
To unravel how the genetic background affects signalling pathways that contribute to complex diseases, we used the model organism Caenorhabditis elegans [7]. Specifically, we selected genes involved in apoptosis [8], as well as genes involved in three pathways (MAPK [9], Notch [10], and Wnt [11]) that control vulva development, an important phenotypic readout in C. elegans for these pathways [12,13]. All four signalling pathways are evolutionarily conserved and many human homologs of these C. elegans signalling proteins [14] have been linked to various cancers, as well as neurodegenerative, cardiovascular, and other diseases [15–19].
Signalling pathways in C. elegans are usually studied in the canonical wild-type N2 (Bristol [20]) background through the screening for and characterization of induced mutations, which often lead to complete loss of gene function and show distinctive phenotypic defects. By contrast, natural variation usually causes more subtle phenotypic changes, as natural selection rapidly selects against mutations with strong negative impact.
To explore the effect of natural variation, we selected two highly divergent wild-type strains [21,22], N2 and CB4856 (Hawaii [23]), and a set of recombinant inbred lines (RILs) generated from them [24–27]. Previous transcriptome analysis of N2, CB4856, and RILs from this set showed significant heritable variation in gene expression at the transcript level, which could be mapped to several expression QTLs (eQTLs) [24,28–31], but very little is known about heritable variation at the protein level. Here we quantified the abundances of selected proteins from four cancer signalling pathways (apoptosis, MAPK, Notch, and Wnt) using selected reaction monitoring (SRM), a mass spectrometric technique for targeted quantitative proteomics that is characterized by its high specificity, sensitivity, and wide dynamic range [32,33]. We also measured apoptosis levels in both parental strains and RILs. We found that natural variation in C. elegans causes significant changes in signalling pathways both at the gene expression (transcript and protein abundance) and at the phenotypic levels.
Results and Discussion
We selected 156 proteins (S1 Table) that have been implicated in one of four signalling pathways in C. elegans: apoptosis (62 proteins), MAPK (59 proteins), Notch (18 proteins), and Wnt (17 proteins). These 156 proteins span over three orders of magnitude in abundance [34], with approximately half of them being in the range of 1–20 ppm (S1A Fig and S1 Table), and show a similar abundance distribution as the overall C. elegans proteome (S1B Fig).
SRM measurements
In a first step, we performed an experiment to find proteins that are differently abundant in the two parental strains N2 and CB4856. Out of 148 tested proteins (S2 Fig and S2 Table) we successfully measured the abundance of between 104 and 116 proteins in each sample; 71 proteins could be quantified in two biological replicates in both strains (S3 Fig). For most of these, the protein abundance difference was very small between the two strains (below our fold change cut-off, which we set at 1.3-fold, based on the variation in abundance that we measured between biological replicates of the same strain; S4 Fig) and also correlated poorly with the differences in transcript abundance levels (S5 Fig). The extent of conservation in protein abundance in the dataset of the four signalling pathways is very similar to the variation that can be observed at the level of the whole proteome using stable isotope labelling by amino acids in cell culture (SILAC) based shotgun mass spectrometry data from [35] (Fig 1 and S4 Fig). Taken together, these observations suggest that for most of the measured proteins, the small variation in protein abundance observed between CB4856 and N2 is not due to genetic variation between the two strains, but rather comes from measurement errors or from the innate variation that also exists between biological replicates of the same strain (S4 Fig).
Histogram with Tukey-style box plot [36] on the top for protein abundances measured in CB4856 relative to N2. Vertical dashed lines represent the fold change cut-off of 1.3 (~ 0.38 on log2 scale). (A) The C. elegans shotgun proteome dataset was quantified using SILAC (data from [35]). (B) Signalling pathway proteins were quantified using SRM.
Previous work on wild isolates has shown that wild-type strains often contain hidden variation in gene expression due to compensatory mutations within the genome [37,38]. Crossing of two wild types followed by the generation of RILs can lead to significant variation due to shuffling of the parental genotypes and the segregation of compensatory mutations [30,37,38]. To test this concept at the proteome level, we analysed RILs generated from the two wild types. As was previously reported [24,28,29], many RILs show a higher level of variation in transcript abundance than either parental strain (Fig 2A). We selected four RILs (WN31, WN71, WN105, and WN186) that showed large variation in transcript abundance for the 148 selected proteins compared with the parental strains (Fig 2A) and that also showed great genetic diversity between them (Fig 2B and 2C).
Gene expression at the transcript level was quantified in N2, CB4856, and 47 RILs by two-colour microarray analysis. (A) Tukey-style box plot representing log2 scaled deviations of the gene expression value from the mean across all samples. Four genetically different RILs (WN31, WN71, WN105, and WN186) that showed large variation in transcript abundance were selected for proteome quantification. (B) Hierarchical clustering of the 47 RILs and the two parental strains based on their genotype (see “Additional file 1” of [25] for details). Clustering was done with R functions “dist” and “hclust” from the package stats (version 3.2.2) using “euclidean” distance and “ward.D2” method. Four RILs (black arrows) were chosen from different clusters to ensure maximum genetic diversity; red arrows indicate parental strains. (C) Genotype of the four selected RILs and the two parental strains. Vertical lines separate chromosomes I to V and X from left to right.
Between 71 and 85 proteins (out of 148 proteins measured) were quantified by SRM in the four selected RILs; 44 proteins (represented by 114 peptides) were quantified in all four RILs. To gain more statistical confidence, we re-measured these 44 proteins (S2 Table) in the four selected RILs as well as in the two parental strains with three biological replicates (Fig 3A). Generally, the quantified proteins showed a trend to be up-regulated in CB4856 and in the RILs relative to N2 (P ≤ 0.01; Fig 3A), and about a quarter of the protein abundance changes were both statistically significant (P ≤ 0.05) and above the fold change cut-off of 1.3 (~ 0.38 on log2 scale; Fig 3A and S6 Fig). To determine how much of the observed variation in protein levels could be attributed to genetic variation, we calculated the broad-sense heritability [39] for the 44 proteins and found that on average, about half of the measured protein level variation was due to genetic variation (Fig 3B and S1 Table). We tested the extent of variation in abundance of the 44 protein, at both the transcript and protein level (relative to N2) using the Fligner-Killeen test for homogeneity of variances. No significant variation could be detected except between CB4856 and WN105 at the transcript level (Fig 4). Indicating that for this smaller set of 44 genes, the tested RILs do not show greater overall levels of expression, at the transcript and protein levels compared to the parental strains.
Protein abundance was quantified by SRM. Identification of the true peak group was performed using the mProphet software, followed by protein significance analysis using intensity-based linear mixed-effects model implemented in MSstats. (A) Heat map showing differential abundance of 44 proteins in CB4856 and four empirically selected RILs relative to N2. Blue and red shades represent log2 scaled fold changes, grey colour shows the fold change cut-off of 1.3 (~ 0.38 on log2 scale) and number of asterisks represent BH corrected P-values. Black bands on the left side indicate proteins selected for subsequent pQTL mapping. Number of asterisks on top of each column represent BH corrected P-values from one sample t-test (H0: μ = 0 and H1: μ ≠ 0) on protein fold changes relative to N2; *P ≤ 0.05; **P ≤ 0.01; ***P ≤ 0.001; ****P ≤ 0.0001. (B) Tukey-style box plot of broad-sense heritability for the 44 proteins shown in panel A. Scatter points overlaid on the box plot represent the broad-sense heritability values for the individual proteins (solid black circles correspond to selected proteins). (C) Bar graph of selected proteins from panel A. Horizontal dashed lines represent the fold change cut-off of 1.3 (~ 0.38 on log2 scale). Error bars represent SEM between three biological replicates.
Comparison of protein and transcript abundance (log2 scaled fold changes relative to N2) for 44 signalling pathway proteins in CB4856 and four selected RILs. Horizontal and vertical dashed lines represent the fold change cut-off of 1.3 (~ 0.38 on log2 scale). Tukey-style box plot on top and right side represents variability in protein and transcript log2 fold changes respectively. Pearson correlation coefficient is denoted by r. Table on bottom right represents the P-values from the Fligner-Killeen test for homogeneity of variances between protein (column 1) and transcript (column 2) data for RILs compared with CB4856.
Protein QTL mapping
Most of the 156 proteins that we selected for SRM analysis have weak eQTLs at the transcript level (S1 Table) [27,40,41]. To explore the genetic basis of the protein abundance differences between parental strains and RILs, and to determine potential regulatory genes associated with the observed protein abundance differences, we performed protein quantitative trait locus (pQTL) mapping on seven proteins (PSR-1, SIR-2.1, LIN-2, LIP-1, MEK-2, SPR-5, and SUP-17; represented by 14 peptides; Figs 3A, 3C and S6) that showed significant abundance differences and fold changes above 1.3 among the six strains (two parental strains and four selected RILs). This mapping was used to determine whether genetic variation in the gene itself (or nearby location) was responsible for the observed protein abundance changes (local pQTL) or a change in another region away from the gene (distant pQTL) [42]. Mapping was done by linking the variation in abundances of the seven selected proteins (S2 Table) within the parental strains and 48 RILs (the four selected RILs and 44 additional ones; S7 Fig) with the genotypic variation in the same samples. For one of the seven proteins, PSR-1 (phosphatidylserine receptor), we found a significant distant pQTL on the left arm of chromosome II: the presence of the CB4856 allele at this position strongly correlated with a high PSR-1 protein abundance (Fig 5). Interestingly, this distant QTL is not apparent at the transcript level [24,29,43], suggesting that it likely affects a late step in the gene expression process (e.g. translation efficiency or protein stability). Early work in mouse [44] and yeast [45] showed that most of the pQTLs did not have corresponding variation at the transcript level, in contrast to this a much better agreement was found between eQTLs and corresponding pQTLs [46]. Recent studies in yeast using a new experimental design to overcome the sample size limitation of previous studies also suggest that over half of the previously reported eQTLs have corresponding pQTLs [47,48].
Blue curves show the significance of the pQTLs multiplied by the sign of the effect of the N2 allele (positive values of blue curve indicate higher protein abundance when the N2 allele is present, whereas negative values indicate higher protein abundance when the CB4856 allele is present). Horizontal orange and red dashed lines show 0.1 and 0.05 FDR thresholds respectively. Vertical dotted grey lines separate chromosomes I to V and X from left to right. Vertical magenta bands indicate the position of the gene in the genome. PSR-1 shows a significant pQTL on the left arm of chromosome II.
The large number of RIL measurements also allowed us to compare in more detail the relative variations in transcript and protein abundances for the seven selected proteins (Fig 6). Whereas some proteins (PSR-1, LIN-2, and MEK-2) showed a tendency for greater variation at the transcript level, SIR-2.1 showed a greater protein abundance variation while the remaining genes showed similar variation, at both protein and transcript levels.
Comparison of protein and transcript abundance (log2 scaled fold changes relative to N2) in CB4856 (solid orange circle) and RILs for the seven proteins used in pQTL mapping. Horizontal and vertical dashed lines represent the fold change cut-off of 1.3 (~ 0.38 on log2 scale). Tukey-style box plot on top and right side represents variability in protein and transcript log2 fold changes respectively. P-value from Fligner-Killeen test for homogeneity of variances between protein and transcript data is denoted by PFK.
Natural variation in apoptosis levels
The PSR-1 protein has been implicated in the clearance of apoptotic cells in C. elegans [49,50]. To determine whether the changes in PSR-1 protein abundance might affect corpse clearance efficiency, we quantified both developmental [51] and germ line (physiological and DNA-damaged induced) [52] apoptosis levels, by counting apoptotic cell corpse numbers, both in embryos and in the adult germ line (Fig 7A–7C). Cell corpse numbers were consistently lower in CB4856 compared to N2. Although most RILs showed apoptotic cell corpse numbers between the two parental strains, some showed more extreme apoptotic levels than the parental strains (higher than N2, or lower than CB4856).
(A-C) Quantification of apoptotic cell corpse numbers in embryos (A), germ line without ionizing radiation (IR; B), and with 60 Gy IR (C). Error bars represent SEM between numbers of biological replicates indicated at the bottom of each bar. (D-F) Natural variation in apoptosis levels of parental strains and RILs does not correlate with the PSR-1 protein abundance (relative to N2). Scatter plots (CB4856 in orange and N2 in green) of PSR-1 protein abundance and apoptotic levels in embryos (D), germ line without IR (E), and with 60 Gy IR (F). Pearson correlation coefficient is denoted by r. (G-I) Embryonic and IR-induced apoptosis shows a significant QTL on chromosome IV. QTL profiles of apoptotic phenotype in embryos (G), adult germ line without IR (H) and with 60 Gy IR (I). Blue curves show the significance of the QTLs multiplied by the sign of the effect of the N2 allele (positive values of blue curve indicate higher apoptosis level when the N2 allele is present, whereas negative values indicate higher apoptosis level when the CB4856 allele is present). Horizontal orange and red dashed lines show 0.1 and 0.05 FDR thresholds respectively. Vertical dotted grey lines separate chromosomes I to V and X from left to right. Vertical magenta band indicates the position of psr-1 gene in the genome.
Unfortunately, we did not find any correlation between changes in PSR-1 protein abundance and apoptosis levels among the strains tested (Fig 7D–7F). Given that even the complete loss of PSR-1 function only results in a very mild change in cell corpse clearance [49], this result is not unexpected. We conclude that the variation in apoptosis observed between the RILs must arise from other genetic changes present between N2 and CB4856.
QTL mapping of the apoptosis levels revealed the presence of a QTL that affects ionizing radiation (IR)-induced apoptosis on chromosome IV and a QTL on the left arm of chromosome IV that affects embryonic cell death (in both cases, the presence of the N2 allele results in increased cell corpse numbers), whereas no QTLs were detected for physiological germ cell apoptosis (Fig 7G–7I). The QTL for IR-induced germ cell death is very broad, and might conceivably even consist of two (or more) linked QTLs on chromosome IV. By contrast, the embryonic cell corpse QTL is very sharp and maps close to, but can clearly be separated from, the physical position of psr-1. Interestingly, this region includes several genes that have been linked to apoptosis or cell corpse cleaning, including the engulfment gene ced-10 [53] and the E3 ubiquitin ligase eel-1 [54]. There are 8 and 20 SNPs between CB4856 and N2 for these two genes. While none of the ced-10 SNPs change the protein sequence, one of the 20 SNPs in eel-1 induces an amino acid change. Further analysis will be required to determine if any of those SNPs are responsible for the observed QTL in embryonic cell corpse apoptosis.
Conclusion
Here we developed and made available a SRM method for 148 C. elegans signalling pathway proteins represented by 295 peptides.
Our quantitative analysis of protein abundance changes across six strains revealed that statistically significant changes can be found at a fair frequency (in about half of the cases), but that most of these changes are very minor in magnitude (less than 1.3-fold). This is not surprising, as we assume that many of the tested proteins should be under significant evolutionary pressure, which likely precludes large changes in protein abundance. Nevertheless, we also identified a smaller number of genes that showed reproducible protein abundance changes in the two- to fourfold. QTL analysis on a selected set of seven proteins that showed particularly strong changes allowed us to identify a potential distant pQTL on the left arm of chromosome II for PSR-1, a protein that intrinsically acts as receptor for phosphatidylserine exposed by apoptotic cells [49,50]. We also found two separate QTLs on chromosome IV affecting embryonic and IR-induced apoptosis levels, but failed to detect any correlation between PSR-1 abundance and apoptosis levels suggesting the involvement of other genes in these events.
Materials and Methods
Strains
We used C. elegans wild-type strains N2 [20] and CB4856 [23], and a set of 54 RILs (S3 Table) generated from them [24–27]. Unless otherwise stated all experiments were performed on synchronized developmental stage L4 worms, grown as previously described [7] on nematode growth medium (NGM) agar plates seeded with Escherichia coli strain OP50 at 20°C.
Transcriptomics
From one sample of N2, CB4856 and 47 RILs (S3 Table) RNA extraction was done using RNEasy Micro Kit (Qiagen), as previously described (see “RNA isolation” from “Supplementary Materials and Methods” of [31]). Two-colour microarray was used to analyse the extracted RNA using 413 different hybridization probes (representing the 148 proteins of interest), as previously described (see “Microarray sample preparation, scanning and normalization” from “Supplementary Materials and Methods” of [31]) with the exception that we averaged normalised intensities from single channel for all probes of a gene, followed by the calculation of the log2 scale deviation of the gene expression value from the mean across all samples.
Protein Extraction
Protein extraction was done as previously described [55] with minor changes. Briefly worm pellets were homogenized with glass-beads (G1277; Sigma-Aldrich) in lysis buffer (8 M urea, 50 mM Tris-HCl pH 8.3) in a 1:1:2 ratio using a benchtop homogenizer (FastPrep-24; MP Biomedicals) at 4°C, four times for 30 s at the speed of 5 m/s. After homogenization 0.125% SDS (v/v of buffer) was added and the homogenate was incubated at room temperature for 1 h to extract proteins. Proteins were separated from cell debris by centrifugation, and the protein concentration was determined using the Bradford reagent (B6916; Sigma-Aldrich).
Database used for peptide spectrum matching
The C. elegans protein database wormpep212 (downloaded on March 2010, 24362 entries) was used for peptide spectrum matching. A decoy database generated by reversing the sequences was appended to forward database prior to the search, to facilitate the calculation of false discovery rate (FDR) [56], followed by addition of 259 common mass spectrometry contaminants yielding a total of 48983 entries.
Sample Preparation for SRM
Selection and solubilisation of hPTPs.
For each protein of interest, 1–5 tryptic proteotypic peptides (PTPs [57]; without cysteine and methionine; in total 377 PTPs; S2 Fig and S1 Table) were selected based on a C. elegans shotgun dataset [55]. These 377 PTPs were synthesized with isotopically labelled (13C and 15N) C-terminal Arginine and Lysine (SpikeTide L; JPT Peptide Technologies GmbH) as heavy labelled PTPs (hPTPs) for internal controls for SRM method development and SRM experiments. Prior to spike-in, hPTPs were dissolved in 180 μl of 20% ACN and 1% FA solvent. Either 0.4 nmol or 4 nmol of each peptide (depending on whether the abundance of the corresponding protein from PaxDb version 2.1 [34] is less than 100 ppm or greater than 100 ppm) was mixed to generate a 100× hPTPs master-mix. Separate 100× hPTPs master-mixes were prepared for experiments targeting 148, 44, and 7 proteins with including only peptides corresponding to those proteins.
Protein digestion and peptide pre-fractionation.
250 μg of total worm protein from each sample was mixed with a 100 times diluted hPTP master-mix and digested overnight with trypsin (V5111; Promega) in the ratio of 50:1 (protein:trypsin) in digestion buffer (8 M urea, 50 mM Tris-HCl pH 8.3, 0.125% SDS, pH 7–8) at 37°C. Anionic SDS was removed after digestion using strong cation exchange (SCX). Samples were applied on SCX cartridges (Applied Biosystems) in loading buffer (10 mM KH2PO4, 25% ACN, pH < 3.0), and after washing peptides were eluted with 1 ml of elution buffer (10 mM KH2PO4, 25% ACN, 350 mM KCl, pH < 3.0). The cartridge was cleaned between subsequent samples using cleaning buffer (10 mM KH2PO4, 25% ACN, 1 M KCl, pH < 3). Salt was removed by solid phase extraction (SPE). Samples were applied on C18 SPE columns (Finisterre™, Teknokroma) in loading solution (5% ACN, 0.1% TFA), and after washing peptides were eluted with 1 ml of elution solution (60% ACN, 0.1% TFA). To circumvent the problem of inefficient ionization of target peptides in the complex background of the total worm protein extract, we used offline pre-fractionation by reversed phase-high pressure liquid chromatography (RP-HPLC) at pH 11.0 on an Agilent 1100 series HPLC system using a YMC Triart C18 column (150 mm × 4.6 mm ID, particle size 5 μm, pore size 12 nm) at a flow rate of 1 ml/min. Peptide samples were applied in buffer A (20 mM K2HPO4, 5% ACN, pH 11.0), and were eluted with a gradient between buffer A and buffer B (20 mM K2HPO4, 50% ACN, pH 11.0) into 47 fractions (pooled to 10 final fractions). The gradient profile was 2% buffer B between 0–20 min, 2%-50% buffer B between 20–50 min, 50%-98% buffer B between 50–55 min and 98% buffer B between 55–60 min followed by column re-equilibration to 2% buffer B in a total 80 min run. Buffer A and Buffer B were prepared from the stock solution (200 mM K2HPO4, pH 11.0).
For the pQTL experiments, peptide samples (directly after tryptic digestion) were pre-fractionated by hydrophilic interaction liquid chromatography (HILIC) on an Agilent 1100 series HPLC system using a YMC-pack polyamine II column (250 mm × 4 mm ID, particle size 5 μm, pore size 12 nm) at a flow rate of 0.85 ml/min. Peptide samples were applied in buffer A (8 mM KH2PO4, 75% ACN, pH 4.5), and were eluted with a gradient between buffer A and buffer B (100 mM KH2PO4, 5% ACN, pH 4.5) into 14 fractions (pooled to 3 final fractions). The gradient profile was 0% buffer B between 0–7.5 min, 0%-50% buffer B between 7.5–37.5 min, 50%-100% buffer B between 37.5–42.5 min and 100% buffer B between 42.5–47.5 min followed by column re-equilibration to 0% buffer B in a total 67.5 min run. Buffer A and buffer B were prepared from the stock solution (400 mM KH2PO4, pH 4.5).
To identify in which fractions the target peptides were eluted, all fractions were analysed by shotgun proteomics.
SRM method development and analysis
Generation of spectral library.
For each peptide two transitions were selected for doubly and triply charged precursor ions using Skyline [58] with the following transition settings (Precursor charges = 2 and 3, Ion charges = 1, Ion types = y, Product ions from = m/z > precursor, Product ions to = 2, and Always add = N-Terminal to Proline enabled). Transitions with |Q3-Q1| m/z < 40 were removed to reduce interferences. Remaining transitions were used to acquire MS/MS spectra triggered by SRM (SRM-triggered MS/MS [32,59]) on a TSQ Vantage (Thermo Scientific) triple quadrupole mass spectrometer (MS) coupled with a nanoLC-ultra chromatography instrument (Eksigent Technologies). Peptides were separated on a self-packed C-18 (Magic C18 AQ, particle size 3 μm, pore size 20 nm; Michrom) column (15 cm × 75 μm ID) at a flow rate of 200 nl/min. Peptides were loaded in 3% ACN, 0.1% FA and were eluted with a gradient between solvent A (water with 0.1% FA; Biosolves) and solvent B (ACN with 0.1% FA; Biosolves). The gradient profile was 5%-40% buffer B between 0–56 min, 40%-47% buffer B between 56–60 min, 47%-97% buffer B between 60–64 min and 97% buffer B between 64–71 min followed by column re-equilibration to 5% buffer B in a total 80 min run. The MS was operated in SRM mode (Q1FWHM = 0.40, Q3FWHM = 0.70, and dwell time = 20 ms with roughly 240 transitions per run), triggering MS/MS acquisition (Q1FWHM = 0.70, Q3FWHM = 0.70, and dwell time = 50 ms) if it detected counts higher than 300. MS/MS spectra were searched against the C. elegans protein database wormpep212 using the Mascot search engine (version 2.3.02; Matrix Science) [60], with the following search parameters (Type of search = MS/MS ion search, Enzyme = Trypsin with zero missed cleavage, Variable modifications = Arg10 and Lys8, Peptide mass-tolerance = 2 Da, and Fragment mass-tolerance = 0.8 Da). This resulted in the identification of 356 (94.4%) peptides with a 3.3% FDR at peptide level.
The Mascot search results in mascot DAT file format were used to build a MS/MS spectra library for 340 (90.2%) peptides corresponding to 154 proteins using Skyline with a score cut-off setting of 0.95. From this MS/MS spectra library, either doubly or triply charged precursor ions were selected for each peptide and for each selected precursor the five most intense y-ions fulfilling the following criteria (precursor m/z > 400 and |Q3-Q1| m/z > 5) were selected to generate the final transition list 2950 transitions, including both heavy and light isoforms (S2 Fig and S2 Table).
SRM.
After peptide pre-fractionation, ten fractions from RP-HPLC at pH 11.0 or three fractions from HILIC per strain were desalted by using ZipTip C18 (Millipore). Samples were applied at a speed of 1 drop/s in washing solution (5% ACN, 0.1% TFA; sample pH was adjusted below 3, if necessary with 5% ACN, 10% TFA), and after washing peptides were eluted with 80 μl of elution solution (60% ACN, 0.1% TFA) at a speed of 1 drop/s. Peptide samples were completely dried in a centrifugal evaporator and dissolved in 3% ACN, 0.1% FA solvent. SRM measurements were also performed on a TSQ Vantage (Thermo Scientific) coupled with a nanoLC-ultra (Eksigent Technologies). The setup was as described above except that the MS was operated in scheduled SRM (sSRM) mode (Q1FWHM = 0.40, Q3FWHM = 0.70, and cycle time = 2 s with a retention time window of 10 min for each peptide). For each off-line HPLC fraction a separate method containing only transitions for target peptides in that fraction was used. iRT kit (Biognosys) peptides were used as retention time standard [61].
SRM data analysis
We used the mProphet software (version 1.0.4.1) [62] for the identification of true peak groups among each transition group record from SRM data, followed by a protein significance analysis using the intensity-based linear mixed-effects model [63] implemented in R [64] based on the package MSstats (version 0.99.0 for Fig 3A and version 1.99.0 for pQTL experiments) [65].
Briefly, at first raw data from the MS were converted into the mzXML file format [66] using command line tool ReAdW (version 4.0.2, ISB/SPC) with default options. Within mProphet, these mzXML files were mapped to transition lists using the mMap module (with parameter file = param_AQUA_heavy_stringent_ref_synthetic.def and machine type = QQQ) followed by peak picking using mQuest module (with parameter file = param_AQUA_heavy_stringent_ref_synthetic.def), and finally mQuest score optimization was done using the mProphet module (with workflow = SPIKE_IN). For each sample (based on mprophet_stat.xlsx file) an appropriate normalised discriminate score (d_score) cut-off was selected to keep the number of true positive peak groups higher and the number of false positive peak groups lower (for Fig 3A this was done by keeping a q-value of 0.05 and for pQTL analysis by keeping a q-value of 0.01). Finally, identified peak groups with a d_score lower than the selected cut-off and dummy peak groups were filtered out from the final output file of mProphet software (mProphet_peakgroups.xlsx).
For MSstats based analyses an input file (type “?RawData” in R console for details) with an intensity value for each transition of selected peak groups at both light and heavy isotopic level was generated using custom R scripts. To ensure that each peptide was analysed in the same fraction across all samples, we used fraction number concatenated with peptide as “PeptideSequence” in the input file. All fractions measured in one sample were considered to belong to the same “Run” under input file. Data analysis was done as previously described [63], considering interference in transitions and keeping “scope of biological variation” as restricted and “scope of technical variation” as expanded.
For S3 Fig, after mProphet analysis, Microsoft Excel and custom R scripts were used to sum intensity values for all transitions of a protein (including all detected peptides, if measured in all samples) at both light and heavy isotopic level separately for both replicates. These intensity values were appropriately scaled to make the median of resulted deconvoluted ratios (CB4856Light/CB4856Heavy)/(N2Light/N2Heavy) for proteins in both biological replicates equal to 1. The ratios of averages of scaled intensities of CB4856 and N2 from both replicates represent the average protein fold change in CB4856 relative to N2. Whereas log10 transformed intensities were used to perform a two-sample equal variance t-test, resulting P-values were corrected for multiple hypothesis testing using the Benjamini-Hochberg (BH) procedure [67] (type “?p.adjust” in R console for details).
QTL analysis
The QTLs were mapped using a custom R script applying linear regression on a single marker model as previously described [68]. For pQTLs, we used the log2 scale abundance of seven selected proteins in one sample of CB4856 and 48 RILs (S3 Table) relative to N2 as phenotypic data and genotypic variation in same samples using 121 markers [24]. Thresholds were determined per trait by 1000 permutations, where the trait values for the individual samples were randomly distributed before QTL mapping. Comparison to previously published eQTL data was done by extracting the data from WormQTL (http://www.wormqtl.org) [27,40,41]. QTL mapping of apoptosis levels was done as described above where average corpse number in N2, CB4856 and RILs were considered as phenotypic data.
Apoptotic cell corpse counting
For embryonic cell corpse counting, between 6 and 23 embryos (3-fold stage) from N2, CB4856 and 34 RILs (S3 Table; Fig 7A for number of biological replicates) were counted in the head region using differential interference contrast (DIC) microscopy, as previously described [69] but instead of worms, mixed staged embryos were used. For germ line cell corpse counting, between 15 and 25 synchronized young adult hermaphrodites (12 h post L4/adult molts) from N2, CB4856 and 44 RILs (S3 Table; Fig 7B and 7C for number of biological replicates) were either exposed to no ionizing radiation (No IR) or 60 Gy ionizing radiation (60 Gy IR). After 24 h germ line apoptotic cell corpses were counted by using DIC microscopy, as previously described [69].
Heritability
Broad-sense heritability [39] was calculated in R using the following formula H2 = msq.geno/(msg.geno + msq.error), where msg.geno was the mean sum of squares of genotype (between variation) and msq.error was the mean sum of squares of error (within variation).
Statistical analysis
Unless otherwise stated all statistical tests were performed in R using the package stats (version 3.2.2). Specifically, function “t.test” (type “?t.test” in R console for details) was used for one sample t-test and function “fligner.test” (type “?fligner.test” in R console for details) was used for Fligner-Killeen test for homogeneity of variances.
Supporting Information
S1 Fig. Signalling pathway proteins used in this study show a similar abundance distribution as the whole C. elegans proteome.
Relative abundance data of C. elegans proteins were extracted from PaxDb version 2.1 [34]. (A) Data shown for 156 selected signalling pathway proteins. Each square represents a protein from one of four selected pathways, black squares (both solid and open) represent the 44 quantified proteins from Fig 3A; black open squares represent the 7 proteins (mostly under 20 ppm) selected for pQTL mapping. Parenthesis on top indicates number of proteins belonging to each pathway. (B) Histogram of all C. elegans proteins from PaxDb (top) and abundance distribution of the 156 selected signalling proteins (bottom, redrawn from A).
https://doi.org/10.1371/journal.pone.0149418.s001
(TIF)
S2 Fig. Outline of the SRM method development.
See Materials and Methods for details.
https://doi.org/10.1371/journal.pone.0149418.s002
(TIF)
S3 Fig. The N2 and CB4856 parental strains do not show any strong protein abundance variation of the tested signalling pathway proteins.
Protein abundance was quantified by SRM. Identification of the true peak group was performed using the mProphet software, followed by protein significance analysis using Microsoft Excel 2010 and custom R scripts. Horizontal dashed lines represent the fold change cut-off of 1.3 (~ 0.38 on log2 scale). Error bars represent SEM between two biological replicates. BH corrected P-values for all proteins from a two-sample equal variance t-test were above 0.9.
https://doi.org/10.1371/journal.pone.0149418.s003
(TIF)
S4 Fig. SILAC-based shotgun and SRM quantification show similar measurement accuracy.
Scatterplots with Tukey-style box plot representing variation in measurement of protein abundance in CB4856 relative to N2, within two biological replicates using SRM (A) and between the averages of two biological replicates using SRM with three biological replicates using SILAC-based shotgun mass spectrometry data from [35] (B) Horizontal and vertical dashed lines represent the fold change cut-off of 1.3 (~ 0.38 on log2 scale). Pearson correlation coefficient is denoted by r. (C) Tukey-style box plot for protein abundance of 71 proteins using SRM, indicating that variation in protein abundance between CB4856 and N2 is not greater than between two biological replicates of one of the two parental strains. Vertical dashed lines represent the fold change cut-off of 1.3 (~ 0.38 on log2 scale). N2-1 is 1st biological replicate of N2, N2-2 is 2nd biological replicate of N2, CB4856-1 is 1st biological replicate of CB4856, and CB4856-2 is 2nd biological replicate of CB4856.
https://doi.org/10.1371/journal.pone.0149418.s004
(TIF)
S5 Fig. Correlation between difference in protein and transcript abundances between the parental strains.
Scatterplots with Tukey-style box plot show log2 fold change for the tested signalling pathway proteins and transcripts in CB4856 relative to N2. Overall and pathway specific Pearson correlation coefficient is denoted by r.
https://doi.org/10.1371/journal.pone.0149418.s005
(TIF)
S6 Fig. Differential abundance of signalling pathway proteins in CB4856 and RILs relative to N2.
Tukey-style box plot of protein abundance (log2 scaled fold changes relative to N2) redrawn from Fig 3A. Scatter points overlaid on the box plot represent the protein abundance values in CB856 and RILs. Most of the protein changes are either non-significant (P > 0.05) or below the fold change cut-off of 1.3 (~ 0.38 on log2 scale; horizontal dashed lines). Seven proteins (unfilled black boxes) with significant abundance differences and fold changes above 1.3 were selected for pQTL mapping.
https://doi.org/10.1371/journal.pone.0149418.s006
(TIF)
S7 Fig. Differential abundance of proteins selected for pQTL mapping in CB4856 and 48 RILs.
Protein abundance was quantified by SRM. Identification of the true peak group was performed using the mProphet software, followed by protein significance analysis using an intensity-based linear mixed-effects model implemented in MSstats. Number of asterisks represent BH corrected P-values as follows, *P ≤ 0.05; **P ≤ 0.01; ***P ≤ 0.001; ****P ≤ 0.0001. Blue and red shades within heat map represent log2 scaled fold changes in protein abundance relative to N2 (white boxes = no data).
https://doi.org/10.1371/journal.pone.0149418.s007
(TIF)
S1 Table. Proteins and peptides used in this study.
File with separate sheets for the list of 156 proteins (along with relative protein abundance data extracted from PaxDb version 2.1 and eQTL data extracted from WormQTL), broad-sense heritability values for 44 proteins, and the list of 377 PTPs.
https://doi.org/10.1371/journal.pone.0149418.s008
(XLSX)
S2 Table. SRM transitions used in this study.
File with separate sheets for SRM transition lists (targeting 148, 44, and 7 proteins) used in this study and a sheet with normalised retention time (iRT) of peptides.
https://doi.org/10.1371/journal.pone.0149418.s009
(XLSX)
S3 Table. Strains used in this study.
File with all the RILs along with parental strains N2 and CB4856 used in this study indicating (by “Yes”) the experiments performed on them.
https://doi.org/10.1371/journal.pone.0149418.s010
(XLSX)
Acknowledgments
We thank Lukas Reiter from Biognosys for help with the mProphet software and Claudia Fortes from the Functional Genomics Center Zurich for help with offline HPLC.
Author Contributions
Conceived and designed the experiments: KS BR LBS SPS JEK MOH. Performed the experiments: KS LBS XZ ME. Analyzed the data: KS LBS JG. Contributed reagents/materials/analysis tools: ME GBP JEK MOH. Wrote the paper: KS MOH. Helped develop SRM methods and provided SILAC-based shotgun data: PK.
References
- 1. Freeman D, Lesche R, Kertesz N, Wang S, Li G, Gao J, et al. Genetic background controls tumor development in PTEN-deficient mice. Cancer Res. 2006;66: 6492–6496. pmid:16818619
- 2. Kristensen VN, Edvardsen H, Tsalenko A, Nordgard SH, Sorlie T, Sharan R, et al. Genetic variation in putative regulatory loci controlling gene expression in breast cancer. Proc Natl Acad Sci. 2006;103: 7735–7740. pmid:16684880
- 3. Seitz S, Korsching E, Weimer J, Jacobsen A, Arnold N, Meindl A, et al. Genetic background of different cancer cell lines influences the gene set involved in chromosome 8 mediated breast tumor suppression. Genes, Chromosom Cancer. 2006;45: 612–627.
- 4. Salido EC, Li XM, Lu Y, Wang X, Santana A, Roy-Chowdhury N, et al. Alanine-glyoxylate aminotransferase-deficient mice, a model for primary hyperoxaluria that responds to adenoviral gene transfer. Proc Natl Acad Sci. 2006;103: 18249–18254. pmid:17110443
- 5. Tsuchiya N, Honda Z, Tokunaga K. Role of B cell inhibitory receptor polymorphisms in systemic lupus erythematosus: a negative times a negative makes a positive. J Hum Genet. 2006;51: 741–750. pmid:16946996
- 6. Thompson DA, Janecke AR, Lange J, Feathers KL, Hübner CA, McHenry CL, et al. Retinal degeneration associated with RDH12 mutations results from decreased 11-cis retinal synthesis due to disruption of the visual cycle. Hum Mol Genet. 2005;14: 3865–3875. pmid:16269441
- 7. Brenner S. The genetics of Caenorhabditis elegans. Genetics. 1974;77: 71–94. pmid:4366476
- 8.
Conradt B. Programmed cell death. WormBook. 2005; 1–13. https://doi.org/10.1895/wormbook.1.32.1
- 9.
Sundaram M. RTK/Ras/MAPK signaling. WormBook. 2006; 1–19. https://doi.org/10.1895/wormbook.1.80.1
- 10.
Greenwald I. LIN-12/Notch signaling in C. elegans. WormBook. 2005; 1–16. https://doi.org/10.1895/wormbook.1.10.1
- 11.
Eisenmann DM. Wnt signaling. WormBook. 2005; 1–17. https://doi.org/10.1895/wormbook.1.7.1
- 12.
Sternberg PW. Vulval development. WormBook. 2005; 1–28. https://doi.org/10.1895/wormbook.1.6.1
- 13. Schmid T, Hajnal A. Signal transduction during C. elegans vulval development: a NeverEnding story. Curr Opin Genet Dev. 2015;32: 1–9. pmid:25677930
- 14. Shaye DD, Greenwald I. OrthoList: A Compendium of C. elegans Genes with Human Orthologs. PLoS One. 2011;6: e20085. pmid:21647448
- 15. Saito RM, van den Heuvel S. Malignant Worms: What Cancer Research Can Learn from C. elegans. Cancer Invest. 2002;20: 264–275. pmid:11901546
- 16. Favaloro B, Allocati N, Graziano V, Di Ilio C, De Laurenzi V. Role of apoptosis in disease. Aging (Albany NY). 2012;4: 330–349.
- 17. Kim EK, Choi E-J. Pathological roles of MAPK signaling pathways in human diseases. Biochim Biophys Acta—Mol Basis Dis. 2010;1802: 396–405.
- 18. Penton AL, Leonard LD, Spinner NB. Notch signaling in human development and disease. Semin Cell Dev Biol. 2012;23: 450–457. pmid:22306179
- 19. Clevers H, Nusse R. Wnt/β-Catenin Signaling and Disease. Cell. 2012;149: 1192–1205. pmid:22682243
- 20. Nicholas WL, Dougherty EC, Hansen EL. AXENIC CULTIVATION OF CAENORHARDITIS BRIGGSAE (NEMATODA: RHABDITIDAE) WITH CHEMICALLY UNDEFINED SUPPLEMENTS; COMPARATIVE STUDIES WITH RELATED NEMATODES*. Ann N Y Acad Sci. 1959;77: 218–236.
- 21. Thompson OA, Snoek LB, Nijveen H, Sterken MG, Volkers RJM, Brenchley R, et al. Remarkably Divergent Regions Punctuate the Genome Assembly of the Caenorhabditis elegans Hawaiian Strain CB4856. Genetics. 2015;200: 975–989. pmid:25995208
- 22. Andersen EC, Gerke JP, Shapiro JA, Crissman JR, Ghosh R, Bloom JS, et al. Chromosome-scale selective sweeps shape Caenorhabditis elegans genomic diversity. Nat Genet. 2012;44: 285–290. pmid:22286215
- 23. Hodgkin J, Doniach T. Natural variation and copulatory plug formation in Caenorhabditis elegans. Genetics. 1997;146: 149–64. pmid:9136008
- 24. Li Y, Álvarez OA, Gutteling EW, Tijsterman M, Fu J, Riksen JAG, et al. Mapping Determinants of Gene Expression Plasticity by Genetical Genomics in C. elegans. PLoS Genet. 2006;2: e222. pmid:17196041
- 25. Elvin M, Snoek LB, Frejno M, Klemstein U, Kammenga JE, Poulin GB. A fitness assay for comparing RNAi effects across multiple C. elegans genotypes. BMC Genomics. 2011;12: 510. pmid:22004469
- 26. Rodriguez M, Snoek LB, Riksen JAG, Bevers RP, Kammenga JE. Genetic variation for stress-response hormesis in C. elegans lifespan. Exp Gerontol. 2012;47: 581–587. pmid:22613270
- 27. Snoek LB, Van der Velde KJ, Arends D, Li Y, Beyer A, Elvin M, et al. WormQTL—public archive and analysis web portal for natural variation data in Caenorhabditis spp. Nucleic Acids Res. 2013;41: D738–D743. pmid:23180786
- 28. Li Y, Breitling R, Snoek LB, van der Velde KJ, Swertz MA, Riksen J, et al. Global Genetic Robustness of the Alternative Splicing Machinery in Caenorhabditis elegans. Genetics. 2010;186: 405–410. pmid:20610403
- 29. Viñuela A, Snoek LB, Riksen JAG, Kammenga JE. Genome-wide gene expression regulation as a function of genotype and age in C. elegans. Genome Res. 2010;20: 929–937. pmid:20488933
- 30. Viñuela A, Snoek LB, Riksen JAG, Kammenga JE. Aging Uncouples Heritability and Expression-QTL in Caenorhabditis elegans. G3. 2012;2: 597–605. pmid:22670229
- 31. Snoek LB, Sterken MG, Volkers RJM, Klatter M, Bosman KJ, Bevers RPJ, et al. A rapid and massive gene expression shift marking adolescent transition in C. elegans. Sci Rep. 2014;4: 3912. pmid:24468752
- 32. Lange V, Picotti P, Domon B, Aebersold R. Selected reaction monitoring for quantitative proteomics: a tutorial. Mol Syst Biol. 2008;4: 222. pmid:18854821
- 33. Gallien S, Duriez E, Domon B. Selected reaction monitoring applied to proteomics. J Mass Spectrom. 2011;46: 298–312. pmid:21394846
- 34. Wang M, Weiss M, Simonovic M, Haertinger G, Schrimpf SP, Hengartner MO, et al. PaxDb, a Database of Protein Abundance Averages Across All Three Domains of Life. Mol Cell Proteomics. 2012;11: 492–500. pmid:22535208
- 35. Kamkina P, Snoek LB, Grossmann J, Volkers RJM, Sterken MG, Daube M, et al. Natural genetic variation differentially affects the proteome and transcriptome in C. elegans. Mol Cell Proteomics. 2015; In press.
- 36. Krzywinski M, Altman N. Points of Significance: Visualizing samples with box plots. Nat Methods. 2014;11: 119–120. pmid:24645192
- 37. Rieseberg LH, Archer MA, Wayne RK. Transgressive segregation, adaptation and speciation. Heredity (Edinb). 1999;83: 363–372.
- 38. Rieseberg LH, Widmer A, Arntz AM, Burke B. The genetic architecture necessary for transgressive segregation is common in both natural and domesticated populations. Philos Trans R Soc B Biol Sci. 2003;358: 1141–1147.
- 39. Visscher PM, Hill WG, Wray NR. Heritability in the genomics era—concepts and misconceptions. Nat Rev Genet. 2008;9: 255–266. pmid:18319743
- 40. Snoek LB, Joeri van der Velde K, Li Y, Jansen RC, Swertz MA, Kammenga JE. Worm variation made accessible: Take your shopping cart to store, link, and investigate! Worm. 2014;3: e28357. pmid:24843834
- 41. van der Velde KJ, de Haan M, Zych K, Arends D, Snoek LB, Kammenga JE, et al. WormQTLHD—a web database for linking human disease to natural variation data in C. elegans. Nucleic Acids Res. 2014;42: D794–D801. pmid:24217915
- 42. Albert FW, Kruglyak L. The role of regulatory variation in complex traits and disease. Nat Rev Genet. Nature Publishing Group; 2015;16: 197–212.
- 43. Rockman MV, Skrovanek SS, Kruglyak L. Selection at linked sites shapes heritable phenotypic variation in C. elegans. Science. 2010;330: 372–376. pmid:20947766
- 44. Ghazalpour A, Bennett B, Petyuk VA, Orozco L, Hagopian R, Mungrue IN, et al. Comparative Analysis of Proteome and Transcriptome Variation in Mouse. Snyder M, editor. PLoS Genet. 2011;7: e1001393.
- 45. Foss EJ, Radulovic D, Shaffer SA, Goodlett DR, Kruglyak L, Bedalov A. Genetic Variation Shapes Protein Networks Mainly through Non-transcriptional Mechanisms. Eisen MB, editor. PLoS Biol. 2011;9: e1001144. pmid:21909241
- 46. Skelly DA, Merrihew GE, Riffle M, Connelly CF, Kerr EO, Johansson M, et al. Integrative phenomics reveals insight into the structure of phenotypic diversity in budding yeast. Genome Res. 2013;23: 1496–1504. pmid:23720455
- 47. Albert FW, Treusch S, Shockley AH, Bloom JS, Kruglyak L. Genetics of single-cell protein abundance variation in large yeast populations. Nature. Nature Publishing Group; 2014;506: 494–497.
- 48. Parts L, Liu Y-C, Tekkedil MM, Steinmetz LM, Caudy AA, Fraser AG, et al. Heritability and genetic basis of protein level variation in an outbred population. Genome Res. 2014;24: 1363–1370. pmid:24823668
- 49. Wang X, Wu Y-C, Fadok VA, Lee M-C, Gengyo-Ando K, Cheng L-C, et al. Cell corpse engulfment mediated by C. elegans phosphatidylserine receptor through CED-5 and CED-12. Science. 2003;302: 1563–1566. pmid:14645848
- 50. Yang H, Chen Y-Z, Zhang Y, Wang X, Zhao X, Godfroy JI, et al. A lysine-rich motif in the phosphatidylserine receptor PSR-1 mediates recognition and removal of apoptotic cells. Nat Commun. 2015;6: 5717. pmid:25564762
- 51. Lettre G, Hengartner MO. Developmental apoptosis in C. elegans: a complex CEDnario. Nat Rev Mol Cell Biol. 2006;7: 97–108. pmid:16493416
- 52. Gumienny TL, Lambie E, Hartwieg E, Horvitz HR, Hengartner MO. Genetic control of programmed cell death in the Caenorhabditis elegans hermaphrodite germline. Development. 1999;126: 1011–22. pmid:9927601
- 53. Reddien PW, Horvitz HR. CED-2/CrkII and CED-10/Rac control phagocytosis and cell migration in Caenorhabditis elegans. Nat Cell Biol. 2000;2: 131–6. pmid:10707082
- 54. Page BD, Diede SJ, Tenlen JR, Ferguson EL. EEL-1, a Hect E3 ubiquitin ligase, controls asymmetry and persistence of the SKN-1 transcription factor in the early C. elegans embryo. Development. 2007;134: 2303–2314. pmid:17537795
- 55. Schrimpf SP, Weiss M, Reiter L, Ahrens CH, Jovanovic M, Malmström J, et al. Comparative Functional Analysis of the Caenorhabditis elegans and Drosophila melanogaster Proteomes. Weissman JS, editor. PLoS Biol. 2009;7: e48. pmid:19260763
- 56. Käll L, Storey JD, MacCoss MJ, Noble WS. Assigning Significance to Peptides Identified by Tandem Mass Spectrometry Using Decoy Databases. J Proteome Res. 2008;7: 29–34. pmid:18067246
- 57. Kuster B, Schirle M, Mallick P, Aebersold R. Innovation: Scoring proteomes with proteotypic peptide probes. Nat Rev Mol Cell Biol. 2005;6: 577–583. pmid:15957003
- 58. MacLean B, Tomazela DM, Shulman N, Chambers M, Finney GL, Frewen B, et al. Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics. 2010;26: 966–968. pmid:20147306
- 59. Picotti P, Rinner O, Stallmach R, Dautel F, Farrah T, Domon B, et al. High-throughput generation of selected reaction-monitoring assays for proteins and proteomes. Nat Methods. 2010;7: 43–46. pmid:19966807
- 60. Perkins DN, Pappin DJC, Creasy DM, Cottrell JS. Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis. 1999;20: 3551–3567. pmid:10612281
- 61. Escher C, Reiter L, MacLean B, Ossola R, Herzog F, Chilton J, et al. Using iRT, a normalized retention time for more targeted measurement of peptides. Proteomics. 2012;12: 1111–1121. pmid:22577012
- 62. Reiter L, Rinner O, Picotti P, Hüttenhain R, Beck M, Brusniak M-Y, et al. mProphet: automated data processing and statistical validation for large-scale SRM experiments. Nat Methods. 2011;8: 430–435. pmid:21423193
- 63. Chang C-Y, Picotti P, Hüttenhain R, Heinzelmann-Schwarz V, Jovanovic M, Aebersold R, et al. Protein Significance Analysis in Selected Reaction Monitoring (SRM) Measurements. Mol Cell Proteomics. 2012;11: M111.014662. pmid:22190732
- 64.
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria; Available: http://www.r-project.org/
- 65.
Choi M, Chang C-Y, Vitek O. MSstats: Protein Significance Analysis in DDA, SRM and DIA for Label-free or Label-based Proteomics Experiments. 2014.
- 66. Pedrioli PGA, Eng JK, Hubley R, Vogelzang M, Deutsch EW, Raught B, et al. A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol. 2004;22: 1459–1466. pmid:15529173
- 67. Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J R Stat Soc Ser B. 1995;57: 289–300.
- 68. Snoek LB, Orbidans HE, Stastna JJ, Aartse A, Rodriguez M, Riksen JAG, et al. Widespread Genomic Incompatibilities in Caenorhabditis elegans. G3. 2014;4: 1813–1823. pmid:25128438
- 69. Sendoel A, Maida S, Zheng X, Teo Y, Stergiou L, Rossi C-A, et al. DEPDC1/LET-99 participates in an evolutionarily conserved pathway for anti-tubulin drug-induced apoptosis. Nat Cell Biol. 2014;16: 812–820. pmid:25064737