Skip to main content
Advertisement
  • Loading metrics

Degradation graphs reveal hidden proteolytic activity in peptidomes

  • Erik Hartman ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    erik.hartman@med.lu.se

    Affiliation Division of Infection Medicine, Faculty of Medicine, Lund University, Lund, Sweden

  • Johan Malmström,

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing

    Affiliation Division of Infection Medicine, Faculty of Medicine, Lund University, Lund, Sweden

  • Jonas Wallin

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Supervision, Writing – original draft, Writing – review & editing

    Affiliation Department of Statistics, Lund University, Lund, Sweden

Abstract

Protein degradation is a regulated process that reshapes the proteome and generates bioactive peptides. Peptidomics and degradomics enables large-scale measurement of these peptides, yet most data analyses approaches treat peptides as isolated endpoints rather than intermediates produced by sequential cleavage. Here, we introduce degradation graphs, a probabilistic framework that represents proteolysis as a directed acyclic network of cleavage events with explicit absorption. From single-snapshot peptidomes, we infer graph weights by gradient descent or linear-flow optimization, quantify flows through branches and bottlenecks, and correct a core bias in conventional quantification. Across three biological datasets, failure to model downstream trimming leads to 3–4-fold underestimation of upstream proteolytic activity. Moreover, degradation graphs provide graph-structured features that enable machine learning models to capture protease-specific signatures from both graph topology and sequence context. Taken together, these findings establish explicit degradation modeling as a practical approach to mechanistic and interpretable peptidomics, bridging the fields of degradomics and peptidomics.

Author summary

Proteins are continuously broken down into smaller fragments called peptides, a process known as proteolysis. This controlled degradation shapes the proteome, regulates signaling, and generates bioactive molecules involved in immunity, inflammation, and disease. Modern mass spectrometry techniques can measure thousands of such peptides, yet most analytical methods treat these peptides as isolated snapshots rather than as part of an ongoing proteolytic process.

In this study, we introduce a framework for modelling protein degradation as a networked process. We use degradation graphs to represent how proteins break down step by step, where each peptide is connected to all fragments it can generate. From experimental peptidomic data, these graphs can be reconstructed to quantify how proteolytic activity flows through the network, allowing more accurate estimation of enzyme activity and identification of bottlenecks in the degradation process.

Using both experimental and clinical datasets, we show that traditional analyses underestimate total proteolytic activity by up to fourfold. By treating the peptidome as a dynamic system rather than a static collection of fragments, degradation graphs bridge the gap between peptidomics and degradomics, offering a mechanistic and interpretable view of how proteins are proteolytically processed in health and disease.

1 Introduction

Protein degradation is central to cellular life. Beyond recycling proteins to maintain homeostasis, proteolysis regulates protein localization, abundance, and activity, and acts as an irreversible post-translational modification [30,37] (Fig 1a). Proteolysis also generates peptides with functional roles, including hormones [7], neurotransmitters [41,48], antimicrobial peptides [15,36,50,56], and modulators of inflammation [6,17,50]. These bioactive fragments show that degradation is not a terminal process but a dynamic one that remodels the proteome while creating new biological activities.

thumbnail
Fig 1. Overview of proteolysis and peptide-based degradomic analysis.

a Proteins are hydrolyzed into peptides by proteases with distinct cleavage modes. Endoproteases cleave internal peptide bonds, whereas exoproteases trim residues sequentially from peptide termini. Both contribute to continuous peptide turnover. b During host–pathogen interactions, proteolysis shapes immune responses on both sides. Host proteases can release antimicrobial and immunomodulatory peptides, while bacterial proteases degrade host defense and immune proteins, generating immune-inactive fragments. The balance of these processes determines the outcome of infection. c Peptidomics and degradomics use mass spectrometry to characterize proteolys......is from complementary angles. Peptidomics quantifies endogenous peptides directly, whereas degradomics, using methods such as TAILS, blocks native termini to enrich for neo-termini generated by protease activity. Peptides are analyzed by LC–MS/MS in DDA or DIA mode, searched against protein databases, and visualized as peptide ladders. d Growth of peptidomics and degradomics research over time, shown as PubMed term counts by year, illustrating the expansion of both fields, especially peptidomics. This figure was made with BioRender.

https://doi.org/10.1371/journal.pcbi.1013972.g001

The importance of proteolysis is especially evident in host–pathogen interactions, where both host and pathogen exploit proteases to gain an advantage [13]. Host proteases release antimicrobial peptides, modulate inflammation, and degrade virulence factors [2,8,35,45], while pathogens secrete proteases that disable antibodies, cytokines, and complement proteins [8,9,12,19,20,23,24,35,40,42,43,46,47,49,50,53] (Fig 1b). These examples illustrate that proteolysis is tightly regulated, context dependent, and mechanistically complex.

Two fields attempt to disentangle this complexity. Degradomics focuses on protease activities, cleavage sites, and substrate profiles, with applications to cancer [26,32,34,52], neurodegeneration [1,27], and inflammation [4,5,28,31,32,37]. Peptidomics emphasizes large-scale identification and quantification of endogenous peptides for biomarker discovery and activity profiling [10,16,18,25,29,38]. Both rely on mass spectrometry, but degradomics enriches for neo-termini [21,44,54], whereas peptidomics quantifies peptides globally [10] (Fig 1c, 1d). Despite methodological differences, both aim to understand how proteolysis shapes the peptidome.

Current data analysis strategies in degradomics and peptidomics treat peptides as static endpoints, although peptides are intermediates that undergo further proteolysis. Only a few studies have attempted to model this process. Yi et al. showed that exoprotease activity can be represented as a sequential multistep reaction using isotopically labelled fibrinogen peptide A and time-resolved quantification of its degradation products [55]. Kluge et al. introduced the first formal framework for modelling exoprotease activity as a Markov process on a graph in which edges represent cleavage events and parameters correspond to amino acid specific degradation rates [22]. Aiche et al. extended this framework to endoproteolytic activity and estimated kinetic parameters from time series data [3]. These studies provide a conceptual basis and proof-of-concept for modelling protein degradation but remain restricted to exoproteolysis or depend on temporally resolved measurements that are rarely available. They also precede modern informatics workflows, which limits their applicability to large datasets. As a result, upstream protease activities are systematically underestimated and degradomics and peptidomics remain analytically disconnected.

Here, we introduce an alternative framing of degradation graphs: a representation of proteolysis as a directed acyclic graph in which nodes are peptides, edges represent cleavage events, and each node is assigned an explicit absorption probability. This framework treats peptides as dynamic intermediates and captures the sequential nature of degradation without the need for time-series data, making it a natural extension to prior works. By first formalizing degradation graphs probabilistically, we show how degradation graphs generate marginal peptide distributions, and establish an intuitive flow-based interpretation. Second, we present two methods to reconstruct degradation graphs from single-snapshot peptidomes: (i) gradient-based optimization with closed-form gradients, and (ii) a linear-flow formulation solved with linear programming. Third, we demonstrate applications of degradation graphs using simulated proteolysis and experimental data, showing that degradation graphs unify peptidomic and degradomic goals by connecting observed peptides to the mechanistic processes that generate them.

2 Results

We introduce a framework that connects peptidomic observations to the degradomic processes that generate them. Rather than treating peptides as isolated endpoints, degradation graphs model their sequential connectivity through directed cleavage relationships. This representation captures both degradation routes and resulting peptide abundances, enabling mechanistic quantification, peptide clustering, and graph-based inference. In contrast to previous approaches, this method does not require time-resolved measurements and can model protein degradation by applying modern optimization techniques to large peptidomes generated with contemporary mass spectrometry workflows. Below, we formalize degradation graphs, show how they can be inferred from single-snapshot peptidomes, and demonstrate their applications to in vitro and in vivo data.

2.1 Formal definition of the degradation graph

A degradation graph is defined as a directed acyclic graph , where each node represents a peptide (including the full-length protein), and each directed edge, , indicates that peptide v can degrade into peptide u (Fig 2a). This graph encodes the network of cleavages that progressively fragment the intact protein into smaller peptides. To quantify this process, each edge is assigned a transition probability specifying the likelihood that v degrades into u. The difference between one and the sum of outgoing probabilities represents the chance that (v) remains stable, an absorption event in the Markov sense, captured by a self-loop, (), with a probability of

where denotes the set of child nodes. This probabilistic formulation extends earlier models of exoprotease activity [22] to general peptide degradation.

thumbnail
Fig 2. Definition and probabilistic formulation of the degradation graph.

a A biological sample is analyzed with the mass spectrometer followed by peptide identification and quantification, resulting in a peptide distribution. b A degradation graph represents proteolysis as a directed acyclic graph in which each node corresponds to a peptide (including the intact protein, Ω), and each directed edge denotes a proteolytic event where peptide v is cleaved into a shorter peptide u. The graph thereby encodes sequential cleavage relationships that describe how a protein is progressively degraded into smaller fragments. b Each edge is associated with a transition probability describing the likelihood that v degrades into u. The probability that v remains intact, its absorption probability, is modeled as , where is the set of child nodes. This defines a Markov chain in which transitions correspond to cleavage events and self-loops to peptide stability. c The overall peptide distribution, , can be obtained by propagating probability mass from the protein node Ω through the graph in a forward pass. At each node, a fraction of the mass is absorbed, and the remainder distributed according to the outgoing transition probabilities, yielding a marginal distribution that reflects the steady-state abundances of peptides generated by the degradation process.

https://doi.org/10.1371/journal.pcbi.1013972.g002

An intuitive way to interpret these probabilities is through the notion of flow. Beginning with probability mass of one at a protein node Ω, mass flows through the graph according to the assigned edge probabilities (Fig 2b). As each node v is reached, a proportion of the mass arriving at v is absorbed in v, and the remainder is distributed among its children in proportion to . Because the graph contains no cycles except for self-loops, all mass is eventually absorbed in the nodes, generating a peptide distribution P (Fig 2c). The total absorbed mass for each node can be interpreted as its overall probability of being the endpoint of a degradation pathway starting at Ω.

2.2 Identifying the degradation graph

Without high resolution time-series data, proteolysis cannot be observed directly and the degradation graph must be inferred from the measured peptide distribution (Fig 4a). Previous work reconstructed proteolysis kinetics from time-resolved data [3], but such data are rarely available from samples from in vivo experiments and from patients which constitute a large majority of peptidomics experiments. We therefore treat the graph as a latent structure inferred from a static peptidome snapshot, following the principle that the optimal graph should reproduce the empirical peptide distribution Y. Here, Y denotes the empirical peptide abundance distribution, with component corresponding to the observed abundance of peptide u.

We frame inference as an optimization problem. The model predicts a marginal peptide distribution P determined by the edge transition probabilities . These probabilities are adjusted to minimize a loss function measuring the discrepancy between modeled and observed abundances (Fig 2b). A softmax mapping enforces valid transition probabilities that sum to at most one per node, with the residual interpreted as absorption. Gradients are computed through P, and iterative updates refine the graph until its predicted distribution aligns with the data (Algorithm 1, Fig 3b–3c).

Algorithm 1. Gradient-based inference of transition probabilities .

thumbnail
Fig 3. Inferring degradation graphs from observed peptidomes.

a The degradation graph is a latent structure that describes the proteolytic relationships giving rise to the observed peptidome. Observed peptide abundances are used to infer the most plausible graph structure and transition probabilities. b Inference is formulated as an optimization problem in which the modeled marginal peptide distribution is fitted to the measured distribution. Edge transition probabilities are iteratively updated by gradient descent to minimize the loss , or equivalently inferred as a linear-flow system solved by linear programming under mass-conservation constraints. c Example of graph optimization by gradient descent. The mean-squared-error loss (blue) decreases over iterations as the inferred graph converges toward the true degradation graph, measured by the total deviation of edge weights (orange). d The coefficient of variation of edge weights when applying gradient descent to graphs of increasing sizes. The shaded region shows ± 1 standard deviation. e Experimental validation using in vitro trypsin digestion of human pharyngeal epithelial cells (Detroit 562; dataset PXD037803 [14]). Peptides were identified and quantified by LC–MS/MS and analyzed to reconstruct the underlying degradation graph. f Peptides mapped onto the β-actin backbone and colored by their number of descendants in the inferred graph, illustrating hierarchical fragmentation patterns. g Relationship between peptide abundance and total inflow (inflow) for each peptide. The solid line indicates where the modeled inflow and measured abundance are equivalent, demonstrating quantitative agreement between the inferred degradation flow and experimental intensities.

https://doi.org/10.1371/journal.pcbi.1013972.g003

An equivalent and more intuitive formulation expresses the same problem as a flow system, in which each edge carries a non-negative flow representing the probability mass transferred from u to v. Mass conservation requires that the outgoing flow and absorbed mass at each node equal its total incoming mass. When these conditions are written as linear constraints, the result is a standard optimization problem that can be solved with linear programming. The solution, provides flows whose ratios directly yield transition probabilities (Algorithm 2, Fig 4b).

Algorithm 2. Linear-flow reconstruction of degradation graphs.

Input: Degradation graph ; root (protein) Ω; observed peptide distribution Y

Output: Feasible edge flows and transition probabilities

1 Variables: Define a non-negative flow variable for each .

2 Root injection:

3 Mass conservation: For every node ,

4 Solve LP: Find a feasible satisfying the constraints above (or minimize a small regularizer, e.g. ).

5 Convert to transition probabilities: For each node u, define its total incoming mass , and set

The identification of a degradation graph is an underdetermined problem because different graph structures can produce the same observed peptide distribution (see Discussion and Methods: Formalization of key limitations and assumptions). To assess how much the inferred graphs vary, we ran gradient descent from different random initializations (five replicates) across graphs of increasing size. We measured variation in edge weights using their coefficient of variation. Although the mean variation increased with graph size, it remained below 10 percent even for the largest graphs (Fig 3d).

To illustrate the inference procedure on biological datasets, we reconstructed the degradation graph of trypsinized β-actin from human cell lysates (Fig 3e–3g). Peptide intensities were averaged across samples, and edge weights were optimized by gradient descent. The inferred graph recapitulated the expected degradation hierarchy, capturing sequential fragmentation from the intact protein toward shorter peptides. This demonstrates that degradation graphs can be recovered from single-snapshot peptidomes, providing a compact representation of degradation connectivity.

Both optimization schemes are implemented in modern frameworks like PyTorch [39] and PuLP [33] are available in our GitHub repository. The advantage of the LP formulation is fast and guarantees exact satisfaction of the flow constraints. While gradient descent is slower, it is more flexible because the objective is written as a differentiable loss, which allows future extensions such as regularization terms, priors, or custom penalties that guide the solution toward preferred graph structures. This flexibility makes gradient descent well suited for exploring alternative modeling assumptions that cannot be expressed cleanly using linear constraints. In practice, the runtime remains short even for large peptidomes, so the additional compute required for gradient descent is minimal. For this reason, we view gradient descent as the default choice (Table 1).

thumbnail
Table 1. Runtime of gradient descent and linear programming solvers across increasing graph sizes.

https://doi.org/10.1371/journal.pcbi.1013972.t001

2.3 Why degradation graphs matter

Traditional peptidomic quantification assumes that each peptide reflects a single proteolytic event, and the proteolytic activity is therefore simply calculated as the sum of peptide abundances: . In reality, peptides can themselves act as substrates for further degradation, as modelled by Yi et al. [55], meaning that their observed abundance underestimates the upstream activity that generated them. In conventional workflows, only the direct link between the protein, Ω, and its peptide, v, is considered, while degradation of v into u is ignored (Fig 4a). Consequently, the apparent cleavage weight w1 derived from peptide abundance alone is smaller than the true value w2 obtained when downstream trimming is modeled. The total protelytic events in a degradation graph can therefore be computed as the sum of flows, and the ratio by which classical modelling underestimates the proteolytic activity, Δ, can be computed as:

thumbnail
Fig 4. Why degradation graphs improve protease quantification and enable predictive modeling.

a Conventional peptidomic analyses assume that each peptide arises directly from the parent protein, neglecting that peptides can themselves undergo further degradation. This omission leads to systematic underestimation of upstream proteolytic activity. In degradation graphs, downstream cleavages (e.g., ) are explicitly modeled, revealing the true effective transition weight () and correcting activity quantification. b By representing peptide abundances as probabilistic flow through the graph, degradation graphs enable flow quantification, where the total upstream activity can be mapped to specific proteases using databases such as MEROPS or TopFIND. c Branch quantification summarizes local subgraphs, sets of related peptides sharing a degradation ancestry, into stable and interpretable biological units. Bottlenecks and branch points identify sites of proteolytic control or preferential cleavage. d The explicit graph topology also permits machine learning applications. Degradation graphs generated in silico for four proteins (beta actin, hemoglobin subunit beta, thrombin and apolipoprotein a1) digested by trypsin or elastase were encoded as graph structures and classified using a GraphConv neural network trained on node (position, abundance, length) and edge (transition weight) features. e Left: receiver operating characteristic (ROC) curves showing high classification performance across proteins (overall ROC–AUC = 0.915). Right: Kernel density estimates of predicted probabilities for trypsin- and elastase-derived graphs in the validation set, illustrating accurate separation of protease-specific degradation patterns. f Peptidome plots of β-actin in cell lysates digested with different enzymes. g ROC of the GraphConv network trained to identify the enzyme.

https://doi.org/10.1371/journal.pcbi.1013972.g004

In the β-actin data, this led to a roughly two-fold underestimation of total proteolytic activity (Fig 3f). We examined how the underestimation ratio changes as a function of degradation extent in a proteolysis simulation. As simulation time increased and more cleavages accumulated, the ratio rose (S1 Fig). This behavior is expected, as larger degradation graphs contain longer and more branched pathways, and simply summing observed peptide intensities fails to capture the cumulative loss of abundance along these routes. These results show quantitatively that conventional peptide-summation methods increasingly underestimate proteolytic activity as the peptidome becomes more complex.

By explicitly modeling the sequential flow of degradation, degradation graphs correct this bias. Each peptide’s inflow equals the mass absorbed in that peptide plus the flow routed to its descendants, ensuring that upstream events are represented in proportion to their true activity. Beyond improving quantitative accuracy, the graph structure introduces interpretable organization into the peptidome (Fig 4b–4c). Related peptides naturally cluster into branches that reflect shared ancestry, while intermediate nodes identify points of control where flow divides or accumulates. These network features provide biologically meaningful units, branches and bottlenecks, that are more stable and interpretable than single peptide intensities and can be linked directly to specific protease families or pathways.

Finally, the explicit graph topology enables graph-based inference (Fig 4d). Node and edge-level features such as position, length, abundance, and transition probability can be integrated into machine learning models that recognize characteristic degradation regimes. As an example, degradation graphs derived from in silico proteolysis of several proteins were classified by a GraphConv network, which separated trypsin from elastase-derived graphs with high accuracy (overall ROC–AUC = 0.915, Fig 4e). We also applied the GraphConv network on cell lysate peptidomes when digested with different enzymes. β-actin was chosen due to its high coverage in the samples, and the graphs were identified through gradient descent (Fig 4f). On these samples, the model achieves a near perfect ROC-AUC (Fig 4g). While these demonstrations are limited to simulated data and simple digestions, it demonstrates the concept that degradation graphs not only correct quantification but also provide a foundation for predictive analysis of protease activity.

2.4 Applications to in vivo data

To investigate the performance of degradation graphs on clinical data, we analyzed a dataset of urinary peptides from diabetic and healthy individuals [51] (Fig 5a). Uromodulin, a previously proposed biomarker candidate for diabetes, was selected as a case study. For each sample, degradation graphs were reconstructed by gradient descent to align modeled and observed peptide abundances (Fig 5b). Without accounting for downstream degradation, conventional quantification underestimated total proteolytic activity by about 3.5-fold (CI 95%: 3.52-3.58, Fig 5c). No significant difference in underestimation ratios was found between healthy and diabetic patients. Mapping total inflow along the uromodulin backbone revealed differential degradation concentrated in the C-terminal biomarker region (Fig 5d).

thumbnail
Fig 5. Degradation graph analysis of urinary peptidomes from diabetic and healthy individuals.

a Peptidomic data from urine samples of individuals with diabetes (n = 15) and healthy controls (n = 15) [51] were analyzed without enzymatic digestion using LC–MS/MS. b For each sample, degradation graphs were reconstructed by gradient descent to fit the modeled peptide distribution to observed abundances. c Comparison of total generated versus observed peptide abundance revealed that conventional quantification underestimated proteolytic activity by approximately 3.7-fold in both groups. d Mapping total inflow along the uromodulin sequence showed a localized increase in degradation flow within the biomarker-associated region, highlighting differential proteolysis between healthy and diabetic samples. This figure was made with BioRender.

https://doi.org/10.1371/journal.pcbi.1013972.g005

In a second campaign we analyzed peptidomic data from porcine wound fluids infected with either Staphylococcus aureus or Pseudomonas aeruginosa [11] (Fig 6a). Previous work identified the N-terminal region of hemoglobin subunit alpha as a discriminatory biomarker between infection caused by the two different pathogens [16]. Degradation graph analysis reproduced this signal, identifying the key cleavage point in wounds infected by P. aeruginosa, and further highlighted additional differential flow around residues 60–80 (Fig 6d). The total proteolytic activity was again underestimated by approximately 3.5-fold (CI 95%: 3.41-3.60) when sequential degradation was ignored (Fig 6c). No significant difference in underestimation ratios was found between S. aureus and P. aeruginosa infected wounds. These showcases highlight how degradation graph modelling can be used to improve peptidomic analyses.

thumbnail
Fig 6. Degradation graph analysis of porcine wound fluid peptidomes from bacterial infections.

a Peptidomic data from porcine wound fluids infected with Staphylococcus aureus (n = 38) or Pseudomonas aeruginosa (n = 33) [11] were analyzed without enzymatic digestion using LC–MS/MS. b For each sample, edge transition probabilities were optimized by gradient descent to reproduce measured peptide distributions. c The ratio between generated and observed peptide abundance showed that neglecting sequential degradation led to an average 3.5-fold underestimation of total proteolytic activity. d Visualization of total inflow along the hemoglobin subunit alpha backbone revealed differential degradation flow between infection types, with pronounced variation in the N-terminal biomarker region and an additional differential site around residues 60–80. This figure was made with BioRender.

https://doi.org/10.1371/journal.pcbi.1013972.g006

We tested how sensitive the method is to training settings by sweeping over learning rates and numbers of epochs when applying gradient descent on a random sample from the clinical datasets. For both datasets and peptidomes, when the loss converged, the underestimation ratio (Δ) increased and stabilized. The peptides with the highest inflow and the edges with the highest flows were recovered consistently across runs, especially in the smaller UMOD graph. The larger HBA graph showed more variation but still produced stable top features once training had converged (Fig 7a, 7b).

thumbnail
Fig 7. Robustness of graph identification.

Parameter sweeps across learning rates and training epochs were used to assess the stability of degradation graph inference for the UMOD dataset (a) and the HBA infection dataset (b). Heatmaps show the underestimation ratio (Δ), final mean squared error, and the stability of the top peptides and edges when ranked of inflow and flow respectively, measured as the percentage of overlap across replicate runs.

https://doi.org/10.1371/journal.pcbi.1013972.g007

3 Discussion

This work reframes peptidomic analysis by treating peptides as intermediates in degradation pathways instead of terminal observations. Conventional proteolysis quantification collapses sequential proteolysis into static peptide abundances, leading to systematic underestimation of upstream enzymatic activity. By explicit modelling of degradation graphs, we recover the mechanistic flow of proteolysis. This structure brings several advantages such as improved quantification, where accounting for downstream degradation increased inferred proteolytic activity by roughly 3 to 4-fold. Beyond correcting quantification, degradation graphs provide a natural organizational layer for the peptidome. Branches summarize coherent degradation routes, while bottlenecks highlight rate-limiting intermediates or potentially bioactive fragments. The flow-based representation links degradomic logic to peptidomic observables, thereby unifying the two fields in a single probabilistic formalism.

Because degradation graphs encode proteolysis as a directed acyclic graph with explicit node and edge features, they can be used directly in graph neural networks and other machine learning models that are designed to learn from graph-structured data, unlike earlier approaches based solely on peptide intensities. In our demonstration, a simple GraphConv architecture could classify degradation graphs with topological and sequence-derived features alone. Although we only present this application as a proof-of-concept, it opens the door to predictive models that learn protease signatures, degradation kinetics, or tissue-specific proteolytic patterns directly from data.

In practical terms, degradation graphs enable richer analysis of static peptidomes without requiring time-resolved measurements. The approach can be scaled to proteome-wide reconstructions and integrated with protease databases such as MEROPS to identify enzyme families contributing to observed degradation flows. Applications include biomarker discovery, assessment of proteolytic state, and mechanistic stratification of diseases characterized by dysregulated proteolysis.

3.1 Limitations and outlook

There are two major challenges in building degradation graphs from peptidomic data. The first is that the graphs are not unique as different graph structures can produce the same marginal distribution, so the problem is inherently underdetermined. Even so, our results show that the graphs inferred are reproducible across runs, with consistent identification of edges with high flow and nodes with high cumulative inflow.

The second challenge is inherited from the field of mass spectrometry peptidomics at large, which is the issue with peptide identification and quantification. Mass spectrometry fails to identify and quantify peptides for many reasons, including ionization efficiency, length, charge, or low abundance (see Methods: Formalization of key limitations and assumptions). If key intermediates are not detected, we do not have the true peptide distribution, and no method can recover a completely correct graph from incomplete data. Missing peptides can hide real branches in the degradation pathway or create apparent shortcuts that do not occur in biology. However, there are practical developments that could utilize the graph structure to help mitigate this issue. For example, imputation methods designed around degradation logic could infer likely missing intermediates, for example by enforcing that a parent fragment should give rise to two child fragments with compatible abundances. Further, sensitivity analyses where we add plausible missing peptides and examine how the graph changes, could show how strongly the results depend on detection gaps.

The severity of the detection problem will lessen as mass spectrometry instrumentation and workflows continue to improve, expanding the portion of the peptidome that can be reliably measured. The non-uniqueness issue will benefit from growing databases of known protease behavior and cleavage patterns. Together, these developments will make degradation graphs more accurate and more useful for understanding proteolysis in complex samples.

In summary, degradation graphs represent a step toward mechanistic, interpretable peptidomics. By embedding proteolysis into a quantitative, graph-based formalism, they connect sequence-level events to system-level outcomes. In doing so, it becomes possible to correct existing quantification biases providing conceptual bridge between degradomics and peptidomics, linking the enzymes that act and the fragments they leave behind.

4 Methods

4.2 Identifying the degradation graph using gradient-based optimization

We infer transition probabilities by optimizing the degradation graph so that the modeled peptide distribution P reproduces the observed distribution Y.

Each node u is parameterized by a vector of logits , corresponding to transitions toward its child nodes and its self-absorption. A softmax transformation ensures that outgoing transition probabilities are valid and sum to one:

where represents the absorption parameter for node u.

Given an initial mass of 1 at the root Ω, probability mass propagates through the graph in topological order. At each node u, the incoming probability mass p(u) is distributed to its children and absorbed according to :

and the absorbed probability at node u is

Collectively, these absorption probabilities define the predicted peptide distribution P.

To align the predicted distribution P with the observed data Y, we minimize the mean squared error:

Regularization can be added to enforce sparsity or smoothness on the parameters:

yielding the total objective .

After training, the final transition probabilities are obtained via the softmax transformation of optimized θ values. Because softmax enforces strictly positive values, transition probabilities cannot be reduced exactly to zero.

4.3 Identifying the degradation graph using linear programming

Alternatively, inference can be formulated as a constrained flow problem where probability mass is conserved across nodes. Instead of directly learning , we solve for flows representing the mass transferred from node u to node v.

Flow variables:

Mass conservation: At each node, the total incoming mass equals the total outgoing flow plus absorbed mass (observed abundance):

Root injection:

Non-negativity constraint:

These constraints form a linear system that can be solved via linear programming (e.g. using PuLP [33]). From the optimized flow solution, transition probabilities are computed as normalized outgoing flows:

The residual absorption probability at each node is defined as

4.4 Graph neural network for graph-based inference

To demonstrate how degradation graphs can capture proteolytic patterns, we implemented a graph neural network (GNN) classifier operating on inferred degradation graphs. Each sample is represented as a directed graph with edge weights .

Each peptide node v is described by a feature vector :

where abundance is the normalized peptide intensity, and start/end/length are position-normalized sequence descriptors.

We used a two-layer GraphConv architecture with hidden dimensionality 64 and dropout probability 0.2 after the first layer. Node embeddings were aggregated by global mean pooling to yield a graph-level representation, followed by a linear classifier with log-softmax output.

The network was trained using negative log-likelihood loss on enzyme class labels (trypsin vs. elastase), optimized with Adam (learning rate 0.005, weight decay 10−4) for 50 epochs.

4.5 Formal definitions of peptide distributions and degradation graphs

We now formalize peptide distributions, degradation transitions, and their graphical representation.

Mechanistic degradation model: Proteins degrade through sequential cleavage, producing a set of peptides. For a protein sequence Ω, degradation yields peptides observed with empirical frequencies Y.

We model degradation as a probabilistic process where a peptide v may remain intact or degrade into smaller peptides in . Each transition has probability , and self-absorption occurs with .

Graph representation: The process is represented as a directed acyclic graph , where V are peptides and E are transitions . Each node v has incoming edges from and outgoing edges to . Weights satisfy .

Marginal peptide distribution: The predicted peptide distribution P describes the steady-state absorption probabilities across the graph and can be computed recursively:

  1. The root probability is .
  2. For all other nodes:where is the cumulative probability of reaching u from the root. Using , we obtain

Conditional distribution: The distribution when starting from node k is

where denotes a one-hot vector with unit mass at node k.

Gradients: The gradient of P with respect to a weight is:

Generalization to realistic data: In real datasets, peptide coverage is incomplete. We relax the strict two-fragment splitting model and allow each peptide to transition into any valid subsequence of itself. This yields equivalent marginal distributions P under simpler connectivity assumptions.

For example, if , possible transitions include:

Weights in this simplified model can be rescaled from the idealized model to yield the same marginal distribution:

4.6 Formalization of key limitations and assumptions

The degradation graph formalism rests on several simplifying assumptions and biological constraints, detailed below.

Peptide generation assumptions: Let Y represent the set of peptides observed in a peptidomic experiment. This set is the result of multiple processes acting on an initial set of peptides derived from protein degradation. We assume the following steps in the workflow:

Let be the set of all theoretically possible peptides generated through protein degradation. This process consists of two primary operations:

  • Endoproteolytic cleavage: The splitting of peptides within the sequence, excluding terminal amino acids. Let E represent the set of such cleavages.
  • Exoproteolytic trimming: The removal of terminal amino acids from the peptide chain. Let X represent the set of terminal trimming operations.

Thus, we define as the set resulting from the application of these two processes, where the operations E and X are performed on a precursor protein sequence P. This can be expressed as , where f represents the function that maps the sequence P under the actions of E and X to the set .

Rapid degradation filter: A biological filtering process removes some peptides in that undergo rapid degradation for resource recycling (e.g., for energy and atoms). Let represent this biological filter, which results in a subset of more stable peptides denoted as . Formally, we have:

where F1 is the set of peptides subject to rapid degradation. The remaining set contains peptides that are stable.

Peptide identification filter: The observed set of peptides, , is a subset of because not all peptides in are detectable through experimental techniques such as mass spectrometry. Let represent the filter corresponding to technical limitations (e.g., detection sensitivity, ionization efficiency, etc.), yielding the final observed peptide distribution . Formally,

where F2 represents the set of peptides undetected or unquantified by the experimental apparatus.

Thus, the final observed peptide set is:

Additionally, peptide intensity is skewed by their ability to ionize effectively and fly in the mass spectrometer. This results in a possible inaccurate representation of .

Unidentifiability: Approximating degradation graphs comes with fundamental limitations. For example, if ABC degrades to B, and we observe equal abundances of AB and BC, we cannot determine whether B was degraded through AB alone, BC alone, or both. This issue underscores the overparametrization of degradation graphs.

4.7 Degradation simulation

To validate our degradation graph model and test computational methods, we developed an algorithm to simulate proteolysis. The simulator generates synthetic peptidomes by modeling endoproteolytic and exoproteolytic events according to enzyme-specific cleavage patterns.

The simulator works as follows: starting with an intact protein sequence, the simulator iteratively performs proteolytic events until a target number of peptides is generated. Each iteration randomly selects between endoproteolytic cleavage (cutting within the sequence) and exoproteolytic trimming (removing terminal residues), based on a configurable probability parameter.

For endoproteolytic events, the simulator: 1. Selects a peptide from the current pool, weighted by length and abundance 2. Determines potential cleavage sites according to the specified enzyme’s regex-based rules, e.g. (.)([K|R])([P])(.) for trypsin 3. Selects a primary cleavage site using the enzyme’s positional preferences 4. Selects a second cleavage site, with probability weighted by both enzyme specificity and distance from the first site (following a gamma distribution) 5. Generates up to three fragments from the two cuts

For exoproteolytic events, the simulator: 1. Randomly selects a peptide weighted by abundance 2. Removes one amino acid randomly from either the N- or C-terminus

The output is the degradation graph as well as the generated peptide distribution.

4.8 Application to trypsinized β-actin

We analyzed peptidomic data from trypsinized human cell lysates (PXD037803 [14]). Raw files were searched against the human proteome using PEAKS X without cleavage site restrictions, and peptide intensities were extracted and log-normalized.

For the β-actin sequence, a degradation graph was constructed and edge weights were optimized by gradient descent so that the modeled marginal distribution matched the observed peptide abundances Y. This allowed us to quantify inflows and identify bottleneck peptides, defined as intermediates with disproportionately high flow relative to their abundance.

4.9 Application to the diabetic uromodulin peptidome

We applied degradation graph modeling to a published urinary peptidome dataset of diabetic and healthy patients [51]. Uromodulin was selected as a case study. Peptidomic data were downloaded from ProteomeXchange (PXD012210) and searched against the human proteome without enzymatic constraints using PEAKS X. For each patient, peptide abundances corresponding to uromodulin were extracted and log-normalized. Missing peptides were imputed with the log intensity value corresponding to the lowest quantile of the data.

For each sample, a degradation graph was constructed and gradient descent was used to optimize edge weights so that the modeled marginal distribution matched the observed peptide abundances Y.

To quantify the systematic bias of traditional peptidomics, we defined the underestimation factor per sample as the total flow over the total abundance:

where is the total flow through edge and Ω is the full-length protein. A value indicates that the total proteolytic activity is underestimated when considering peptide abundances alone.

To localize differences between groups, we further aggregated optimized inflows to peptides and mapped their log2 fold change between diabetic and healthy samples along the uromodulin backbone.

4.10 Application to infected hemoglobin α (porcine wound fluid)

We analyzed a peptidomic dataset of porcine wound fluids infected with either Staphylococcus aureus or Pseudomonas aeruginosa [11]. Peptide intensities corresponding to hemoglobin subunit α (HBA) were extracted and log-normalized. Missing peptides were imputed with the log intensity value corresponding to the lowest quantile of the data.

For each sample, a degradation graph was constructed and edge weights were optimized with gradient descent. As before, the total proteolytic activity per sample was quantified as the ratio between the total generated peptide flow and the sum of peptide abundances.

Group-level differences were assessed by averaging peptide inflows within infection type and calculating log2 fold changes between S. aureus and P. aeruginosa samples. These values were mapped along the HBA backbone to visualize regions of differential degradation.

Supporting information

S1 Fig. Relationship between degradation extent and underestimation ratio.

a Schematic illustrating how extended simulation time produces deeper and more branched degradation pathways. b Underestimation ratio (Δ) as a function of graph size, measured by the number of edges generated during simulated proteolysis. As degradation progresses and graphs become larger, Δ increases, reflecting the growing discrepancy between total peptide intensity and the true cumulative degradation flow.

https://doi.org/10.1371/journal.pcbi.1013972.s001

(TIFF)

References

  1. 1. Abou-El-Hassan H, Sukhon F, Assaf EJ, Bahmad H, Abou-Abbass H, Jourdi H, et al. Degradomics in neurotrauma: profiling traumatic brain injury. Methods Mol Biol. 2017;1598:65–99. pmid:28508358
  2. 2. Agüero-Chapin G, Domínguez-Pérez D, Marrero-Ponce Y, Castillo-Mendieta K, Antunes A. Unveiling encrypted antimicrobial peptides from cephalopods’ salivary glands: a proteolysis-driven virtual approach. ACS Omega. 2024;9(43):43353–67. pmid:39494035
  3. 3. Aiche S, Reinert K, Schütte C, Hildebrand D, Schlüter H, Conrad TOF. Inferring proteolytic processes from mass spectrometry time series data using degradation graphs. PLoS One. 2012;7(7):e40656. pmid:22815782
  4. 4. Bhutada S, Martin DR, Cikach F, da Silva EG, Willard BB, Ramkhelawon B, et al. Integrated forward and reverse degradomics uncovers the proteolytic landscape of aortic aneurysms and the roles of MMP9 and mast cell chymase. openRxiv. 2024. https://doi.org/10.1101/2024.06.26.600914
  5. 5. Bhutada S, Hoyle A, Piuzzi NS, Apte SS. Degradomics defines proteolysis information flow from human knee osteoarthritis cartilage to matched synovial fluid and the contributions of secreted proteases ADAMTS5, MMP13 and CMA1 to articular cartilage breakdown. Osteoarthritis Cartilage. 2025;33(1):116–27. pmid:39293776
  6. 6. Cambeiro-Pérez N, Hidalgo-Cantabrana C, Moro-García MA, Blanco-Míguez A, Fdez-Riverola F, Riestra S, et al. In silico and functional analyses of immunomodulatory peptides encrypted in the human gut metaproteome. Journal of Functional Foods. 2020;70:103969.
  7. 7. Checco JW. Identifying and measuring endogenous peptides through peptidomics. ACS Chem Neurosci. 2023;14(20):3728–31. pmid:37751547
  8. 8. Cole AM, Shi J, Ceccarelli A, Kim YH, Park A, Ganz T. Inhibition of neutrophil elastase prevents cathelicidin activation and impairs clearance of bacteria from wounds. Blood. 2001;97(1):297–304. pmid:11133774
  9. 9. Engel LS, Hill JM, Caballero AR, Green LC, O’Callaghan RJ. Protease IV, a unique extracellular protease and virulence factor from Pseudomonas aeruginosa. J Biol Chem. 1998;273(27):16792–7. pmid:9642237
  10. 10. Foreman RE, George AL, Reimann F, Gribble FM, Kay RG. Peptidomics: a review of clinical applications and methodologies. J Proteome Res. 2021;20(8):3782–97. pmid:34270237
  11. 11. Forsberg F, Kjellström S, Petrlova J, Puthia M, Schmidtchen A, Malmström J, et al. Mass spectrometry peptidomics data from infected and uninfected porcine wounds. Sci Data. 2025;12(1):1533. pmid:40897740
  12. 12. Frey AM, Chaput D, Shaw LN. Insight into the human pathodegradome of the V8 protease from Staphylococcus aureus. Cell Rep. 2021;35(1):108930. pmid:33826899
  13. 13. Gallo RL, Nizet V. Endogenous production of antimicrobial peptides in innate immunity and human disease. Curr Allergy Asthma Rep. 2003;3(5):402–9. pmid:12906776
  14. 14. Gueto-Tettay C, Tang D, Happonen L, Heusel M, Khakzad H, Malmström J, et al. Multienzyme deep learning models improve peptide de novo sequencing by mass spectrometry proteomics. PLoS Comput Biol. 2023;19(1):e1010457. pmid:36668672
  15. 15. Hartman E, Wallblom K, van der Plas MJA, Petrlova J, Cai J, Saleh K, et al. Bioinformatic analysis of the wound peptidome reveals potential biomarkers and antimicrobial peptides. Front Immunol. 2021;11:620707. pmid:33613550
  16. 16. Hartman E, Forsberg F, Kjellström S, Petrlova J, Luo C, Scott A, et al. Peptide clustering enhances large-scale analyses and reveals proteolytic signatures in mass spectrometry data. Nat Commun. 2024;15(1):7128. pmid:39164298
  17. 17. Hartman E, Samsudin F, Bond PJ, Schmidtchen A, Malmström J. Mining the endogenous peptidome for peptide binders with deep learning-driven optimization and molecular simulations. openRxiv. 2025. https://doi.org/10.1101/2025.01.20.633551
  18. 18. Hellinger R, Sigurdsson A, Wu W, Romanova EV, Li L, Sweedler JV, et al. Peptidomics. Nat Rev Methods Primers. 2023;3:25. pmid:37250919
  19. 19. Ingmer H, Brøndsted L. Proteases in bacterial pathogenesis. Res Microbiol. 2009;160(9):704–10. pmid:19778606
  20. 20. Islam D, Bandholtz L, Nilsson J, Wigzell H, Christensson B, Agerberth B, et al. Downregulation of bactericidal peptides in enteric infections: a novel immune escape mechanism with bacterial DNA as a potential regulator. Nat Med. 2001;7(2):180–5. pmid:11175848
  21. 21. Kleifeld O, Doucet A, auf dem Keller U, Prudova A, Schilling O, Kainthan RK, et al. Isotopic labeling of terminal amines in complex samples identifies protein N-termini and protease cleavage products. Nat Biotechnol. 2010;28(3):281–8. pmid:20208520
  22. 22. Kluge B, Gambin A, Niemiro W. Modeling exopeptidase activity from LC-MS data. J Comput Biol. 2009;16(2):395–406. pmid:19193154
  23. 23. Kolar SL, Ibarra JA, Rivera FE, Mootz JM, Davenport JE, Stevens SM, et al. Extracellular proteases are key mediators of Staphylococcus aureus virulence via the global modulation of virulence-determinant stability. Microbiologyopen. 2013;2(1):18–34. pmid:23233325
  24. 24. Lai Y, Villaruz AE, Li M, Cha DJ, Sturdevant DE, Otto M. The human anionic antimicrobial peptide dermcidin induces proteolytic defence mechanisms in staphylococci. Mol Microbiol. 2007;63(2):497–506. pmid:17176256
  25. 25. Li L, Wu J, Lyon CJ, Jiang L, Hu TY. Clinical peptidomics: advances in instrumentation, analyses, and applications. BME Front. 2023;4:0019. pmid:37849662
  26. 26. Liotta LA, Petricoin EF. Serum peptidome for cancer detection: spinning biologic trash into diagnostic gold. J Clin Invest. 2006;116(1):26–30. pmid:16395400
  27. 27. Lualdi M, Ronci M, Zilocchi M, Corno F, Turilli ES, Sponchiado M, et al. Exploring the mitochondrial degradome by the TAILS proteomics approach in a cellular model of Parkinson’s disease. Front Aging Neurosci. 2019;11:195. pmid:31417398
  28. 28. López-Otín C, Overall CM. Protease degradomics: a new challenge for proteomics. Nat Rev Mol Cell Biol. 2002;3(7):509–19. pmid:12094217
  29. 29. Madsen CT, Refsgaard JC, Teufel FG, Kjæ rulff SK, Wang Z, Meng G, et al. Combining mass spectrometry and machine learning to discover bioactive peptides. Nat Commun. 2022;13(1):6235. pmid:36266275
  30. 30. Marshall NC, Finlay BB, Overall CM. Sharpening host defenses during infection: proteases cut to the chase. Mol Cell Proteomics. 2017;16(4 suppl 1):S161–71. pmid:28179412
  31. 31. McQuibban GA, Gong JH, Tam EM, McCulloch CA, Clark-Lewis I, Overall CM. Inflammation dampened by gelatinase A cleavage of monocyte chemoattractant protein-3. Science. 2000;289(5482):1202–6. pmid:10947989
  32. 32. Mead TJ, Bhutada S, Martin DR, Apte SS. Proteolysis: a key post-translational modification regulating proteoglycans. Am J Physiol Cell Physiol. 2022;323(3):C651–65. pmid:35785985
  33. 33. Mitchell S, O’Sullivan M, Dunning I. Pulp: a linear programming toolkit for python. 2011.
  34. 34. Neves LX, Granato DC, Busso-Lopes AF, Carnielli CM, Patroni FM de S, De Rossi T, et al. Peptidomics-driven strategy reveals peptides and predicted proteases associated with oral cancer prognosis. Mol Cell Proteomics. 2021;20:100004. pmid:33578082
  35. 35. Nizet V, Ohtake T, Lauth X, Trowbridge J, Rudisill J, Dorschner RA, et al. Innate antimicrobial peptide protects the skin from invasive bacterial infection. Nature. 2001;414(6862):454–7. pmid:11719807
  36. 36. Ong PY, Ohtake T, Brandt C, Strickland I, Boguniewicz M, Ganz T, et al. Endogenous antimicrobial peptides and skin infections in atopic dermatitis. N Engl J Med. 2002;347(15):1151–60. pmid:12374875
  37. 37. Overall CM, Blobel CP. In search of partners: linking extracellular proteases to substrates. Nat Rev Mol Cell Biol. 2007;8(3):245–57. pmid:17299501
  38. 38. Parker BL, Burchfield JG, Clayton D, Geddes TA, Payne RJ, Kiens B, et al. Multiplexed temporal quantification of the exercise-regulated plasma peptidome. Mol Cell Proteomics. 2017;16(12):2055–68. pmid:28982716
  39. 39. Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, et al. Pytorch: an imperative style, high-performance deep learning library. arXiv preprint 2019. https://arxiv.org/abs/1912.01703
  40. 40. Pietrocola G, Nobile G, Rindi S, Speziale P. Staphylococcus aureus manipulates innate immunity through own and host-expressed proteases. Front Cell Infect Microbiol. 2017;7:166. pmid:28529927
  41. 41. Podvin S, Jiang Z, Boyarko B, Rossitto L-A, O’Donoghue A, Rissman RA, et al. Dysregulation of neuropeptide and tau peptide signatures in human Alzheimer’s disease brain. ACS Chem Neurosci. 2022;13(13):1992–2005. pmid:35758417
  42. 42. Prasad ASB, Shruptha P, Prabhu V, Srujan C, Nayak UY, Anuradha CKR, et al. Pseudomonas aeruginosa virulence proteins pseudolysin and protease IV impede cutaneous wound healing. Lab Invest. 2020;100(12):1532–50. pmid:32801335
  43. 43. Saravanan R, Holdbrook DA, Petrlova J, Singh S, Berglund NA, Choong YK, et al. Structural basis for endotoxin neutralisation and anti-inflammatory activity of thrombin-derived C-terminal peptides. Nat Commun. 2018;9(1):2762. pmid:30018388
  44. 44. Savickas S, Kastl P, Auf dem Keller U. Combinatorial degradomics: precision tools to unveil proteolytic processes in biological systems. Biochim Biophys Acta Proteins Proteom. 2020;1868(6):140392. pmid:32087360
  45. 45. Schaller-Bals S, Schulze A, Bals R. Increased levels of antimicrobial peptides in tracheal aspirates of newborn infants during infection. Am J Respir Crit Care Med. 2002;165(7):992–5. pmid:11934727
  46. 46. Schmidtchen A, Frick IM, Björck L. Dermatan sulphate is released by proteinases of common pathogenic bacteria and inactivates antibacterial alpha-defensin. Mol Microbiol. 2001;39(3):708–13. pmid:11169110
  47. 47. Schmidtchen A, Frick I-M, Andersson E, Tapper H, Björck L. Proteinases of common pathogenic bacteria degrade and inactivate the antibacterial peptide LL-37. Mol Microbiol. 2002;46(1):157–68. pmid:12366839
  48. 48. Secher A, Kelstrup CD, Conde-Frieboes KW, Pyke C, Raun K, Wulff BS, et al. Analytic framework for peptidomics applied to large-scale neuropeptide identification. Nat Commun. 2016;7:11436. pmid:27142507
  49. 49. Singh V, Phukan UJ. Interaction of host and Staphylococcus aureus protease-system regulates virulence and pathogenicity. Med Microbiol Immunol. 2019;208(5):585–607. pmid:30483863
  50. 50. Torres MDT, Cesaro A, de la Fuente-Nunez C. Peptides from non-immune proteins target infections through antimicrobial and immunomodulatory properties. Trends Biotechnol. 2025;43(1):184–205. pmid:39472252
  51. 51. Van JAD, Clotet-Freixas S, Zhou J, Batruch I, Sun C, Glogauer M, et al. Peptidomic analysis of urine from youths with early type 1 diabetes reveals novel bioactivity of uromodulin peptides in vitro. Mol Cell Proteomics. 2020;19(3):501–17. pmid:31879271
  52. 52. Villanueva J, Shaffer DR, Philip J, Chaparro CA, Erdjument-Bromage H, Olshen AB, et al. Differential exoprotease activities confer tumor-specific serum peptidome patterns. J Clin Invest. 2006;116(1):271–84. pmid:16395409
  53. 53. Wendler J, Schroeder BO, Ehmann D, Koeninger L, Mailänder-Sánchez D, Lemberg C, et al. Proteolytic degradation of reduced human beta defensin 1 generates a novel antibiotic octapeptide. Sci Rep. 2019;9(1):3640. pmid:30842543
  54. 54. Weng SSH, Demir F, Ergin EK, Dirnberger S, Uzozie A, Tuscher D, et al. Sensitive determination of proteolytic proteoforms in limited microscale proteome samples. Mol Cell Proteomics. 2019;18(11):2335–47. pmid:31471496
  55. 55. Yi J, Liu Z, Craft D, O’Mullan P, Ju G, Gelfand CA. Intrinsic peptidase activity causes a sequential multi-step reaction (SMSR) in digestion of human plasma peptides. J Proteome Res. 2008;7(12):5112–8. pmid:19367699
  56. 56. Zhang L-J, Gallo RL. Antimicrobial peptides. Curr Biol. 2016;26(1):R14-9. pmid:26766224