Figures
Abstract
Despite the importance of grains and legumes in the human diet, little is known regarding peptide release and the temporal changes of protease activities during seed germination. LC/MS-MS peptidomic analysis of two cultivars of germinating chickpea followed by computational analyses indicated cleavage dominated by proteases with a single position preference (mainly before (P1) or after cleavage (P1’): L at P2 (cysEP-like); R or K at P1 (vignain-like), N or Q at P1 (legumain-like); and previously unidentified K, R, A and S at P1’; A at P2’). While P1 N cleavages were relatively constant, P1’ K/R preferences were high in soaked garbanzo (kabuli) seeds, declined by four days, and returned at six days, but were much rarer in the brown (desi) cultivar. Late Embryogenesis Associated (LEA) peptides were markedly released during early germination. Vicilin peptides rich in glutamic acid near their N-termini markedly increased with germination, consistent with strong proteolytic resistance, even to human digestion, as indicated by analyses of separate datasets. Thus, this first peptidomics study of seed germination proteolytic profiles unveils a complex cultivar-specific programme of sequential activation and inactivation of a series of proteases, associated with the differential release of peptides from different protein groups.
Citation: Bera I, O’Sullivan M, Scaife C, Cagney G, Shields DC (2024) Motif mapping during chickpea germination reveals a complex sequential activation of different proteolytic activities. PLoS ONE 19(10): e0307481. https://doi.org/10.1371/journal.pone.0307481
Editor: Anil Bhatia, University of California Riverside, UNITED STATES OF AMERICA
Received: January 16, 2024; Accepted: July 7, 2024; Published: October 31, 2024
Copyright: © 2024 Bera et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The data is deposited at EBI-PRIDE database (https://www.ebi.ac.uk/pride/) with accession no. PXD048224. The files are named as follows: 2022_03_31_IB_T1F1S(1/2/3).raw -Tryptic Soaked samples 2022_03_31_IB_T1F12_(1/2/3).raw - Tryptic 2 days samples 2022_03_31_IB_T1F14_(1/2/3).raw - Tryptic 4 days samples 2022_03_31_IB_T1F16_(1/2/3).raw - Tryptic 6 days samples 2022_03_31_IB_BF1S(1/2/3).raw - Non-tryptic Brown chickpea soaked samples 2022_03_31_IB_BF12_(1/2/3).raw - Non-tryptic Brown chickpea 2 days samples 2022_03_31_IB_BF14_(1/2/3).raw - Non-tryptic Brown chickpea 4 days samples 2022_03_31_IB_BF16_(1/2/3).raw - Non-tryptic Brown chickpea 6 days samples 2022_02_22_IndraniBera_FI_S(1/2/3).raw - Non-tryptic Garbanzo chickpea soaked samples 2022_02_22_IndraniBera_FI_2_(1/2/3).raw - Non-tryptic Garbanzo chickpea 2 days samples 2022_02_22_IndraniBera_FI_4_(1/2/3).raw - Non-tryptic Garbanzo chickpea 4 days samples 2022_02_22_IndraniBera_FI_6_(1/2/3).raw - Non-tryptic Garbanzo chickpea 6 days samples *(1/2/3) represents 3 biological replicates. Data are available via ProteomeXchange with identifier PXD048224.
Funding: EU Horizon 2020 Marie Sklodowska-Curie and Enterprise Ireland funding grant no. Agreement No 847402. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Plant proteins are becoming increasingly important as a means of meeting nutritional requirements while minimising impacts on climate change [1] and improving health such cardiovascular aspects [2]. Plant-based proteins may potentially reduce diet-related land use by 76%, diet-related greenhouse gas emissions by 49%, eutrophication by 49%, and green and blue water use by 21% and 14%, respectively [3]. Legumes contribute strongly to plant-derived protein intake in human diets, particularly among vegetarians and vegans [4]. Among the legumes, chickpea is the most important in the diets of people in African and Asian countries [5]. Chickpeas are high in protein content (~25%), low in fat content, and rich in bioactive compounds like phenols and isoflavones [4]. Along with the presence of nutritional components, chickpeas contain anti-nutritional factors such as protease inhibitors, amylase inhibitors, phytolectins and others [6]. These anti-nutritional factors, specifically protease inhibitors, decrease the nutritional value of chickpeas by hindering the proteolysis of proteins and thereby reducing digestibility. Protease inhibitors can comprise up to 10% of seed protein, and are likely to play key roles in combating pathogens (fungi, bacteria, nematodes and viruses) and insect herbivores [7].
Transitioning to a more plant-based protein in the diet requires the development of foods which are more easily assimilated. In plants, there is large-scale protein turnover associated with germination. Germination of legumes involves extensive changes which impact on digestibility of the protein [8]. Cysteine endopeptidases are physiologically important in breaking down storage proteins during seed germination [9]. They are typically deposited within vacuoles, and prevented from carrying out inappropriate proteolysis by compartmentalisation, and by storage in a proenzyme form [10]. Over 50% of chickpea seed protein is composed of salt soluble globulins, around 10% of water soluble albumins, around 20% of acid/base soluble glutelins and 5% of alcohol/water soluble prolamins [7]. Vicilin (7S) and legumin (11S) comprise a major portion of the globulin and albumin fractions of chickpea [11], with lipoxygenase, convicilin, lectinβ and LEA (late-embryogenesis associated) proteins also being at high abundance in total protein [12]. Various studies have identified the changes in protein and amino acid content, changes in proteolytic activities with changes in protease inhibitors and antinutritional factors during germination [8, 13]. Similarly in cereals protein digestibility and protein content changes have been quantified [14]. However, the process of protein breakdown and peptide release from the different sets of major legume proteins is not well understood.
Peptidomics provides a method to describe peptides released during germination. An elegant ex vivo digestion study showed better mammalian digestibility of chickpea sprouts when compared to ungerminated seeds [15], and their peptidomics analysis characterised chickpea peptides resistant to the stomach, duodenal lumen and brush border membrane phases of mammalian digestion [15]. Peptidomics not only describes the peptides released, but also gives insights into the patterns of proteolysis, since some endoproteases have specific cleavage patterns, which will be in evidence at peptide termini. While these patterns have been used to predict enzymes involved in infant human milk digestion [16–18], and to characterise in detail the specificity of a seed protease [19], it has not been used to systematically track the changes in proteolysis occurring during seed germination, which is the aim of this study.
Here, we investigated the pattern of changes in proteolysis in germinating chickpea by monitoring shifts in the terminal preferences of peptides, and in the protein sources of released peptides. This revealed complex differences in the major proteolytic players not only over the time course of germination, but also between cultivars, involving both known and novel endoproteolytic specificities.
Materials and methods
Materials
Diethyl ether (99.5% purity), NaCl (>99.5% purity), Pierce Trypsin Mass Spectrometry grade, Iodoacetamide ultra-pure (98% purity) proteomics Grade, TCEP (98% purity), Pierce Water LC-MS Grade, Acetonitrile (99.9%), Trifluoroacetic acid (99.5%) biochemistry grade were procured from VWR chemicals and 0.1% (v/v) formic acid LC-MS grade was procured from Fisher Scientific.
Methods
The methods are described below and summarized schematically in Fig 1.
Germinating chickpeas
Garbanzo and small brown chickpeas were soaked separately for 12 hours, and then divided into 2 parts in a 30:70 ratio. The part with 30% chickpeas of the initial batch was divided into 3 glass bottles and were used as soaked samples. The remaining 70% were divided into 3 containers and germinated at 18°C. Samples of germinated chickpeas were taken out from each container after 2, 4 and 6 days of germination after 12 hours of soaking in water. Thus 4 times points; 12 hours (soaked), 60 hours (2 days germinated), 108 hours (4 days germinated) and 156 hours (6 days germinated) were considered for further studies. At each time point the samples, in glass jars, were immediately immersed in a water bath and heated to 90°C for 20 minutes to eliminate any further proteolytic activity. Following cooling the seed coats were then removed manually and the dehulled seeds chopped into pieces before freeze-drying and grinding to a fine powder for storage at -20°C before analysis.
Defatting
To defat the fine chickpea powder, 30ml diethyl ether [20, 21] was added per 5gm sample and shaken for 2 minutes, and then put in an ice bucket for 15 minutes and the supernatant discarded. This was repeated three times. The defatted samples were air dried at room temperature and then stored in 50 mL polypropylene tubes at -20°C.
Determination of Soluble Nitrogen
The crude protein content was estimated by the Kjeldahl method using the conversion factor of 6.25 [22] on three replicates and protein concentration for each sample is reported as the average of these in Table 1. In brief, 0.5 g of sample was taken into Kjeldahl tubes and a Kjeldahl digestion tablet was added to them. 25 ml concentrated H2SO4 was added to each of the tube and they were placed in digestion chamber at 200°C for 30 minutes and then at 300°C and 400° C for 30 minutes respectively. The flask was then cooled and 75 mL of water was added carefully. The tubes were placed in distillation unit where ammonia was collected in 25ml boric acid. Distillation was carried out until ~ 150ml of distillate was collected. It was then titrated against 0.1 M HCl.
Peptide extraction and filtration from non-trypsinised samples
Defatted chickpea powder from all four time points (soaked, 2,4 and 6 days with 3 replicates each) were resuspended in separate aliquots in 0.2M NaCl solution and in distilled water in glass beakers as 10%w/w solutions by dissolving 5gm of chickpea powder in NaCl and, making the total volume to 50ml to extract albumin and globulin peptides [23, 24]. The dispersions were stirred for 2 hours by a magnetic stirrer at room temperature and then placed in the refrigerator overnight. Next day, the extracts containing soluble peptides were collected by placing them in polypropylene tubes and centrifuging at 12,000g for 20 minutes at 20°C. The soluble supernatant (15mL) combined from NaCl and water extraction were transferred very carefully into 10kD MWCO centrifugal filtration module (Amicon Ultra Centrifugal Filter, 10 kDa MWCO) and centrifuged at 5000g for 20 minutes. The collected permeate fractions were stored at -20°C before further processing for injecting into the LC column for LC/MS-MS. Two technical replicates of the mass spectrometry analysis were obtained from each of the biological replicates.
Preparation of trypsinised samples
For preparing the trypsinised samples, 1ml of soluble peptide extract (supernatant after 12,000 x g centrifugation as described above) was diluted 10 times with HPLC grade water. 50μl of the peptide extract from each sample was retained for digestion. The samples were trypsinised as Pierce manufacturer’s instructions for MS grade trypsin. Each sample was first reduced by incubation for 1 hour at 60°C in the presence of 3mM TCEP (tris(2-carboxyethyl)phosphine, 98% purity)) followed by alkylation by incubation in the dark for 30 minutes at room temperature in the presence of Iodoacetamide (final concentration of 12mM, 98% purity). Trypsin was added in a ratio of 1 μg enzyme to 10μg sample, and samples were incubated overnight at 37°C. The following day the samples were dried on a vacuum evaporator (miVac Duo concentrator, Genevac UK) followed by resuspension in 20μl 0.5% TFA for de-salting using C18 Ziptips (ZTC18S096) according to the manufacturer’s guidelines. Peptides were finally resuspended in 0.1% formic acid prior to loading onto the mass spectrometer.
Peptide detection using LC-MS/MS
Peptides were characterised using Orbitrap LC-MS/MS. An equivalent amount of each sample (5μl) was injected onto a Dionex Ultimate 3000 (RSLCnano) chromatography system connected to a Thermo Scientific Q Exactive mass spectrometer. Tryptic and non-tryptic peptides from each sample were loaded onto a fused silica emitter (75 μm ID, pulled using a laser puller (Sutter Instruments P2000), packed with Reprocil Pur C18 (3 μm) reverse phase media (Dr. Maisch HPLC GmbH) and was separated by an increasing acetonitrile gradient from 1% Solvent A (0.5% acetic acid, 2.5% acetonitrile) to 30% Solvent B (0.5% acetic acid, 97.5% acetonitrile) over 60 minutes at a flow rate of 250 nL/min. The QExactive MS/MS has an accuracy of less than 3 ppm (external) RMS. The mass spectrometer was operated in data dependent TopN 12 mode, with the following settings: mass/charge (m/z) range 300-1600Th; resolution (m/Δm) for MS1 scan 70,000 at m/z = 200; Automatic Gain Control (AGC) target 3e6; resolution for MS2 scan 17,500 at m/z = 200; AGC target 5e4. The mass spectrometry data have been deposited to the PRIDE Archive (http://www.ebi.ac.uk/pride/archive/) via the PRIDE partner repository (identifier PXD048224).
Analysis of mass spectra
Raw MS/MS spectra were analyzed in MaxQuant software version 2.0.3.0 [25]. Technical replicates yielded similar total peptide counts, with greater variability in peptide counts among the biological replicates (S1 File). Accordingly, peptide counts obtained from the two technical replicates were pooled for further analysis. The raw files were searched against the Cicer arietinum protein database (Uniprot, Downloaded 27/09/2021, 38500 protein entries). Peptides between 6–25 residues were searched for, with variable modifications considered for methionine oxidation and protein N-terminal acetylation. Up to 2 missed cleavages were allowed. The protein false discovery rate (FDR) was set at ≤ 0.01 with a peptide FDR of ≤ 0.01. Match between runs (MBR) and label-free quantification (LFQ) options were selected. MS/MS tolerance was set at 20 ppm. For non-digested samples a nonspecific digestion mode was selected. For digested samples, trypsin was selected as the enzyme, and a fixed modification of carbamidomethylation on cysteine. MaxQuant output files were further analysed. Peptidomic (non-typsinised) analyses were carried out after filtering the peptides based on PEP (Posterior Error Probability) ≤0.001, where PEP is the probability of a false hit. Quantitative analyses were carried out using R [26]. pLogo [27] was used to visualise the enrichment and depletion of amino acids around N and C terminal cleavage sites separately, of the set of unique identified peptides. Letter size is proportional to statistical significance of either enrichment (above the line) or depletion (below the line), with a red line indicating the size of a statistically significant effect. Smaller illegible amino acid letters are typically insignificant. Both peptide counts and log10 of intensities provided approximate relative quantifications of peptides. Peptide and protein distribution across the top 100 identified proteins (defined by peptide count) within each cultivar were tabulated for major functional categories which we defined manually (Fig 5), since gene ontology is underdeveloped for the major protein classes in the chickpea seed.
Results
Differentiation between peptide N and C terminal motifs
We investigated the make-up of chickpea seeds at various stages of germination, searching for endogenous peptides (6–25 residues long) by LC-MS/MS. Analysis of trypsinised samples enabled us to understand the protein make-up of the seeds that are the major contributors to the identified peptides, and also to understand functional changes that may arise from changes in protein composition.
After germination, there was a modest increase in protein content as a percentage of dry matter for both garbanzo and brown chickpeas (Table 1), which may reflect both an increase of protein synthesis and a breakdown of carbohydrate and lipid. Between 600 and 1984 endogenous peptides were identified at the four germination stages in the two cultivars, yielding large enough sample sizes to characterise amino acid preferences at peptide termini.
To understand the endogenous proteolysis pattern of seed proteins and how they differ as germination progresses, we inspected four residues upstream (termed P4-P1), and four residues downstream of the peptide termini (termed P1’-P4’), since these residues commonly play roles in determining protease specificity. We first investigated the major preferences across the four time points and two cultivars in a single combined analysis. There was an avoidance of cysteines within the actual peptides (at P4-P1 of the C-terminus, and at P1’-P4’ of the N-terminus, Fig 2). A typical consequence of eukaryotic endoprotease specificity is that the strongest amino acid preferences and avoidances are seen at the P1 and P1’ sites, and very similar patterns are seen at both N and C termini (Fig 2). However, N terminal cleavages showed a striking preference for E at P4’ in the N terminal cleavage sites, and a preference for E at P3’ (Fig 2B).
Probability of residue enrichment (above line) or depletion (below line) around peptide cleavage (between positions P1 and P1’ at (a) C-termini and (b) N-termini in a merged peptidomic sample of both cultivars across all germination time points.
Since the pattern of preference for glutamic acid near the N-terminus extended out beyond the region usually playing a role in proteolysis (S1 Fig in S2 File), we considered that this preference likely reflects a selection bias after cleavage in favour of E-enriched peptides arising within the seed or during our biochemical separation of peptides. This contrast between N and C terminal cleavages was even more pronounced in chickpea, both ungerminated and three day germinated, after going through mammalian in vitro digestion, from our terminal analysis of data in an independent study [15] (S2 Fig in S2 File). Therefore, we considered the C-termini to provide a better indication of the initial endoproteolysis steps, and focussed subsequent attention on the patterns seen in the C-termini of peptides, which showed stronger amino preferences around the P1 and P1’ sites that most typically play important roles in endoproteolysis. The preferences seen at these sites may reflect motifs involving more than one amino acid, or may reflect simpler motifs dominated by a single residue site (for example, trypsin like). Principal components analysis (PCA) was used to identify a number of enriched motifs involving multiple sites (S3-S5 Figs in S2 File) but the percentage variance accounted for by these was low, with the suggested multi-site motifs, such as an alanine rich motif (S1 Table in S2 File), only accounting for a small number of peptides. Accordingly, we focussed on mapping further the single site preferences over germination, focussing the eight most enriched amino acids in C-terminal peptides (Fig 3), along with P2 Leucine, which accounted for a large number of peptide counts, and P1’ S, suggested from the PCA analysis. The pattern of motif preferences around each of these residues was then inspected (Fig 3 and S3 Table in S2 File).
The most predominant letter is significantly enriched or depleted when it crosses the red line.
Relationship of residue preferences to known endoproteolytic preferences
The preference for P1 N (Fig 2) likely reflects the action of the protease legumain, which is primarily known to cleave after asparagine [28, 29]. While a kidney bean legumain shows a preference for V, F and Y at P3 [28], there is no significant chickpea P3 residue enrichment (Fig 3), perhaps reflecting the diversity of legumain proteases encoded by different genes in plants. Although mammalian legumain shows specificity for N, and D at very low pH [30] we hypothesise that chickpea P1 N and Q cleavage may be mainly performed by legumain or legumain related protease(s), since the original paper that characterised a legumain extract confirmed its ability to cleave after both these residues in a legume [31], and their surrounding profiles of suggestive amino acid preferences are similar (Fig 3).
The preference for cleavage after arginine may reflect the activity of vignain (VPE)-related proteases, which have this activity [31]. Both vignain and legumain are associated with plant vacuoles, which are low-pH and reducing environments [28]. Another plant cysteine protease, mcII-Pa shows a 50 fold preference for arginine over lysine in the P1 position [32]. In addition, there is an apparent serine protease in chickpea that cleaves after arginine [33]. In contrast, the cleavage before R or K in the P1’ is not well mapped to known proteases, although the CPPh/CYSP1-like cysteine protease of kidney bean that digests vicilin efficiently is known to cleave between a serine and lysine [34]. P1’ R prefers R at the P1 position (Fig 3), a preference reciprocated by preferences around P1 R (Fig 3) and suggested by the PCA analysis of garbanzo (S1 Table in S2 File); but this enrichment is not seen for P1’ K, suggesting that they may arise from separate proteases.
The predominance of L at P2 (reminiscent of mammalian cathepsin cysteine proteases [29], resembles that of the castor bean cysteine proteinase enzyme cysEP, whose activity is similar to the vignain-related papain-like enzyme activity accumulating in the legume seed ricinosome which prefers L at P2, with arginine and lysine the preferred amino acids at P1 [35], as seen in Fig 3. This enzyme is associated with destruction of proteins in the endosperm during germination [35]. A more distantly related plant cysteine protease with a more extensively mapped specificity, tobacco NbCP14, has a preference for P2 L>V>I and P1 K>R>G>A [19]. This is consistent with the chickpea P2 L preference for K and R at the P1 site (Fig 3), a preference shared by the mammalian cathepsins [29].
S at P1’ could be associated with a protease resembling stem bromelain, which cleaves within a tri-serine substrate in bromelain inhibitor [36]. P1’ S shows a significant avoidance of proline at P1, suggesting that it may be generated by a distinct enzyme from the endoprotease(s) involved in cleavage before P1’A, which instead shows an avoidance of P1 L. P1’ A and P2’ A show significant enrichment for each-other and similar P1 preferences and avoidances, in particular both strongly avoiding P1 I corroborating with the PCA studies (S1 Table in S2 File).
P1 Isoleucine is the most strongly avoided amino acid at cleavage sites (Fig 2), and this avoidance is distributed across the major subsets of sites (Fig 3). This avoidance may be in common across many cysteine proteases [37]. P1’ S also does not show a significant avoidance of P1 I (Fig 3), suggesting that it may be cleaved by a different class of proteases to the other P1’ -preferring enzymes. P1’ P is strongly avoided by the vignain-like P1 [RK] sites, and also by the legumain like P1 [NQ] sites [38].
There is evidence for an aspartic protease cleaving before aspartic acid during brassica seed development [39], and a legumain-like protease of Vicia sativa cleaving after both aspartate and asparagine [10] but we do not see enrichment for cleavage before or after aspartate in chickpea (Fig 2A).
Changes in proteolytic preferences across germination timepoints and cultivars
From Fig 4 and Table 2, certain features are shared between the garbanzo and brown cultivars, with overrepresentation of P1 N strongest in soaked versus germinated seeds, and the increase of P1 K and R with germination. In other respects they differ substantially, for example the high frequency of P1’ K and R seen in soaked, 4 day and 6 day garbanzo is largely absent from 2 day garbanzo and from all four stages of brown germination. Avoidance of P1 I and V are seen at all stages in both cultivars.
Shades of green and red showing low and high values respectively in garbanzo.
The differences in apparent proteolytic signatures seen at each stage seem likely to involve multiple proteases with different specificities dominating the digestion at different stages, but overlapping each-other. In garbanzo C terminal peptides, the positively charged amino acids R and K are preferred at P1’ in soaked, at both P1 and P1’ after two days germination, but at P1 at four days. This is suggestive of one set of proteases being present earlier in germination, overlapping with another protease set which appears to dominate later in germination. This is a very significant effect in garbanzo (S3 Table in S2 File), especially at the C-termini, where at 2 days, cleavage sites are under-represented for side-by-side positive charges at both P1 and P1’ (Odds Ratio 0.47, p = 2x10-9), while at 4 days they are strongly over-represented (Odds Ratio 2.01, p = 3x10-5). This clearly indicates different proteases with positive charge preferences acting at different times. In contrast, no such enrichments for, or deficits of, two basic residues across the cleavage site were seen in brown chickpea, indicating that the set of most active proteases is very different between the cultivars (S4 Table in S2 File).
Early release of LEA proteins during germination
Gene ontology of the proteins releasing peptides during germination (S7 Fig in S2 File) suggested an increase in proteins with a role in translation during germination. However, the number of proteins with a gene ontology assignment was small, and the total number of peptides accounted for by these proteins was also small. Given the deficiencies in current gene ontology annotation for major legume seed proteins, we classified the top 100 proteins into seven major categories: storage proteins, proteases, protease inhibitors, metabolic, translation, stress and chaperones. These categories are indicated at the start of the names displayed in Fig 5. Other functions commonly seen (Fig 5) included lipid associated proteins. We analysed changes in the proteins observed in both the trypsinised garbanzo (indicative of soluble protein presence) and non-typsinised peptidomic garbanzo and brown chickpea soluble peptide samples (indicative of protein breakdown) (S8 Fig in S2 File).
Heatmap of z-scores of log10(intensity) of top 100 proteins in non-trypsinised garbanzo. Proteins were assigned to seven major categories indicated in capitals at the start of the protein name.
Peptide counts and percentage of peptides in each group for all the biological sample replicates are tabulated in S6-S8 Tables in S2 File, and summarised in Tables 3–5. Storage proteins release the greatest number of peptides from soaked and germinated seeds, and the number increases up to day six in both cultivars (Tables 3 and 4). However, the trypsinised samples show a modest reduction in the proportion of peptides derived from storage proteins with germination (Table 5), consistent with a gradual depletion of storage proteins over time. The second largest category of embryogenesis proteins contribute to about a fifth of peptides identified throughout germination in the trypsinised samples (Table 5); however embryogenesis peptide release is much more marked early in germination, especially in garbanzo (Tables 3 and 4), dropping from 28% of peptides in soaked to 10% at day 6, indicating that this class of proteins is being most efficiently hydrolysed at the start of germination. While peptide release from metabolic proteins increased over germination in both cultivars (Tables 3 and 4), there was a more complex pattern of metabolic protein representation in the trypsinised samples (Table 5). It may be that some of the metabolic proteins have completed their major function in the germinating seed, and are then being broken down at day 6, so that fewer remain in the trypsinised samples at this stage. We used Fisher’s exact test to measure the statistical significance compared to 12 hours (soaked) samples and significant p-values are indicated.
Table 3 shows that both cultivars release more embryogenesis peptides early in germination, but that the effect is far more pronounced in garbanzo than in the brown chickpea, which has fewer embryogenesis related proteins released overall. A priori we would have expected storage proteins to be gradually depleted as a proportion of released peptides over germination, but the timescale of germination studied may be too short for that in the chickpea, where there may well be substantial remaining storage protein in the original seed at day 6. The relative numbers of peptides of the defined classes show a number of pronounced shifts (Table 3) that cannot be simply explained, suggesting that the dynamics of peptide release from each class is influenced by multiple factors that vary with both time point and cultivar.
Changes in detected proteases and protease inhibitors
We tracked the peptide counts of proteases and their inhibitors from both the tryptic and non-tryptic samples. For proteases, very few have high peptide counts in the tryptic sample (S10 Table in S2 File), which may be because there is such a high preponderance of trypsin-digested storage proteins masking their detection. The most detectable putative protease was “Putative Pi starvation-induced protein” but its functional role in seeds is unknown. In the non-tryptic garbanzo sample (S8 Table in S2 File), peptides derived from aspartic protease and aspartic protease-like proteins are present throughout germination, while legumain appears at higher counts in soaked seeds and tails off during germination matching the decline in its cleavage motif (after asparagine) later in germination in both garbanzo and brown (Fig 4). Thus, while peptide counts in non-tryptic samples are an indicator of protein breakdown, they may be an indirect indicator of soluble protein concentration, although the numbers are too small to provide any reasonable quantification. Changes in protease inhibitor concentrations could also have effects on proteolysis. From the tryptic samples, the three highly detectable inhibitors Bowman-Birk type proteinase inhibitor-like, subtilisin inhibitor-like and 7S globulin-like protease inhibitors showed relatively constant levels throughout soaking and germination. Interestingly, in the non-tryptic samples, the protease inhibitors were not detectable at such high peptide counts (S8 and S9 Tables in S2 File). Together, these two observations are consistent with a relatively high stability of protease inhibitors during germination, so that they may only come to be broken down beyond day 6 of germination.
Discussion
Despite the importance of grains and legumes in the human diet, there is only patchy knowledge regarding peptide release and the temporal changes of protease activities during seed germination. We took a peptidomics approach to characterise the cleavage preferences across germination in two cultivars of chickpea. Our analysis identified a number of different endoprotease cleavage preferences, most of which were dominated by a single residue preference. Some of these we could partly align with the known cleavage preferences of known plant cysteine proteases (legumain-like P1[QN], vignain-like P1[RK], cysEP-like P2[L]), and other preferences which were difficult to map to likely proteases (K, R, A and S at P1’; A at P2’). A limitation of this study is that the proteases underpinning the cleavages are only suggestively mapped to the preferences, based on the sparse and patchy literature across a variety of legumes and other plants. Future systematic analysis of specificity by proteomics analysis of cleavage of substrates with purified enzymes [40] will in future help map out specificities more convincingly. However, the precise pH and hydration conditions in different compartments of the germinating seed may be difficult to represent in vitro. Accordingly, more extensive systematic analysis of combined cleavage of natural seed substrates in situ along with more detailed assessment of protease variation across multiple strains and conditions, can in parallel shed light on physiologically relevant protease activities.
The degree of divergence between the endoproteolytic profiles of the brown and garbanzo chickpeas was somewhat surprising. While P1 N (legumain-like) cleavages were relatively constant, the P1’ K/R preferences were high in soaked garbanzo seeds, declined by four days, and returned at six days, but overall these were much rarer in the brown cultivar. This raises very interesting questions, as to why there should be such differences, since presumably they will impact on the overall efficiency of storage protein release, impacting on seedling viability and competition. While the cultivars appear to have roughly similar distribution of proteins by 1D gel electrophoresis, they do have some biophysical differences [41], so it is possible that the proteolytic programme is optimised to differences in their storage protein substrates. However, in one study, compared to garbanzo (kabuli), brown chickpea (desi) had higher levels of both trypsin and chymotrypsin inhibitory activity, and the presence of protease inhibitors is likely to impact differentially on the suppression of certain proteases during germination [42]. Variation in protease inhibitor levels may not only reflect inherent strain differences, but may also reflect environmental differences, since exposure to a major insect pathogen of chickpea, the podborer (Helicoverpa armigera), can result in elevated expression of protease inhibitors [36]. Given that the podborer appears to have acquired the ability to proteolytically inactivate protease inhibitors in chickpea [43], it is possible to envisage very strong selection pressures on the proteolysis system in legumes and grains, where protease inhibitors whose primary function may be to inhibit pathogen or insect digestion may in turn impact on the ability of the endogenous proteases to efficiently break down the seed storage proteins.
Peptides from Late Embryogenesis Associated (LEA) proteins were markedly released during early germination, in contrast with peptides from major seed storage proteins, which increased as a proportion of the peptides released later in germination. LEAs represent an important set of under-studied chickpea seed proteins. They are compositionally highly biased, being particularly glycine rich. 22–24 kD LEA proteins accumulate during the late stage of chickpea seed formation [12]. They are likely to perform multi-functional roles, including tolerance to dehydration and cold stress, maintaining space in the mature dessicated seed, acting as membrane and protein protectors, and acting as potential anti-oxidants [44]. Thus, LEA proteins may well have multiple potential effects on yield under drought conditions, on storage stability and on vigorous germination. A key feature of these proteins is their high protein disorder [44, 45], in contrast with the more structured elements of many seed storage proteins. Thus, their rapid mobilisation during early germination may simply reflect the ease with which proteases can digest them, since neither the full proteins nor their partial digestion products are likely to form aggregates or shield hydrophobic cores from cleavage, in a well hydrated environment. For brown chickpea there was a similar, but less marked trend of early LEA peptide release during germination, indicating likely variability in cultivars with respect to these proteins or their proteolysis. Overall, their early release during germination may suggest that the primary functional role of LEA proteins is in the dormant seed, rather than in the sprout.
We noted a difference between cleavage patterns at the N and C termini of peptides, with an enrichment for glutamic acid centred around the P4’ site of N-termini that was absent from C termini (S6 Fig in S2 File). This could potentially reflect exoprotease specificity patterns, such as those of tripeptidyl aminopeptidases [46] which are abundant in seeds of the legume sesame [47]. However, given their greater representation in a separate study [15] (S2 Fig in S2 File) which subjected ungerminated and germinated chickpea to mammalian in vitro digestion (including stomach, pancreatic fluid and brush border membrane proteolysis phases), it appears that these peptides are resistant to a wide variety of proteolysis. We hypothesise that they may have undergone a modification conferring protease resistance that does not alter the overall peptide mass, such as stereo conversion of residues from L- to D- amino acids.
The peptides identified here do not include shorter di- and tri-peptides. However, it is worth noting that a study by LC-MS/MS of soybean oligopeptides [48] showed a preponderance of arginine and lysine at the termini of tripeptides that changed during germination, suggestive of their C-terminal cleavage being performed by a vignain-like protease.
Major classes of peptides identified here that are released during germination include transient LEA-protein derived peptides, and persistent, protease-resistant glutamate rich vicilin-derived peptides. It will be of interest to explore if there are any consequences for gut comfort or health arising from ingesting these peptides in the hydrolysed form generated by seed germination. The proteolysis of seed storage proteins during germination may improve digestibility in the human diet, and breakdown of certain peptides and proteins may also reduce allergenicity [49–51]. Future studies of such health implications should pay careful attention to variation within and between cultivars, some of which is likely to reflect their different growth environments and pathogen exposures.
Supporting information
S1 File. S1 File which contains 5 excel sheets containing protein list along with sequence coverages at different stages of germination and peptide files from MaxQuant output, namely: a) Peptides_Non-trypsinised brown, b) Peptides_Non-tryp Garbanzo, c) Peptides_Trypsinised Garbanzo, d) Proteins trypsinised garbanzo, e) Proteins non-trypsinised, f) Proteins non-trypsinised brown.
https://doi.org/10.1371/journal.pone.0307481.s001
(XLSX)
S2 File. S1-S10 Tables and S1-S8 Fig contain details of information regarding peptide counts, cleavage site preferences and protein expression during different stages of germination.
https://doi.org/10.1371/journal.pone.0307481.s002
(DOCX)
Acknowledgments
We thank Darragh Flynn (The Happy Pear) and Cristina Marino Buslje (Leloir Institute) for discussions, and Neeladri Roy (RCSI) and Selene Pedros-Garrido (UCD) for technical assistance. TOC is partially generated using biorender.com.
References
- 1. Lamberg‐Allardt C.; Bärebring L.; Arnesen E.; Nwaru B. I.; Þórisdóttir B.; Ramel A.; et al. Animal versus Plant-Based Protein and Risk of Cardiovascular Disease and Type 2 Diabetes: A Systematic Review of Randomized Controlled Trials and Prospective Cohort Studies. Food & Nutrition Research 2023, 67. pmid:37050925
- 2. Espinosa-Marrón A.; Adams K.; Sinno L.; Cantu-Aldana A.; Taméz M.; Marrero A.; et al. Environmental Impact of Animal-Based Food Production and the Feasibility of a Shift toward Sustainable Plant-Based Diets in the United States. Frontiers in Sustainability 2022, 3. https://doi.org/10.3389/frsus.2022.841106.
- 3. Gibbs J.; Cappuccio F. P. Plant-Based Dietary Patterns for Human and Planetary Health. Nutrients 2022, 14 (8), 1614. pmid:35458176
- 4. Boukid F. Chickpea (Cicer Arietinum L.) Protein as a Prospective Plant‐based Ingredient: A Review. International Journal of Food Science & Technology 2021, 56 (11), 5435–5444. https://doi.org/10.1111/ijfs.15046.
- 5. Sofi S. A.; Singh J.; Muzaffar K.; Majid D.; Dar B. N. Physicochemical Characteristics of Protein Isolates from Native and Germinated Chickpea Cultivars and Their Noodle Quality. International Journal of Gastronomy and Food Science 2020, 22, 100258. https://doi.org/10.1016/j.ijgfs.2020.100258.
- 6. Rachwa-Rosiak D.; Nebesny E.; Budryn G. Chickpeas—Composition, Nutritional Value, Health Benefits, Application to Bread and Snacks: A Review. Critical Reviews in Food Science and Nutrition 2015, 55 (8), 1137–1145. https://doi.org/10.1080/10408398.2012.687418.
- 7. Rodríguez‐Sifuentes L.; Marszalek J. E.; Chuck‐Hernández C.; Serna‐Saldívar S. O. Legumes Protease Inhibitors as Biopesticides and Their Defense Mechanisms against Biotic Factors. International Journal of Molecular Sciences 2020, 21 (9), 3322. pmid:32397104
- 8. Bera I.; O’Sullivan M.; Flynn D.; Shields D. C. Relationship between Protein Digestibility and the Proteolysis of Legume Proteins during Seed Germination. Molecules 2023, 28 (7), 3204. pmid:37049968
- 9. Chrispeels M. J.; Boulter D. Control of Storage Protein Metabolism in the Cotyledons of Germinating Mung Beans: Role of Endopeptidase. Plant Physiology 1975, 55 (6), 1031–1037. pmid:16659204
- 10. Müntz K. Proteases and Proteolytic Cleavage of Storage Proteins in Developing and Germinating Dicotyledonous Seeds. Journal of Experimental Botany 1996, 47 (5), 605–622. https://doi.org/10.1093/jxb/47.5.605.
- 11. Chang Y. L.; Alli I.; Konishi Y.; Ziomek E. Characterization of Protein Fractions from Chickpea (Cicer Arietinum L.) and Oat (Avena Sativa L.) Seeds Using Proteomic Techniques. Food Research International 2011. https://doi.org/10.1016/j.foodres.2011.08.001.
- 12. Mishra S.; Sanyal I.; Amla D. V. Changes in Protein Pattern during Different Developmental Stages of Somatic Embryos in Chickpea. Biologia Plantarum 2012, 56 (4), 613–619. https://doi.org/10.1007/s10535-012-0124-0.
- 13. Ohanenye I. C.; Sun X.; Sarteshnizi R. A.; Udenigwe C. C. Germination Alters the Microstructure, in Vitro Protein Digestibility, α‐Glucosidase and Dipeptidyl Peptidase‐IV Inhibitory Activities of Bioaccessible Fraction of Pigeon Pea (Cajanus Cajan) Seeds. Legume Science 2021. https://doi.org/10.1002/leg3.79.
- 14. Joye I. Protein Digestibility of Cereal Products. Foods 2019, 8 (6), 199. pmid:31181787
- 15. Nitride C.; Vegarud G. E.; Comi I.; Devold T. G.; Røseth A.; Marti A.; et al. Effect of Sprouting on the Proteome of Chickpea Flour and on Its Digestibility by Ex Vivo Gastro-Duodenal Digestion Complemented with Jejunal Brush Border Membrane Enzymes. Food Research International 2022, 154, 111012. pmid:35337570
- 16. Dallas D. C.; Guerrero A.; Parker E. A.; Robinson R. C.; Gan J.; German J. B.; et al. Current Peptidomics: Applications, Purification, Identification, Quantification, and Functional Analysis. Proteomics 2015, 15 (5–6), 1026–1038. pmid:25429922
- 17. Khaldi N.; Vijayakumar V.; Dallas D. C.; Guerrero A.; Wickramasinghe S.; Smilowitz J. T.; et al. Predicting the Important Enzymes in Human Breast Milk Digestion. Journal of Agricultural and Food Chemistry 2014, 62 (29), 7225–7232. pmid:24620897
- 18. Dallas D. C.; Citerne F.; Tian T.; Silva V. L. M.; Kalanetra K. M.; Frese S. A.; et al. Peptidomic Analysis Reveals Proteolytic Activity of Kefir Microorganisms on Bovine Milk Proteins. Food Chemistry 2016, 197, 273–284. pmid:26616950
- 19. Paireder M.; Mehofer U.; Tholen S.; Porodko A.; Schähs P.; Maresch D.; et al. The Death Enzyme CP14 Is a Unique Papain-like Cysteine Proteinase with a Pronounced S2 Subsite Selectivity. Archives of Biochemistry and Biophysics 2016, 603, 110–117. pmid:27246477
- 20. Feyzi S.; Varidi M.; Zare F.; Varidi M. J. A Comparison of Chemical, Structural and Functional Properties of Fenugreek (Trigonella Foenum Graecum) Protein Isolates Produced Using Different Defatting Solvents. International Journal of Biological Macromolecules 2017, 105, 27–35. pmid:28663151
- 21. Walczyk N. E.; Smith Penelope M.C.; Tovey E. R.; Roberts T. H Peanut Protein Extraction Conditions Strongly Influence Yield of Allergens Ara H 1 and 2 and Sensitivity of Immunoassays. Food chemistry 2017, 221, 335–344. pmid:27979211
- 22. A.O.A.C. (1990) Official Methods of Analysis. 15th Edition, Association of Official Analytical Chemist, Washington DC. 1990.
- 23. Liu L. H.; Hung T. V.; Bennett L. Extraction and Characterization of Chickpea (Cicer Arietinum) Albumin and Globulin. Journal of Food Science 2008, 73 (5), C299–C305. pmid:18576973
- 24. Grasso N.; Lynch N. L.; Arendt E. K.; O’Mahony J. A. Chickpea Protein Ingredients: A Review of Composition, Functionality, and Applications. Comprehensive Reviews in Food Science and Food Safety 2021, 21 (1), 435–452. pmid:34919328
- 25. Tyanova S.; Temu T.; Carlson A.; Sinitcyn P.; Mann M.; Cox J. Visualization of LC-MS/MS Proteomics Data in MaxQuant. Proteomics 2015, 15 (8), 1453–1456. pmid:25644178
- 26. Team R. C. R: A Language and Environment for Statistical Computing. MSOR Connections 2014, 1 (1).
- 27. O’Shea J. P.; Chou M. F.; Quader S.; Ryan J. K.; Church G. M.; Schwartz D. pLogo: A Probabilistic Approach to Visualizing Sequence Motifs. Nature Methods 2013, 10 (12), 1211–1212. pmid:24097270
- 28. Rotari V. I.; Dando P. M.; Barrett A. J. Legumain Forms from Plants and Animals Differ in Their Specificity. Biological Chemistry 2001, 382 (6). pmid:11501761
- 29. Dall E.; Brandstetter H. Structure and Function of Legumain in Health and Disease. Biochimie 2016, 122, 126–150. pmid:26403494
- 30. Vidmar R.; Vizovišek M.; Turk D.; Turk B.; Fonović M. Protease Cleavage Site Fingerprinting by Label‐free In‐gel Degradomics Reveals pH ‐dependent Specificity Switch of Legumain. The EMBO Journal 2017, 36 (16), 2455–2465. pmid:28733325
- 31. Kembhavi A. A.; Buttle D. J.; Knight C. G.; Barrett A. J. The Two Cysteine Endopeptidases of Legume Seeds: Purification and Characterization by Use of Specific Fluorometric Assays. Archives of Biochemistry and Biophysics 1993, 303 (2), 208–213. pmid:8512309
- 32. Bozhkov P. V.; Suárez M. F.; Filonova L.; Daniel G.; Zamyatnin A. A.; Rodríguez-Nieto S.; et al. Cysteine Protease mcII-Pa Executes Programmed Cell Death during Plant Embryogenesis. Proceedings of the National Academy of Sciences of the United States of America 2005, 102 (40), 14463–14468. pmid:16183741
- 33. Shamsi T. N.; Parveen R.; Sen P.; Fatima S. Purification and Characterization of a Novel Trypsin-like Protease from Green-Seeded Chickpea (Cicer Arientum). Preparative Biochemistry & Biotechnology 2017, 47 (5), 513–519. pmid:28278112
- 34. Rotari V. I.; Senyuk V.; Horstmann C.; Jivotovskaya A.; Vaĭntraub I. A. Proteinase A-like Enzyme from Germinated Kidney Bean Seeds. Its Action on Phaseolin and Vicilin. Physiologia Plantarum 1997, 100 (1), 171–177. https://doi.org/10.1111/j.1399-3054.1997.tb03469.x.
- 35. Than M. E.; Helm M.; Simpson D. J. W.; Lottspeich F.; Huber R.; Gietl C. The 2.0Å Crystal Structure and Substrate Specificity of the KDEL-Tailed Cysteine Endopeptidase Functioning in Programmed Cell Death of Ricinus Communis Endosperm. Journal of Molecular Biology 2004, 336 (5), 1103–1116. https://doi.org/10.1016/j.jmb.2003.12.075.
- 36. Sawano Y.; Hatano K.-I.; Tanokura M. Susceptibility of the Interchain Peptide of a Bromelain Inhibitor Precursor to the Target Proteases Bromelain, Chymotrypsin, and Trypsin. Biological Chemistry 2005, 386 (5). https://doi.org/10.1515/bc.2005.058.
- 37. Choe Y.; Leonetti F.; Greenbaum D. C.; Lecaille F.; Bogyo M.; Brömme D.; et al. Substrate Profiling of Cysteine Proteases Using a Combinatorial Peptide Library Identifies Functionally Unique Specificities. Journal of Biological Chemistry 2006, 281 (18), 12824–12832. pmid:16520377
- 38. Schwarz G.; Brandenburg J.; Reich M.; Burster T.; Driessen C.; Kalbacher H. Characterization of Legumain. Biological Chemistry 2002, 383 (11), 1813–1816. pmid:12530547
- 39. D’Hondt K.; Bosch D.; Van Damme J.; Goethals M.; Vandekerckhove J.; Krebbers E. An Aspartic Proteinase Present in Seeds Cleaves Arabidopsis 2 S Albumin Precursors in Vitro. Journal of Biological Chemistry 1993, 268 (28), 20884–20891. https://doi.org/10.1016/s0021-9258(19)36869-3. pmid:8407921
- 40. Biniossek M. L.; Niemer M.; Maksimchuk K.; Mayer B.; Fuchs J. E.; Huesgen P. F.; et al. Identification of Protease Specificity by Combining Proteome-Derived Peptide Libraries and Quantitative Proteomics. Molecular & Cellular Proteomics 2016, 15 (7), 2515–2524. pmid:27122596
- 41. Withana-Gamage T. S.; Wanasundara J. P. D.; Pietrasik Z.; Shand P. J. Physicochemical, Thermal and Functional Characterisation of Protein Isolates from Kabuli and Desi Chickpea (Cicer Arietinum L.): A Comparative Study with Soy (Glycine Max) and Pea (Pisum Sativum L.). Journal of the Science of Food and Agriculture 2011, 91 (6), 1022–1031. pmid:21328351
- 42. Singh U.; Jambunathan R. Studies on Desi and Kabull Chickpea (Cicer Arietinum L.) Cultivars: Levels of Protease Inhibitors, Levels of Polyphenolic Compounds and in Vitro Protein Digestibility. Journal of Food Science 1981, 46 (5), 1364–1367. https://doi.org/10.1111/j.1365-2621.1981.tb04176.x.
- 43. Giri A. P.; Harsulkar A.; Deshpande V.; Sainani M. N.; Gupta V. S.; Ranjekar P. K. Chickpea Defensive Proteinase Inhibitors Can Be Inactivated by Podborer Gut Proteinases1. Plant Physiology 1998, 116 (1), 393–401. https://doi.org/10.1104/pp.116.1.393.
- 44. Tunnacliffe A.; Wise M. J. The Continuing Conundrum of the LEA Proteins. The Science of Nature 2007, 94 (10), 791–812. pmid:17479232
- 45. Shih M.; Lin S. H.; Hsieh J.-W.; Tsou C.-H.; Chow T.-Y.; Lin T.-P.; et al. Gene Cloning and Characterization of a Soybean (Glycine Max L.) LEA Protein, GmPM16. Plant Molecular Biology 2004, 56 (5), 689–703. pmid:15803408
- 46. Shimotsuura I.; Yoshida N.; Ogata F.; Ito A. Evidence for and Properties of a Tripeptidyl Peptidase in Soybean (Glycine Max Merr.) Extract. Plant Science 1992, 87 (1), 1–7. https://doi.org/10.1016/0168-9452(92)90187-q.
- 47. Chen Y.; Zhu J.; Zhang C.; Kong X.; Hua Y. Sesame Water-Soluble Proteins Fraction Contains Endopeptidases and Exopeptidases with High Activity: A Natural Source for Plant Proteases. Food Chemistry 2021, 353, 129519. pmid:33740507
- 48. Nolasco E.; Krassovskaya I.; Hong K.; Hansen K. H.; Alvarez S.; Obata T.; et al. Sprouting Alters Metabolite and Peptide Contents in the Gastrointestinal Digest of Soybean and Enhances In-Vitro Anti-Inflammatory Activity. Journal of Functional Foods 2023, 109, 105780. https://doi.org/10.1016/j.jff.2023.105780.
- 49. Patil S.; Niphadkar P. V.; Bapat M. M. Chickpea: A Major Food Allergen in the Indian Subcontinent and Its Clinical and Immunochemical Correlation. Annals of Allergy, Asthma & Immunology 2001, 87 (2), 140–145. https://doi.org/10.1016/s1081-1206(10)62209-0.
- 50. Sánchez‐Monge R.; López‐Torrejón G.; Pascual C.; Varela J.; Martin‐Esteban M.; Salcedo G. Vicilin and Convicilin Are Potential Major Allergens from Pea. Clinical & Experimental Allergy 2004, 34 (11), 1747–1753. pmid:15544600
- 51. Dadon S. B.-E.; Pascual C.; Eshel D.; Teper‐Bamnolker P.; Ibáñez M. D.; Reifen R. Vicilin and the Basic Subunit of Legumin Are Putative Chickpea Allergens. Food Chemistry 2013, 138 (1), 13–18. pmid:23265449