Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Metabolomics and databases driven approach of identification of phytochemicals from seed of Salvia schimperi using high-performance liquid chromatography tandem mass spectrometry

  • M. Azene ,

    Contributed equally to this work with: M. Azene, O. O. Olaokun, B.C. Iweriebor, N.M. Mkolo, C.L. Obi, W. Shibeshi, S. Assefa, Z. Abebe, A. Habtamu, K. Baye

    Roles Conceptualization, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Center for Food Science, College of Natural and Computational Sciences, Addis Ababa University, Addis Ababa, Ethiopia

  • O. O. Olaokun ,

    Contributed equally to this work with: M. Azene, O. O. Olaokun, B.C. Iweriebor, N.M. Mkolo, C.L. Obi, W. Shibeshi, S. Assefa, Z. Abebe, A. Habtamu, K. Baye

    Roles Formal analysis, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing

    oyinolaokun@yahoo.com

    Affiliation School of Science and Technology, Department of Biology and Environmental Science, Sefako Makgatho Health Sciences University, Pretoria, South Africa

  • B.C. Iweriebor ,

    Contributed equally to this work with: M. Azene, O. O. Olaokun, B.C. Iweriebor, N.M. Mkolo, C.L. Obi, W. Shibeshi, S. Assefa, Z. Abebe, A. Habtamu, K. Baye

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing

    Affiliation School of Science and Technology, Department of Biology and Environmental Science, Sefako Makgatho Health Sciences University, Pretoria, South Africa

  • N.M. Mkolo ,

    Contributed equally to this work with: M. Azene, O. O. Olaokun, B.C. Iweriebor, N.M. Mkolo, C.L. Obi, W. Shibeshi, S. Assefa, Z. Abebe, A. Habtamu, K. Baye

    Roles Conceptualization, Formal analysis, Investigation, Validation, Writing – review & editing

    Affiliation School of Science and Technology, Department of Biology and Environmental Science, Sefako Makgatho Health Sciences University, Pretoria, South Africa

  • C.L. Obi ,

    Contributed equally to this work with: M. Azene, O. O. Olaokun, B.C. Iweriebor, N.M. Mkolo, C.L. Obi, W. Shibeshi, S. Assefa, Z. Abebe, A. Habtamu, K. Baye

    Roles Resources, Validation, Visualization, Writing – review & editing

    Affiliation School of Science and Technology, Department of Biology and Environmental Science, Sefako Makgatho Health Sciences University, Pretoria, South Africa

  • W. Shibeshi ,

    Contributed equally to this work with: M. Azene, O. O. Olaokun, B.C. Iweriebor, N.M. Mkolo, C.L. Obi, W. Shibeshi, S. Assefa, Z. Abebe, A. Habtamu, K. Baye

    Roles Conceptualization, Methodology, Resources, Writing – review & editing

    Affiliation Department of Pharmacology and Clinical Pharmacy, School of Pharmacy, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia

  • S. Assefa ,

    Contributed equally to this work with: M. Azene, O. O. Olaokun, B.C. Iweriebor, N.M. Mkolo, C.L. Obi, W. Shibeshi, S. Assefa, Z. Abebe, A. Habtamu, K. Baye

    Roles Conceptualization, Methodology, Resources, Writing – review & editing

    Affiliation Department of Pharmacology and Clinical Pharmacy, School of Pharmacy, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia

  • Z. Abebe ,

    Contributed equally to this work with: M. Azene, O. O. Olaokun, B.C. Iweriebor, N.M. Mkolo, C.L. Obi, W. Shibeshi, S. Assefa, Z. Abebe, A. Habtamu, K. Baye

    Roles Conceptualization, Methodology, Resources, Writing – review & editing

    Affiliation Center for Food Science, College of Natural and Computational Sciences, Addis Ababa University, Addis Ababa, Ethiopia

  • A. Habtamu ,

    Contributed equally to this work with: M. Azene, O. O. Olaokun, B.C. Iweriebor, N.M. Mkolo, C.L. Obi, W. Shibeshi, S. Assefa, Z. Abebe, A. Habtamu, K. Baye

    Roles Conceptualization, Methodology, Writing – review & editing

    Affiliation Department of Pediatrics and Child Health, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia

  • K. Baye

    Contributed equally to this work with: M. Azene, O. O. Olaokun, B.C. Iweriebor, N.M. Mkolo, C.L. Obi, W. Shibeshi, S. Assefa, Z. Abebe, A. Habtamu, K. Baye

    Roles Conceptualization, Methodology, Resources, Validation, Writing – review & editing

    Affiliation Center for Food Science, College of Natural and Computational Sciences, Addis Ababa University, Addis Ababa, Ethiopia

Abstract

Salvia schimperi is widely used in Ethiopian folk medicine, particularly its raw and roasted seeds for treating ailments such as diarrhea. While numerous reports on its traditional uses and biological activities exist, limited chemical datasets are available on this plant. This study aimed to annotate and identify the phytochemical constituents in S. schimperi seed extracts. The ground samples of S. schimperi seeds (raw and roasted) were extracted with 80% methanol prior to metabolomic analysis using LCMS. Data processing and compound identification were conducted using MZmine, SIRIUS and XCMS platforms. Multivariate statistical analyses and biological targets prediction were carried out via XCMS, MetaboAnalyst, BindingDB and Therapeutic Target Databases. Annotation by SIRIUS based on ZODIAC, SIRIUS and confidence scores criteria, validated 99 of the 689 identified compounds. Among the compounds annotated by METLIN in XCMS, 105 were predicted to be of significant value based on multivariate analysis of MetaboAnalyst. Compounds annotated comprised of primary and secondary metabolites. Two alkaloid compounds (actinonin and indole acrylate) were identified as significant from SIRIUS and METLIN/MetaboAnalyst, with actinonin described as a potent antibacterial metabolite. To our knowledge, this study represents the first metabolomic fingerprinting of S. schimperi seed extracts, revealing diverse bioactive phytochemicals with nutritional and therapeutic potential. The consistent identification of actinonin suggests promising antibacterial applications. Roasting significantly alters the metabolite profile, reducing certain nutritional components such as isoleucine. These findings underline the importance of processing methods in determining phytochemical composition and bioactivity. Further research would explore the toxicity and potential functional food applications.

Introduction

In many developing countries, medicinal plants are essential as they offer a viable alternative in primary health care systems. The use of herbal medicine in these regions is motivated by various factors, including the perceived lower side effects and cost of phytomedicines compared to modern synthetic drugs, as well as the effectiveness of some plant remedies [1]. One of such medicinal plants is Salvia schimperi, commonly known as Schimper’s sage. This is a perennial shrub belonging to the Lamiaceae family, which includes about 900 species worldwide. Many species and varieties of the genus Salvia are cultivated for their aromatic nature and are used as flavoring and food condiments, as well as in cosmetics, perfumes, and medicine [2,3].

S. schimperi has various medicinal properties utilized in traditional medicine. Phytochemical analysis of the extracts revealed the presence of polyphenolic compounds such as flavonoids and tannins [4]. Research has reported on the chemical constituents of the essential oils of the leaves and flowers of S. schimperi [3] and the antinociceptive effects of the essential oil of its leaves [5]. The essential oils of the flowers and leaves contain compounds such as linalool, linalyl acetate, and α-terpineol [3], which contribute to their effectiveness against a range of pathogens, including Gram-positive and Gram-negative bacteria, as well as certain fungal strains [6]. The oils also demonstrate free-radical scavenging potential [6]. Additionally, compounds such as caryophyllene, bisabolol, and farnesene found in the essential oils of S. schimperi have been linked to its promising antidiabetic potential [7]. Despite the reports on its traditional uses and biological activities, limited chemical datasets are available for this plant. In particular there is a dearth of information on the phytochemicals and biological activities of the seed extracts.

Phytochemicals, or plant secondary metabolites, are essential for plants’ adaptation to environmental changes and interactions with other organisms [810]. These compounds are a sustainable source of lead compounds for drug discovery [11,12]. Characterizing phytochemicals is fundamental to the study of plant extracts, and advancements in technology over the past two decades have greatly enhanced this process [13,14]. Liquid chromatography coupled with mass spectrometry (LC-MS) has emerged as a preferred method due to its high throughput, soft ionization, and extensive coverage of phytochemicals [15]. Its versatility and sensitivity make LC-MS ideal for analyzing a wide variety of semi-polar compounds, including key secondary metabolites [13]. Tandem mass spectrometry (MS/MS) is particularly valuable as it provides detailed structural insights by capturing both precursor and fragment ion information, which are crucial for annotating, identifying, and dereplicating phytochemicals [16].

Phytochemical metabolomics analysis via LC-MS can be divided into untargeted and targeted approaches. The untargeted approach comprehensively analyzes all measurable chemicals, including unknowns, while the targeted approach focuses on specific predefined chemical groups [13]. LC-MS, a predominant tool for rapid characterization of these metabolites in herbal medicines, offers high resolution and sensitivity with multistage fragmentation data for detailed structural information [8,13]. Metabolites annotation tools such as MZmine, XCMS and SIRIUS aid in these analyses, facilitating the structural characterization of phytochemical mixtures and accelerating the discovery of novel active compounds [17,18].

Notably, annotating phytochemicals from untargeted mass spectrometry (MS) data often relies on spectral matching against limited reference databases [8]. Molecular networking (MN) has emerged as an effective approach for organizing complex MS/MS spectra by grouping structurally related compounds based on spectral similarity [19]. However, despite its advantages, the dereplication capabilities of MN are constrained by the availability of reference spectra and its limited ability to deeply annotate structurally similar compounds [20]. To address these limitations, several in silico tools such as MetFrag, MetFusion, ISIS, CFM-ID, and MAGMa have been developed [8]. Additionally, CSI:FingerID, integrated into the SIRIUS platform, uses machine learning to compute fragmentation trees and match experimental spectra with user-defined structure databases, thereby enhancing metabolite annotation coverage [21,22].

Therefore, this study aimed to annotate and identify the phytochemical constituents in S. schimperi seed extracts. The ground samples of S. schimperi seeds (both raw and roasted) were extracted with 80% methanol and subjected to untargeted High-Performance Liquid Chromatography Tandem Mass Spectrometry analysis. Data preprocessing and compounds annotations were conducted using web-based tools.

Materials and methods

Sample material

A sample of S. schimperi seeds were collected from Menz Mama Midir district, North Shewa Zone, Ethiopia between January 15 and February 14, 2022, identified and recorded as a voucher number of MA001 at National Herbarium, Addis Ababa University, College of Natural and Computational Sciences. The voucher specimen (voucher number MA001) was deposited at the National Herbarium, Addis Ababa University. The samples were collected, manually cleaned, dried and packed in hermetic plastic vessels, and stored at 5ºC until analysis. Roasting was performed at 185ºC for 15 minutes [23] using an electrical drying oven as communities living in Menz Mama Midir use the seed in roasted and raw forms.

Instruments and reagents

Ultimate 3000 LC combined with Q Exactive MS (Thermo), Temp functional Centrifugation (Eppendorf), ACQUITY UPLC HSS T3 (100 × 2.1 mm × 1.8 μm), Acetonitrile (Merck), Methanol (Merck), Formic acid (Merck) and DL-o-Chlorophenylalanine (Merck) was used.

Sample preparation

Each sample of S. schimperi seed (raw and roasted seeds) was crushed to fine powder in a 5-mL homogenizing tube, at 30 Hz with the aid of four 5-mm metal balls on a MM 400 mill mixer. Then 50 mg of each sample was precisely weighed into a tube and 800 μL of 80% methanol was added. Thereafter samples were vortexed for 30 s, followed by sonication for 30 min, 4°C. All samples were kept at −20°C for 1 h and centrifuged at 12000 rpm and 4°C for 15 min. Finally, 200 μL of supernatant and 5 μL of DL-o-Chlorophenylalanine (140 μg/mL) were transferred to the vial for LC-MS analysis.

Plant samples data acquisition using UPLC-MS/MS

The analysis of secondary metabolites was performed using an Ultimate 3000 LC system (Thermo) coupled to a Q Exactive Orbitrap mass spectrometer (Thermo Fisher Scientific) equipped with an electrospray ionization (ESI) source. Separation was achieved on an ACQUITY UPLC HSS T3 column (100 × 2.1 mm, 1.8 μm). The mobile phases consisted of solvent A (0.05% formic acid in water) and solvent B (acetonitrile). The elution gradient program was as follows: (0–1 min, 95% A; 1–12 min, linear gradient from 95% A to 5% A; 12–13.5 min, 5% A (isocratic); 13.5–13.6 min, return from 5% A to 95% A; 13.6–30 min, re-equilibration at 95% A). The flow rate was 0.3 mL·min ⁻ ¹, the column temperature was maintained at 40 °C, and the autosampler was set at 4 °C. Mass spectrometric data were acquired in positive ionization mode under the following conditions: ESI+: heater temp 300 °C, spray voltage 3.5 kV, capillary temperature 350 °C, sheath gas flow rate 45 arb units, S-Lens RF level at 30%, auxiliary gas flow rate 15 arb units, resolution 70,000 (at m/z 200), and scan range m/z 100–1200.

Data analysis software

Sample data acquired in centroid mode for the untargeted metabolomics analyses of methanol extracts from untreated (raw) and treated (roasted) seeds of S. schimperi was processed using the following software:

MZmine

MZmine software (v3.7.0, Windows version) (http://mzmine.github.io/, downloaded on October 24, 2023) [24] was utilized to preprocess the raw LC-MS spectra data. Firstly, the raw data was imported into the platform, and data processing was performed, including mass detection (centroid), chromatogram resolving, deisotoping, peak alignment, and filling in missing peak data. The noise level parameters were set for MS1 (2.0E3) and MS2 (3.0E2) before executing the ADAP chromatogram builder with a scan-to-scan accuracy set at 0.005 Da or 10 ppm, while keeping other parameters at default values. Finally, the resulting MS1 feature data was exported and save in Excel (.csv) file while the MS2 feature data was exported and saved in an MGF file in SIRIUS/CSI:FingerID format.

SIRIUS

SIRIUS software (v5.0, Windows version) (https://bio.informatik.uni-jena.de/software/sirius/, downloaded on November 6, 2023) [22] was used for the MS2 for compound annotation after the MGF file from the MZmine was imported into the software. Then for formula prediction, the program was set to search the entire database while enabling ZODIAC, CSI:FingerID, and CANOPUS other parameters were left at their default values. The “compute all” tab was selected to run the annotation by molecular formula prediction and structure elucidation step. In the Sirius overview tab where the results were displayed, the compound with the highest Sirius score was selected as the potential candidate. In addition to this, the “export summarize” option was enabled to facilitate the export of MS2 annotation search results of recommended compounds. The identified compounds were generated according to their confidence scores [25].

XCMS

The analysis of LC-MS spectra data of untreated and treated samples was facilitated using the XCMS online [26] and on R package [27]. For the XCMS online (https://xcmsonline.scripps.edu/) spectra data was imported onto the platform using the pairwise drop-down on create new job. Then HPLC/single quad parameter was selected prior to running the program. For the XCMS (v3.20.0, an R package) the command line utilized to process spectra data. XCMS was used for feature extraction, feature alignment and data analysis, Annotation of compounds was enabled by the METLIN on XCMS online [2831]. The resulting Excel csv files showcasing the total aligned peak intensity features of both XCMS analyses were compared. The XCMS online was selected for comparison of the untreated and treated samples by focusing on the feature extraction capability.

BindingDB and therapeutic target database for biological targets prediction

The prediction of biological target of compounds was conducted using the BindingDB and Therapeutic Target Database. The exported recommended compounds from the Sirius MS2 search were further screened using their Simplified Molecular Input Line Entry System (SMILES) numbers. Each compound’s canonical SMILES was imported into the BindingDB database web tool (open-source license) to identify potential protein targets by clicking the “Find My Compound’s Target” button under the “Special tools” section. The biological targets in BindingDB were filtered for 85% similarity to existing ligand molecules [32]. Additionally, drug similarity was analyzed using the Therapeutic Target Database (TTD) web tool (open source license) [33].

Statistical analyses

The resulting peak intensity features of XCMS online was subjected to a statistical analysis including using the MetaboAnalyst 5.0 [34]. The processed data was exported as CSV files containing information of the detected peaks, retention time and m/z values, to MetaboAnalyst 5.0. Data were normalized by sum, auto scaled and analyzed using principal component analysis (PCA) and orthogonal partial least-squares discriminant analysis (OPLS-DA). For the PCA, five components were analyzed to explain the variance between untreated and treated extracts. Then the OPLS-DA was used to further assess the significant difference between secondary metabolites extracted from the untreated and treated extracts. Multivariate data analyses tools were used to reveal signature metabolites that gratify the basis of p-value < 0.05, false discovery rate (FDR) corrected p-values (q-values) < 0.05, fold change (FC) > 2.0, and variable importance in projection (VIP) > 1.2. Cross-validation (CV) was performed using a tenfold CV method indicating the accuracy, Q2 and R2 values. A permutation test was also done to validate the model with the permutation p-value of p < 0.01.

Result and discussion

This study represents one of the first attempts to investigate and describe the metabolomic profile of S. schimperi seed, specifically comparing roasted (ROS) (treated) and unroasted (RAW) (untreated) seed methanol extracts. The primary goal was to identify metabolomic differences, focusing on secondary metabolites such as polyphenols (e.g., flavonoids), terpenoids, alkaloids, and other compounds, which may serve as bio-fingerprints and with possible antibacterial potential. An untargeted metabolomics analysis was conducted using LC-MS/MS (liquid chromatography–tandem mass spectrometry). This high-resolution MS data acquisition method allowed for the separation and possible detection of numerous phytochemicals, identified by their mass-to-charge ratio (m/z) and retention time (RT).

Data processing on MZmine and phytochemical annotation using SIRIUS 5.0

The LC-MS spectra obtained from the methanolic extracts of both treated and untreated S. schimperi seeds were subjected to peak detection and integration, resulting in the generation of 2D matrices. These matrices included variable indices (paired m/z-retention time), sample identifiers (observations), and peak areas. Several software packages are available for the processing of metabolomic data, but their performance can vary significantly, potentially leading to different outcomes. In this study, we evaluated the quality of the processed data matrices obtained using two freely available tools, XCMS and MZmine, by comparing them with the raw data. MZmine, XCMS Online, and the XCMS R package identified 774, 2,756, and 2,530 m/z features, respectively (Table 1). MZmine detected the fewest variables/peaks, possibly because some extracted noise peaks were removed directly through visual inspection. The total ion chromatogram (TIC) of S. schimperi extracts (Fig 1), obtained through a 30-minute gradient elution, revealed MS1 features ranging from m/z 100.0223 to 999.2317 and the maximum TIC intensity reached 2.9 × 10⁶.

thumbnail
Fig 1. Total ion chromatography of S. schimperi seed methanol extracts (treated (ROS) and untreated.

https://doi.org/10.1371/journal.pone.0335929.g001

SIRIUS platform computes candidate molecular formulas (MFs) by matching the experimental MS1 spectra against predicted isotopic patterns and assessing how well the fragmentation spectra align with candidate MFs through fragmentation trees. To enhance high degree annotation, SIRIUS integrates advanced algorithms such as ZODIAC for enhanced MF prediction, CSI:FingerID for putative structural annotation, and COSMIC for confidence scores, alongside CANOPUS for chemical class annotation [13]. For this study, the spectra features obtained from MZmine were imported into SIRIUS 5.0. Key attributes such as retention time, ion mass, adduct, confidence ranking and COSMIC confidence score were generated. Candidate structures for the compounds of S. schimperi were identified on SIRIUS platform by searching the top CSI:Finger ID hits across various databases, including Natural Products, COCONUT, and CHEBI. A confidence score (value between 0 and 1) given to each candidate predicted the probability of correct structural identification. The results generated were then manually curated, and the fingerprint vectors of the top candidates were exported for further analysis. From this workflow, a total of 689 compounds were identified in the extracts (S1 File in S1 Data) This comprised both primary and secondary metabolites, including fatty acids, carbohydrates, Shikimates and phenylpropanoids, alkaloids, terpenoids and polyamines. To ensure reliability, only candidates that passed the selection criteria thresholds were retained. Specifically, compounds with a COSMIC confidence score > 0.1 were included while no identification was made if the ZODIAC score was below 50% or if the SIRIUS score was below 50% without a corresponding ZODIAC score [3538]. Based on this selection, only 99 compounds were validated (Table 2), which were further categorized into 25 chemical classes (Table 3). In addition, the export summary function of SIRIUS, which is designed to automatically annotate and recommend compounds based on MS2 spectral data, generated annotations for ten compounds as shown in Table 4. However, two of these compounds; 2-[4-(4-methoxyphenyl)-2,2-dimethyloxan-4-yl]ethyl-[(4-methoxyphenyl)methyl]azanium, and 2-methyl-4,6-dinitro-5-(4-phenylpiperazin-1-yl)-1H-benzimidazole were absent in Table 2. This is because both compounds failed to satisfy the scoring thresholds set to ensure high-confidence identification. The complementary use of COSMIC and CANOPUS further strengthened the annotation process. COSMIC provided the confidence levels for molecular structure annotations for selected compounds, while CANOPUS offered broad classification of all measured compounds into chemical classes. This integrated approach yielded a comprehensive insight into the phytochemical profile S. schimperi. This approach has been successfully employed in previous studies, including the detailed chemical characterization of the Pingxiao capsule [39]. In the present study, it proved equally effective, revealing that the identified compounds were predominantly alkaloids, fatty acids, amino acids, and peptides.

thumbnail
Table 2. Compounds in extracts of S. schimperi based on confidence, Zodiac and Sirius scores in SIRIUS 5.0 software.

https://doi.org/10.1371/journal.pone.0335929.t002

thumbnail
Table 3. Characteristics of class of the compounds identified by SIRIUS software.

https://doi.org/10.1371/journal.pone.0335929.t003

thumbnail
Table 4. Compounds identified using the MS2 spectra results of “export summarized” option enabled by SIRIUS 5.0 software.

https://doi.org/10.1371/journal.pone.0335929.t004

Data analysis and phytochemical annotation using XCMS and METLIN

The data processing done on XCMS Online and on R packages (Table 1) resulted in the difference of 226 m/z features, for which no clear explanation could be provided. XCMS Online, a cloud-based metabolomics informatics platform, has undergone significant improvements, including the incorporation of paired two-group comparisons, higher-order meta-analysis, and multiple group comparisons [40,41]. In addition, statistical tests have been introduced into it, and the interactive visualization tools have been improved and expanded to better deconvolve complex untargeted metabolomic datasets. These statistical tests were systematically performed following feature detection and profile alignment, providing a direct interface for visualization [40]. For the S. schimperi seed extracts, feature detection produced a mean value of 1995.5 ± 86.25 features in RAW (untreated) samples and 1952.81 ± 212.02 features in ROS (treated) samples (S2 File in S1 Data). The extracted features were then subjected to principal component analysis (PCA) to explore and visualize the metabolite variation between the two treatment groups. The PCA score plot revealed that the first principal component (PC1) accounted for 56.0% of the observed variation, while the second principal component (PC2) explained 16.0% (Fig 2A), clearly distinguishing between the RAW and ROS extracts. To further evaluate sample variability, the Distance to the Model of X-space (DModX) was calculated. The results indicated that three RAW samples and two ROS samples exhibited higher DModX values (Fig 2A), suggesting possible outliers with greater deviation from the PCA model. In PCA, a high DModX value reflects a point that departs significantly from the correlation structure captured by the model, thereby flagging it as a potential outlier.

thumbnail
Fig 2. A. Visualization of features of S. schimperi seed extracts of raw (untreated) (RAW) and roasted (treated) (ROS) in Multivariate analysis PCA plot, illustrating PCA scores unit variance scaling of features based on XCMS online prediction. B. Visualization of features of S. schimperi seed extracts of raw (untreated) (RAW) and roasted (treated) (ROS) in Cloud plot of XCMS online, illustrating features significantly upregulated (color in green) and significantly downregulated (color in red). C. Visualization of features of S. schimperi seed extracts of raw (untreated) (RAW) and roasted (treated) (ROS) in Fold change on MetaboAnalyst webtool, illustrating sig. Up (significantly upregulated) (color in red), sig. Down (significantly downregulated) (color in blue), and unsig. (unsignificant) (color in grey).

https://doi.org/10.1371/journal.pone.0335929.g002

Feature-level analysis in XCMS Online identified a total of 2756 features, of which 1081 were upregulated and 1675 were downregulated (S3 File in S1 Data). When filtered using significance thresholds of p ≤ 0.01 and fold change ≥ 1.5, a total of 1333 significant features were retained, comprising 589 upregulated and 744 downregulated features (Fig 2B). In contrast, analysis using MetaboAnalyst yielded 673 significantly upregulated features, 574 significantly downregulated features, and 1509 unsignificant features (Fig 2C). This difference underscores the variance in interpretation between the two analytical webtools.

The METLIN-guided in-source annotation (MISA) algorithm annotates m/z values of in-source fragments (ISFs) using experimental low-energy MS/MS spectra from the METLIN library, and is applied to the datasets generated by XCMS Online [40,41]. Through this process, a total of 2087 compounds were predicted, of which 451 were classified as unknowns. Among the known compounds, 105 metabolites were identified as significant based on the Significance Analysis of Metabolites (SAM) performed in MetaboAnalyst (S4 File S1 Data), with some of the compounds shown in Table 5.

thumbnail
Table 5. Significant compounds based on significant Analysis of Metabolites of S. shimperi extracts by METLIN on XCMS.

https://doi.org/10.1371/journal.pone.0335929.t005

The annotation results from METLIN revealed a wider diversity of compound classes compared to SIRIUS. While both platforms detected similar classes, METLIN additionally annotated compounds such as flavonoids, saponins, and organic acids. Notably, two alkaloids, actinonin and indole acrylate, were consistently identified by both SIRIUS and METLIN. Of particular interest, actinonin has been reported as a potent antibacterial alkaloid (Table 6) [42], highlighting the pharmacological relevance of the annotated metabolites.

thumbnail
Table 6. Compounds, targets and biological activities prediction using the BindingDB and Therapeutic Target Database.

https://doi.org/10.1371/journal.pone.0335929.t006

Multivariate analysis of S. schimperi untreated and treated extracts

The principal component analysis (PCA) score plot showed distinct groupings between the seed extracts of the S. schimperi raw (untreated) (RAW) and roasted (treated) (ROS) (Fig 3A). The five principal components (PC) used in generating the PCA gave a total explained variance of 96.5% (Fig 3B,C). From these five PCs, the first two PCs [i.e., PC1 (56.7%) and PC2 (19.8%)] captured the most information from the given datasets (Fig 3B,C), which led to a clear separation between the treated and the untreated extracts.

thumbnail
Fig 3. A. Multivariate data analyses of the seed extracts of the S. schimperi raw (untreated) (RAW) and roasted (treated) (ROS) showing their principal component analysis (PCA) score plot. B. Multivariate data analyses of the seed extracts of the S. schimperi raw (untreated) (RAW) and roasted (treated) (ROS) showing their paired score. C. Multivariate data analyses of the seed extracts of the S. schimperi raw (untreated) (RAW) and roasted (treated) (ROS) showing their Scree plot variance explained.

https://doi.org/10.1371/journal.pone.0335929.g003

To further explain the groupings in the PCA, an orthogonal partial least-squares discriminant analysis (OPLS-DA) was performed (Fig 4). The score plot had a T score of 56.4%, which indicates that 56.4% of the variance between untreated and treated was explained (Fig 4A). The OPLS-DA score plot [Q2 = 0.977, R2Y = 0.999 (p < 0.01)] (Fig 4B,C) used to categorize the detected secondary metabolites as either from untreated or treated samples, effectively distinguishing between the two groups (Fig 4A). Authors have demonstrated the score plot to discriminated the secondary metabolites of mycelia of Ganoderma boninense into monokaryon and dikaryon [43]. Furthermore, the plot also showed an orthogonal T score of 14.1% indicating that 14.1% of the variations do not contribute to the separation of the groups, which reduced the “noise” in the model, thereby improving the accuracy of group separation (Fig 4A). The group classification via PCA and OPLS-DA is indicative of the distinction between the metabolite profile of untreated (RAW) and treated (ROS) of S. schimperi seed extracts.

thumbnail
Fig 4. A. The multivariate data analyses of the seed extracts of the S. schimperi raw (untreated) (RAW) and roasted (treated) (ROS) showing their orthogonal partial least-squares discriminant analysis (OPLS-DA) classification using different number of components. B. The multivariate data analyses of the seed extracts of the S. schimperi raw (untreated) (RAW) and roasted (treated) (ROS) showing their cross-validation test. C. The multivariate data analyses of the seed extracts of the S. schimperi raw (untreated) (RAW) and roasted (treated) (ROS) showing their permutation test.

https://doi.org/10.1371/journal.pone.0335929.g004

Univariate analysis was used where the volcano plot was made using fold change (FC) values (>2.0) and false discovery rate (FDR) corrected p-values (q-values) < 0.05, as the selection criteria for identifying the degree of metabolic molecule difference, as shown in Fig 4. A total of 1535 metabolites were not significant, while 572 were downregulated and 649 were upregulated (Fig 4). Among the 15 metabolites (Fig 5) with a VIP greater than 1.2, include M569T9 (isoleucine), M693T8 (3,5,3’-Triiodothyronine), M411T1 (analyte), M368T9 (himbacine), M217T1 (metameconine), M442T6 (L-aspartate), M362T1 (funtumine), M148T1 (glutamate), M608T8 (glutamine), M369T9_2 (analyte), M715T13 (patuletin 3-(6’‘-(E)-feruloylglucoside), M568T12 (analyte), M611T8 (metrizoic acid), M369T1 (octadecanedioic acid) and M412T1 ((S)-Autumnaline). Isoleucine, which was identified as a significant compound in the analyzed extracts, is one of the branched-chain amino acids that is an essential building block of protein synthesis in living organisms [44]. In this study among other compounds (3,5,3’-Triiodothyronine, L-aspartate, glutamine and metrizoic acid) it was also found to be more abundant in the roasted than in raw of S. schimperi. The compounds that were more abundant in the raw seeds of S. schimperi are himbacine, metameconine, funtumine, glutamate, patuletin 3-(6’‘-(E)-feruloylglucoside, octadecanedioic acid and (S)-Autumnaline. The top 15 downregulated (Fig 6) include M569T9: isoleucine, M704T9 (coumaroylspermin), M222T7 ((6E)-3,7-Dimethyl-6-octene-1,2,3,8-tetrol), M400T6 (analyte) and M432T3 (tryptophan) while the upregulated include M365T1 (Analytes), M453T8 (Serine), M534T1 (Analytes), M369T1 (Miraxanthin-V), M474T1 (Analyte), M399T7 (Analyte), M400T7 (Analyte), M188T3 (L-Phenylalanine), M339T5 (Phosphophosphinate) and M217T21 (7,10-hexadecadienoic acid). The downregulation of isoleucine observed in the roasted S. schimperi seed indicates that the raw seeds of the plant have more nutritional value than the roasted.

thumbnail
Fig 5. Volcano plots analysis of top 15 up-regulated and down-regulated metabolites of the untreated and treated extracts of S. schimperi ranked.

Key: M365T1: Analyte, M453T8: Serine, M534T1: Analyte, M369T1: Miraxanthin-V, M474T1: Analyte, M399T7: Analytes, M400T7: Analyte, M188T3: L-Phenylalanine, M339T5: Phosphophosphinate, M217T21: 7,10-hexadecadienoic acid; M569T9: isoleucine, M704T9: coumaroylspermin, M222T7: (6E)-3,7-Dimethyl-6-octene-1,2,3,8-tetrol, M400T6: analyte, M432T3: tryptophan.

https://doi.org/10.1371/journal.pone.0335929.g005

thumbnail
Fig 6. Distribution patterns of untreated and treated extracts of S. schimperi compounds.

The scatter plot of VIP coordinates showing top 15 significant compounds (VIP > 1.2). Key: M569T9: isoleucine, M693T8: 3,5,3’-Triiodothyronine, M411T1: analyte, M368T9: himbacine, M217T1: metameconine, M442T6: L-aspartate, M362T1: funtumine, M148T1: glutamate, M608T8: glutamine, M369T9_2: analyte, M715T13: patuletin 3-(6’‘-(E)-feruloylglucoside, M568T12: analyte, M611T8: metrizoic acid, M369T1: octadecanedioic acid and M412T1: (S)-Autumnaline.

https://doi.org/10.1371/journal.pone.0335929.g006

Conclusion

This study revealed that S. schimperi seeds contain significant amounts of various chemical compounds, including flavonoids, saponins, alkaloids, small peptides, polyamines, amino acids, and organic acids. Notably, two alkaloids which are actinonin and indole acrylate, and the amino acid isoleucine were identified using SIRIUS and METLIN. Isoleucine, an essential amino acid, is known for its high nutritional value, while actinonin has been demonstrated to possess potent antibacterial properties. These findings have highlighted the potential value of S. schimperi, and further research into its toxicity and antimicrobial activities is recommended to fully harness its benefits.

Supporting information

Acknowledgments

We would like to thank Sefako Makgatho Health Sciences University and University of Venda for the technical and facility support throughout the study.

References

  1. 1. Wachtel-Galor S, Benzie IFF. Herbal Medicine: An Introduction to Its History, Usage, Regulation, Current Trends, and Research Needs. In: Benzie IFF, Wachtel-Galor S, editors. Herbal Medicine: Biomolecular and Clinical Aspects. 2nd edition. Boca Raton (FL): CRC Press/Taylor & Francis; 2011.
  2. 2. Will M, Claßen-Bockhoff R. Why Africa matters: evolution of Old World Salvia (Lamiaceae) in Africa. Ann Bot. 2014;114(1):61–83. pmid:24966353
  3. 3. Endeshaw MM, Gautun OR, Asfaw N, Aasen AJ. Volatile oil constituents of the Ethiopian plantSalvia schimperi Benth. Flavour Fragr J. 2000;15(1):27–30.
  4. 4. Umer S, Andualem G, Getnet F, Alemayehu H, Tekewe A, Kebede N. Antidiarrheal and antibacterial activities of hydroalcoholic extracts of Salvia schimperi Benth from Ethiopia. International Journal of Pharmacognosy. 2015;2(6):290–5.
  5. 5. Masresha B, Makonnen E, Debella A. In vivo anti-inflammatory activities of Ocimum suave in mice. J Ethnopharmacol. 2012;142(1):201–5. pmid:22561892
  6. 6. Asres K, Ahfaha H, Mazumder A, Bucar F. Leaf Essential oils of <i>Salvia nilotica</i> and <i>Salvia schimperi</i>: Their Antimicrobial and Antioxidant Activities. Eth Pharm J. 2008;26(1).
  7. 7. Ya’ni AA, Eldahshan OA, Hassan SA, Elwan ZA, Ibrahim HM. Antidiabetic effects of essential oils of some selected medicinal Lamiaceae plants from Yemen against a-glucosidase enzyme. J Phytochem Biochem. 2018;2(106):2.
  8. 8. Liu F-J, Jiang Y, Li P, Liu Y-D, Xin G-Z, Yao Z-P, et al. Diagnostic fragmentation-assisted mass spectral networking coupled with in silico dereplication for deep annotation of steroidal alkaloids in medicinal Fritillariae Bulbus. J Mass Spectrom. 2020;55(9):e4528. pmid:32559823
  9. 9. Yonekura-Sakakibara K, Saito K. Functional genomics for plant natural product biosynthesis. Nat Prod Rep. 2009;26(11):1466–87. pmid:19844641
  10. 10. Bourgaud F, Gravot A, Milesi S, Gontier E. Production of plant secondary metabolites: a historical perspective. Plant Science. 2001;161(5):839–51.
  11. 11. Bernardini S, Tiezzi A, Laghezza Masci V, Ovidi E. Natural products for human health: an historical overview of the drug discovery approaches. Nat Prod Res. 2018;32(16):1926–50. pmid:28748726
  12. 12. David B, Wolfender JL, Dias DA. The pharmaceutical industry and natural products: historical status and new trends. Phytochem Rev. 2015;14(2):299–315.
  13. 13. Sukor NS, Zakri ZH, Rasol NE, Salim F. Annotation and identification of phytochemicals from Eleusine indica using high-performance liquid chromatography tandem mass spectrometry: databases-driven approach. Molecules. 2023;28(7):3111.
  14. 14. Wolfender JL, Rodriguez S, Hostettmann K. Liquid chromatography coupled to mass spectrometry and nuclear magnetic resonance spectroscopy for the screening of plant constituents. J Chromatogr A. 1998;794(1–2):299–316.
  15. 15. Zhou B, Xiao JF, Tuli L, Ressom HW. LC-MS-based metabolomics. Mol Biosyst. 2012;8(2):470–81. pmid:22041788
  16. 16. Covington BC, McLean JA, Bachmann BO. Comparative mass spectrometry-based metabolomics strategies for the investigation of microbial secondary metabolites. Nat Prod Rep. 2017;34(1):6–24. pmid:27604382
  17. 17. Fox Ramos AE, Evanno L, Poupon E, Champy P, Beniddir MA. Natural products targeting strategies involving molecular networking: different manners, one goal. Nat Prod Rep. 2019;36(7):960–80. pmid:31140509
  18. 18. Alvarez-Rivera G, Ballesteros-Vivas D, Parada-Alfonso F, Ibañez E, Cifuentes A. Recent applications of high resolution mass spectrometry for the characterization of plant natural products. TrAC Trends in Analytical Chemistry. 2019;112:87–101.
  19. 19. Quinn RA, Nothias L-F, Vining O, Meehan M, Esquenazi E, Dorrestein PC. Molecular Networking As a Drug Discovery, Drug Metabolism, and Precision Medicine Strategy. Trends Pharmacol Sci. 2017;38(2):143–54. pmid:27842887
  20. 20. Allard P-M, Péresse T, Bisson J, Gindro K, Marcourt L, Pham VC, et al. Integration of Molecular Networking and In-Silico MS/MS Fragmentation for Natural Products Dereplication. Anal Chem. 2016;88(6):3317–23. pmid:26882108
  21. 21. Dührkop K, Shen H, Meusel M, Rousu J, Böcker S. Searching molecular structure databases with tandem mass spectra using CSI:FingerID. Proc Natl Acad Sci U S A. 2015;112(41):12580–5. pmid:26392543
  22. 22. Dührkop K, Fleischauer M, Ludwig M, Aksenov AA, Melnik AV, Meusel M, et al. SIRIUS 4: a rapid tool for turning tandem mass spectra into metabolite structure information. Nat Methods. 2019;16(4):299–302. pmid:30886413
  23. 23. Hatamian M, Noshad M, Abdanan-Mehdizadeh S, Barzegar H. Effect of roasting treatment on functional and antioxidant properties of chia seed flours. NFS Journal. 2020;21:1–8.
  24. 24. Schmid R, Heuckeroth S, Korf A, Smirnov A, Myers O, Dyrlund TS, et al. Integrative analysis of multimodal mass spectrometry data in MZmine 3. Nat Biotechnol. 2023;41(4):447–9. pmid:36859716
  25. 25. Rutz A, Dounoue-Kubo M, Ollivier S, Bisson J, Bagheri M, Saesong T. Taxonomically Informed Scoring Enhances Confidence in Natural Products Annotation. Front Plant Sci. 2019;10:1329.
  26. 26. Tautenhahn R, Patti GJ, Rinehart D, Siuzdak G. XCMS Online: a web-based platform to process untargeted metabolomic data. Anal Chem. 2012;84(11):5035–9. pmid:22533540
  27. 27. Smith CA, Want EJ, O’Maille G, Abagyan R, Siuzdak G. XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. Anal Chem. 2006;78(3):779–87. pmid:16448051
  28. 28. Smith CA, O’Maille G, Want EJ, Qin C, Trauger SA, Brandon TR, et al. METLIN: a metabolite mass spectral database. Ther Drug Monit. 2005;27(6):747–51. pmid:16404815
  29. 29. Guijas C, Montenegro-Burke JR, Domingo-Almenara X, Palermo A, Warth B, Hermann G, et al. METLIN: A Technology Platform for Identifying Knowns and Unknowns. Anal Chem. 2018;90(5):3156–64. pmid:29381867
  30. 30. Trivedi DK, Hollywood KA, Goodacre R. Metabolomics for the masses: The future of metabolomics in a personalized world. New Horiz Transl Med. 2017;3(6):294–305. pmid:29094062
  31. 31. Kell DB. Metabolomic biomarkers: search, discovery and validation. Expert Rev Mol Diagn. 2007;7(4):329–33. pmid:17620040
  32. 32. Gilson MK, Liu T, Baitaluk M, Nicola G, Hwang L, Chong J. BindingDB in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res. 2016;44(D1):D1045–53. pmid:26481362
  33. 33. Li YH, Yu CY, Li XX, Zhang P, Tang J, Yang Q, et al. Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics. Nucleic Acids Res. 2018;46(D1):D1121–7. pmid:29140520
  34. 34. Chong J, Wishart DS, Xia J. Using MetaboAnalyst 4.0 for Comprehensive and Integrative Metabolomics Data Analysis. Curr Protoc Bioinformatics. 2019;68(1):e86. pmid:31756036
  35. 35. Hoffmann MA, Nothias L-F, Ludwig M, Fleischauer M, Gentry EC, Witting M, et al. High-confidence structural annotation of metabolites absent from spectral libraries. Nat Biotechnol. 2022;40(3):411–21. pmid:34650271
  36. 36. Blatt-Janmaat K, Neumann S, Schmidt F, Ziegler J, Qu Y, Peters K. Impact of in vitro phytohormone treatments on the metabolome of the leafy liverwort Radula complanata (L.) Dumort. Metabolomics. 2023;19(3):17. pmid:36892716
  37. 37. Peters K, Blatt-Janmaat KL, Tkach N, van Dam NM, Neumann S. Untargeted Metabolomics for Integrative Taxonomy: Metabolomics, DNA Marker-Based Sequencing, and Phenotype Bioimaging. Plants (Basel). 2023;12(4):881. pmid:36840229
  38. 38. Smith E, Lewis A, Narine SS, Emery RJN. Unlocking Potentially Therapeutic Phytochemicals in Capadulla (Doliocarpus dentatus) from Guyana Using Untargeted Mass Spectrometry-Based Metabolomics. Metabolites. 2023;13(10):1050. pmid:37887375
  39. 39. Wang D, Fu Z, Xing Y, Tan Y, Han L, Yu H, et al. Rapid identification of chemical composition and metabolites of Pingxiao Capsule in vivo using molecular networking and untargeted data-dependent tandem mass spectrometry. Biomed Chromatogr. 2020;34(9):e4882. pmid:32396262
  40. 40. Gowda H, Ivanisevic J, Johnson CH, Kurczy ME, Benton HP, Rinehart D, et al. Interactive XCMS Online: simplifying advanced metabolomic data processing and subsequent statistical analyses. Anal Chem. 2014;86(14):6931–9. pmid:24934772
  41. 41. Domingo-Almenara X, Montenegro-Burke JR, Guijas C, Majumder EL-W, Benton HP, Siuzdak G. Autonomous METLIN-Guided In-source Fragment Annotation for Untargeted Metabolomics. Anal Chem. 2019;91(5):3246–53. pmid:30681830
  42. 42. Snow Setzer M, Sharifi-Rad J, Setzer WN. The Search for Herbal Antibiotics: An In-Silico Investigation of Antibacterial Phytochemicals. Antibiotics (Basel). 2016;5(3):30. pmid:27626453
  43. 43. Santiago KAA, Wong WC, Goh YK, Tey SH, Ting ASY. Pathogenicity of monokaryotic and dikaryotic mycelia of Ganoderma boninense revealed via LC-MS-based metabolomics. Sci Rep. 2024;14(1):5330. pmid:38438519
  44. 44. Yu H, Zhang F, Wang G, Liu Y, Liu D. Partial deficiency of isoleucine impairs root development and alters transcript levels of the genes involved in branched-chain amino acid and glucosinolate metabolism in Arabidopsis. J Exp Bot. 2013;64(2):599–612. pmid:23230023