Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Utilisation of QSPR ODT modelling and odour vector modelling to predict Cannabis sativa odour

  • Kimber Wise,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations School of Science, RMIT University, Bundoora, Victoria, Australia, Nutrifield, Sunshine West, Victoria, Australia

  • Nicholas Phan,

    Roles Data curation, Formal analysis, Investigation, Methodology, Validation, Writing – review & editing

    Affiliation Faculty of Science, Monash University, Clayton, Victoria, Australia

  • Jamie Selby-Pham ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing

    jamie@nutrifield.com

    Affiliations School of Science, RMIT University, Bundoora, Victoria, Australia, Nutrifield, Sunshine West, Victoria, Australia

  • Tomer Simovich,

    Roles Conceptualization, Project administration, Supervision, Validation, Writing – review & editing

    Affiliations School of Engineering, RMIT University, Melbourne, Victoria, Australia, PerkinElmer Inc., Glen Waverley, Victoria, Australia

  • Harsharn Gill

    Roles Conceptualization, Project administration, Supervision, Validation, Writing – review & editing

    Affiliation School of Science, RMIT University, Bundoora, Victoria, Australia

Abstract

Cannabis flower odour is an important aspect of product quality as it impacts the sensory experience when administered, which can affect therapeutic outcomes in paediatric patient populations who may reject unpalatable products. However, the cannabis industry has a reputation for having products with inconsistent odour descriptions and misattributed strain names due to the costly and laborious nature of sensory testing. Herein, we evaluate the potential of using odour vector modelling for predicting the odour intensity of cannabis products. Odour vector modelling is proposed as a process for transforming routinely produced volatile profiles into odour intensity (OI) profiles which are hypothesised to be more informative to the overall product odour (sensory descriptor; SD). However, the calculation of OI requires compound odour detection thresholds (ODT), which are not available for many of the compounds present in natural volatile profiles. Accordingly, to apply the odour vector modelling process to cannabis, a QSPR statistical model was first produced to predict ODT from physicochemical properties. The model presented herein was produced by polynomial regression with 10-fold cross-validation from 1,274 median ODT values to produce a model with R2 = 0.6892 and a 10-fold R2 = 0.6484. This model was then applied to terpenes which lacked experimentally determined ODT values to facilitate vector modelling of cannabis OI profiles. Logistic regression and k-means unsupervised cluster analysis was applied to both the raw terpene data and the transformed OI profiles to predict the SD of 265 cannabis samples and the accuracy of the predictions across the two datasets was compared. Out of the 13 SD categories modelled, OI profiles performed equally well or better than the volatile profiles for 11 of the SD, and across all SD the OI data was on average 21.9% more accurate (p = 0.031). The work herein is the first example of the application of odour vector modelling to complex volatile profiles of natural products and demonstrates the utility of OI profiles for the prediction of cannabis odour. These findings advance both the understanding of the odour modelling process which has previously only been applied to simple mixtures, and the cannabis industry which can utilise this process for more accurate prediction of cannabis odour and thereby reduce unpleasant patient experiences.

1. Introduction

Cannabis sativa is an herbaceous plant that originated in Central Asia and has been utilised for its industrial [1], ornamental [2], nutritional [3], medicinal [4], and recreational [5] potentials. Odour is conferred by the detection of volatile compounds within the nasal cavity (olfactory system) during inhalation of air, or consumption of food or liquid, via the ortho- or retro-nasal routes, respectively [6, 7]. The detection and recognition of odours by the brain can illicit physiological and psychological responses, due to the close association between odour processing and memory [8, 9]. For Cannabis sativa—a medicinal flower with a strong odour often administered through inhalation—odour is closely associated with quality perception and can impact the pleasantness of the medicinal experience, which in turn can impact the likelihood of redosing [10]. Accurate descriptions and consistency of product odours are therefore important for medicinal cannabis patients to promote the likelihood of achieving their therapeutic goals.

Within the cannabis industry, strain diversity is expanding, which is creating new phenotypes that require the determination of odour prior to market presentation. New phenotypes can also be created accidentally, when seed-propagated (cross-fertilised) plants are presumed to maintain their parent strain traits without proper assessment, resulting in the mis-naming of strains and associated misattribution of odour characteristics [11, 12]. Accordingly, all new strains and seed-propagated plants should be phenotype-tested to avoid inaccurate descriptions of medicinal products in the market. Whilst chemical composition (including volatiles) can be determined relatively easily through established gas chromatography (G-C) analytical methods [13, 14], these are only partially informative to odour prediction [15], and therefore current methods for determination of sensory properties relies on a slow, costly, and subjective sensory assessment by panellists [16]. Accordingly, there is a need for additional tools to measure cannabis odour and better understand the connection between volatile profiles and odour perception.

Various measures have been developed to aid in the characterisation of odours, including odour detection threshold (ODT), which is the lowest concentration that 50% of participants will detect a compound [17, 18], and odour descriptors (OD), which are common odours (items) recalled from memory such as ‘floral’ or ‘balsamic’ when a compound is detected [19]. These measures are useful for describing odour perception for samples with a single compound, however, they fail to inform the perception of mixtures. Whilst olfactory signals are produced following the detection of an odorant individually and within a mixture, the processing of the olfactory signals from a mixture occurs on a batch/ profile level, which results in the perception of a single odour rather than multiple individual odours [20]. Even though individual odorants and their OD’s cannot be distinguished as part of a mixture, it is understood that an odorant’s presence contributes to the overall odour [21]. Even low concentrations of compounds—below their ODT—have been known to influence the perception of odorant mixtures [22]. Whilst natural products tend to produce highly complex volatile profiles that are uniquely perceived, volatile mixture complexity is not necessarily associated with increased odour complexity, with some highly complex mixtures exhibiting similarity known as ‘olfactory white’ [23]. Accordingly, the relationship between volatile profiles and their perception is highly complex and there is an interest to better understand how odorants interact/ add together to produce an overall single odour. Such developments would push odour-chemistry knowledge towards facilitating prediction and targeted creation of volatile mixture odours without the need for sensory analysis.

Vector addition has been proposed as a strategy for representing odorant addition within mixtures [24]. The odour activity value (OAV) describes the relative strength of an odorant in a mixture and is calculated as the ratio between its concentration and its ODT, which is then used to calculate the odour intensity (OI) of an OD for that odorant [25]. The vector model hypothesises that the OIs (per OD per compound within a mixture) are vectors in n-dimensional odour space (for n ODs), which can be added together using vector addition [24, 26]. The resulting estimate from the vector model is a profile of OI values, which represent the combination and relative strengths of ODs imparted by the mixture. The odour vector model has been successfully applied to binary [2629], ternary [30], quaternary [31, 32], and quinary mixtures [24]. However, noting that natural product odours—such as those from flowers or fruits—are mixtures of larger numbers of volatile compounds, there is an interest in expanding this modelling process to assess its utility within the context of representative profile complexity.

One of the limitations to the application of the odour vector model to high-complexity volatile mixtures, is the need for concentration and odour property data (ODT and ODs) for each chemical, as this is used for the calculation of OI. Whilst measurement of volatile concentration is well established [33], identification of ODT and ODs for uncharacterised chemicals generally requires sensory analysis, which is slow and costly. Chemical ODs have been identified for many volatiles due to their importance within the perfumery industry, and statistical modelling has recently been applied for the prediction of uncharacterised chemical ODs [34, 35]. However, ODT data is less comprehensive, and no large-scale generalised statistical models have been published. Therefore, there is a need for additional tools to measure or predict the ODT of uncharacterised chemicals in order to facilitate a vector modelling approach to complex mixtures containing these compounds.

Statistical modelling can be used to predict chemical activities or properties from their physicochemical data, which is known as quantitative structure–activity/property relationship (QSAR/QSPR) modelling. This process and relationship has been demonstrated for a range of properties including time of maximal phytochemical concentration in circulation following ingestion and inhalation [36, 37], blood-to-liver partition coefficients of volatile compounds [38], intestinal bioavailability and antioxidant activity [39], and perception of chemical odour following inhalation [34]. Generation of these QSPR models is useful, particularly for measures such as ODT which are laborious to measure and are required for OI calculation which is an input for vector modelling [40]. Previous studies have identified that there is a relationship between ODT and physicochemical properties, including carbon chain length and functional group presence [41, 42]. Furthermore, statistical modelling of ODT from physicochemical measures has been done for small data sets of chemical families including alkanes, esters, aldehydes, mercaptans, and aliphatic alcohols [43, 44]. However, no generalised model for the prediction of uncharacterised compound ODT has been presented.

The aim of this study was to develop a generalised QSPR model for the prediction of ODT, to enable OI calculation for uncharacterised compounds. Furthermore, this study explored the application of the odour vector model to the complex volatile profiles of cannabis flowers to produce OI profiles which could inform the overall sensory descriptor (SD) of the sample. The findings from this work provide researchers and the odour industry with a tool for approximating ODT without the need for sensory testing and additionally, strengthen the understanding of the utility of the odour vector model. The demonstration of the predictive capability of OI profiles over terpene profiles to cannabis SD is a step towards fast and reproducible prediction of flower odour which has implications for both the recreational and medicinal cannabis markets.

2. Materials and methods

An overview of the methods utilised herein are presented in Fig 1, with further details on each method presented in their respective subsections below and detailed further in the supplemental file.

thumbnail
Fig 1. Overview of methods workflow.

Cannabis flowers volatile profiles were converted into sample odour intensity (OI) profiles via vector modelling, which utilised odour descriptor (OD) data, odour detection thresholds (ODT), and compound concentrations as calculation inputs prior to vector addition. Sample OI profiles were compared to sensory data which allowed for logistic regression and k-means clustering to predict sensory descriptor assignment.

https://doi.org/10.1371/journal.pone.0284842.g001

2.1. Sourcing of Cannabis headspace profiles

A set of 265 Cannabis sativa volatile terpene profiles (containing data for 18 compounds, Table S1 in S1 File) for named commercial cannabis verities were sourced from a commercial compliance testing facility (CB Labs, Novato, CA). The online database, Leafly (https://www.leafly.com/, accessed August 2022), was used to source the SDs for each named commercial variety, which were the top 3 categories listed under the ‘strain flavour’ section of each named variety’s Leafly entry. The samples were grouped into 13 SD categories (Table S2 in S1 File: Citrus, Tropical, Berry, Tree Fruit, Earthy, Mineral, Animal, Roasted, Mellow, Warming, Herbal, Floral, and Sweet), based on the location of the Leafly descriptors within a cannabis odour wheel [45]. The ODs associated with each terpene were extracted from The Good Scents Company (http://www.thegoodscentscompany.com/, accessed August 2022), as the odours listed under the ‘odor’ subsection of the ‘organoleptic properties’ section of each compound entry (Table S4 in S1 File). The ODs associated with each compound were given a numerical ranking based on the ranking of the organoleptic property for that chemical, wherein primary ranking was assigned as rank 1, secondary rank was assigned as rank 2, and so on to octonary, which was assigned as rank 8.

2.2. Sourcing of ODT and physicochemical measures

A set of 2,698 ODT measures of 1,274 chemicals (in water) were sourced from Van Gemert [46]. Corresponding physicochemical measures for each chemical were sourced from ChemMine Tools (https://chemminetools.ucr.edu/, accessed August 2022), which included the JoeLib descriptors: Fraction_of_rotatable_bonds, Kier_shape_2, Number_of_HBA_2, Number_of_Br_atoms, Number_of_basic_groups, Geometrical_radius, Zagreb_group_index_2, Geometrical_diameter, and Number_of_N_atoms, the OpenBabel descriptors: abonds, dbonds, MW, TPSA, and Log_P (Log_P_1), and the Smarts Search results: [#16X2H] (thiol), [$([CX3] = [CX3])] (vinylic carbon), [#6][CX3] (= O)[OX2H0][#6] (ester), [#16X2H0] (sulfide), [CX3] (= O)[OX1H0-,OX2H1] (carboxylic acid), [CX3H1] (= O)[#6] (aldehyde), and [#6][CX3] (= O)[#6] (ketone). The physicochemical measures Log P (Log_P_2) and nOHNH were sourced for each compound from Molinspiration’s online property tool kit (http://www.molinspiration.com, accessed August 2022).

2.3. Statistical modelling of ODT

Polynomial regression was performed in Minitab 19 statistical software package (Minitab Inc., State College, PA) to predict the natural log of the median ODT (ln ODT) for each of the 1,274 chemicals from their physicochemical properties listed in 2.2. Automated term inclusion was performed using the step-wise method with an alpha value of 0.05 to enter and remove, beginning with all combinations of terms up to and including 3rd order. Further manual term removal was performed to achieve a model with only significant (P < 0.05) terms. Model validation was performed using the k-fold cross-validation technique, with k = 10. The accuracy of the model predictions was compared with the variations in ODT measures used to calculate the median ODT. This was done via paired t-test between the ratio of the standard deviation of the natural log of the individual ODT values for each compound with the natural log of the median ODT of each compound (ODT data variance) and the ratio of the model residual with the natural log of the median ODT (model variance), for all chemicals in the modelling-data set which were produced as a median of multiple values and had a non-zero ln ODT value (411 out of 1,274 chemicals).

2.4. Vector modelling to produce OI profiles

A vector modelling process, presented in Appendix 1 in S1 File, was adapted from Berglund [24], Yan, Liu [31], Yan, Liu [30], and Niu, Liu [26], and was applied to each cannabis terpene profile to produce a corresponding odour intensity (OI) profile. In short, the OI profile of the mixture of compounds (OIM) was generated as a 33-term vector (1 term per OD) representing the vector sum of individual compound OIs for that OD (Equation S8, Appendix 1 in S1 File). This utilised calculation of compound OAV (Equation S1, Appendix 1 in S1 File), defined as per Yan, Liu [30], wherein terpene median ODT (in water) was sourced from Van Gemert [46], except for alpha bisabolol and alpha terpinene, which were estimated using the predictive ODT model presented herein. Creation of individual compound odour profile vectors (Equation S3A, Appendix 1 in S1 File)—defined following the linear OI-lnOAV relation of individual odorants presented in Yan, Liu [31]—utilised the OD rankings presented in the GoodScents database such that OI was defined as inversely proportionate to ranking when assigned, or 0 when not assigned. The angle between compound odour profile vectors (α) was defined as the angle between vectors in 33-dimensional space due to the data covering measures across 33 ODs. For each OD, the OI of the mixture was calculated as per Berglund [24] and expanded for summation across the 18 terpenes explored per mixture (Equation S5A, Appendix 1 in S1 File).

2.5. Statistical modelling of cannabis sensory descriptors

Cannabis sample terpene and OI profiles were utilised separately as inputs for binary logistic regression (LR) with the logit function in Minitab 19 statistical software package (Minitab Inc., State College, PA), to predict cannabis sensory descriptors. Automated step-wise term selection was utilised with an alpha value of 0.2 to enter and remove. An initial preliminary model was created using all factors up to and including their second order terms, followed by further development into a final model with an automated selection from the same terms plus all combinations of third order terms from the factors which were selected in the initial model. The event probability (EP) outputs from LR were grouped into 2 groups using the k-means unsupervised clustering method in Minitab 19, and the group with the higher centroid was assigned the associated SD. The proportions of correct SD assignments were compared between terpene and OI profiles for an SD using 2-proportions testing with a 95% confidence interval. The overall performance of the modelling strategies using either terpene or OI data was compared by paired t-test with 95% confidence level, on the proportions of correct assignments.

3. Results and discussion

Whilst cannabis is known for having a strong and distinct odour, there is considerable diversity in the language used to describe cannabis odour [10]. With the recent acceptance of cannabis as a recreational drug and medicinal product, plant breeding has enabled the development of many new varieties, which often have exotic odour-inspired names and rich odour descriptions. However, cannabis naming is unregulated [47], and there is significant chemical and genetic diversity between strains of the same name [48, 49], which has resulted in a reputation within the industry of products having inaccurate descriptions or misleadingly suggestive names [5052].

A significant demographic of medicinal cannabis users are the paediatric and geriatric patient populations, which have specific considerations for drug design and usability [5356]. Paediatrics in particular, are sensitive to organoleptic properties such as taste and odour, with rejection occurring from unpalatable medications [5760]. Cannabis oils are reported to have unpleasant organoleptic properties for both adults and children, which has resulted in cannabis capsule product developments [60, 61], however swallowing can be difficult in paediatrics making this unavailable to those patients [62]. For liquid oral formulations such as Epidiolex, strawberry flavouring is added [63], however no such alternative is available for raw flower products. With the inconsistencies in product labelling and descriptions, there is a risk of these special patient groups receiving medicinal products with different sensory properties to what they expect, which may result in rejection and thereby negatively affect their therapeutic outcomes. Accordingly, there is a need to increase the consistency and accuracy of cannabis labelling and descriptions to improve patient experiences and therapeutic outcomes.

The simplest measure of odour strength is OAV which is the ratio of the concentration of a compound in a sample relative to its ODT. However, OAV doesn’t inform on how that chemical will be perceived by an individual or, if present in a mixture, how other compounds are contributing to the overall odour. Odour intensity and vector addition are proposed as improved methods for determining odour perception, particularly for mixtures. As OAV is an input to OI calculation, ODT for each compound present in a mixture is required in order to perform odour vector addition of OI. Accordingly, as cannabis and other natural products contain many volatiles which have not been tested by sensory analysis to determine their ODT, there is a need to predict ODT from in silico measures to enable vector modelling of profiles containing these compounds.

Polynomial regression was utilised to generate a model for the prediction of ln ODT from physicochemical measures, which had an R2 = 0.6892 and a 10-fold R2 = 0.6484 (Fig 2, Table S5 in S1 File). The similarity between the two R2 measures indicates comparable accuracy for both the training and the test sets across the fold splits, supporting that the model is not overfit [64]. The model includes 69 total 1st, 2nd, and 3rd order terms (Table S5 in S1 File), which are all significant (P < 0.05). Of these 69 terms, 49 include terms relating to the presence of functional groups, and 23 contain terms relating to molecular size. The inclusion of these properties for the prediction of ODT is consistent with previous studies which identified that ODT is impacted by the related measures of carbon chain length and functional group presence [41, 42, 65]. The model also included terms with measures relating to molecular properties (e.g. Log P), and types of bonds present (e.g. rotatable bonds), which may indicate their association with ODT perception and could therefore be the targets for further studies on physicochemical properties impacting ODT.

thumbnail
Fig 2. Prediction of ln ODT.

Polynomial regression analysis with 10-fold cross-validation to predict ln ODT from physicochemical measures utilising median ODT values from 1,274 volatile chemicals.

https://doi.org/10.1371/journal.pone.0284842.g002

Previous studies have presented predictive models for ODT, however, these tend to utilise small data sets. Polster and Schieberle [65] present an ODT model from 44 compounds with a R2 of 0.89, Rodríguez, Teixeira [66] identified a model from 121 compounds with R2 of 0.769, Anker, Jurs [67] and Junkes, Arruda [68] present models from 49 compounds with R2 of 0.863 and 0.765, respectively, whilst Pal, Mitra [44] utilised 42 compounds to develop a model with R2 of 0.809 and an R2 of 0.813 when applied to a test set of 11 compounds. Abraham, Sánchez-Moreno [43] also modelled a log transformation of ODT against two datasets containing 206 and 353 compounds to produce models with R2 of 0.748 and 0.759, respectively. These works support the use of modelling of ODT from chemical properties, however, the data sets utilised in these previous models are small compared to the 10,000 identified volatile odour chemicals [69]. Therefore, as statistical models are limited in their application to compounds with properties similar to those within the modelling dataset, and the relatively small size of these models means they only account for a small proportion of the diversity of properties across all volatile compounds, these models are expected to have low generalisability for predicting ODT in new compounds. By contrast, the model presented herein utilised data from 1,274 chemicals, which is a substantial expansion in the breadth of chemical properties modelled (Table S6 in S1 File) and therefore has greater generalisability to new compounds. Furthermore, the accuracy of the model herein was found to be relatively similar to the accuracy of empirical determination of ODT through human participant sensory testing. The ratios for the ODT data variance and the model variance (defined in method 2.3 herein) were found to be on average 2.36 and 2.21, respectively (Table S7 and Fig S1 in S1 File), and were not significantly different (p = 0.771), indicating that the predicted values fall within a similar range of the reported ODT values to their median. Human olfactory sensitivity is known to be highly variable [70], and for ODT specifically, variations of up to 4 or 5 magnitudes are reported between studies of the same chemicals due to variations in sensory methodology [71]. Accordingly, the model herein which has achieved prediction of comparable accuracy could be considered as a suitable substitute for ODT in the absence of sensory testing.

It was hypothesised that odour vector modelling could be utilised to generate OI profiles for the highly complex volatile mixtures present in cannabis flowers, and that these profiles would be more informative to the overall odour than their associated terpene profiles. Additionally, whilst current odour vector modelling approaches utilise empirically solved values for the angles between compound OI vectors, it was hypothesised herein that a calculated estimation could be used in place of these values. Finally, the incorporation of OD ranking into OI calculation was hypothesised to account for the relative strengths of ODs and thereby improve the predictive power of the resulting OI profile.

Cannabis terpene and OI profiles were explored for the prediction of cannabis SD by LR modelling. Of the 13 SD categories modelled, terpene data was able to produce models for 10 SDs (Equations S10–S19 in S1 File) whilst OI profiles could produce models for 12 SDs (Equations S20–S31 in S1 File). The performance of each model was assessed for its ability to accurately assign SDs to each sample (Fig 3). Six SD categories were able to be modelled by OI significantly (p < 0.05) better than terpene (herbal, sweet, berry, tree fruit, roasted, and mellow), 5 SD categories were modelled equally successfully for both OI and terpene data (mineral, earthy, tropical, warming, and floral), 1 SD category was modelled by terpene significantly better than OI data (citrus), and 1 SD could not be modelled by either terpene or OI data (animal). When comparing the overall performance of the two modelling strategies, it was identified that OI data produced models with an average of 70.33% accuracy and models produced from volatile data had an average of 48.39% accuracy, which was a significant (p = 0.031) difference of 21.94%.

thumbnail
Fig 3. Prediction of cannabis sensory descriptors (SD) from volatile data, and OI data produced through odour vector modelling.

Logistic regression analysis and k-means unsupervised cluster analysis (k = 2) to predict the SD of 265 cannabis samples. Data represent the percentage of strains correctly assigned for each SD (for each data type), with error bars representing the 95% confidence interval. A) the set of 6 SDs which were predicted significantly more accurately with OI than terpene data, B) the set of 5 SDs which were predicted with equal accuracy for both OI and terpene data, and C) the single SD which was predicted significantly more accurately with terpene than OI data. Significant differences between volatile and OI data for an individual SD are indicated by * for p < 0.05, and *** for p ≤ 0.001 calculated by Student’s t-test.

https://doi.org/10.1371/journal.pone.0284842.g003

The results indicate that odour vector modelling produces profiles which are more informative for predicting overall cannabis odour (SD) than profiles of terpene data produced by routine testing within the Californian cannabis industry. The transformation of these relatively easy-to-produce terpene profiles, into OI profiles which are more informative for odour prediction is a promising development in achieving more consistent labelling and product descriptions within the cannabis industry. Noting that these routine profiles contain only 18 volatiles of the more than 200 which contribute to the cannabis odour [72], increasing the predictive capability of these (if possible) may require considerable expansion of the breadth of chemical profiling. This increased volatile profiling would result in greater costs to manufacturers and possibly consumers and patients. Improving on the current practices is therefore the easiest and cheapest way to confer benefits to the industry and the patients. However, as some of the SDs were not able to be modelled well by either the terpene or the OI data sets (floral and animal), expansion of the volatile profiling or cannabis sample data may still be desirable to achieve more accurate modelling of these SDs. As the expansion of the volatile profiles also adds data to the corresponding OI profiles, and expansion of cannabis sample data provides more data points for the modelling, these strategies may increase the accuracy of either the terpene or the OI modelling strategies. Furthermore, the application of this strategy to explore statistical relationships between volatile and sensory profiles is not limited to the cannabis industry. Industries wherein volatile chemical profiles and sensory data are both readily collected—such as perfume or food industries—could potentially utilise this strategy to improve predictive capabilities and eventually guide alterations to volatile profiles to achieve targeted changes to sensory profiles.

4. Conclusion

The medicinal cannabis industry has a reputation for having inconsistent labelling and product descriptions, which contributes to a misalignment between patient sensory expectations and experiences. For special patient populations such as geriatrics and particularly paediatrics, this is of concern as they are sensitive to unpleasant sensory experiences, and this can affect their likelihood of redosing and thus achieving their therapeutic outcomes. Current routine testing procedures involve the production of volatile profiles; however, these have limited utility for the accurate determination of product odour. Accordingly, the study herein demonstrates how these routinely produced volatile profiles can be transformed through odour vector modelling to produce OI profiles which predicted cannabis SD equally well or better than terpene profiles for 11 out of 13 SD categories, equating to a significant (p = 0.031) average improvement of 21.9% accuracy. This process utilised ODT inputs which were in part produced from a novel QSPR predictive model, that was developed using an extensive dataset of 1,274 median ODT values and their physicochemical properties to achieve a regression model with a R2 = 0.6892 and a 10-fold R2 = 0.6484. Of note, these models utilised data from 18 volatile compounds, whilst cannabis odour is comprised of hundreds of compounds. As such, progressions on the work presented herein may benefit from utilisation of more comprehensive volatile profiles. Nonetheless, these findings are significant to both the cannabis industry wherein improved accuracy of odour prediction will benefit patients, as well as broader odour industries wherein expansion of the odour vector modelling process to highly complex natural odour mixtures is a substantial progression in this field with potential further applications to food products and perfumes.

Supporting information

S1 File.

Appendix 1. Methods and equations for odour vector modelling containing Equations S1-S8, Appendix 2. Table S1-S8 and Fig S1, and Appendix 3. Equations for prediction of SD containing Equations S9-S31.

https://doi.org/10.1371/journal.pone.0284842.s001

(DOCX)

References

  1. 1. Karche T. The application of hemp (Cannabis sativa L.) for a green economy: A review. Turkish Journal of Botany. 2019;43(6):710–23.
  2. 2. Hesami M, Pepe M, Baiton A, Salami SA, Jones AMP. New insight into ornamental applications of cannabis: Perspectives and challenges. Plants. 2022;11(18):2383. pmid:36145783
  3. 3. Krüger M, van Eeden T, Beswa D. Cannabis sativa Cannabinoids as Functional Ingredients in Snack Foods—Historical and Developmental Aspects. Plants. 2022;11(23):3330. pmid:36501366
  4. 4. Odieka AE, Obuzor GU, Oyedeji OO, Gondwe M, Hosu YS, Oyedeji AO. The medicinal natural products of Cannabis sativa Linn.: A review. Molecules. 2022;27(5):1689.
  5. 5. Gabri AC, Galanti MR, Orsini N, Magnusson C. Changes in cannabis policy and prevalence of recreational cannabis use among adolescents and young adults in Europe—An interrupted time-series analysis. PLoS One. 2022;17(1):e0261885. pmid:35020763
  6. 6. Shepherd GM. The human sense of smell: are we better than we think? PLoS biology. 2004;2(5). pmid:15138509
  7. 7. Dattatreya BS, Kamath A, Bhat KK. Developments and challenges in flavor perception and measurement-a review. Food Reviews International. 2002;18(2–3):223–41.
  8. 8. Brattoli M, De Gennaro G, De Pinto V, Demarinis Loiotile A, Lovascio S, Penza M. Odour detection methods: Olfactometry and chemical sensors. Sensors. 2011;11(5):5290–322. pmid:22163901
  9. 9. Wang L, Walker VE, Sardi H, Fraser C, Jacob TJC. The correlation between physiological and psychological responses to odour stimulation in human subjects. Clinical Neurophysiology. 2002;113(4):542–51. pmid:11955999
  10. 10. Gilbert AN, DiVerdi JA. Consumer perceptions of strain differences in Cannabis aroma. PLoS One. 2018;13(2):e0192247. pmid:29401526
  11. 11. Smith T. Why Seeds of the same Strain produce Different looking plants Smokey Okie’s Cannabis2022 [Available from: https://oklahoma-cannabis-company.com/seeds-same-strain-different-pheno/.
  12. 12. Karpik K. Nova scotia scientist discovers cannabis strains being mislabeled on shelves cannabislifenetwork.com2021 [Available from: https://cannabislifenetwork.com/nova-scotia-scientist-discovers-cannabis-strains-being-mislabeled-on-shelves/.
  13. 13. Rocha ED, Silva VE, Pereira F, Jean VM, Souza FLC, Baratto LC, et al. Qualitative terpene profiling of Cannabis varieties cultivated for medical purposes. Rodriguésia. 2020;71.
  14. 14. Shapira A, Berman P, Futoran K, Guberman O, Meiri D. Tandem mass spectrometric quantification of 93 terpenoids in cannabis using static headspace injections. Analytical chemistry. 2019;91(17):11425–32. pmid:31369251
  15. 15. Luo J-q, Selby Pham J, Wise K, Wu Y-h, Sun J-c, Qu Y-m, et al. Early prediction of Shiraz wine quality based on small volatile compounds in grapes: CABI International agriRxiv; 2022. 21 p.
  16. 16. Lewkowska P, Dymerski T, NamieśNik J. Use of sensory analysis methods to evaluate the odor of food and outside air. Critical Reviews in Environmental Science and Technology. 2015;45(20):2208–44.
  17. 17. Amoore JE, Hautala E. Odor as an ald to chemical safety: odor thresholds compared with threshold limit values and volatilities for 214 industrial chemicals in air and water dilution. Journal of applied toxicology. 1983;3(6):272–90.
  18. 18. Nagata Y, Takeuchi N. Measurement of odor threshold by triangle odor bag method. Odor measurement review. 2003;118:118–27.
  19. 19. Debnath T, Nakamoto T. Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures. PloS one. 2020;15(6):e0234688. pmid:32559255
  20. 20. Boyle JA, Djordjevic J, Olsson MJ, Lundström JN, Jones-Gotman M. The human brain distinguishes between single odorants and binary mixtures. Cerebral Cortex. 2009;19(1):66–71. pmid:18445603
  21. 21. Wilson DA, Fleming G, Vervoordt SM, Coureaud G. Cortical processing of configurally perceived odor mixtures. Brain research. 2020;1729:146617. pmid:31866364
  22. 22. Thomas-Danguin T, Sinding C, Romagny S, El Mountassir F, Atanasova B, Le Berre E, et al. The perception of odor objects in everyday life: a review on the processing of odor mixtures. Frontiers in Psychology. 2014;5(504). pmid:24917831
  23. 23. Weiss T, Snitz K, Yablonka A, Khan RM, Gafsou D, Schneidman E, et al. Perceptual convergence of multi-component mixtures in olfaction implies an olfactory white. Proceedings of the National Academy of Sciences. 2012;109(49):19959–64. pmid:23169632
  24. 24. Quantitative Berglund B. and qualitative analysis of industrial odors with human observers. Annals of the New York Academy of Sciences. 1974;237(1):35–51.
  25. 25. Niu Y, Zhu Q, Xiao Z. Characterization of perceptual interactions among ester aroma compounds found in Chinese Moutai Baijiu by gas chromatography-olfactometry, odor Intensity, olfactory threshold and odor activity value. Food Research International. 2020;131:108986. pmid:32247502
  26. 26. Niu Y, Liu Y, Xiao Z. Evaluation of Perceptual Interactions between Ester Aroma Components in Langjiu by GC-MS, GC-O, Sensory Analysis, and Vector Model. Foods. 2020;9(2). pmid:32069959
  27. 27. Yan L, Wu C, Liu J. Visual analysis of odor interaction based on support vector regression method. Sensors. 2020;20(6):1707. pmid:32204317
  28. 28. Xiao Z, Xiang P, Zhu J, Zhu Q, Liu Y, Niu Y. Evaluation of the perceptual interaction among sulfur compounds in mango by Feller’s additive model, odor activity value, and vector model. Journal of agricultural and food chemistry. 2019;67(32):8926–37. pmid:31352781
  29. 29. Szulczyński B, Gębicki J. Determination of odor intensity of binary gas mixtures using perceptual models and an electronic nose combined with fuzzy logic. Sensors. 2019;19(16):3473. pmid:31398955
  30. 30. Yan L, Liu J, Jiang S, Wu C, Gao K. The regular interaction pattern among odorants of the same type and its application in odor intensity assessment. Sensors. 2017;17(7):1624. pmid:28703760
  31. 31. Yan L, Liu J, Fang D. Use of a modified vector model for odor intensity prediction of odorant mixtures. Sensors. 2015;15(3):5697–709. pmid:25760055
  32. 32. Laffort P, Dravnieks A. Several models of suprathreshold quantitative olfactory interaction in humans applied to binary, ternary and quaternary mixtures. Chemical Senses. 1982;7(2):153–74.
  33. 33. Tholl D, Hossain O, Weinhold A, Röse US, Wei Q. Trends and applications in plant volatile sampling and analysis. The Plant Journal. 2021;106(2):314–25. pmid:33506558
  34. 34. Lee BK, Mayhew EE, Sanchez-Lengeling B, Wei JN, Qian WW, Little K, et al. A Principal Odor Map Unifies Diverse Tasks in Human Olfactory Perception. bioRxiv. 2022.
  35. 35. Chacko R, Jain D, Patwardhan M, Puri A, Karande S, Rai B. Data based predictive models for odor perception. Sci Rep. 2020;10(1):1–13.
  36. 36. Selby-Pham SN, Howell KS, Dunshea FR, Ludbey J, Lutz A, Bennett L. Statistical modelling coupled with LC-MS analysis to predict human upper intestinal absorption of phytochemical mixtures. Food chemistry. 2018;245:353–63. pmid:29287381
  37. 37. Wise K, Gill H, Selby-Pham J. Inhalation Absorption Prediction (IAP) Model for Predicting Medicinal Cannabis Phytochemical Pharmacokinetics. Pharmacognosy Communications. 2019;9(3):85–90.
  38. 38. Palomba D, Martínez MJ, Ponzoni I, Díaz MF, Vazquez GE, Soto AJ. QSPR models for predicting log Pliver values for volatile organic compounds combining statistical methods and domain knowledge. Molecules. 2012;17(12):14937–53.
  39. 39. Wise K, Selby-Pham SNB, Selby-Pham J, Gill H. Development of intestinal bioavailability prediction (IBP) and phytochemical relative antioxidant potential prediction (PRAPP) models for optimizing functional food value of Cannabis sativa (hemp). International Journal of Food Properties. 2020;23(1):1287–95.
  40. 40. Ruijten M, Van Doorn R, van Harreveld AP. Assessment of odour annoyance in chemical emergency management. Dutch National Institute for Public Health and the Environment (RIVM); 2009.
  41. 41. Cometto-Muñiz JE, Abraham MH. Structure–activity relationships on the odor detectability of homologous carboxylic acids by humans. Experimental brain research. 2010;207(1):75–84. pmid:20931179
  42. 42. Zarzo M. Effect of functional group and carbon chain length on the odor detection threshold of aliphatic compounds. Sensors. 2012;12(4):4105–12. pmid:22666021
  43. 43. Abraham MH, Sánchez-Moreno R, Cometto-Muñiz JE, Cain WS. An algorithm for 353 odor detection thresholds in humans. Chemical senses. 2012;37(3):207–18. pmid:21976369
  44. 44. Pal P, Mitra I, Roy K. QSPR modeling of odor threshold of aliphatic alcohols using extended topochemical atom (ETA) indices. Croatica Chemica Acta. 2014;87(1):29–37.
  45. 45. Jess Hen. Things Budtenders Should Know—The Cannabis Flavor/Scent Wheel http://www.jesshen.com/budtender-resources/cannabis-tasting-wheel [Accessed 19/10/2022]2022.
  46. 46. Van Gemert L. Odour thresholds. Compilations of odour threshold values in air, water and other media. 2011;2.
  47. 47. Reimann-Philipp U, Speck M, Orser C, Johnson S, Hilyard A, Turner H, et al. Cannabis chemovar nomenclature misrepresents chemical and genetic diversity; survey of variations in chemical profiles and genetic markers in Nevada medical cannabis samples. Cannabis and Cannabinoid Research. 2020;5(3):215–30. pmid:32923659
  48. 48. Smith CJ, Vergara D, Keegan B, Jikomes N. The phytochemical diversity of commercial Cannabis in the United States. PLOS ONE. 2022;17(5):e0267498. pmid:35588111
  49. 49. Sawler J, Stout JM, Gardner KM, Hudson D, Vidmar J, Butler L, et al. The Genetic Structure of Marijuana and Hemp. PLOS ONE. 2015;10(8):e0133292. pmid:26308334
  50. 50. Stockton N. Sorry, But the Names for Weed Strains Are Kinda Meaningless Wired2015 [Available from: https://www.wired.com/2015/08/sorry-names-weed-strains-kinda-meaningless/.
  51. 51. Wilhelm J. How Does a Cannabis Strain Get Its Name? Leafly2017 [Available from: https://www.leafly.com/news/strains-products/cannabis-strain-names.
  52. 52. Verilife. Cannabis Strain Names: How Are Strains Named? Verilife2022 [Available from: https://www.verilife.com/blog/cannabis-strain-names.
  53. 53. MacCallum CA, Russo EB. Practical considerations in medical cannabis administration and dosing. European journal of internal medicine. 2018;49:12–9. pmid:29307505
  54. 54. Chapman S, Protudjer J, Bourne C, Kelly LE, Oberoi S, Vanan MI. Medical cannabis in pediatric oncology: a survey of patients and caregivers. Supportive Care in Cancer. 2021;29(11):6589–94. pmid:33932156
  55. 55. Abuhasira R, Schleider LB-L, Mechoulam R, Novack V. Epidemiological characteristics, safety and efficacy of medical cannabis in the elderly. European journal of internal medicine. 2018;49:44–50. pmid:29398248
  56. 56. Aran A, Cayam-Rand D. Medical cannabis in children. Rambam Maimonides Medical Journal. 2020;11(1). pmid:32017680
  57. 57. Walsh J, Cram A, Woertz K, Breitkreutz J, Winzenburg G, Turner R, et al. Playing hide and seek with poorly tasting paediatric medicines: do not forget the excipients. Advanced drug delivery reviews. 2014;73:14–33. pmid:24614069
  58. 58. Mennella JA, Beauchamp GK. Optimizing oral medications for children. Clinical Therapeutics. 2008;30(11):2120–32. pmid:19108800
  59. 59. Boateng J. Drug delivery innovations to address global health challenges for pediatric and geriatric populations (through improvements in patient compliance). Journal of Pharmaceutical Sciences. 2017;106(11):3188–98. pmid:28734784
  60. 60. Bar-Lev Schleider L, Mechoulam R, Saban N, Meiri G, Novack V. Real life experience of medical cannabis treatment in autism: analysis of safety and efficacy. Sci Rep. 2019;9(1):1–7.
  61. 61. Stella B, Baratta F, Della Pepa C, Arpicco S, Gastaldi D, Dosio F. Cannabinoid formulations and delivery systems: Current and future options to treat pain. Drugs. 2021;81(13):1513–57. pmid:34480749
  62. 62. Mennella JA, Spector AC, Reed DR, Coldwell SE. The bad taste of medicines: overview of basic research on bitter taste. Clinical therapeutics. 2013;35(8):1225–46. pmid:23886820
  63. 63. Abu-Sawwa R, Stehling C. Epidiolex (cannabidiol) primer: frequently asked questions for patients and caregivers. The Journal of Pediatric Pharmacology and Therapeutics. 2020;25(1):75–7. pmid:31897080
  64. 64. Yiu T. Understanding Cross Validation Towards Data Science2020 [Available from: https://towardsdatascience.com/understanding-cross-validation-419dbd47e9bd.
  65. 65. Polster J, Schieberle P. Structure-odor correlations in homologous series of alkanethiols and attempts to predict odor thresholds by 3D-QSAR studies. Journal of Agricultural and Food Chemistry. 2015;63(5):1419–32. pmid:25608797
  66. 66. Rodríguez O, Teixeira MA, Rodrigues AE. Prediction of odour detection thresholds using partition coefficients. Flavour and fragrance journal. 2011;26(6):421–8.
  67. 67. Anker LS, Jurs PC, Edwards PA. Quantitative structure-retention relationship studies of odor-active aliphatic compounds with oxygen-containing functional groups. Analytical chemistry. 1990;62(24):2676–84. pmid:2096733
  68. 68. Junkes BdS Arruda ACS, Yunes RA Porto LC, Heinzen VEF. Semi-empirical topological index: a tool for QSPR/QSAR studies. Journal of Molecular Modeling. 2005;11(2):128–34.
  69. 69. Volatiles in Food Products. [Internet]. 2021 [cited 09/01/2023]. Available from: https://encyclopedia.pub/entry/14580.
  70. 70. Menashe I, Lancet D. Variations in the human olfactory receptor pathway. Cellular and Molecular Life Sciences. 2006;63(13):1485–93. pmid:16732427
  71. 71. Schmidt R, Cain WS. Making scents: dynamic olfactometry for threshold measurement. Chemical senses. 2010;35(2):109–20. pmid:19965900
  72. 72. Oswald IW, Ojeda MA, Pobanz RJ, Koby KA, Buchanan AJ, Del Rosso J, et al. Identification of a New Family of Prenylated Volatile Sulfur Compounds in Cannabis Revealed by Comprehensive Two-Dimensional Gas Chromatography. ACS omega. 2021;6(47):31667–76. pmid:34869990