Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The effect of detoxification on acoustic features of Mandarin speech in male heroin users

  • Puyang Geng ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    gengpy@ssfjd.cn

    Affiliations Department of Audio, Video, and Electronic Forensics, Academy of Forensic Science, Shanghai, China, Shanghai Forensic Service Platform, Key Laboratory of Forensic Science, Ministry of Justice, Shanghai, China

  • Ningxue Fan,

    Roles Conceptualization, Writing – review & editing

    Affiliation Information Security and Social Management Innovation Lab, Shanghai Open University, Shanghai, China

  • Zhijun Li,

    Roles Formal analysis, Resources, Software

    Affiliations Department of Audio, Video, and Electronic Forensics, Academy of Forensic Science, Shanghai, China, Shanghai Forensic Service Platform, Key Laboratory of Forensic Science, Ministry of Justice, Shanghai, China

  • Rong Ling,

    Roles Data curation

    Affiliations Department of Audio, Video, and Electronic Forensics, Academy of Forensic Science, Shanghai, China, Shanghai Forensic Service Platform, Key Laboratory of Forensic Science, Ministry of Justice, Shanghai, China

  • Kai Yang,

    Roles Software, Visualization

    Affiliations Department of Audio, Video, and Electronic Forensics, Academy of Forensic Science, Shanghai, China, Shanghai Forensic Service Platform, Key Laboratory of Forensic Science, Ministry of Justice, Shanghai, China

  • Xiao Mao,

    Roles Data curation, Software

    Affiliations Department of Audio, Video, and Electronic Forensics, Academy of Forensic Science, Shanghai, China, Shanghai Forensic Service Platform, Key Laboratory of Forensic Science, Ministry of Justice, Shanghai, China

  • Hong Guo

    Roles Resources, Supervision, Writing – review & editing

    Affiliations Department of Audio, Video, and Electronic Forensics, Academy of Forensic Science, Shanghai, China, Shanghai Forensic Service Platform, Key Laboratory of Forensic Science, Ministry of Justice, Shanghai, China

Abstract

This study aims to investigate the effect of detoxification on acoustic features of Mandarin speech. Speech recordings were collected from 66 male abstinent heroin users with different durations of drug detoxification, specifically early abstinent users with a detoxification duration of less than 2 years, sustained abstinent users with 2 years of detoxification, and long-term abstinent users with a detoxification duration of more than 2 years. The results of the acoustic analyses showed that early abstinent users exhibited lower loudness, relative energies of F1, F2, and F3, higher H1–A3, and fewer loudness peaks per second, as well as a longer average duration of unvoiced segments, compared to the sustained and long-term abstinent users. The findings suggest that detoxification may lead to a rehabilitation process in the speech production of abstinent heroin users (e.g., less vocal hoarseness). This study not only provides valuable insights into the effect of detoxification on speech production but also provides a theoretical basis for the speech rehabilitation and detoxification treatment of heroin users.

1. Introduction

Drug (e.g., heroin, cocaine, marijuana, and methamphetamine) abuse can result in significant organ damage, thereby exerting a detrimental impact on the health status of drug users. In the process of detoxification, it is commonly anticipated that drug users would be able to reduce or cease drug consumption and restore their health. Nevertheless, research has revealed that detoxification may not necessarily engender a change in the health condition of drug users; in fact, it may even precipitate more severe complications [1, 2].

At present, there is a growing focus on the rehabilitation and reintegration of drug users into normal social life. As a fundamental instrument for social interaction, speech production holds significant importance in facilitating successful interpersonal communication and fostering positive social relationships [3]. However, a considerable body of research has reported significant differences in acoustic features of speech between drug users and non-drug users.

Previous studies on American English speakers have reported that both heroin and cocaine use would lead to decreased vocal control and dysphonia [46]. Besides, some evidence from acoustic analyses has also demonstrated that 3,4-methylamphetamine (MDMA; also known as ecstasy) users showed a higher second formant (i.e., F2), and lower F2 variability and fundamental frequency (henceforth, F0) [7]. Further, a classification analysis of MDMA, oxytocin, and placebo users indicates that a promising accuracy (i.e., 92%) could be achieved based on acoustic parameters, such as F0, pause, and MFCCs (Mel-frequency cepstral coefficients) [8]. Additionally, Agurto and her colleagues applied similar research paradigm on cocaine users, the results showed no significant acoustic difference (e.g., pitch variation, vowel space feature, voice quality, and MFCC) between the abstinence and current cocaine users [9]. However, they observed a relatively good performance in classification tasks, with an accuracy of over 80% in predicting cocaine abstinence using linguistic features (e.g., acoustic and semantic features) [9, 10]. Another automatic classification analysis on cannabis-intoxication speech, consistent with previous studies, reported encouraging model performance (around 69%) in predicting cannabis intoxication using mel-spectrograms from sustained vowels [11]. Previous study conducted on Mandarin Chinese speakers exhibited that the users of certain drugs (i.e., heroin, ketamine, and methamphetamine) showed smaller F0 standard deviation, reduced loudness, and cepstral peak prominence, as well as higher H1–A3, longer unvoiced segments, and fewer voiced segments per second [12]. Therefore, some scholars have proposed that these differences indicate that the acoustic characteristics of drug users deviate from the norm [12, 13].

Additionally, the impact of alcohol on speech production has been widely studied, and previous research has yielded varying results. Early studies suggested that abstinent alcoholics showed no significant incongruencies in speech patterns [14]. However, studies on reading speech in alcoholics during sober and intoxicated states have found that intoxicated individuals tend to exhibit more speech errors, such as longer reading time, increased word omission, and revision [15]. In recent years, research on alcohol intoxication speech has found significant differences in the mean and range of F0 between abstinent alcoholics and healthy controls during the expression of emotional speech [16]. From the perspective of automatic detection, Schuller et al. [17] reviewed previous alcohol intoxication state classification challenges at the INTERSPEECH conference, outlining common features (e.g., low-level descriptors extracted by OpenSMILE, including energy, F0, and MFCC) and algorithms (e.g., SVM, KNN, and GMM) used in these challenges. Their research reported the detection accuracy for alcohol intoxication can reach approximately 70% and emphasized the necessity of speaker-independent splitting when performing speaker state detection tasks. More recent research using acoustic features and various classifiers (e.g., SVM, Random Forest, KNN) has achieved recognition accuracies of around 80% for distinguishing between alcohol and non-alcohol states [18]. Similarly, support vector machine model exhibited high accuracy (i.e., 98%) in predicting alcohol intoxication using spectrogram features (e.g., MFCC) [19].

Scholars have engaged in discussions regarding the reasons behind the occurrence of acoustic changes in drug users. It has been reported that medicinal users of cannabis exhibit a significant increase in the voice onset time (VOT) during the production of bilabial sounds (e.g., /b/, /p/) and alveolar sounds (e.g., /d/, /t/), indicating an extended control time over the vocal tract and lips. Moreover, there is a larger shimmer, suggesting a decline in the control force of the vocal folds [6]. Indeed, numerous studies on drug users have consistently documented that drug use can cause significant damage to the speech organs. Consensus has been reached among researchers that marijuana, methamphetamine, ketamine, and opioid (e.g., cocaine, heroin) drugs can lead to respiratory depression in drug users [2022]. In addition, studies have found that heroin and cocaine can cause palatal perforation and nasal septal perforation [2325]. There have been case reports suggesting that marijuana and opioid drugs may lead to vocal nodules or vocal cord paralysis [20, 26]. Aides et al. [27] and Marco et al. [28] discovered that ketamine can result in tongue movement disorders, characterized by sustained lingual contraction.

It is then reasonable to ask whether the speech production of drug users tend to normalize with detoxification. However, there is a paucity of research on the acoustic changes in drug users after detoxification. Some scholars have hypothesized that cognitive dysfunction in drug users gradually normalizes along with the detoxification process [29]. Similarly, other scholars have suggested that the cognitive dysfunctions in drug users would gradually recover after detoxification [30]. On the contrary, most studies have held opposite opinions that those dysfunctions do not alleviate with increasing abstinence duration, and significant differences still exist between drug users and healthy controls [31, 32]. Therefore, whether the acoustic changes of drug users can gradually recover during the detoxification process is also a question worthy of attention.

The current study was then set out to investigate the effect of drug detoxification on acoustic features of speech in drug users. Considering that heroin is one of the most prevalent drugs of abuse in China [33], the abstinent heroin users with different durations of detoxification (i.e., the last drug use was within a period of less than 2 years, 2 years, and longer than 2 years) were involved in the current study. Speech recordings were then collected from sixty-six male abstinent heroin users, twenty-two of whom have been in drug detoxification for less than 2 years, thirty-seven have been in drug detoxification for 2 years, and seven have been in drug detoxification for longer than 2 years. Acoustic analysis was performed to reveal the difference of speech production across the three groups of abstinent heroin users. This paper seeks to understand the effect of detoxification on speech production and provide theoretical foundation for the speech rehabilitation and detoxification treatment of drug users.

2. Methods

This research was approved by the Committee for the Protection of Human Subjects (CPHS) at the Academy of Forensic Science (Shanghai, China). All subjects involved in the current study were recruited to participate in this experiment between July and September 2022 and had indicated their awareness of the research purpose and voluntarily participated in the experiment. The written consent forms have obtained from all participants.

2.1 Speech recording

Sixty-six male heroin users aged from 26–53 were recruited from a compulsory drug rehabilitation centre in China, among which twenty-two have been in drug abstinence for less than two years (nineteen were in the abstinence for less than two month), thirty-seven have been in abstinence for two years, and seven have been in abstinence for longer than 2 years (a criterion suggested by Huang et al. [29]). In terms of narrative convenience, the first group of subjects will be referred to as early abstinent heroin users (< 2 years), the second group as sustained abstinent heroin users (2 years), and the third group as long-term abstinent heroin users (> 2 years). The average durations of previous heroin usage were 9.0 years, 4.4 years, and 7.4 years for the early, sustained, and long-term abstinent users, respectively. All heroin users were native speaker of Mandarin Chinese and spoke fluent standard Mandarin.

As shown in S1 Appendix, twenty-six phonetically balanced target sentences were designed for the current study. Before the experiment, all participants were first asked to familiarize the speech materials. Speech recording experiment was conducted in a sound-attenuated room. The unidirectional lavalier microphone was placed about 25 cm away from the speaker’s mouth. The audio was recorded using a portable high-quality digital recorder (Zoom H5n) with a sample rate was set to 44.1 kHz and 16 bits resolution. All participants were instructed to read aloud the target sentences as per the sequence prescribed in S1 Appendix using their normal speech style. In the event of mispronunciation, the sentence is reiterated until correctly delivered. Altogether, 26 (target sentences) * 66 (speakers) = 1716 recordings were collected for the current research. The average duration of each speech recording is around 1.7 seconds, with a range of 0.6 to 7.2 seconds. The total duration of all recordings is around 0.8 hour. The spectrograms of an example speech produced by of the early, sustained, and long-term abstinent heroin users were shown in Fig 1.

thumbnail
Fig 1. The spectrogram (from 0 to 5000 Hz) of an example speech (i.e., Sentence 26: /ʈʂɤ4/ /ɕja4/ /uɔ3/ /kʰɤ3/ /i3/ /xwan4/ /ʂoʊ3/ /tɕi1/ /lɤ0/) produced by of the early (bottom), sustained (middle), and long-term (top) abstinent heroin users.

https://doi.org/10.1371/journal.pone.0304399.g001

2.2 Data extraction

The Geneva Minimalistic Parameter Set (GeMAPS) [34, 35] is a well-established and standard acoustic feature set, which has been widely used in previous studies on pathological speech (e.g., [36, 37]). Hence, the GeMAPS acoustic features were automatically extracted for all collected speech using openSmile (i.e., a standard audio feature extraction tool) in Python 3.8.10. A detailed description and implementation of these feature set is given in [35], The acoustic features analysed in the current study are classified into four categories, viz., frequency-related feature, amplitude-related feature, spectral-related feature, and temporal-related feature.

Frequency-related feature.

The mean, standard deviation, and range of logarithmic fundamental frequency (F0) on a semitone scale, starting at 27.5 Hz (semitone 0; please note that as 0 is reserved for unvoiced frames, every value below semitone 1 [29.136 Hz] is clipped to 1); jitter (i.e., the cycle-to-cycle variation of F0; for the formula used to calculate jitter see Eyben et al. [35]); mean of the first, second and third formants’ (i.e., F1, F2, F3) centre frequency in Hertz.

Amplitude-related feature.

Shimmer (i.e., the measures of the difference in amplitude from cycle to cycle; for the formula used to calculate shimmer see Eyben et al. [35]), loudness (i.e., estimate of perceived signal intensity from an auditory spectrum), and Harmonic-to-Noise Ratio (i.e., HNR; relation of energy in harmonic components to energy in noise-like components).

Spectral-related feature.

The relative energies of F1, F2, and F3 (The ratio of the energy of the spectral harmonic peak at the formant’s centre frequency to the energy of the spectral peak at F0), H1-H2 (i.e., the difference between the amplitudes of the first harmonic and the second in the Fourier spectrum), and H1-A3 (i.e., the difference between the amplitudes of H1 and the amplitude of F3).

Temporal-related feature.

Loudness-peaks per second (i.e., number of loudness peaks per second), voiced-segments mean (i.e., average duration of voiced segments), unvoiced-segments mean (i.e., average duration of unvoiced segments), and voiced-segments per second (i.e., number of continuous voiced regions per second).

2.3 Statistical analysis

All nineteen features (i.e., frequency-related feature, amplitude-related feature, spectral-related feature, and temporal-related feature) were statistically analysed to investigate the acoustic difference across the three groups of speakers (i.e., the early, sustained, and long-term abstinent heroin users). Linear-mixed effect models were built for the nineteen acoustic features using lme4 and lmerTest packages in R software [38, 39]. In each model, the duration of drug detoxification (i.e., the early, sustained, and long-term abstinent heroin users) was fixed effect, and the intercepts for speaker and target sentence were random effects. Tukey HSD post-hoc tests were then conducted on all significant effects for more detailed analyses using lsmeans package [40].

3. Results

The average values and standard deviations of the nineteen acoustic features of the three groups of abstinent heroin users (i.e., the early, sustained, and long-term abstinent users) are presented in Table 1. The mean values (standard deviations as error bars) of the four categories of acoustic features (i.e., frequency-related feature, amplitude-related feature, spectral-related feature, and temporal-related feature) for the three groups of abstinent heroin users (i.e., the early, sustained, and long-term abstinent users) were plotted in Figs 25.

thumbnail
Fig 2. The mean value (standard deviation as error bars) of the frequency-related acoustic parameters of the early (i.e., < 2 years), sustained (i.e., 2 years), and long-term abstinent heroin users (i.e., > 2 years).

https://doi.org/10.1371/journal.pone.0304399.g002

thumbnail
Fig 3. The mean value (standard deviation as error bars) of the amplitude-related acoustic parameters of the early (i.e., < 2 years), sustained (i.e., 2 years), and long-term abstinent heroin users (i.e., > 2 years).

https://doi.org/10.1371/journal.pone.0304399.g003

thumbnail
Fig 4. The mean value (standard deviation as error bars) of the spectral-related acoustic parameters of the early (i.e., < 2 years), sustained (i.e., 2 years), and long-term abstinent heroin users (i.e., > 2 years).

https://doi.org/10.1371/journal.pone.0304399.g004

thumbnail
Fig 5. The mean value (standard deviation as error bars) of the temporal-related acoustic parameters of the early (i.e., < 2 years), sustained (i.e., 2 years), and long-term abstinent heroin users (i.e., > 2 years).

https://doi.org/10.1371/journal.pone.0304399.g005

thumbnail
Table 1. The mean values (standard deviations) of the nineteen acoustic features of the three groups of abstinent heroin users (i.e., the early, sustained, and long-term abstinent users).

https://doi.org/10.1371/journal.pone.0304399.t001

The results of linear-mixed effect models on the nineteen acoustic features were presented in Table 2. For frequency-related features, significant main effects of “the duration of drug detoxification” were observed for F0 mean and jitter. Tukey HSD post hoc tests were then conducted, and the results showed that early abstinent users exhibited lower jitter as compared to the other two groups of abstinent users (Early-Sustained: β = -0.98, SE = 0.25, z = -3.86, p < 0.001; Early-Long-term: β = -0.99, SE = 0.41, z = -2.42, p = 0.048).

thumbnail
Table 2. The results of linear-mixed effect models on the nineteen acoustic features.

In each model, the duration of drug detoxification (i.e., the early [< 2 years], sustained [= 2 years], and long-term [> 2 years] abstinent heroin users) was fixed effect.

https://doi.org/10.1371/journal.pone.0304399.t002

For amplitude-related features, significant main effects of “the duration of drug detoxification” were observed for shimmer and loudness. The results of post hoc test showed that early abstinent users exhibited lower loudness (Early-Sustained: β = -0.76, SE = 0.07, z = -11.58, p < 0.001; Early-Long-term: β = -0.81, SE = 0.11, z = -7.62, p < 0.001) and shimmer (Early-Sustained: β = -0.21, SE = 0.04, z = -5.86, p < 0.001; Early-Long-term: β = -0.14, SE = 0.06, z = -2.46, p = 0.04) than the sustained and long-term abstinent users.

Significant main effects of “the duration of drug detoxification” were observed for all spectral-related feature except for H1-H2. Tukey HSD post hoc analyses showed that early abstinent heroin users exhibited lower relative energies of F1 (Early-Sustained: β = -53.05, SE = 2.48, z = -21.39, p < 0.001; Early-Long-term: β = -57.98, SE = 4.00, z = -14.50, p < 0.001), F2 (Early-Sustained: β = -53.58, SE = 2.31, z = -23.21, p < 0.001; Early-Long-term: β = -57.11, SE = 3.72, z = -15.35, p < 0.001), and F3 (Early-Sustained: β = -51.68, SE = 2.15, z = -24.04, p < 0.001; Early-Long-term: β = -54.14, SE = 3.47, z = -15.62, p < 0.001) than the other two groups of abstinent users; and (2) higher H1-A3 than the sustained abstinent users (Early-Sustained: β = 5.29, SE = 1.07, z = 4.95, p < 0.001).

Significant main effects of “the duration of drug detoxification” were found for all temporal-related features, with the only exception of voiced-segments mean. The results of post hoc analysis showed that early abstinent users exhibited fewer loudness peaks per second (Early-Sustained: β = -0.92, SE = 0.12, z = -7.72, p < 0.001; Early-Long-term: β = -0.77, SE = 0.19, z = -4.01, p < 0.001) and longer average duration of unvoiced segments (Early-Sustained: β = 0.21, SE = 0.01, z = 24.77, p < 0.001; Early-Long-term: β = 0.21, SE = 0.01, z = 15.76, p < 0.001) than the sustained and long-term abstinent users. Besides, early abstinent users showed fewer voiced segments per second as compared to the sustained abstinent users (Early-Sustained: β = -0.47, SE = 0.09, z = -5.23, p < 0.001).

4. Discussion

The current study investigated the speech pattern of 66 heroin users with different durations of drug detoxification (i.e., 22 early abstinent users [< 2 years], 37 sustained abstinent users [= 2 years], and 7 long-term abstinent users [> 2 years]). The results of acoustic analysis showed significant effects of detoxification on the amplitude-related (i.e., shimmer and loudness), spectral-related (i.e., relative energies of F1, F2, and F3, and H1-A3), and temporal-related features (i.e., loudness peaks and voiced segments per second, and average duration of unvoiced segments).

Previous research has established that higher H1-A3 indicate the presence of hyper-functional voice disorders (i.e., vocal hoarseness) in speakers. It has found that the previous heroin, ketamine, and methamphetamine users exhibit higher H1-A3 compared to healthy control groups, indicating the presence of vocal hoarseness in drug users [12]. Therefore, the most significant finding observed in the present study is that the sustained and long-term abstinent heroin users demonstrated reduced H1-A3, indicating an alleviation of vocal hoarseness in the two groups of abstinent users. It is difficult to explain this result without further medical examinations on these drug users, but there are several possible explanations for this result. As the duration of detoxification increases, the respiratory function of drug users may improve, leading to more stable airflow during speech production. Alternatively, the tension in the vocal fold muscles of drug users may increase, enhancing control over the vocal folds. These explanations require future specialized physiological research to validate.

In addition, the increasing vocal intensity (e.g., loudness, relative energies of F1, F2, and F3, voiced segments, and loudness peaks per second) observed for the sustained and long-term abstinent heroin users may suggest an enhancement in the spectral energy of their speech spectrum, as well as an increased prevalence of spectral harmonic components. Comparing with previous acoustic measurements on healthy non-drug users (i.e., H1-A3 = 23.12, loudness = 1.32, F1 relative energy = -31.62, F2 relative energy = -40.48, F3 relative energy = -45.64, and voiced segments per second = 2.23) [12], the above findings in the current paper collectively support the hypothesis that detoxification may lead to a gradual recovery in the speech production of abstinent heroin users.

From Table 1, a further comparison among the three groups of abstinent heroin users shows an ascending order of F0 measurements (i.e., mean, standard deviation, and range), i.e., early < sustained < long-term. Although it is not rigorous here to make a direct comparison across groups of abstinent users, we can still see the distinction roughly. In specific, the longer the duration of detoxification, the greater the variation in fundamental frequency observed in heroin users. In other words, as abstinent heroin users undergo detoxification, their control over the vocal folds tends to approach that of healthy controls (the measurements of the healthy non-drug users from Geng et al. [12] are taken as a reference: F0 mean = 25.29, F0 sd = 0.18, and F0 range = 4.77).

5. Limitations

Several limitations of the present study should also be noted. In the first place, only seven long-term abstinent users (i.e., > 2 years) were recruited in the current study. The small sample size of the long-term abstinent users may not be representative of the larger population of long-term abstinent users. Therefore, the results should be interpreted with caution and may not be generalizable to all long-term abstinent users. The predominance of short-term abstinent users may also introduce limitations in result interpretation. Our findings may primarily reflect the effects of detoxification on this specific stage, rather than providing a comprehensive understanding of the entire recovery process. To improve future studies, it is crucial to increase the sample size of the long-term abstinent users and balance the sample size of each group to enhance the reliability and validity of the findings. Additionally, the lack of female participants in our study should be acknowledged as a limitation. By not including female participants, our findings may not be representative of the entire population and may overlook potential gender-related differences in the effects of detoxification. Further studies involving female participants will enable a more comprehensive understanding of the effects of detoxification on the speech production. Thirdly, a longitudinal study on all abstinent drug users is of utmost necessity to further corroborate the findings of this research. Lastly, an investigation of the effect of different drug types (e.g., ketamine, Methamphetamine, etc.) will be necessary to draw a full picture of the effect of detoxification on speech production.

6. Conclusion

This study set out to investigate the effect of detoxification on speech production. The current study revealed that, taken heroin as an example, significant changes were observed for amplitude-related, spectral-related, and temporal related feature, which correlated with the extended duration of drug detoxification. In general, it seems drug detoxification might result in a speech rehabilitation (e.g., less vocal hoarseness) of the abstinent heroin user. Not only does this study enhance the understanding of the impact of drug use on speech production, but it also provides a theoretical basis for the speech rehabilitation and detoxification treatment of drug users.

References

  1. 1. Liu N, Lu Z, Xie Y. Tracking study on the relapse and aftercare effect of drug patients released from a compulsory isolated detoxification center. Front Psychiatry. 2022;12: 699074. pmid:35111083
  2. 2. Zhu H, Wu L-T. National trends and characteristics of inpatient detoxification for drug use disorders in the United States. BMC Public Health. 2018;18: 1–14. pmid:30157815
  3. 3. Gamble TK, Gamble MW. Interpersonal communication: Building connections together. SAGE publications; 2013. https://thuvienso.hoasen.edu.vn/bitstream/handle/123456789/9794/Contents.pdf
  4. 4. Nassif Filho A, Bettega S, Lunedo S, Maestri J, Gortz F. Repercussões otorrinolaringológicas do abuso de cocaína e/ou crack em dependentes de drogas. Rev Assoc Médica Bras. 1999;45: 237–241.
  5. 5. Sataloff RT. Voice science. San Diego/Oxford: Plural Pub Incorporated; 2005.
  6. 6. Shamei A, Bird S. An acoustic analysis of cannabis-intoxicated speech. Can Acoust. 2019;47: 108–109.
  7. 7. Agurto C, Norel R, Ostrand R, Bedi G, De Wit H, Baggott MJ, et al. Phonological Markers of Oxytocin and MDMA Ingestion. Interspeech. 2017. pp. 3142–3146.
  8. 8. Agurto C, Cecchi GA, Norel R, Ostrand R, Kirkpatrick M, Baggott MJ, et al. Detection of acute 3, 4-methylenedioxymethamphetamine (MDMA) effects across protocols using automated natural language processing. Neuropsychopharmacology. 2020;45: 823–832. pmid:31978933
  9. 9. Agurto C, Norel R, Pietrowicz M, Parvaz M, Kinreich S, Bachi K, et al. Speech markers for clinical assessment of cocaine users. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2019. pp. 6391–6394.
  10. 10. Agurto C, Cecchi G, King S, Eyigoz EK, Parvaz MA, Alia-Klein N, et al. Speak and you shall predict: speech at initial cocaine abstinence as a biomarker of long-term drug use behavior. bioRxiv. 2023. Available: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370100/pdf/nihpp-2023.07.18.549548v1.pdf pmid:37503140
  11. 11. Shamei A, Sullivan PR, Liu Y, Abdul-Mageed M, Gick B. Automated detection of cannabis intoxication from speech. Can Acoust. 2021;49.
  12. 12. Geng P, Fan N, Ling R, Li Z, Guo H, Lu Q, et al. Acoustic Characteristics of Mandarin Speech in Male Drug Users. J Voice. 2023. Available: http://dx.doi.org/10.1016/j.jvoice.2023.08.022 pmid:37827893
  13. 13. Mirkov V, Mitrović SM. Vocal assessment of addicts on methadone therapy via the RBH scale and objective acoustic parameters. Braz J Otorhinolaryngol. 2019;85: 50–54. pmid:29174584
  14. 14. Stein DR. Linguistic analysis of the speech of a selected group of former alcoholics: A research note. Q J Stud Alcohol. 1966;27: 106–109. pmid:5907517
  15. 15. Sobell LC, Sobell MB. Effects of alcohol on the speech of alcoholics. J Speech Hear Res. 1972;15: 861–868. pmid:4652408
  16. 16. Harmsworth C. Emotional Prosody Communication In Long-Term Abstained Alcoholics. University of Essex. 2015. https://repository.essex.ac.uk/16876/1/MSD_Final.pdf
  17. 17. Schuller B, Steidl S, Batliner A, Schiel F, Krajewski J, Weninger F, et al. Medium-term speaker states—A review on intoxication, sleepiness and the first challenge. Comput Speech Lang. 2014;28: 346–374.
  18. 18. Shenoi VV, Kuchibhotla S, Kotturu P. An efficient state detection of a person by fusion of acoustic and alcoholic features using various classification algorithms. Int J Speech Technol. 2020;23: 625–632.
  19. 19. Suffoletto B, Anwar A, Glaister S, Sejdic E. Detection of alcohol intoxication using voice features: a controlled laboratory study. J Stud Alcohol Drugs. 2023;84: 808–813. pmid:37306378
  20. 20. Miner NB, Schutzer WE, Zarnegarnia Y, Janowsky A, Torralva R. Fentanyl causes naloxone-resistant vocal cord closure: A platform for testing opioid overdose treatments. Drug Alcohol Depend. 2021;227: 108974. pmid:34492557
  21. 21. Tashkin DP, Roth MD. Pulmonary effects of inhaled cannabis smoke. Am J Drug Alcohol Abuse. 2019;45: 596–609. pmid:31298945
  22. 22. Vearrier D, Greenberg MI, Miller SN, Okaneku JT, Haggerty DA. Methamphetamine: history, pathophysiology, adverse health effects, current trends, and hazards associated with the clandestine manufacture of methamphetamine. Dis Mon. 2012;58: 38–89. pmid:22251899
  23. 23. Gold M, Boyack I, Caputo N, Pearlman A. Imaging prevalence of nasal septal perforation in an urban population. Clin Imaging. 2017;43: 80–82. pmid:28242555
  24. 24. Silvestre FJ, Perez-Herbera A, Puente-Sandoval A, Bagán JV. Hard palate perforation in cocaine abusers: a systematic review. Clin Oral Investig. 2010;14: 621–628. pmid:20063023
  25. 25. Zhang D, Patel KB, Cass LM, Foster AE, Guntupalli L, Brunworth JD. Heroin-induced nasal necrosis and septal perforation. Acta Oto-Laryngol Case Rep. 2017;2: 145–149.
  26. 26. Meehan-Atrash J, Korzun T, Ziegler A. Cannabis inhalation and voice disorders: a systematic review. JAMA Otolaryngol Neck Surg. 2019;145: 956–964. pmid:31393535
  27. 27. Aides LD, Prater SR, Chronister RB, Marco LA. Ketamine as a pharmacological model for tongue dyskinesia. Biol Psychiatry. 1988;23: 159–168. pmid:3334884
  28. 28. Marco LA, Reed TF, Aides LD, Chronister RB. An electrographic characterization of ketamine-induced linguopharyngeal motor activity. Clin Neuropharmacol. 1988;11: 141–150. pmid:3378223
  29. 29. Huang X, Chen Y-Y, Shen Y, Cao X, Li A, Liu Q, et al. Methamphetamine abuse impairs motor cortical plasticity and function. Mol Psychiatry. 2017;22: 1274–1281. pmid:28831198
  30. 30. Lu L, Yang W, Zhang X, Tang F, Du Y, Fan L, et al. Potential brain recovery of frontostriatal circuits in heroin users after prolonged abstinence: A preliminary study. J Psychiatr Res. 2022;152: 326–334. pmid:35785575
  31. 31. Wang X, Li B, Zhou X, Liao Y, Tang J, Liu T, et al. Changes in brain gray matter in abstinent heroin addicts. Drug Alcohol Depend. 2012;126: 304–308. pmid:22717339
  32. 32. Zhuang W, Tang Y, Zhong N, Jiang H, Du J, Wang J, et al. Persistent microstructural deficits of internal capsule in one-year abstinent male methamphetamine users: a longitudinal diffusion tensor imaging study. J Neuroimmune Pharmacol. 2016;11: 523–530. pmid:27115910
  33. 33. Wang X, Li Y, Li J, Hao W. Emerging patterns of substance abuse and related treatment in China. Curr Opin Psychiatry. 2023;36: 277–282. pmid:37191649
  34. 34. Eyben F, Wöllmer M, Schuller B. Opensmile: the munich versatile and fast open-source audio feature extractor. Proceedings of the 18th ACM international conference on Multimedia. 2010. pp. 1459–1462.
  35. 35. Eyben F, Scherer KR, Schuller BW, Sundberg J, André E, Busso C, et al. The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing. IEEE Trans Affect Comput. 2015;7: 190–202.
  36. 36. Chen J, Ye J, Tang F, Zhou J. Automatic detection of alzheimer’s disease using spontaneous speech only. Interspeech. NIH Public Access; 2021. p. 3830.
  37. 37. Tracy JM, Özkanca Y, Atkins DC, H Ghomi R. Investigating voice as a biomarker: deep phenotyping methods for early detection of Parkinson’s disease. J Biomed Inform. 2020;104: 103362. pmid:31866434
  38. 38. Bates D, Mächler M, Bolker B, Walker S. Fitting Linear Mixed-Effects Models Using lme4. J Stat Softw. 2015;67: 1–48.
  39. 39. Kuznetsova A, Brockhoff PB, Christensen RHB. lmerTest package: tests in linear mixed effects models. J Stat Softw. 2017;82.
  40. 40. Lenth R. Package ‘lsmeans.’ Am Stat. 2018;34: 216–221.