Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Withdrawal ruptures in adolescents with borderline personality disorder psychotherapy are marked by increased speech pauses–can minimal responses be automatically detected?

  • Christophe Künsch,

    Roles Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Faculty of Psychology, University of Basel, Basel, Switzerland

  • Lukas Fürer,

    Roles Data curation, Formal analysis, Software, Writing – review & editing

    Affiliation Child and Adolescent Psychiatric Research Department, Psychiatric University Hospital Basel, Basel, Switzerland

  • Martin Steppan,

    Roles Supervision, Visualization, Writing – review & editing

    Affiliation Child and Adolescent Psychiatric Research Department, Psychiatric University Hospital Basel, Basel, Switzerland

  • Nathalie Schenk,

    Roles Data curation, Writing – review & editing

    Affiliation Child and Adolescent Psychiatric Research Department, Psychiatric University Hospital Basel, Basel, Switzerland

  • Kathrin Blum,

    Roles Writing – review & editing

    Affiliation Faculty of Psychology, University of Basel, Basel, Switzerland

  • Michael Kaess,

    Roles Project administration, Resources, Supervision, Writing – review & editing

    Affiliations Department of Child and Adolescent Psychiatry, Centre of Psychosocial Medicine, University of Heidelberg, Heidelberg, Germany, University Hospital of Child and Adolescent Psychiatry and Psychotherapy, University of Bern, Bern, Switzerland

  • Julian Koenig,

    Roles Writing – review & editing

    Affiliations University Hospital of Child and Adolescent Psychiatry and Psychotherapy, University of Bern, Bern, Switzerland, Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany

  • Klaus Schmeck,

    Roles Project administration, Resources, Supervision, Writing – review & editing

    Affiliation Child and Adolescent Psychiatric Research Department, Psychiatric University Hospital Basel, Basel, Switzerland

  • Ronan Zimmermann

    Roles Conceptualization, Formal analysis, Methodology, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    ronan.zimmermann@unibas.ch

    Affiliations Faculty of Psychology, University of Basel, Basel, Switzerland, Child and Adolescent Psychiatric Research Department, Psychiatric University Hospital Basel, Basel, Switzerland

Abstract

Alliance ruptures of the withdrawal type are prevalent in adolescents with borderline personality disorder (BPD). Longer speech pauses are negatively perceived by these patients. Safran and Muran’s rupture model is promising but its application is very work intensive. This workload makes research costly and limits clinical usage. We hypothesised that pauses can be used to automatically detect one of the markers of the rupture model i.e. the minimal response marker. Additionally, the association of withdrawal ruptures with pauses was investigated. A total of 516 ruptures occurring in 242 psychotherapy sessions collected in 22 psychotherapies of adolescent patients with BPD and subthreshold BPD were investigated. Trained observers detected ruptures based on video and audio recordings. In contrast, pauses were automatically marked in the audio-recordings of the psychotherapy sessions and automatic speaker diarisation was used to determine the speaker-switching patterns in which the pauses occur. A random forest classifier detected time frames in which ruptures with the minimal response marker occurred based on the quantity of pauses. Performance was very good with an area under the ROC curve of 0.89. Pauses which were both preceded and followed by therapist speech were the most important predictors for minimal response ruptures. Research costs can be reduced by using machine learning techniques instead of manual rating for rupture detection. In combination with other video and audio derived features like movement analysis or automatic facial emotion detection, more complete rupture detection might be possible in the future. These innovative machine learning techniques help to narrow down the mechanisms of change of psychotherapy, here specifically of the therapeutic alliance. They might also be used to technologically augment psychotherapy training and supervision.

1. Introduction

Therapeutic alliance has been identified as one of the most robust predictors for therapy outcome [1, 2]. Alliance-outcome correlations are of moderate effect size [35]. According to the rupture and resolution model [6], the therapeutic alliance is dynamic and it is constantly negotiated between the patient and the therapist. "An alliance rupture is [defined as] a deterioration in the alliance, manifested by a lack of collaboration between patient and therapist on tasks or goals, or a strain in the emotional bond.” [7, p. 2]. Ruptures are conceptualised as inevitable in psychotherapy and offer windows of opportunity for the therapeutic process [8]. Two types of ruptures are distinguished by the model: ‘withdrawal ruptures’, in which the patient is, moving away” from the therapist, and ‘confrontation ruptures’, in which the patient, moves against” the therapist [912]. According to this model, the therapist can use rupture-resolution-strategies to repair ruptures, which maintains and strengthens the therapist-patient bond. Examples of resolution strategies are the reconceptualization of the patient’s task or disclosing the therapist’s perception of the rupture. The resolution process is thought to be therapeutic in itself [6].

Borderline personality disorder (BPD) is characterized by persistent patterns of i.a. unstable and intense personal relationships and fear of abandonment [13]. The disorder is known to impact the therapeutic alliance [14, 15]. Consequently, a good management of the therapeutic relationship is of high importance in the psychotherapy of these patients [15]. A dynamic conceptualisation of alliance as offered by the rupture and resolution model is of theoretical and, likely, of practical interest for the treatment of BPD. Research data supports the idea that personality disorders, in general, are associated with a higher number of ruptures. However, the evidence base is somewhat inconclusive [16]. It has been hypothesised that specific rupture training can improve therapy outcome, however, this association was not statistically confirmed [17].

A growing body of research shows the importance of early detection and intervention in BPD already in youth [18] and, as a consequence, the age threshold for diagnosing personality disorders has been omitted in the ICD-11 [19]. Our research group has studied the effectiveness of Adolescent Identity Treatment (AIT) and Dialectic Behaviour Therapy in adolescent patients with BPD. The result of this study showed that both these treatments are effective at improving psychosocial functioning and personality functioning in these patients [20]. In the same project [21], we have also studied ruptures and resolutions occurring in the psychotherapy with AIT. This research was done based on video recordings of the psychotherapeutic sessions which were reviewed by trained observers and rated with the ‘Rupture and Resolution Rating System’ (3RS) [7]. The resulting paper by Schenk et al. [22] focused on the trajectories of ruptures over entire psychotherapies. Withdrawal ruptures occurred more often than confrontation ruptures. Most ruptures occurred in the middle of the treatment, and, additionally, alliance struggle peaks could be identified, mostly after an impactful rupture [22]. The most frequent rupture marker was “minimal response” (please see below for a definition). Confrontation ruptures seemed to have a stronger impact than withdrawal ruptures [22]. A major limitation of the study by Schenk et al. [22] was that it only contained 10 fully analysed psychotherapies while our total sample amounted to 23 cases [20]. This was due to a lack of 3RS trained personnel since the involved PhD and master students finished their projects. After the initial 10 cases, the decision was made to only rate the initial 5 sessions of each psychotherapy to answer questions on early alliance.

While clinically very insightful and popular over the last years, there are still many open questions regarding ruptures and resolutions and their usefulness in the treatment of personality disorders. As the model entails the careful observation of psychotherapeutic sessions on a moment to moment basis [2225], its application is very resource intensive. In addition, the observers need to be trained and inter-rater reliability needs to be ensured to allow for replicability. This can be difficult to maintain over time due to changes in personnel. The resource intensity is a barrier for research, e.g., producing adequately powered research while observing the dynamic of the alliance over full therapies will be very expensive. Additionally, the translation of the model into clinical practice, where resources are even scarcer, is highly problematic, if not impossible. With this in mind, while not part of our initial research protocols [21, 26], our research group has gained interest in automated evaluations of psychotherapy sessions based on audio and video recordings (e.g., dyadic speech pattern analysis or facial emotion recognition) and used these techniques in the sample that will be investigated in the current study [2730]. These techniques allow for the standardised processing of entire psychotherapies within hours, minutes or even in real time. From a feasibility perspective, these automated methods have a much greater potential to translate into clinical practice, if they are proven clinically useful. While a real-time detection during the psychotherapy might not be desirable, some researchers see potential in digital tools that can support quality assurance of psychotherapy for training and supervision. As an example, a web-based platform which organises session recordings and, according to the creators of the platform, provides clinically relevant markers based on audio recordings, has already been developed [31].

Consequently, such technology-based procedures might potentially be used to automatically detect ruptures rendering research with this model much more affordable and offering a perspective for clinical implementations. Such a detection has been piloted [32]: Dolev-Amit et al. argue that the detection of withdrawal ruptures is critical as these ruptures often go unnoticed. They hypothesised that acoustic data could serve as a marker for withdrawal ruptures. Dolev-Amit et al. were able to show, in a case study, that a higher- F0 span, speech pause proportion and shimmer as well as a lower articulation rate than neutral speech can be used to mark withdrawal ruptures. The in-depth case study allowed them to draw up a scenario in which previously missed ruptures are identified during supervision. They also discussed the future potential of such markers to be used in real time computer assisted psychotherapy.

In the current study, we were specifically interested in using speech pauses as a predictor and investigate their correlation with ruptures. This interest was motivated by our previous study on speech pauses (silence) conducted on the sample used in the current article [27]. The study investigated the correlation of speech pauses with post session evaluations of the “smoothness” and “goodness” of the overall session (measured with the Session Evaluation Questionnaire [33, 34]). Sessions with more pauses were perceived as worse and less smooth by the adolescent patients with BPD, making pauses potentially problematic. Additionally, we had used automatic speaker diarisation (analysing who speaks when) to investigate the effect of the four possible speaker switching patterns in which pauses can be located (e.g., the pause could be located between two therapist speaking turns or, alternatively, between a patient speaking turn and a therapist speaking turn, …). In this session level analysis, the amount of pauses in the different speaker switching patterns was highly intercorrelated and therefore aggregated for the analysis. However, we concluded that future research should correlate pauses with “significant therapeutic events such as ruptures” [27, p. 167]. One of the aims of the current study is to follow-up on this research suggestion. First, we hypothesised that ruptures in general would contain more pauses than non-ruptures (H1). Second, we hypothesised that withdrawal ruptures would be characterised by more pauses than confrontation ruptures (H2).

In addition, we hypothesised that automatically detected pauses would, to a certain extent, allow for an automatic detection of rupture markers. Considering the results of the explorative case study by Dolev-Amit et al. [32], it makes sense to target withdrawal ruptures. However, the concept of withdrawal ruptures is an amalgam of multiple rupture markers (denial, minimal response, abstract communication, avoidant storytelling and shifting topic, deferential and appeasing) [7] making withdrawal ruptures a too complex and abstract construct to be targeted with automatic procedures. Instead, we targeted the “minimal response” marker. This rupture marker is defined as “withdrawal from the therapist by going silent or by giving minimal responses to questions or statements that are intended to initiate or continue discussion” [7]. The predictive counterpart in our hypothesised model, speech pause, has a more technical definition and is just absence of speech in a verbal interaction. The concepts withdrawal rupture and speech pause are, thus, similar but not congruent. A major difference is that, in the current context, minimal response markers are detected by a 3RS trained human who observes video material of psychotherapies and considers the overall therapeutic situation based on video and audio input. Pauses, in contrast, are, in the current context, automatically detected based on audio recordings. Based on the strong conceptual link, and the fact that minimal response is the most frequent rupture marker in adolescents with BPD, we aimed at automatically detecting minimal responses. We hypothesised that automatically detecting ruptures with “minimal response” markers based on speech pauses in different speaker switching patterns would be feasible (H3). For a clearer understanding of the study, it is noteworthy that the 3RS rating system acknowledges that the same sentence can be labelled as confrontation rupture and withdrawal rupture at the same time [7]. However, in these cases, we selected the dominant type to make rupture type (confrontation and withdrawal) mutually exclusive. Additionally, a rupture can have multiple rupture markers (e.g. minimal response, denial, complaints about the psychotherapist) which are not mutually exclusive. Minimal response markers which are associated with withdrawal ruptures could, thus, occur in segments with the dominant type ‘confrontation rupture’.

2. Materials and methods

The present study is part of the multi-centre study, Evaluation of Adolescent Identity Treatment” [20, 21, 35], registered at clinicaltrials.gov (NCT02518906). The results of the clinical trial have been published by Schmeck et al. [20]. The following analysis uses the available data registered at a single participating centre (Psychiatric University Hospitals of Basel). Ethical approval has been obtained from the local ethics committee (Ethikkommission Nordwest- und Zentralschweiz: Nr.: 2015–230). Written consent has been signed by participating adolescents, parents and therapists. The current study is a secondary analysis which combines data from two previous studies by Zimmermann et al. [27] and Schenk et al. [22].

2.1. Sample

The inclusion criteria for patients were: age between 13 and 19 years; at least three BPD criteria according to the Structured Clinical Interview For DSM-IV Axis II Personality Disorders (SCID-II); and identity diffusion (total t-score > 60), evaluated with the Assessment of Identity Development in Adolescence [36, 37]. The overall sample of 23 patients is described in the paper by Schmeck et al. [20] in Table 1. Only the patients in the AIT study arm were included here. Since one patient withdrew the permission to use the video material, the final sample comprised N = 22 patients. The mean age of the participants was 16.3 years (SD = 1.6). Six patients dropped out of treatment but the recorded sessions were included in the analysis. Fig 1 shows the available and missing sessions for each of the 23 patients. It also shows which sessions were missing for which reason.

thumbnail
Table 1. Speech pause percentage according to rupture type and minimal response markers.

https://doi.org/10.1371/journal.pone.0280329.t001

The rupture and resolution rating used in the current study are re-used from a previous study by Schenk et al. [22]. As described in the introduction section, only the initial 10 patients had all psychotherapy sessions rated with the 3RS. Due to lack of personnel, only the five first sessions were rated for the subsequent 13 patients. Additionally, five records of therapeutic sessions could not be evaluated due to technical difficulties or human errors (i.e., the data was not saved). The analysed sample consisted of 242 psychotherapeutic sessions with a planned duration of 50 minutes per session. Eight psychotherapists were involved.

2.2. Setting and data acquisition

The included patients were treated with up to 25 session of AIT [38]. AIT is an integrative manualised psychotherapy approach for the treatment of adolescent personality disorder. It uses psychodynamic elements, psychoeducation and a cognitive-behaviour oriented home plan. AIT was shown to be equally effective compared to the more established treatment approach Dialectic Behaviour Therapy in adolescent patients within this clinical trial [20]. The sessions were video recorded with two cameras mounted to observe both the patient and therapist from the front. Audio was recorded with a boundary microphone attached to an adjacent wall between patient and therapist.

2.3. Speech pause detection

The audio recordings were cut to start at the beginning of the actual psychotherapeutic process (patient and therapist in sitting position and therapist invites patient to start the session) and end when the therapist formally ended the session. Silence detection was performed with the matlab code available in the repository ‘https://github.com/com-psy-lab/Silence-Detection’.

The method for silence detection is based on the idea that speech will yield a signal that is variable, while the absence of speech will result in a non-variable signal. First, cut-off parameters for a specific recording environment need to be determined. This was done for each session by calculating the absolute difference between signal maximum and signal minimum in small windows of 0.01 s. Windows with low signal variability will be associated with silence (small signal range). Looking at the distribution of all maximum to minimum distances in these windows in a histogram, one notices its positive skewness. This derives from the fact that windows containing silence will always yield the same (or highly similar) absolute maximum to minimum distances in their respective window and result in a high occurrence rate to the left of the histogram. Scott’s Rule was used to determine the adequate number of bins of the histogram to select a cut-off in terms of maximum to minimum variability. In a next step, this cut-off is applied to larger non-overlapping windows of 0.1 s. The selection of the right bin in the histogram was based on auditory probes. We found that this procedure was superior to any other available method that we tested on our dataset. The method is described in detail in the readme file and the manual [39]. The method has been used in preceding peer-reviewed publications [27, 28]. Based on the result of the procedure, start and stop times of the pauses relative to the start of the recording were extracted.

2.4. Classification of the speech pauses according to the speaker-switching patterns

Speaker diarisation is the process of determining who speaks when in recorded speech [40]. We used a supervised machine learning algorithm for this task. The following description of the procedure is slightly modified from study [27, p.162] in which the method was used on the current data set: A human scientific assistant extracted learning material, which was then used to train a machine learning algorithm to perform diarization of the complete material. If possible, the learning set was extracted for each dyad from two initial, two middle, and two final sessions. After this procedure, the learning set comprised a list of start and stop time stamps of samples of “patient speech” or “therapist speech” set with Audacity software [41]. We did not use transcripts. The learning set amounted to a minimum of 5-min of voice recordings per speaker. The features for machine learning were calculated in non-overlapping 0.2-s windows using a Matlab Audio Library [42]. This library computes 35 audio features for each window (e.g., mean fundamental frequency or Mel-frequency cepstral coefficients). The features and learning set were then used in a random forest classifier. This decision tree-based method learned to classify 0.2-s windows of patient or therapist speech based on the extracted features. The source code for this method can be retrieved from github [43]. Please consider the readme file for details. Furthermore, the procedure was described and validated on a speech corpus in a study by Fürer et al. [28] showing low error rates compared to unsupervised methods. The outcome of the procedure is the attribution of each detected speech utterance to either the psychotherapist or the patient in each dyad. In the current study, this information was used to classify each pause as belonging to one of four possible speaker switching patterns:

Patient speaks–Pause–Patient continues speaking (P_P); Patient speaks–Pause–Therapist speaks after the pause (P_T); Therapist speaks–Pause–Patient speaks (T_P); Therapist speaks–Pause–Therapist continues speaking (T_T).

For each of the patterns a pause variable was created, coding the proportion of the specific type of pauses.

2.5. Rupture coding

The “Rupture Resolution Rating System”, 3RS [7], was used to code ruptures and resolutions. The 3RS is an observer-based coding system to assess alliance rupture and resolution markers in psychotherapy. The 3RS differentiates between two types of ruptures: withdrawal and confrontation. It includes seven withdrawal markers, seven confrontation markers and ten resolution markers. A detailed definition of the rupture types and the markers can be found in the manual of the 3RS [7]. In addition, the 3RS provides a rupture significance rating using a five-point Likert scale ranging from no significance to high significance. It assesses the immediate impact that rupture markers inflict on the therapeutic alliance with respect to the impairment in collaboration regarding goals, tasks and the affective bond. The 3RS has demonstrated good interrater reliability (ICCs = .85 to .98) [44].

The rupture and resolution data for the current study is re-used from a study by Schenk et al. [22] who analysed 10 full psychotherapies. Additionally, the same team of trained observers rated the five first sessions of 13 additional patients using the same method. The rupture coding procedure has been described in the study by Schenk et al. [22]. Parts of the description below are re-used from this paper: The rupture and resolution detection and rating was done by two independent observers based on video recordings of the therapy sessions. The observers’ training involved reading the 3RS manual, training with a 3RS experienced research team from the Millennium Institute for Research in Depression and Personality (Santiago, Chile), and rating and discussing of exercise material. The observers were blind to the study hypotheses and patients’ diagnoses. The complete data collection is based on consensual coding according to a three-step qualitative procedure: i) Independent coding phase: each therapy session was rated independently by each observer; ii) Intersubjective consensus meeting: the two observers compared and re-evaluated their coding. If the observers did not achieve agreement, an observed event was marked for supervision; iii) Supervisor meeting: data collection was supervised by N. Schenk in monthly meetings in which unclear events were re-evaluated.

While the 3RS manual allows for episodes to belong to both rupture types, in the current study, the observers selected a dominant rupture type. Thus, rupture types were mutually exclusive in the current study. Additionally, all rupture markers were used. For withdrawal ruptures, the markers were: Denial, Minimal response, Abstract communication, Avoidant storytelling and/or shifting topic, Deferential and appeasing, Content/affect split, Self-criticism and/or hopelessness. For confrontation the markers were Complaints/concerns about the therapist, Patient rejects therapist intervention, Complaints/concerns about the activities of therapy, Complaints/concerns about the parameters of therapy, Complaints/concerns about progress in therapy, Patient defends self against therapist, Efforts to control/pressure therapist. The rupture markers are not mutually exclusive and as an episode can be coded as confrontation and withdrawal rupture at the same time, it is possible that the selected dominant type ended up with a marker belonging to the other type (e.g., a minimal response marker could appear in a confrontation withdrawal). In Schenk et al. [22], all markers were used. In the current study we only used the “minimal response” marker. Additionally, the rupture intensity was coded which is not relevant for the current study [7, 22]. While the 3RS mentions multiple techniques for delimiting markers (e.g., using a-priori fixed 5 minutes windows), for the current data, the observers marked the exact beginning and the end of rupture and resolution episodes and rated the observed rupture and resolution markers within these episodes.

2.6. Software and hardware

Data processing was done with R (v4.0.2) [45] for statistical computing. RStudio was used as the integrated development environment. We used the R packages lme4 (v1.1–28) [46], lmerTest (v3.1–3) [47] and sjPlot (v2.8.10) [48] for random effect models and their presentation; tidyverse (v1.3.1) [49] for data handling, ggplot (v3.3.5) [50] and RColorBrewer (v1.1–5) [51] for visualisations; data.table (v1.14.2) [52] for data writing and retrieving; and randomForest (v4.7–1) [53] and ROCR (v1.0–11) [54] for machine learning classification and performance evaluation. Calculations were performed at sciCORE (http://scicore.unibas.ch/) scientific computing centre at University of Basel.

2.7. Aggregation of pause and rupture data

Video time was divided into windows of 10 seconds length. For each window, the proportion of pauses compared to the total window length was calculated (sometimes, pauses could stretch over multiple windows which was then accounted for by splitting the pause episode proportional to their contribution to the different windows). Additionally, for each window, we determined whether it was part of a rupture episode or not. For each window marked as rupture, we also retained data specifying the rupture regarding the contained rupture markers ‘minimal response’ and the dominant type (withdrawal or confrontation).

2.8. Statistical analysis

Two linear mixed-effects models with random intercept by dyad were used to investigate differences in pauses according to rupture parameters. Percent of pauses in the 10 s windows was used as dependent variable in both models.

To test hypothesis H1 that ruptures are correlated with more speech pauses, Model A used ruptures vs. non-ruptures as predictor. Please see the formula of Model A in S1 File.

To test hypothesis H2 that withdrawal ruptures are characterised by more pauses than confrontation ruptures, Model B used rupture type as predictor. This model was exclusively based on rupture windows (excluding non-rupture windows). In order to additionally estimate the effect of the minimal response ruptures, presence of this marker was added as predictor as a fixed effect. P-values < 0.05 were considered statistically significant. As two mixed linear mixed-effects models were performed, we adjusted the p-values accordingly with the Bonferroni method. Please see the formula of Model B in S1 File.

For hypothesis H3 that minimal response ruptures can be predicted based on pauses and their speaker switching patterns, a random forests algorithm (RF) was used. RF is a machine learning classifier that uses decision trees [55, 56]. As an ensemble learner, the algorithm combines a certain amount of decision trees and uses them in a single prediction model [56]. RF is an algorithm known to easily obtain very good classification results while other approaches like deep learning might in some cases achieve better results when finetuned [57]. For a proof of principle study using RF seemed to be a sensible choice as it is known to perform well for many supervised classification problems while not requiring much finetuning. We used the guide by De Oliveira [58] for setting up the machine learning model. Faced with the problem of an imbalanced data set (only about 3% of the ten seconds windows contained minimal response marked ruptures), the RF was trained in rupture episodes only (discarding non-rupture windows). About 52% of the ruptures had a minimal response marker. We randomly selected 2/3 of the ruptures as a training set leaving 1/3 for validation. This selection was stratified according to presence of a minimal response marker and dyad and, additionally, weighted by the length of the ruptures in seconds. The predicted variable was minimal response (yes or no, in 10 s windows). The predicting features consisted of the z-transformed percentages (per dyad) of pauses according to the different speaker switching patterns (four variables: T_T, T_P, P_T, P_P) and additionally, those four variables lagged in both directions by 1–5 windows. For each window the model therefor ‘knew’ pause percentages 10, 20, 30, 40 and 50 seconds before and 10, 20, 30, 40, 50 seconds after the 10 s window in question. The approach was inspired by this blog [59]. These shifts were selected based on the information in Fig 2 which shows the timing of pauses related to ruptures. After training, we evaluated the obtained model on the validation set, and, additionally, on the validation set adding all the non-rupture windows (which were excluded during training). We used the out of the box settings provided by the ‘randomForest’ function in the randomForest R package (v4.7–1) [53]. We report the out-of-bag error as well as the importance measure for the features. Additionally, we calculated a receiver operator characteristic curve (ROC) i.e. plotting the true positive rate against the false positive rate [60, 61]. Finally, the classifier’s predictive accuracy was evaluated using the area under the curve (AUC), representing the probability that a random positive observation is ranked higher than a random negative observation [62]. Since the AUC considers the complete ROC curve with all possible classification thresholds, it is considered as a robust overall measure [63]. We omitted calculating cut-off scores as the analyses are meant as a proof of principle.

thumbnail
Fig 2. Pauses in the time course of ruptures–withdrawal vs. confrontation.

Fig 2 shows the proportion of pauses during ruptures. To create this figure, the ruptures were overlapped at their starting point. The figure includes 20 windows (200 s) before and 30 windows (300 s) after the beginning of a rupture and the windows are overlapped relative to their distance from the starting point of the ruptures. The black vertical line at x = 0 indicates the starting point of the observed ruptures. Negative values on the x-axis represents the time before the ruptures. The boxplots at the bottom of the figure represent the length of the observed ruptures with the notch indicating the median end point of the ruptures to provide an impression of the rupture length in this figure. Each point (triangles for withdrawals, dots for confrontation) indicates the average of pauses in a 10 s window. Error bars indicate the standard error of mean for each window. The blue dotted line represents withdrawal ruptures and the red solid line confrontation ruptures. The black horizontal line shows average pauses across all windows.

https://doi.org/10.1371/journal.pone.0280329.g002

3. Results

In the 242 sessions stemming from the 22 psychotherapies, 221 confrontation ruptures and 295 withdrawal ruptures were identified by the observers. The average length of a confrontation rupture episode was 133.4 seconds (SD = 215.8). Withdrawal rupture episode had a length of 116.1 seconds (SD = 189.9). A total of 186 ruptures were marked with the ‘minimal response’ rupture marker of which 27 were found in confrontation ruptures and 159 in withdrawal ruptures.

The data was split into a total of 72’926 windows of 10 seconds. For each window, we know the percentage of pauses and whether it is a rupture or not. N = 6’870 of the windows were ruptures. Of those rupture windows, 3’183 were of the confrontation and 3’687 of the withdrawal dominant type and 3’564 included minimal response markers.

3.1. Pauses in ruptures (H1)

In non-ruptures, pauses made up on average 30.6% of the time (SD = 20%). In ruptures (confrontation and withdrawal taken together), pauses made up on average 41.5% of the time (SD = 24.8%). This difference (ruptures vs. non-ruptures) was statistically significant (p < 0.001) (Model A in the methods section). As the percent of pause variable presented with a slight positive skewness, we re-run the analysis with a square root transformed variable. The p-values remained highly significant.

3.2. Time course of pauses in relation to withdrawal vs. confrontation ruptures

Fig 2 shows the average time course of pauses in relation to the observer marked beginning of the 516 rupture episodes. The figure depicts a clear difference between the time course of pauses in withdrawal compared to confrontation ruptures. In confrontation ruptures, pauses appear to be only slightly increased compared to before the beginning of the ruptures. In withdrawal ruptures, a sharp increase of the proportion of pauses can be observed. Interestingly, the sharp increase begins 25 seconds before the observers marked the beginning of the ruptures. Even before this sharp increase, pauses are increased compared to the average (black broken horizontal line). After the ruptures ends, the proportion of pauses returns to the average (due to different lengths of the ruptures the timing of this normalisation is not exactly discernible in this summative figure).

A statistical test of the difference of withdrawal and confrontation ruptures is provided in the next section because it is a combined analysis of rupture type and the minimal response marker.

3.3. Speech pauses according to rupture type (H2) and presence of minimal response markers

Table 1 shows speech pause percentage according to the rupture type and minimal response marker.

Table 2 shows the results of the random effect model analysis (described as Model B in the methods section). Pauses in percent per 10 s window was predicted by the rupture type of the window and whether it contained a minimal response rupture marker.

thumbnail
Table 2. Percent of pauses predicted by rupture type and minimal response (Model B).

https://doi.org/10.1371/journal.pone.0280329.t002

According to the estimated effects of this model, windows belonging to confrontation ruptures without minimal responses have 32.39% of pauses. Compared to this value, withdrawal rupture windows present with 3.22% more pauses. Rupture windows with minimal response marker have 11.67% more pauses compared to confrontation ruptures without minimal response marker. The effects of rupture type are statistically significant (p < 0.001) with withdrawal rupture being correlated with a greater amount of pauses. This confirms hypothesis H2. Additionally, minimal response markers are statistically significantly associated with a greater amount of pauses (p < 0.001).

As the percent of pause variable presented with a slight positive skewness, we re-run the analysis with a square root transformed variable. The p-values remained highly significant.

3.4. Prediction of minimal response marked ruptures based on pauses (H3)

This section will use pauses in their respective speaker-switching patterns. S1 Table shows the percentages of pauses in these patterns in ruptures, non-ruptures, in withdrawal ruptures compared to confrontation ruptures and in minimal response marked ruptures compared to ruptures without this marker.

Training the random forest classifier to predict ruptures with minimal response markers with n = 500 trees and 6 variables tried at each split yielded an out of bag (OOB) estimate of error rate of 17.38%. Fig 3 presents the importance of the employed features (time lagged and lead pauses in different speaker-switching patterns). The trained model was published on Open Science Framework [64].

thumbnail
Fig 3. Pause-feature importance for the prediction of minimal response markers.

Fig 3 shows the variable importance measure ‘Mean decrease accuracy’ for the predictive model. It indicates the loss of the model’s accuracy in percent if the variable in question is omitted from the training set. For an interpretation, please consider in which order the variables are ranked. The more important variables are listed at the top.

https://doi.org/10.1371/journal.pone.0280329.g003

The feature importance estimation (Fig 3) showed that pauses belonging to the T_T speaker switching patterns and its lagged and lead versions show the highest importance for the RF model. Thus, pauses both lead and followed by therapist speech contain the most important information to identify minimal response marked ruptures. Using the trained RF model to predict ruptures with minimal response in the validation sets yielded 1) an AUC of 0.91 for the training set with only ruptures and 2) an AUC of 0.89 in the training set enriched with all 10-second windows that were not within a rupture. Fig 4 shows the ROC for both these predictions. These high AUC values confirm our hypothesis H3 which states that minimal response marked ruptures can be detected based on pauses.

thumbnail
Fig 4. ROC for the prediction of minimal response markers in the validation data sets.

Fig 4 shows the ROC for the minimal response marked rupture-prediction in the validation sets, constructed by plotting the true positivity rate against the false positive rate. The red line presents the performance in the validation set which only included ruptures. The dashed blue line presents performance in a validation set in which non-rupture data (by definition containing no minimal response markers) were added.

https://doi.org/10.1371/journal.pone.0280329.g004

3.5. Supplemental information regarding silence

Psychotherapy research has often used a cut-off of > = 3 seconds on pause data to address “silence” [23, 27]. The S2 and S3 Tables and S1S3 Figs present the results as “silence” instead of “pauses”.

4. Discussion

As hypothesised (H1), ruptures (confrontation and withdrawal taken together) are correlated with more speech pauses. Further, we hypothesised that withdrawal ruptures would be characterised by more pauses than confrontation ruptures (H2). This hypothesis was clearly confirmed in adolescent patients with BPD by our results. This link between withdrawal ruptures and pauses exists independently from minimal-response markers. Minimal response markers (which are conceptually part of withdrawal ruptures) have an even stronger association with pauses, increasing the percentage of pauses by more than 10% (see Tables 1 and 2).

In an explorative analysis, the time course of pauses before, during and after ruptures was investigated (section 3.2, Fig 2). We found that pauses increase during withdrawal ruptures and this increase is observable even before the onset of the ruptures (see Fig 2). This is an interesting phenomenon with potential clinical applications (e.g., earlier use of repair strategies to prevent and train avoiding minimal response ruptures). At the end of the withdrawal ruptures, the proportion of pauses returns to an average level.

The hypothesis H3 (section 3.4) of the current paper stated that it would be possible to detect ruptures with the minimal response marker based on automatically extracted pause data and its location in the different speaker switching patterns (T_T, P_T, T_P, P_P, see section 2.4). We found that the prediction accuracy for minimal response marked ruptures performs at a level which allows to seriously consider this type of rupture localisation to facilitate psychotherapy research and enable psychotherapy quality management. The AUC of the ROC curve was 0.89. The variable importance of the machine learning model showed that pauses in the speaker switching pattern T_T (pause preceded and followed by therapist speech) was most predictive of minimal responses. This likely indicates that the psychotherapists are the ones carrying the dialogue during these episodes (the patient skips his/her speaking turns).

As discussed, methods for alliance rupture identification are currently mostly conducted manually and, thus, are time- and resource-consuming, and, they debatably lack reproducibility across laboratories [32, 65]. To overcome these limitations, the use of audio derived markers has been successfully attempted before [32], however, this attempt was limited to a case study. Here, we show that such detections are indeed feasible with a more general model across multiple dyads. However, it needs to be considered that the sample of patients in the current study was small (N = 22). A replication of the results in an independent sample is required and other disorders, age groups and psychotherapy approaches should be investigated to confirm the generalisability of our results. Additionally, we only predicted a specific rupture marker (minimal response). Aiming at minimal response marked ruptures made the target of the prediction more homogeneous. Another major difference to the study [32] was the usage of only one audio feature (pauses) as predictor. However, pauses have been exploited more deeply in the current study by using speaker switching patterns [43] and the time course of the feature (see Fig 2). In the previous study [32], withdrawal ruptures were, in addition to a higher pause proportion, associated with higher F0-span, shimmer, and lower articulation rate than neutral speech. Based on these results, it is possible that the performance of rupture detection can be improved and generalised to other rupture markers beyond what was shown in the current study. More emphasis on feature engineering will likely result in better performance regarding rupture localisation. The current study points towards the predictive importance of speaker switching patterns as well as the time dynamic of the features. Additionally, video-based features such as facial emotion recognition [29] or motion energy analysis [30] have the potential to improve the predictive performance beyond what can be achieved with audio features alone. Finally, predictive features can be engineered to represent the interaction of the patient and the therapist (e.g., synchrony measures) [6668] which might potentially enable the detection of confrontation and withdrawal ruptures or other significant episodes e.g., moments of psychotherapeutic change [69].

Supporting information

S1 File. Formulas of the random mixed effect models (Model A and Model B).

https://doi.org/10.1371/journal.pone.0280329.s001

(PDF)

S1 Table. Description of pause percentage in speaker-switching patterns.

This table describes the percentage of pauses in the different speaker switching patterns for rupture and non-rupture events, withdrawal and confrontation ruptures, as well as for minimal response marked ruptures and ruptures without this marker. Mdn = Median; Q1 = 1st quartile; Q3 = 3rd quartile; M = arithmetic mean; SD = standard deviation.

https://doi.org/10.1371/journal.pone.0280329.s002

(DOCX)

S2 Table. 3 Seconds filter–percent of silence in rupture and non-rupture events.

S3 shows the percent of silence in rupture and non-rupture events, confrontation and withdrawal ruptures, as well as minimal response marked ruptures and ruptures without minimal response marker, when the 3 s filter for silence is applied. Mdn = Median; Q1 = 1st quartile; Q3 = 3rd quartile; M = arithmetic mean; SD = standard deviation.

https://doi.org/10.1371/journal.pone.0280329.s003

(DOCX)

S3 Table. 3 Seconds filter—description of silence percentage in speaker-switching patterns.

This table describes the percentage of silence in the different speaker switching patterns for rupture and non-rupture events, withdrawal and confrontation ruptures, as well as for minimal response marked ruptures and ruptures without this marker, when the 3 s filter for silence episodes is applied. Mdn = Median; Q1 = 1st quartile; Q3 = 3rd quartile; M = arithmetic mean; SD = standard deviation.

https://doi.org/10.1371/journal.pone.0280329.s004

(DOCX)

S1 Fig. 3 Seconds filter–silence in the time course of ruptures.

S5 shows the proportion of pauses during ruptures, when the 3 s filter for silence is applied. Please consider the legend in Fig 2 for further information on the figures’ creation.

https://doi.org/10.1371/journal.pone.0280329.s005

(PDF)

S2 Fig. 3 Seconds filter–silence-feature importance for the prediction of minimal response markers.

S6 shows the variable importance measure ‘Mean decrease accuracy’ for the predictive model when the 3 s silence filter is applied. It indicates the loss of the model’s accuracy in percent if the variable in question is omitted from the training set. For an interpretation, please consider in which order the variables are ranked. The more important variables are listed at the top.

https://doi.org/10.1371/journal.pone.0280329.s006

(PDF)

S3 Fig. 3 Seconds filter–roc for the prediction of minimal response markers in the validation data sets.

This figure shows the ROC for the minimal response marked rupture-prediction in the validation sets, constructed by plotting the true positivity rate against the false positive rate, when the 3 s silence filter is applied. The red line presents the performance in the validation set which only included ruptures. The dashed blue line presents performance in a validation set in which non-rupture data (by definition containing no minimal response markers) were added.

https://doi.org/10.1371/journal.pone.0280329.s007

(PDF)

Acknowledgments

We would like to thank the participating therapists and patients. Calculations were performed at sciCORE (http://scicore.unibas.ch/) scientific computing centre at University of Basel.

References

  1. 1. Shirk S, Karver M, Brown Hangartner R. The Alliance in Child and Adolescent Psychotherapy. Psychotherapy (Chic). 2011;48(1):17–24. pmid:21401270
  2. 2. Flückiger C, Del Re AC, Wampold BE, Horvath AO. The alliance in adult psychotherapy: A meta-analytic synthesis. Psychotherapy (Chic). 2018;55(4):316–40. pmid:29792475
  3. 3. Horvath A, Re ACD, Flückiger C, Symonds D. Alliance in individual psychotherapy. 2011; Available from: https://semanticscholar.org/paper/f231abd213b5813109d24b0e938316073068c511
  4. 4. Martin DJ, Garske J, Davis M. Relation of the therapeutic alliance with outcome and other variables: a meta-analytic review. J Consult Clin Psychol. 2000;68(3):438–50. pmid:10883561
  5. 5. Soto A. A Meta-Analytic Review of the Association of Therapeutic Alliance, Therapist Empathy, Client Attachment Style, and Client Expectations with Client Outcome [PhD]. David O. McKay School of Education; 2017.
  6. 6. Safran JD. Breaches in the therapeutic alliance: An arena for negotiating authentic relatedness. Psychotherapy: Theory, Research, Practice, Training. 1993;30(1):11–24.
  7. 7. Eubanks C, J Christopher Muran, Safran JD. RUPTURE RESOLUTION RATING SYSTEM (3RS): MANUAL. 2015 [cited 2021 Mar 8]; Available from: http://rgdoi.net/10.13140/2.1.1666.8488
  8. 8. Safran JD, Crocker P, McMain S, Murray P. Therapeutic alliance rupture as a therapy event for empirical investigation. Psychol Psychother. 1990;27(2):154–65.
  9. 9. Harper H. Coding Guide I: Identification of confrontation challenges in exploratory therapy. 1st ed. Sheffield (GB): University of Sheffield; 1989.
  10. 10. Harper H. Coding Guide II: Identification of withdrawal challenges in exploratory therapy. 1st ed. Sheffield (GB): University of Sheffield; 1989.
  11. 11. Horney K. Neurosis and human growth: the struggle toward self-realization. 1st ed. New York (USA): W. W. Norton & Co., Inc.; 1950.
  12. 12. Eubanks C, Burckell LA, Goldfried MR. Clinical consensus strategies to repair ruptures in the therapeutic alliance. J Psychother Integr. 2018;28(1):60–76. pmid:29805243
  13. 13. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders. 5th ed. Arlington (USA): American Psychiatric Association; 2013.
  14. 14. Sansone RA, Sansone LA. Responses of mental health clinicians to patients with borderline personality disorder. Innov Clin Neurosci. 2013;10(5–6):39–43. pmid:23882440
  15. 15. McMain SF, Boritz TZ, Leybman MJ. Common strategies for cultivating a positive therapy relationship in the treatment of borderline personality disorder. Journal of Psychotherapy Integration. 2015;25(1):20–9.
  16. 16. Schenk N, Fürer L, Zimmermann R, Steppan M, Schmeck K. Alliance Ruptures and Resolutions in Personality Disorders. Curr Psychiatry Rep. 2020 Dec 11;23(1):1. pmid:33305340
  17. 17. Eubanks C, Muran J, Safran J. Alliance rupture repair: A meta-analysis. Psychotherapy. 2018;55(4):508–19. pmid:30335462
  18. 18. Chanen AM, Sharp C, Hoffman P, Global Alliance for Prevention and Early Intervention for Borderline Personality Disorder. Prevention and early intervention for borderline personality disorder: A novel public health priority. World Psychiatry. 2017 Jun 1;16(2):215–6.
  19. 19. World Health Organization. International Statistical Classification of Diseases and Related Health Problems. 11th ed. 2019.
  20. 20. Schmeck K, Weise S, Schlüter-Müller S, Birkhölzer M, Fürer L, Koenig J, et al. Effectiveness of Adolescent Identity Treatment (AIT) vs. DBT-A for the Treatment of Adolescent Borderline Personality Disorder. Pers Disord: Theory Res Treat Forthcoming 2022. May 19. pmid:35587408
  21. 21. Zimmermann R, Krause M, Weise S, Schenk N, Fürer L, Schrobildgen C, et al. A design for process-outcome psychotherapy research in adolescents with Borderline Personality Pathology. Contemp Clin Trials Commun. 2018;12:182–91. pmid:30511027
  22. 22. Schenk N, Zimmermann R, Fürer L, Krause M, Weise S, Kaess M, et al. Trajectories of alliance ruptures in the psychotherapy of adolescents with borderline personality pathology: timing, typology and significance. Res Psychother [Internet]. 2019 [cited 2021 Mar 8];22(2). Available from: https://www.researchinpsychotherapy.org/index.php/rpsy/article/view/348 pmid:32913792
  23. 23. Daniel SIF, Folke S, Lunn S, Gondan M, Poulsen S. Mind the gap: In-session silences are associated with client attachment insecurity, therapeutic alliance, and treatment outcome. Psychother Res. 2018;28(2):203–16. pmid:27148916
  24. 24. Frankel Z, Levitt H, Murray D, Greenberg L, Angus L. Assessing silent processes in psychotherapy: An empirically derived categorization system and sampling strategy. Psychother Res. 2006;16(5):627–38.
  25. 25. Soma CS, Baucom BRW, Xiao B, Butner JE, Hilpert P, Narayanan S, et al. Coregulation of therapist and client emotion during psychotherapy. Psychother Res. 2020;30(5):591–603. pmid:32400306
  26. 26. Schmeck K. Process-Outcome Study on Psychotherapeutic Interventions in Adolescents With Personality Disorders [Internet]. clinicaltrials.gov; 2021 Apr [cited 2022 Sep 5]. Report No.: NCT02518906. Available from: https://clinicaltrials.gov/ct2/show/NCT02518906
  27. 27. Zimmermann R, Fürer L, Schenk N, Koenig J, Roth V, Schlüter-Müller S, et al. Silence in the psychotherapy of adolescents with borderline personality pathology. Personal Disord. 2020;12(2):160–70. pmid:32324008
  28. 28. Fürer L, Schenk N, Roth V, Steppan M, Schmeck K, Zimmermann R. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research. Front Psychol. 2020;11:1726. pmid:32849033
  29. 29. Steppan M, Zimmermann R, Fürer L, Schenk N, Schmeck K. Machine Learning Facial Emotion Recognition in Psychotherapy Research. A useful approach? [Internet]. PsyArXiv; 2020 [cited 2022 Mar 14]. Available from: https://psyarxiv.com/wpa5e/
  30. 30. Zimmermann R, Fürer L, Kleinbub JR, Ramseyer FT, Hütten R, Steppan M, et al. Movement Synchrony in the Psychotherapy of Adolescents With Borderline Personality Pathology–A Dyadic Trait Marker for Resilience? Front Psychol. 2021;12:660516. pmid:34276484
  31. 31. Flemotomos N, Martinez VR, Chen Z, Singla K, Ardulov V, Peri R, et al. Automated evaluation of psychotherapy skills using speech and language technologies. Behav Res [Internet]. 2021 Aug 3 [cited 2022 Mar 3]; Available from: pmid:34346043
  32. 32. Dolev-Amit T, Nof A, Asaad A, Tchizick A, Zilcha-Mano S. The melody of ruptures: identifying ruptures through acoustic markers. Couns Psychol Q. 2020 Dec;1–20.
  33. 33. Stiles WB, Reynolds S, Hardy GE, Rees A, Barkham M, Shapiro DA. Evaluation and description of psychotherapy sessions by clients using the Session Evaluation Questionnaire and the Session Impacts Scale. Journal of Counseling Psychology. 1994;41(2):175–85.
  34. 34. Session Evaluation Questionnaire [Internet]. [cited 2019 Apr 11]. Available from: http://www.users.miamioh.edu/stileswb/session_evaluation_questionnaire.htm
  35. 35. Schmeck K, Pick OG, Milidou M, Schenk N, Schlüter-Müller S, Zimmermann R. Früherkennung von Persönlichkeitsstörungen. PTT: Persönlichkeitsstörungen Theorie und Therapie. 2018;22(3):179–85.
  36. 36. Goth K, Foelsch P, Schlüter-Müller S, Birkhölzer M, Jung E, Pick O, et al. Assessment of identity development and identity diffusion in adolescence—Theoretical basis and psychometric properties of the self-report questionnaire AIDA. Child Adolesc Psychiatry Ment Health. 2012;6(1):27. pmid:22812911
  37. 37. Lind M, Vanwoerden S, Penner F, Sharp C. Inpatient adolescents with borderline personality disorder features: Identity diffusion and narrative incoherence. Personality Disorders: Theory, Research, and Treatment. 2019;10(4):389–93. pmid:31033329
  38. 38. Foelsch PA, Schlüter-Müller S, Odom AE, Arena HT, Borzutzky H. A, Schmeck K. Adolescent Identity Treatment: An Integrative Approach for Personality Pathology [Internet]. Basel: Springer International Publishing; 2014 [cited 2019 Jan 10]. Available from: http://link.springer.com/10.1007/978-3-319-06868-8
  39. 39. Silence-Detection [Internet]. com-psy-lab; 2021. Available from: https://github.com/com-psy-lab/Silence-Detection
  40. 40. Anguera X, Bozonnet S, Evans N, Fredouille C, Friedland G, Vinyals O. Speaker diarization: A review of recent research. IEEE. 2012;20(2):356–70.
  41. 41. Audacity Team. Audacity ®. Free, open source, cross-platform audio software for multi-track recording and editing. [Internet]. 2018 [cited 2018 Jul 24]. Available from: https://www.audacityteam.org/
  42. 42. Matlab Audio Analysis Library—File Exchange, MATLAB Central [Internet]. [cited 2018 Jul 24]. Available from: https://www.mathworks.com/matlabcentral/fileexchange/45831
  43. 43. Fürer L, Zimmermann R. Supervised-Dyadic-Speaker-Diarization [Internet]. 2020. Available from: https://github.com/lukasfuerer/Supervised-Dyadic-Speaker-Diarization
  44. 44. Eubanks CF, Lubitz J, Muran JC, Safran JD. Rupture Resolution Rating System (3RS): Development and validation. Psychotherapy Research. 2019 May 4;29(3):306–19. pmid:30526383
  45. 45. R Core Team. R: A Language and Environment for Statistical Computing [Internet]. Vienna (AU): R Foundation for Statistical Computing; 2020. Available from: https://www.R-project.org/
  46. 46. Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. Journal of Statistical Software. 2015;67(1):1–48.
  47. 47. Kuznetsova A, Brockhoff PB, Christensen RHB. lmerTest Package: Tests in Linear Mixed Effects Models. Journal of Statistical Software. 2017;82(13):1–26.
  48. 48. Lüdecke D. sjPlot: Data Visualization for Statistics in Social Science [Internet]. 2021. Available from: https://CRAN.R-project.org/package=sjPlot
  49. 49. Wickham H, Averik M, Bryan J, Chang W, D’Agostino McGowan L, François R, et al. Welcome to the tidyverse. Journal of Open Source Software. 2019;4(43):1686.
  50. 50. Wickham H. ggplot2: Elegant Graphics for Data Analysis [Internet]. Springer-Verlag New York; 2016. Available from: https://ggplot2.tidyverse.org
  51. 51. Neuwirth E. RColorBrewer: ColorBrewer Palettes. 2015.
  52. 52. Dowle M, Srinivasan A. data.table: Extension of `data.frame`[Internet]. 2018. Available from: https://CRAN.R-project.org/package=data.table
  53. 53. Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002;2(3):18–22.
  54. 54. Sing T, Sander O, Beerenwinkel N, Lengauer T. ROCR: visualizing classifier performance in R. Bioinformatics. 2005;21(20):7881. pmid:16096348
  55. 55. Breiman L. Random forests. Machine Learning. 2001;45(1):5–32.
  56. 56. Kotsiantis SB. Decision trees: a recent overview. Artif Intell Rev. 2013;39(4):261–83.
  57. 57. Chollet F. Deep Learning with Python. 2nd ed. New York (USA): Manning Publications Co.; 2018.
  58. 58. De Oliveira S. A very basic introduction to Random Forests using R | Oxford Protein Informatics Group [Internet]. Oxford Protein Informatics Group. 2017 [cited 2022 Jan 22]. Available from: https://www.blopig.com/blog/2017/04/a-very-basic-introduction-to-random-forests-using-r/
  59. 59. Time series forecasting with random forest [Internet]. STATWORX. 2019 [cited 2022 Jan 22]. Available from: https://www.statworx.com/at/blog/time-series-forecasting-with-random-forest/
  60. 60. Fawcett T. ROC Graphs: Notes and Practical Considerations for Researchers. ReCALL. 2004 Jan 1;31(8):1–38.
  61. 61. Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3(1):32–5. pmid:15405679
  62. 62. Hanley JA, McNeil B. The Meaning and Use of the Area Under a Receiver Operating Characteristic (ROC) Curve. Radiology. 1982;143:29–36. pmid:7063747
  63. 63. Melo F. Area under the ROC Curve. In: Dubitzky W, Wolkenhauer O, Cho KH, Yokota H, editors. Encyclopedia of Systems Biology [Internet]. New York (USA): Springer; 2013 [cited 2022 Jan 4]. p. 38–9. Available from: https://doi.org/10.1007/978-1-4419-9863-7_209
  64. 64. Ronan. Full title: Withdrawal ruptures in adolescents with borderline personality disorder psychotherapy are marked by increased speech pauses–can minimal responses be automatically detected? Short title: Pauses and ruptures in adolescent borderline personality disorder psychotherapy. 2022 Sep 12 [cited 2022 Sep 12]; Available from: https://osf.io/2v9x6/
  65. 65. Safran JD, Muran JC. Resolving therapeutic alliance ruptures: Diversity and integration. J Clin Psychol. 2000;56(2):233–43. pmid:10718606
  66. 66. Kleinbub JR. State of the Art of Interpersonal Physiology in Psychotherapy: A Systematic Review. Front Psychol. 2017;8.
  67. 67. Palumbo RV, Marraccini ME, Weyandt LL, Wilder-Smith O, McGee HA, Liu S, et al. Interpersonal Autonomic Physiology: A Systematic Review of the Literature. Pers Soc Psychol Rev. 2017 May 1;21(2):99–141. pmid:26921410
  68. 68. Koole SL, Tschacher W. Synchrony in Psychotherapy: A Review and an Integrative Framework for the Therapeutic Alliance. Front Psychol [Internet]. 2016 Jun 14 [cited 2016 Nov 28];7. Available from: http://journal.frontiersin.org/Article/10.3389/fpsyg.2016.00862/abstract pmid:27378968
  69. 69. Krause M, Altimir C, Pérez JC, De la Parra G. Generic change indicators in therapeutic processes with different outcomes. Psychother Res. 2015;25(5):1–13. pmid:25017369