Figures
Abstract
Perception-action coordination (also known as sensorimotor synchronization, SMS) is often studied by analyzing motor coordination with auditory rhythms. The current study assesses phasing—a compositional technique in which two people tap the same rhythm at varying phases by adjusting tempi—to explore how SMS is impacted by individual and situational factors. After practice trials, participants engaged in the experimental phasing task with a metronome at tempi ranging from 80–140 beats per minute (bpm). Multidimensional recurrence quantification analysis (MdRQA) was used to compare nonlinear dynamics of phasing performance. Varying coupling patterns emerged and were significantly predicted by tempo and linguistic experience. Participants who successfully phased replicated findings from an original case study, demonstrating stable tapping patterns near in-phase and antiphase, while those unsuccessful at phasing showed weaker attraction to in-phase and antiphase.
Citation: Hall C, Kim JC, Paxton A (2023) Multidimensional recurrence quantification analysis of human-metronome phasing. PLoS ONE 18(2): e0279987. https://doi.org/10.1371/journal.pone.0279987
Editor: Sebastian Wallot, Leuphana University of Lüneburg: Leuphana Universitat Luneburg, GERMANY
Received: August 18, 2021; Accepted: December 19, 2022; Published: February 23, 2023
Copyright: © 2023 Hall et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All raw data, code, protocols, and other methods materials are available from the Open Science Framework database (https://osf.io/4uhtb/?view_only=f8b5a974b1b54bff95abbc41a8342f5e).
Funding: This work was financially supported by the Peter and Carmen Lucia Buck (PCLB) Foundation Undergraduate Research Grant from the Connecticut Institute for Cognitive and Behavioral Sciences, which was awarded to C. Hall. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Perception-action coordination (also called sensorimotor synchronization, SMS) occurs as people coordinate overt movements with a rhythmic stimulus [1–3]. This is typically studied using tasks in which participants are instructed to coordinate with auditory rhythms, which requires explicit intention to synchronize. We aim to explore coordination when participants are instructed to execute controlled desynchronization with a rhythmic stimulus. This task, called phasing, is inspired by a musical technique investigated in a case study of professional percussionists [4]. Our controlled examination of phasing provides a conceptual replication and extension of the case study.
The simplest synchronization task is coordinating rhythmic finger movements with a metronome. Within such contexts, phase dynamics are largely involuntary, while tempo change is under voluntary control [5–7]. People display two stable phase relationships: in-phase and anti-phase [5,8]. Individuals also display substantial tempo flexibility, coordinating from 17 bpm to 150–170 bpm for anti-phase tapping [8–10] and 300–400 bpm for in-phase tapping [10–13]. Prior research suggests it becomes difficult to coordinate with a metronome beyond these rate limits.
Rhythmic coordination is also impacted by individual factors, such as musical and linguistic experience. Previous findings suggest musicians are better able to synchronize with auditory stimuli than nonmusicians [14,15]. Additionally, an association between increased inhibitory control and SMS abilities in bilinguals [16–18] indicates multilingualism may improve rhythmic coordination. Taken together, perception-action coupling is shaped by individual differences and task demands.
Phasing
Phasing involves musicians shifting in and out of synchrony with one another while performing the same rhythm at different tempi. For example, in the musical composition “Drumming” [19], two percussionists begin by drumming synchronously. One drummer then attempts to maintain the original tempo while the other accelerates, gradually desynchronizing. Eventually, after tapping the same rhythm at different tempi, the partners approach synchrony again and return to unison by tapping at identical tempi one cycle apart.
During “Drumming,” the professional percussionists reported feeling pulled toward synchronization despite intentions to play at independent tempi [20]. To understand this, Schutz [4] conducted a case study of the percussionists performing “Drumming” (Fig 1). In theory, the accelerating and steady drummers should perform independently. In reality, however, they exhibit a push-and-pull pattern, unable to overcome the tendency toward interpersonal coupling. Inspired by Schutz’s case study, the current work investigates the evolution of coordinative behaviors during phasing with a metronome among non-expert populations using recurrence quantification analysis (or auto-recurrence quantification analysis; RQA).
Left: How phasing would occur, in theory, if partners were uninfluenced by each other’s performance. The static part (blue) remains constant, while the moving part (green) steadily shifts its metrical position. Right: Actual performance of “Drumming” by Becker and Hartenberger. The “static” part actually varies along with the moving part during phasing. In other words, the drummer who intended to maintain the original tempo was unable to do so; instead, this drummer unintentionally increased and decreased their tempo along with the phasing drummer. (Figure reproduced with permission from Schutz [4]).
Quantifying phasing with a novel application of RQA
RQA is a nonlinear time series analysis [21] that has been applied to coordination research [22,23] to quantify a single dynamical system over time [24]. Conceptually, RQA uncovers the structure of a system that changes over time through examining the patterns of repetition of a single variable of interest over time. RQA uses only one observable to describe the dynamics of multiple interdependent variables of a system, as the interacting components of a dynamical system can be uncovered from a single variable [24,25]. The idea is to reconstruct the phase space of the dynamical system, recover the trajectory of the system through that phase space, and then plot the system’s trajectory against itself to identify patterns of recurrent (or repeating) discrete states or regions of the state space. This creates a recurrence plot (RP), from which various metrics may be derived to quantify the dynamics of the system as a whole [25].
RQA has also been adapted to analyze systems with more than one measured variable and to examine the dynamics of coupled systems, including cross-recurrence quantification analysis (CRQA) and multidimensional recurrence quantification analysis (MdRQA). CRQA quantifies the coevolution of two distinct but interacting systems; in other words, CRQA captures the shared trajectories of two separate univariate systems. On the other hand, MdRQA analyzes a single system captured by two or more measured variables; that is, instead of the variables belonging to distinct time series, the variables are different dimensions of the same time series. Thus, MdRQA quantifies the auto-recurrence properties of a single multidimensional or multivariate system [25]. Readers interested in more detailed conceptual and mathematical explanations of RQA and its extensions are encouraged to consult Carello and Moreno [24] and Wallot et al. [25].
We quantify phasing by analyzing relative phase (ψ), which measures the angle between the two phasing signals in degrees. However, to date, the extensions of RQA cannot adequately handle such circular data. Because RQA-based methods operationalize similarity through the revisiting of a similar state within a given radius, RQA detects an apparent discontinuity between 359° and 0°; it calculates an angular difference of 359° even though going from 359° to 0° is a difference of only 1° in terms of ψ. Our conceptual framework necessitates that the shift from 359° to 0° be interpreted as the same ψ change as that when moving from 0° to 1°.
For the current work, we therefore created a novel circular extension of MdRQA by decomposing the relative phase signal into its x- and y-coordinates and using MdRQA to analyze them together as a multidimensional signal of the same system. Because relative phase is inherently a relational measure—in our case, a measure that necessarily accounts for the positions and relations of two phasing signals—this means that the x-- and y-coordinate values must be considered coequal and non-separable parts of a single multidimensional system. As a result, CRQA would be unsuitable for the current study, as it would treat the x- and y-coordinates as two separate but interacting systems. MdRQA, on the other hand, treats both coordinates as part of the same system, allowing us to accurately capture the system revisitations (or approximate revisitations) through specific x,y regions of the relative phase. Importantly, the choice to create and implement this novel extension of MdRQA to analyze these data came after the data were collected, as it provided for a more complete accounting of the nonlinear dynamics of the system. More information on our motives and implementation are available in the “Data Analysis” section.
In the present study, we use MdRQA to characterize multiple dimensions of a singular relative phase variable. We also use region-based MdRQA, which is the same as general MdRQA mathematically and methodologically except that it parses the RP into subsections and analyzes each subsection independently. Thus, region-based MdRQA allows us to quantify different relative phase relations separately. Like RQA, both general and region-based MdRQA yield metrics that quantify the structure, patterns, and stability of a system’s evolution. Here, we target percent recurrence (%REC), percent determinism (%DET), and maxline (MAXL). %REC is the density of recurrent states of the system across time and is proportional to the inverse of the system’s noise. %DET captures recurrent patterns of states across time, quantifying the system’s predictability. MAXL is the length of the longest run of consecutive recurrent states, corresponding to the system’s attractor strength [25–27].
Present study
Inspired by Schutz [4], the present study evaluates phasing among non-expert participants with more varied musical and linguistic backgrounds. To maximize feasibility, we simplify Schutz’s task in two ways: Participants phase with a metronome instead of a partner, and the metronome produces an isochronous rhythm. Furthermore, we narrow the tempo range known to permit successful antiphase tapping to 80–140 bpm. We measure phasing by calculating ψ between participants’ taps and metronome ticks [28–29]. Using MdRQA to analyze ψ, we quantify the human-metronome system’s repetitiveness (%REC), predictability (%DET), and attractor strength (MAXL). Consistent with Schutz [4] and coordination dynamics literature [5,8], we hypothesize that participants will demonstrate stable tapping near in-phase and antiphase, indicated by higher metrics during those periods.
Successfully phasing requires attending to the metronome while voluntarily adopting a different tempo than the metronome [7]. Ignoring the metronome would result in not knowing when to stop phasing, while an inability to voluntarily adopt a different tempo would cause failure to escape the in-phase attractor—a ψ near 0 throughout the task. Based on evidence of bilingual speakers’ increased selective attention [30–32], we hypothesize that multilingual participants may be able to simultaneously attend to the metronome while adopting a different tempo. Monolingual participants may experience stronger coupling with the metronome. This would result in greater metrics for monolingual speakers. Furthermore, considering tapping rate limits, we expect the middle range of our selected tempi (100–120 bpm) to yield the most structured phasing performance for all participants, resulting in higher metrics. This prediction aligns with the preferred tempo range [33,34], which is the zone at which tempo perception is optimal—not so fast that individual pulses appear to fuse together and not too slow that individual pulses sound isolated.
Hypotheses
- H1. Middle range tempi (100–120 bpm) will yield the most structured phasing for all participants. Operationalization: This will lead to higher %REC, %DET, and MAXL for trials between 100–120 bpm than trials above or below that range.
- H2. Participants will demonstrate more stable tapping near antiphase when compared to other nonsynchronous relative phases, due to the increased general stability of antiphase relations when tapping. Operationalization: This will be reflected by higher %REC, %DET, and MAXL during antiphase.
- H3. Monolingual participants will demonstrate stronger coupling with the metronome than multilingual speakers, due to multilinguals’ improved selective attention abilities compared with monolinguals. Operationalization: This would result in higher %REC, %DET, and MAXL for monolingual speakers overall.
As noted above, we chose to use MdRQA for the analysis after designing the study and collecting the data. This necessarily means that the operationalization of each hypothesis in terms of MdRQA metrics came later, but these operationalizations nevertheless reflected the a priori hypotheses that the study was designed to test.
Materials and methods
Participants
Twenty-five undergraduates (17 females, 8 males; 18–21 years, M = 18.7 years) at the University of Connecticut were recruited via the Psychological Sciences Participant Pool and received course credit for completing the experiment. All subjects reported normal hearing abilities and no neurological health complications. Thirteen (9 females, 4 males) reported being monolingual, and 12 (8 females, 4 males) reported being multilingual. Six (5 females, 1 male) reported experience playing one or more instruments for at least one year, while 19 (12 females, 7 males) reported no experience playing an instrument.
Sample size.
As is customary in the field, results with p < 0.05 are considered significant. Experimental studies typically require 15–30 participants for adequate statistical power, which led us to recruit 25 non-expert participants for an exploratory study. We did not recruit based on a planned group comparison.
Procedure
The procedure lasted approximately one hour. First, participants completed a demographics survey. This information was collected for IRB purposes. We did not have a priori plans to analyze anything from the survey except for musical and linguistic experience. Next, participants were introduced to the task with audio and video demonstrations (created using custom MATLAB code [35]). Tapping data were collected with a Roland HandSonic HPD-20 Digital Hand Percussion Controller [36]. After two practice sessions (see S1 Appendix for description), participants advanced to the experiment. One experimental trial consisted of the following human-metronome phasing method: The metronome’s tempo was programmed to tick at one of the seven tempi ranging from 80–140 bpm in 10-bpm increments. The metronome maintained its original tempo throughout the duration of each trial, which lasted a maximum of two minutes.
Participants were instructed to begin by tapping in synchrony with the metronome for several beats. This allowed participants to adjust to the tempo of the current trial. At the sound of a warning signal (i.e., a short, high-pitched beep), participants began phasing (see section “Phasing”) with the metronome. Here, the phasing process entailed participants increasing their tapping rate slightly while the metronome continued ticking at its original tempo. Participants were instructed to maintain their new tempo until they resynchronized with the metronome. Because the participant and metronome played at different (ideally, constant) tempi, resynchronization would happen naturally as the participant “lapped” the metronome. This can be thought of as two people running around a track at different speeds: The faster runner will gradually shift farther ahead of the slower runner until, eventually, the faster runner is one whole lap ahead of the slower runner. At that moment, both runners would be instantaneously resynchronized.
When participants resynchronized with the metronome, they were instructed to revert to the original tempo of the metronome and tap in synchrony for several beats before stopping. The period between participants’ initial desynchronization with the metronome and their resynchronization after increasing their tempo was considered one round of phasing (also referred to as one phasing lap). Participants were informed that the goal was to complete exactly one round of phasing per trial. After the drum pad detected several seconds of no tapping input, the experiment automatically continued to the next trial. Each of the seven tempi was presented three times per participant in randomized order, totaling 21 trials per participant; this allowed for a more robust sampling of the per-tempi variability than presenting it only once. The monitor displayed the total number of trials remaining.
Data inclusion criteria
Overall, our dataset included a total of 524 trials: 25 participants each completed 21 trials, three at each of the seven tempi, minus one trial that terminated early because the participant did not tap a single time. This trial was removed from our analyses.
Due to the challenging nature of the task, only 38% of trials successfully completed one round of phasing (“Successful Trials”). A further 41% completed more than one round, lapping the metronome multiple times (“Unsuccessful Trials”). The remaining 21% did not complete any rounds, resulting in zero laps around the metronome (“Incomplete Trials”). Incomplete Trials still contained tapping data and thus were able to be analyzed using MdRQA; participants simply never resynchronized with the metronome during Incomplete Trials and thus failed to execute the phasing task as instructed. Outliers were included in analyses.
All trial types were analyzed with general MdRQA, and only Successful and Unsuccessful Trials were additionally analyzed with region-based MdRQA. Analyses of Unsuccessful and Incomplete Trials are exploratory because—given the simplified task and earlier piloting—we did not anticipate such a high level of noncompliance. Music experience was excluded as a predictor because we did not achieve sufficient variability in our sample. However, music experience data is provided in participants’ data files and in S2 Appendix.
Data analysis
Open materials, data, and code.
In keeping with the principles of open science, we have made our materials, data, and code for the current project fully available. Study protocols, raw tapping data (stored as MATLAB files), MdRQA analysis code (in MATLAB) [25], and descriptive and inferential statistics (in R) are available at our OSF project page: https://osf.io/4uhtb/?view_only=f8b5a974b1b54bff95abbc41a8342f5e. We used a variety of R libraries for data manipulation, visualization, and analysis, with specific implementation of each included in the open project code: ggplot2, viridis, emmeans, dplyr, tidyverse, rstatix, qwraps2, lme4, GGally, lmerTest, pander, sjPlot, and ggpubr [37–50].
Relative phase (ψ) calculation.
Relative phase (ψ) measures the difference between the time at which a participant taps and the nearest metronome tick, yielding the phasing relationship between participant and metronome. This was calculated as follows:
dt = participant’s tap time–nearest metronome tick time
T = metronome’s interbeat interval where dt is the difference between the participant’s tap time and the nearest metronome tick time and T is the metronome’s interbeat interval. The negative sign yields positive ψ when the participant taps ahead of the metronome and negative values when the participant lags behind. Thus, ψ increases when phasing is performed by tapping ahead of metronome ticks.
General MdRQA.
While ψ was the only independent variable measured in the human-metronome system, these circular data were not well suited for RQA because of the apparent discontinuity in ψ when shifting from 359° to 0°; while ψ changes by only 1° during this transition, RQA calculates an angular difference of 359°, which is conceptually incorrect for our purposes. We resolved this by creating a novel circular application of RQA thanks to MdRQA.
For the circular application of MdRQA, we convert ψ into a 2-dimensional variable by wrapping ψ values to range from zero to 2Ψ; we then decompose each tap into its x- and y-coordinates by taking the cosine and sine, respectively. This converts the one-dimensional circular operationalization of relative phase into a two-dimensional representation of the same data—the x- and y-coordinates of circular relative phase as multivariate measurements of a single human-metronome system. These data are therefore more consistent with the principles of MdRQA (which captures the dynamics of a single multidimensional system) rather than those of CRQA (which captures the shared trajectories of two univariate systems).
While categorical CRQA would have been appropriate for comparing ticks of the metronome with taps of the participant (i.e., two univariate discrete time series), the decomposition of relative phase into two component parts creates two signals from the same system. However, simply conducting categorical CRQA on the metronome ticks and participant taps would not uncover the richness of the relative phase data because of the rigidity of categorical RQA techniques and the relatively short time series.
To capture the dynamics of the continuous relative phase variable, continuous MdRQA was conducted on the two-dimensional vector comprising the x- and y-coordinates of each circular relative phase value. From there, conducting MdRQA generates a recurrence plot (RP) that describes repetitions of the system’s values within a given tolerance across the phase space (Fig 2A). Here, a point on the recurrence plot indicates that a given ψ is repeated within a given radius in the time series. Recurrence metrics are then calculated from the RP. For phasing, higher %REC indicates repetition of the same ψ throughout the trial. Higher %DET means the ψ trajectory is more predictable, and higher MAXL corresponds to stronger attraction to a particular ψ.
(A) How subsections were created from the MdRP to conduct region-based analysis. Each sample in the time series contains a vector of x- and y-coordinates of ψ. Recurrences or revisitations of the same multidimensional space (within a given tolerance) between the two time series (i.e., the same time series represented on the x-axis and y-axis) are plotted as points on the MdRP. (B) How participants’ taps were assigned to regions 1, 2, and 3 based on circular ψ values. As described in “Region-Based MdRQA,” synchronous taps were excluded from region-based MdRQA to effectively compare Successful and Unsuccessful Trials”.
MdRQA was performed in MATLAB [25,35]. As specified in the MATLAB files within our linked OSF page, there are five parameters that must be specified when running MdRQA: number of embedding dimensions (EMB), delay (DEL), type of norm by which the phase space is normalized (NORM), radius size (RAD), and whether the data should be z-scored (ZSCORE). We used only a single embedding dimension (EMB = 1) because we did not need to reconstruct the phase space for the current data; both dimensions of the relative phase data (i.e., the x- and y-coordinates) are represented within the dataset. As a result, we used the default delay value for unembedded data (DEL = 1). We did not normalize the phase space (NORM = ‘non’) or the data (ZSCORE = 0) because both time series exist naturally within the same scale. A small radius (RAD = 1) was chosen because the data is not highly deterministic and thus requires a radius not too close to zero to capture recurrence.
Region-based MdRQA.
Using region-based MdRQA allowed us to compare %REC, %DET, and MAXL for the human-metronome system at different relative phase regions. Specifically, we were able to assess differences in recurrence, predictability, and attractor strength during nonsynchronous regions between the initial desynchronization and final resynchronization with the metronome. In other words, we focused on the system’s trajectory after it moved from synchrony, passed through antiphase, and approached synchrony again, rather than focusing on in-phase dynamics. Region-based MdRQA was conducted on both Successful and Unsuccessful Trials. We parsed taps into three non-overlapping regions based on circular ψ values: 45° ≤ ψ < 135° were region 1, 135° ≤ ψ < 225° were region 2, and 225° ≤ ψ < 315° were region 3 (Fig 2B). MdRQA metrics were calculated for each region separately using the same parameter values that were used for general MdRQA. Synchronous taps were excluded from region-based MdRQA to effectively compare Successful and Unsuccessful Trials: Although Unsuccessful Trials passed through multiple synchronous points while phasing, there were insufficient points within the synchronous region to calculate metrics.
Tempo.
Because our hypotheses included predicted nonlinearities in performance across tempi, we binned tempo to create a categorical variable with three levels: lower- (80 and 90 bpm), middle- (100, 110, and 120 bpm), and upper- (130 and 140 bpm) range tempi.
Model specifications.
Using the lme4 library in R [37], we created two classes of linear mixed-effects models: one to assess general MdRQA and another for region-based MdRQA. We created a separate equation for each of the three dependent variables (i.e., %REC, %DET, and MAXL) for general and region-based analyses, totaling six models. For general MdRQA, we used Incomplete Trials, monolingual language experience, and middle-range tempi as reference categories; for region-based MdRQA analyses, we used Successful Trials, monolingual language experience, middle-range tempi, and region 1 as the reference categories. We used deviation coding for all categorical variables (see S3 Appendix) [51]. For all models, participant identity was included as a random effect to control for multiple trials per participant.
Results and discussion
As described in the “Model Specifications” section (above), we analyzed data with two classes of linear mixed effects models for each MdRQA metric. Statistically significant and nonsignificant results are presented in tables; only statistically significant results (p < 0.05) are discussed in the text. For readability and clarity within the text, all statistics—including effect sizes—are presented only in the tables. We present results of our a priori hypotheses before turning to our exploratory analyses and considering future directions.
Hypothesis 1
In H1, we predicted that tempi from 100 to 120 bpm would yield the most structured tapping data. Because H1 does not consider phasing regions (e.g., synchrony, antiphase), we use general MdRQA results to assess H1. Summary statistics for general MdRQA are presented in Table 1. Results of the statistical analyses for H1 are available in Table 2.
Each MdRQA metric was predicted using a separate model. Marginal and conditional R2 for each model included below each model label. Effect sizes provided as standardized estimates (ß) for statistically significant predictors. As noted in the Model Specifications section, the use of deviation coding yields k-1 variables that test the difference between a given level of the categorical variable and the reference level. General MdRQA models used the incomplete trials, monolingual language experience, and middle tempi as reference categories; for more on mathematical interpretation of deviation-coded interaction terms, see Barr et al. (2013) [51].
Overall, our general MdRQA findings failed to support H1. Neither %REC (Fig 3A) nor %DET (Fig 3B) were significantly greater for middle tempi than for lower and upper tempi. MAXL (Fig 3C) did not significantly differ for middle versus lower tempi, but MAXL was significantly greater for upper tempi than for middle tempi. The attractor strength of the human-metronome system grew as tempo increased from 100–140 bpm, making it more difficult for participants to decouple from the metronome and successfully achieve phasing.
The metronome ranged from 80–140 bpm. In our linear mixed effects model, we binned tempo into the following categories: lower (80–90 bpm), middle (100–120 bpm), and upper (130–140 bpm). The violin plots depict the probability density of the binned tempo data at different %REC, %DET, and MAXL values. The mean value is indicated by a diamond near the center of each violin. Results of statistical analyses are located in Table 2.
Our findings might imply that our tempo selection was too narrow. Based on previous research, participants tapping at a comfortable tempo should experience a stronger pull toward attractors, resulting in more structured, repetitive tapping as reflected in greater MdRQA metrics. Our finding that participants produced the most structured data (i.e., MAXL) at upper range tempi suggested that this may have been a more comfortable tempo for phasing than lower and middle range tempi. According to previous literature, the average natural preferred tempo for people falls around 120 bpm [33,34]. To account for the difficulties of phasing (versus tapping in-phase with a metronome), we chose to center our tempo range at 110 bpm to make the task more manageable for participants. Our results, however, suggest that future phasing studies should test faster tempi.
Hypothesis 2
H2 predicted that, overall, participants would demonstrate more stable tapping during antiphase (i.e., region 2) than during other relative phases (i.e., regions 1 and 3). This would have been supported by higher %REC, %DET, and MAXL during region 2. Because H2 pertains to phasing regions, we use region-based MdRQA results to assess our prediction. Summary statistics for region-based MdRQA are presented in Table 3. Results of the statistical analyses for H2 are available in Table 4. The significant findings related to H2 include the main effect of region on %REC, %DET, and MAXL (Fig 4); interactions between trial type and region on both %DET (Fig 4B) and MAXL (Fig 4C); and interactions among language experience, tempo, and region on both %DET (Fig 5B) and MAXL (Fig 5C).
The metronome ranged from 80–140 bpm. In our linear mixed effects model, we binned tempo into the following categories: lower (80–90 bpm), middle (100–120 bpm), and upper (130–140 bpm). The violin plots depict the probability density of the binned tempo data at different %REC, %DET, and MAXL values. The mean value is indicated by a diamond near the center of each violin. Results of statistical analyses are located in Table 4.
The metronome ranged from 80–140 bpm. In our linear mixed effects model, we binned tempo into the following categories: lower (80–90 bpm), middle (100–120 bpm), and upper (130–140 bpm). The violin plots depict the probability density of the binned tempo data at different %REC, %DET, and MAXL values. The mean value is indicated by a diamond near the center of each violin. Results of statistical analyses are located in Table 4.
Each MdRQA metric was predicted using a separate model. Marginal and conditional R2 for each model included below each model label. Effect sizes provided as standardized estimates (ß) for statistically significant predictors. As noted in the Model Specifications section, the use of deviation coding yields k-1 variables that test the difference between a given level of the categorical variable and the reference level. Region-based MdRQA models used the successful trials, monolingual language experience, middle tempi, and region 1 as reference categories; for more on mathematical interpretation of deviation-coded interaction terms, see Barr et al. (2013) [51].
The main effect of region on %REC revealed that region 2 had significantly lower %REC than region 1, suggesting that antiphase was noisier than desynchronization and failing to support H2. Instead, region 1 had significantly greater %DET and MAXL values than region 3. In other words, the relative phase region just before returning to synchrony was the least predictable and exhibited the weakest coupling between human and metronome.
The interaction between trial type and region on %DET supported H2: Both Successful and Unsuccessful Trials yielded the highest %DET during region 2 and smallest during region 3. This meant that the human-metronome system exhibited the most predictable tapping pattern during region 2, as anticipated. Successful Trials were generally more predictable than Unsuccessful Trials.
The interaction between trial type and region on MAXL revealed findings similar to those for %DET. MAXL also peaked during region 2 for Unsuccessful Trials, which supports H2. However, MAXL peaked during region 1 for Successful Trials, opposing H2. Both trial types produced the smallest MAXL values during region 3. Together, these results suggested that those who could not successfully phase experienced the greatest attractor strength during antiphase, while those who successfully phased experienced the greatest attractor strength during their initial desynchronization from the metronome.
Interestingly, regardless of level of success with the phasing task, participants experienced the weakest coupling with the metronome when moving from antiphase to in-phase synchrony. To the contrary, human-metronome coupling was relatively stronger when shifting from synchrony to antiphase. This was demonstrated by the relative stability of taps (i.e., increased number of consecutive recurrent states) in region 1 compared to region 3, indicating differential pulls of in- and anti-phase attractors. Since in-phase synchrony is known to be a stronger attractor than antiphase, it may be more difficult to escape from synchrony toward antiphase; thus, participants passed more slowly through region 1 than through region 3. Participants resynchronized more quickly, possibly because they were moving from the weaker attractor of antiphase toward the stronger attractor of in-phase synchrony.
The significant interactions among language experience, tempo, and region on both %DET (Fig 5B) and MAXL (Fig 5C) partially supported H2. We expected participants to reach peak %DET and MAXL values during region 2 due to increased stability near antiphase. This held true across tempi for monolingual speakers, but multilinguals exhibited more variability across tempi and regions. This result suggested that tapping predictability and attractor strength varied with task parameters. The relationship between selective attention and pull toward the metronome at various relative phases is expanded upon during our discussion of H3.
Hypothesis 3
In H3, we predicted that monolingual speakers would experience stronger coupling with the metronome and therefore produce greater %REC, %DET, and MAXL than multilingual speakers would. In other words, because multilingual speakers have been shown to have greater inhibitory control compared with monolinguals, we hypothesized that multilingual participants would be better able to intentionally decouple from the metronome in order to successfully phase, as compared to monolingual participants’ anticipated difficulty overcoming the pull toward in- and antiphase tapping. We use general MdRQA to assess the effects of language experience irrespective of phasing region, and then we use region-based MdRQA to compare how monolinguals and multilinguals differed during specific regions (i.e., regions 1–3). The significant outcomes related to hypothesis 3 include an interaction effect between language experience and tempo on %REC for general MdRQA (Fig 3A), as well as interaction effects among language experience, tempo, and region on both %DET (Fig 5B) and MAXL (Fig 5C) for region-based MdRQA. Results of the statistical analyses for H3 are available in Tables 2 and 4.
As measured by %REC, tapping data became less noisy as tempo increased. Multilinguals demonstrated a sharper and greater increase than monolinguals. These findings failed to support H3, perhaps indicating that %REC during intentional decoupling is not tied to inhibitory control—a connection that had been the foundation for H3.
The significant interactions among language experience, tempo, and region on %DET and MAXL showed a pattern of results similar to those of %REC. Both %DET and MAXL increased with tempo across regions 1, 2, and 3 for multilingual speakers. Thus, the predictability (indexed by %DET) and attractor strength (indexed by MAXL) of multilinguals’ taps grew as tempo increased during all regions. Again, in contrast with H3, these findings suggested that multilingual speakers had more difficulty desynchronizing and resynchronizing with the metronome at faster tempi.
Monolingual speakers exhibited a more complex pattern: %DET and MAXL increased with tempo during region 1, remained stable across tempi during region 2, and decreased with tempo during region 3. This meant that—for monolinguals—the predictability and attractor strength of the human-metronome system were greater at faster tempi when desynchronizing and greater at slower tempi when resynchronizing. Monolingual participants had more difficulty decoupling from synchrony at faster tempi and more difficulty returning to synchrony at slower tempi.
Overall, these interactions for %DET and MAXL neither fully supported nor contradicted our predictions regarding language experience, providing a complex picture of the impacts of intentional decoupling across task complexity. Task parameters affected monolingual and multilingual participants differently. For example, multilinguals and monolinguals had opposite relationships with tempo during region 3. Suggestions for how to disentangle these findings are provided in Limitations and Future Directions.
Post hoc analyses
We did not hypothesize about how success in the phasing task would be reflected by MdRQA metrics, as we did not anticipate such a high percentage of Unsuccessful Trials. As such, we conducted exploratory analyses to identify how Successful and Unsuccessful Trials differed in their dynamics.
Post hoc analyses revealed that Successful and Unsuccessful trials exhibited significantly different dynamics and metrics (see Fig 4 and Table 4). %REC, %DET, and MAXL are all significantly greater for Successful Trials than for Unsuccessful Trials. Furthermore, Successful Trials generally supported our prediction that region 2 should yield the most structured tapping data because of the antiphase attractor. %REC, %DET, and MAXL peak during region 2 and are the lowest during region 3 for Successful Trials. Unsuccessful Trials showed a different pattern of results: Region 2 was the least noisy (as indicated by %REC) and most structured (as indicated by %DET), but attractor strength (as indicated by MAXL) waned from region 1 to region 3. The absence of a clear pattern demonstrated by Unsuccessful Trials supports the interpretation that these trials were characterized by substantially different dynamics than Successful Trials.
While many participants faced difficulty phasing, the dynamics of Successful Trials generally replicated the dynamics identified in Kim’s [52] analysis, in which expert percussionists in Schutz’s case study [4] were found to dwell near in-phase and antiphase attractors but to move between them rather quickly. One notable difference in our results is the absence of quick transitions from initial synchrony to antiphase (demonstrated by relatively high metrics for region 1); however, this difference may have resulted from the difference between phasing with an adaptive human partner versus a rigid metronome. Similarities between the Successful Trials in the current work and Schutz’s study, however, held across music experience, tempo, and task demands, suggesting that perception-action coordination dynamics during successful phasing may emerge from general principles of motor behavior and intentional decoupling.
Limitations and future directions
The present study compared Schutz’s [4] expert study to a broader population of participants using a simplified version of the original task. We replicated the phasing dynamics in non-experts during Successful Trials. Participants in Unsuccessful Trials were unable to detect ψ, meaning they failed to complete one round of phasing; these trials did not replicate patterns in Schutz’s case study. This raises the question of what shapes phasing ability. Previous literature suggests attentional flexibility [53] and neuromuscular-skeletal constraints [54] predict high-level motor skill. Although we were able to identify distinct dynamics, our study did not permit us to investigate potential reasons for differences. Future work should explore the constraints that shape the distribution of Incomplete and Unsuccessful Trials relative to Successful Trials.
While phasing dynamics of Successful Trials were similar to those observed between expert musicians [4], we do not claim that nonmusicians and non-professional musicians should be identical to expert musicians in their phasing abilities: Critical differences in the dynamics may emerge when we examine interactive phasing context. To that end, future research should utilize a dyadic phasing task with a wider tempo range and again evaluate musical and linguistic experience to investigate whether our observations extend to dyadic conditions.
Finally, future work should develop a dynamical model that captures the observed behavior and provides novel testable predictions. Such models have been influential in the study of coordination dynamics [55]. A model of phasing should account for the observed changes in attractor landscape determined by task demands and individual experience from the present work and the original case study [4] and should provide an account of dyadic phasing.
Conclusion
Inspired by Schutz’s [4] data-driven case study, we here introduced a novel phasing task that requires intentional decoupling from an auditory metronome. Our complementary approach allowed us to study perception-action coordination through traditional in-phase and antiphase tapping paradigms: Despite key differences between the two paradigms (i.e., partnered expert performance versus isochronous human-metronome phasing), we conceptually replicated the original findings [4]—that is, dwelling near in-phase and antiphase and quickly transitioning between these attractors—among participants who were able to successfully phase, regardless of individual experiences or task demands. Given these findings, similar sensorimotor coordination processes may underlie successful phasing, even for different populations performing at different rhythmic complexities (e.g., isochronous versus non-isochronous). Parallel findings from two very different populations completing similar tasks of varying difficulty provides converging evidence about the general dynamics of phasing and perception-action coordination.
Supporting information
S1 Appendix. Descriptions of practice sessions.
https://doi.org/10.1371/journal.pone.0279987.s001
(DOCX)
Acknowledgments
The authors would like to thank Edward Large (University of Connecticut) for advice, feedback, and guidance and Matthew B. Jané (University of Connecticut) for his assistance designing the figures. This work was completed in part as coursework done by lead author C. Hall in the “Applications of Nonlinear Time Series Analyses” graduate course at the University of Connecticut, taught by coauthor A. Paxton and Steven J. Harrison.
References
- 1. Kelso JAS, Zanone P-G. Coordination dynamics of learning and transfer across different effector systems. Journal of Experimental Psychology. Human Perception and Performance. 2002; 28(4): 776–97. https://doi.org/10.1037/0096-1523.28.4.776. pmid:12190250
- 2. Large EW. On synchronizing movements with music. Human Movement Science. 2000; 19: 527–66.
- 3. Repp BH. Sensorimotor synchronization: A review of the tapping literature. Psychonomic Bulletin & Review. 2005; 12: 969–92. pmid:16615317
- 4. Schutz M. What really happens in Steve Reich’s “Drumming”? Percussive Notes. 2019; 57(4): 86–89.
- 5.
Kelso JAS, Del Colle JD, Schöner G. In: Jeannerod M, editor. Attention and performance 13: Motor representation and control. Hillsdale, NJ: Lawrence Erlbaum Associates, Inc; 1990. p. 139–69.
- 6. Repp BH. Compensation for subliminal timing perturbations in perceptual-motor synchronization. Psychological Research. 2000; 63(2): 106–28. https://doi-org.ezproxy.lib.uconn.edu/10.1007/PL00008170. pmid:10946585
- 7. Repp BH. Processes underlying adaptation to tempo changes in sensorimotor synchronization. Human Movement Science. 2001; 20(3): 277–312. https://doi-org.ezproxy.lib.uconn.edu/10.1016/S0167-9457(01)00049-5 pmid:11517673
- 8. Repp BH. Rate limits of on-beat and off-beat tapping with simple auditory rhythms: 2. The roles of different kinds of accent. Music Perception. 2005; 23: 165–87.
- 9. Fraisse P, Ehrlich S. Note sur la possibilité de syncoper enfonction du tempo d’une cadence [Note on the possibility of syncopation as a function of sequence tempo]. L’Année Psychologique. 1955; 55: 61–65.
- 10. Repp BH, Doggett R. Tapping to a very slow beat: A comparison of musicians and nonmusicians. Music Perception. 2007; 24(4): 367–76. https://doi.org/10.1525/mp.2007.24.4.367.
- 11. Mates J, Radil T, Müller U, Pöppel E. Temporal integration in sensorimotor synchronization. Journal of Cognitive Neuroscience. 1994; 6: 332–340. pmid:23961729
- 12. Peters M. Why the preferred hand taps more quickly than the non-preferred hand: Three experiments on handedness. Canadian Journal of Psychology. 1980; 34: 62–71.
- 13. Peters M. Constraints in the performance of bimanual tasks and their expression in unskilled and skilled subjects. Quarterly Journal of Experimental Psychology. 1985; 37A: 171–196.
- 14. Chen J, Penhune V, Zatorre R. Moving on time: brain network for auditory-motor synchronization is modulated by rhythm complexity and musical training. Journal of Cognitive Neuroscience. 2008; 20: 226–39. pmid:18275331
- 15. Repp BH. Control of expressive and metronomic timing in pianists. Journal of Motor Behavior. 1999; 31: 145–64. pmid:11177628
- 16. Bialystok E. Bilingualism and the development of executive function: the role of attention. Child Development Perspectives. 2015; 9: 117–21. pmid:26019718
- 17. Rigoli D, Piek JP, Kane R, Oosterlaan J. An examination of the relationship between motor coordination and executive functions in adolescents. Developmental Medicine & Child Neurology. 2012; 54: 1025–31. pmid:22845862
- 18. Slater J, Ashley R, Tierney A, Kraus N. Got rhythm? Better inhibitory control is linked with more consistent drumming and enhanced neural tracking of the musical beat in adult percussionists and nonpercussionists. Journal of Cognitive Neuroscience. 2018; 30(1): 14–24. https://doi-org.ezproxy.lib.uconn.edu/10.1162/jocn_a_01189. pmid:28949825
- 19. Reich S. Drumming [Song]. On Drumming. Nonesuch Records. 1971.
- 20.
Hartenberger R. (Ed.). The Cambridge Companion to Percussion (Cambridge Companions to Music). Cambridge: Cambridge University Press. 2016.
- 21. Webber C, Zbilut J. Dynamical assessment of physiological systems and states using recurrence plot strategies. Journal of Applied Physiology. 1994; 76: 965–973. pmid:8175612
- 22. Richardson D, Dale R. Looking to understand: The coupling between speakers’ and listeners’ eye movements and its relationship to discourse comprehension. Cognitive Science. 2005; 29: 1045–60. pmid:21702802
- 23. Shockley K, Santana MV, Fowler CA. Mutual interpersonal postural constraints are involved in cooperative conversation. Journal of Experimental Psychology: Human Perception and Performance. 2003; 29: 326–332. pmid:12760618
- 24. Carello C, Moreno M. Why nonlinear methods? In Riley M. A.& Van Orden G. C.(Eds.), Tutorials in contemporary nonlinear methods for the behavioral sciences. 2005. (pp. 1–25). www.nsf.gov/sbe/bcs/pac/nmbs/nmbs.jsp.
- 25. Wallot S, Roepstorff A, Mønster D. Multidimensional Recurrence Quantification Analysis (MdRQA) for the Analysis of Multidimensional Time-Series: A Software Implementation in MATLAB and Its Application to Group-Level Data in Joint Action. Frontiers in Psychology. 2016; 7: 1835. pmid:27920748
- 26. Coco MI, Dale R. Cross-recurrence quantification analysis of categorical and continuous time series: an R package. Frontiers in Quantitative Psychology and Measurement. 2014; 5: 510. pmid:25018736
- 27.
Pellecchia GL, Shockley K. Cross recurrence quantification of interpersonal postural activity. In Riley M. A.& Van Orden G. C.(Eds.), Tutorials in contemporary nonlinear methods for the behavioral sciences. 2005. (pp. 95–141). Washington, DC: National Science Foundation, Directorate for Social, Behavioral & Economic Sciences.
- 28. Kelso JAS. Phase transitions and critical behavior in human bimanual coordination. The American Journal of Physiology. 1984; 246: 1000–04. pmid:6742155
- 29. Zanone PG, Kelso JAS. Evolution of behavioral attractors with learning: Nonequilibrium phase transitions. Journal of Experimental Psychology: Human Perception and Performance. 1992; 18(2): 403–21. pmid:1593227
- 30.
Bialystok E. Selective attention in cognitive processing: the bilingual edge. In Harris R. J. (Ed.), Advance in Psychology, 83. Cognitive Processing in Bilinguals. 1992. (pp. 501–513). North-Holland.
- 31.
Bialystok E. Bilingualism in development: Language, literacy, and cognition. 2001. Cambridge University Press.
- 32. Philipp AM, Koch I. Inhibition in language switching: What is inhibited when switching between languages in naming tasks? Journal of Experimental Psychology: Learning Memory and Cognition. 2009; 35(5): 1187–95. pmid:19686014
- 33.
Moelants D. Preferred tempo reconsidered. In: Proceedings of the 7th International Conference on Music Perception and Cognition / C Stevens, Burnham D, McPherson G, Schubert E, Renwick J(eds)—Sydney, Adelaide, Causal Productions, 2002. 2002. p. 580–3.
- 34. Fraisse Paul. “6 –Rhythm and Tempo.” Psychology of Music (1982): 149–180.
- 35.
MATLAB. version 10.15.7 (R2020b). 2020. Natick, Massachusetts: The MathWorks Inc.
- 36.
Roland. Roland HandSonic HPD-20. [Apparatus]. 2014. https://www.roland.com/us/products/handsonic_hpd-20/downloads/.
- 37. Bates D, Maechler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. Journal of Statistical Software. 2015; 67(1): 1–48.
- 38. Daróczi G, Tsegelskyi R. pander: An R ’pandoc’ writer (R package version 0.6.3.). 2018. https://CRAN.R-project.org/package=pander.
- 39. deWitt P. qwraps2: Quick wraps 2 (R package version 0.5.2.). 2021. https://CRAN.R-project.org/package=qwraps2.
- 40. Garnier S. viridis: Default color maps from ’matplotlib’ (R package version 0.5.1.). 2018. https://CRAN.R-project.org/package=viridis.
- 41. Kassambara A. ggpubr: ’ggplot2’ Based Publication Ready Plots (R package version 0.4.0.). 2020. https://CRAN.R-project.org/package=ggpubr.
- 42. Kassambara A. rstatix: Pipe-friendly framework for basic statistical tests (R package version 0.6.0.). 2020. https://CRAN.R-project.org/package=rstatix.
- 43. Kuznetsova A, Brockhoff PB, Christensen RHB. lmerTest Package: Tests in linear mixed effects models. Journal of Statistical Software. 2017; 82(13): 1–26.
- 44. Lenth R. emmeans: Estimated marginal means, aka least-squares means (R package version 1.5.4.). 2021. https://CRAN.R-project.org/package=emmeans.
- 45. Lüdecke D. sjPlot: Data visualization for statistics in social science (R package version 2.8.7.). 2021. https://CRAN.R-project.org/package=sjPlot.
- 46.
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2020. https://www.R-project.org/.
- 47. Schloerke B, Cook D, Larmarange J, Briatte F, Marbach M, Thoen E, et al. GGally: Extension to ’ggplot2’ (R package version 2.1.1.). 2021. https://CRAN.R-project.org/package=GGally;l.
- 48.
Wickham H. ggplot2: Elegant graphics for data analysis. Springer-Verlag New York. 2016. https://ggplot2.tidyverse.org.
- 49. Wickham H, François R, Henry L, Müller K. dplyr: A grammar of data manipulation (R package version 1.0.2.). 2020. https://CRAN.R-project.org/package=dplyr.
- 50. Wickham et al. Welcome to the tidyverse. Journal of Open Source Software. 2019; 4(43): 1686, https://doi.org/10.21105/joss.01686.
- 51. Barr DJ, Levy R, Scheepers C, Tily HJ. Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language. 2013; 68(3): pmid:24403724
- 52. Kim J-C. What happens when musicians attempt to desynchronize gradually? Dynamics of interpersonal coordination in the performance of Steve Reich’s Drumming. New England Sequencing and Timing. 2019, Storrs, CT.
- 53. Keele SW, Hawkins HL. Explorations of individual differences relevant to high level skill. Journal of Motor Behavior. 1982; 14: 3–23. pmid:15151886
- 54. de Rugy A, Riek S, Carson RG. Neuromuscular-Skeletal Origins of Predominant Patterns of Coordination in a Rhythmic Two-Joint Arm Movement. Journal of Motor Behavior. 2006; 38(1): 7–14. https://doi-org.ezproxy.lib.uconn.edu/10.3200/JMBR.38.1.7-14.
- 55. Haken H, Kelso JAS, Bunz H. A theoretical model of phase transitions in human hand movements. Biological cybernetics. 1985; 51(5): 347–56. pmid:3978150