Goal-related feedback guides motor exploration and redundancy resolution in human motor skill acquisition

doi:10.1371/journal.pcbi.1006676

Fig 1.

The three DoF planar arm reaching task.

A: Both humans and simulated agents learned to move a three DoF planar arm with joints q₁, q₂ and q₃ (three-dimensional motor space) to reach to targets in an x and y plane (two-dimensional goal space). B: Top: The home postures H₁ and H₂ are different but reach to the same point in goal space. Note that in home posture H₁, the q₂ joint is folded backwards (q₂ = π) so the reach-endpoint is located on top of the q₂ joint. Bottom: The colour map depicts the redundancy, that is the relative number of joint configurations (q₁, q₂, q₃) that reach to a given point in the goal space (lighter colours imply more joint configurations to reach this point). The task is redundant, especially for targets close to the origin of the goal space. Participants were trained and tested on targets in the top quadrant of the goal space (green disks).

More »

Expand

Fig 2.

Average performance across blocks.

Mean and standard error of the mean in Condition H₂ (red) and in Condition H₁ (blue). The black line indicates baseline performance if the hand is not moved at all (slightly lower during training as targets are spaced differently). A: Artificial Agents, test B: Human participants, test. C: Human participants, training.

More »

Expand

Fig 3.

Synergy formation during motor skill acquisition.

The percentage of total variance explained by each PC in artificial agents (A) and in human participants during the test blocks (B) and the training blocks (C). Both agents and human participants need two PCs to explain >90% of the total variance in reach postures (q₁, q₂, q₃) at the end of training. In human participants, this involves a dimensionality reduction, i.e., the amount of variance explained by PC1 goes up with training (sign-test difference beginning vs. end: p = 0.041 in test; training p = 0.263), whereas the amount of variance explained by PC3 goes down (sign test difference beginning vs. end: p<0.001 in test, p = 0.041 in training). The lines in panels A, B and C depict the across-participant median, the error bars the interquartile range.

More »

Expand

Fig 4.

Examples of synergies learned.

Two example solutions from different participants in the test phase for Condition H₁ (A+B) and Condition H₂ (C+D) respectively (first two synergies PC1 and PC2). Panels A and C depict the PC1/PC2 solution plane in the three-dimensional motor space. The plane extends 2σ (standard deviations) away from the central posture in the directions of PC1 and PC2 respectively. The pale ‘shadows’ are projections of the results onto the planes defined by the coordinate axes for a better impression of 3D shape. The empty disks depict the two home postures H₁ and H₂. Panels B and D depict the same result in the goal space (colour code is identical to A and C). The thick lines depict the arm configurations that correspond to the points of the same colour in motor space. The thin lines depict the reach-endpoints along the grid that connects the dots. The inlays depict the respective home postures H₁ and H₂.

More »

Expand

Fig 5.

Location of learned solutions in motor and goal space at the end of training.

A-C: Artificial Agents. D-G: Human Participants. Panels A and D depict the central postures (mid-point of learned solutions) in motor space for all 100 agents and 20 participants (Cond H₁: blue, Cond H₂: red). Panels B and E depict the corresponding arm configurations in goal space, which shows that the reach-endpoints of the central postures cluster around the mid-point of the array of test targets. Panels C, F and G depict the average location of solutions relative to H₁ and H₂ across time (population mean and standard error of the projection on the line connecting H₁ and H₂). Both agents (C) and humans (test F, training G) are biased towards the starting home posture, but this bias is weaker in humans.

More »

Expand

Fig 6.

The relative use of DoFs in learned synergies.

A+C: Population median of the absolute values of q₁, q₂ and q₃ in PC1, PC2 and PC3 (unit length vectors) in artificial agents (A) and human participants during training (C). Brighter colours indicate a higher absolute value of the respective DoF. B: Illustration of the home postures H₁ (blue) and H₂ (red) in goal space. Note that the q₃ joint is contracted in H₁ and extended in H₂. D: Median and interquartile range of absolute q₂ values in PC1 across agents (top) and participants during training (bottom).

More »

Expand

Fig 7.

Absolute variability and influence of finger mapping on motor organization.

The summed variance of all three synergies for artificial agents (left) and human participants (right) across time (median and interquartile range).

More »

Expand

Fig 8.

Percentage of variance in the across-participant reach-endpoint space that is explained by the first, second and third PC across time.

More variance explained by the first synergies indicates more organization of motor control. Left: PCA on human morphology (left index, right index, right middle fingers). Right: PCA on the task motor space (q₁, q₂, q₃). Top: test blocks, bottom: training blocks.

More »

Expand

Fig 9.

Setup and Procedure for the Experiment with Human Participants.

A: Participants experience the task as deforming a black ellipse to track the shape of a white ellipse by moving their left index (LI), right index (RI) and right middle (RM) fingers up and down (left). Internally, finger positions are mapped to the joint angles q₁, q₂ and q₃ of the 3 DoF planar arm (bottom right, Method Sect Task). The size and elongation of the ellipses represents the position of target and reach-endpoint in goal space (top right). B: When a participant keeps the three fingers LI, RI and MI level (bottom left), this corresponds to taking the Baseline Posture q* (top left), which is in the middle of the two home postures (top middle and top right) in motor space. To take one of the home postures, the fingers have to be moved away from the midline in a manner that depends on the random mapping between fingers and joints (example mapping: (-2, 3, -1)). C: Each session consisted of 24 training and test blocks (right). A training block (top left) was started by taking the home posture. The deforming white target ellipse had to be tracked with the black ellipse (reach-endpoint) continuously for the next 80 s. In test blocks, only the white target ellipse was displayed and participants moved their finger till they reached a configuration they thought corresponds to the target and then submitted their response by pressing a foot pedal.

More »

Expand