Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A surgical instrument motion measurement system for skill evaluation in practical laparoscopic surgery training

  • Koki Ebina,

    Roles Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Software, Visualization, Writing – original draft

    Affiliation Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Japan

  • Takashige Abe,

    Roles Conceptualization, Funding acquisition, Investigation, Project administration, Supervision, Writing – review & editing

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Lingbo Yan,

    Roles Investigation

    Affiliation Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Japan

  • Kiyohiko Hotta,

    Roles Investigation

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Toshiaki Shichinohe,

    Roles Funding acquisition, Resources

    Affiliation Department of Gastroenterological Surgery II, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Madoka Higuchi,

    Roles Investigation, Methodology

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Naoya Iwahara,

    Roles Investigation

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Yukino Hosaka,

    Roles Investigation

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Shigeru Harada,

    Roles Investigation

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Hiroshi Kikuchi,

    Roles Investigation

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Haruka Miyata,

    Roles Investigation

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Ryuji Matsumoto,

    Roles Investigation

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Takahiro Osawa,

    Roles Investigation

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Yo Kurashima,

    Roles Funding acquisition, Resources

    Affiliation Clinical Simulation Center, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Masahiko Watanabe,

    Roles Funding acquisition, Resources

    Affiliation Department of Anatomy, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Masafumi Kon,

    Roles Investigation

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Sachiyo Murai,

    Roles Investigation

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  • Shunsuke Komizunai,

    Roles Supervision

    Affiliation Faculty of Engineering and Design, Kagawa University, Takamatsu, Japan

  • Teppei Tsujita,

    Roles Supervision

    Affiliation Department of Mechanical Engineering, National Defense Academy of Japan, Yokosuka, Japan

  • Kazuya Sase,

    Roles Supervision

    Affiliation Department of Mechanical Engineering and Intelligent Systems, Tohoku Gakuin University, Sendai, Japan

  • Xiaoshuai Chen,

    Roles Supervision

    Affiliation Graduate School of Science and Technology, Hirosaki University, Hirosaki, Japan

  • Taku Senoo,

    Roles Supervision

    Affiliation Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Japan

  • Nobuo Shinohara,

    Roles Supervision

    Affiliation Department of Renal and Genitourinary Surgery, Faculty of Medicine, Hokkaido University, Sapporo, Japan

  •  [ ... ],
  • Atsushi Konno

    Roles Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing

    konno@ssi.ist.hokudai.ac.jp

    Affiliation Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Japan

  • [ view all ]
  • [ view less ]

Abstract

This study developed and validated a surgical instrument motion measurement system for skill evaluation during practical laparoscopic surgery training. Owing to the various advantages of laparoscopic surgery including minimal invasiveness, this technique has been widely used. However, expert surgeons have insufficient time for providing training to beginners due to the shortage of surgeons and limited working hours. Skill transfer efficiency has to be improved for which there is an urgent need to develop objective surgical skill evaluation methods. Therefore, a simple motion capture–based surgical instrument motion measurement system that could be easily installed in an operating room for skill assessment during practical surgical training was developed. The tip positions and orientations of the instruments were calculated based on the marker positions attached to the root of the instrument. Because the patterns of these markers are individual, this system can track multiple instruments simultaneously and detect exchanges. However due to the many obstacles in the operating room, the measurement data included noise and outliers. In this study, the effect of this decrease in measurement accuracy on feature calculation was determined. Accuracy verification experiments were conducted during wet-lab training to demonstrate the capability of this system to measure the motion of surgical instruments with practical accuracy. A surgical training experiment on a cadaver was conducted, and the motions of six surgical instruments were measured in 36 cases of laparoscopic radical nephrectomy. Outlier removal and smoothing methods were also developed and applied to remove the noise and outliers in the obtained data. The questionnaire survey conducted during the experiment confirmed that the measurement system did not interfere with the surgical operation. Thus, the proposed system was capable of making reliable measurements with minimal impact on surgery. The system will facilitate surgical education by enabling the evaluation of skill transfer of surgical skills.

1 Introduction

In the field of abdominal or pelvic surgery, laparoscopic techniques are commonly used to avoid making large incisions on patients. In a laparoscopic operation, the surgeon inserts the endoscope and surgical instruments into the patient’s abdomen via a cannula and performs the procedure by referring to images from a camera. The smaller number of incisions made in laparoscopic surgery than in an open surgery results in lesser postoperative pain, smaller scars, and faster recovery. Additionally, the magnified field of view of the laparoscope enables precise surgical manipulation. These advantages have led to the wide use of laparoscopic surgery.

However, owing to the difficulties associated with laparoscopic surgery, surgeons must possess advanced skills. First, surgeons need to develop depth perception, which is the ability to transform a two-dimensional (2D) video image into a three-dimensional (3D) working area, because the operation is performed by referring to a 2D laparoscopic monitor. The narrower field of view also makes the operation more difficult compared with open surgery. Second, there are discrepancies in hand–eye coordination because the affected area cannot be visualized directly and needs to be referred to through a laparoscopic monitor. Third, surgeons must be accustomed to using specific surgical instruments that are located at a long distance from the handle to the tip. Because the surgical instrument moves around the cannula, which is the insertion point from the abdomen, the instrument’s operation is reversed between the handle and the tip owing to the fulcrum effect. Improper manipulation of the forceps can lead to complications such as organ damage and bleeding [1]. Hence, effective laparoscopic surgical training is required for beginners.

Several types of surgical training methods, such as dry labs, wet labs, and virtual reality (VR) surgical simulators, have been developed to impart and enhance surgical skills. Dry and wet labs provide surgical simulation training on human organ models made of plastic or rubber or cadaveric animal organs such as pigs. Although the operational feel is close to that of an actual operation in these training methods, the skills acquired are variable, and inappropriate habits may develop without proper guidance from a skilled surgeon [2]. VR surgical simulators have the advantage of providing training in a variety of situations, ranging from basic to practical. However, these surgical simulators are expensive and result in low-fidelity visual and haptic feedback [3]. Surgical training using live animals, human cadavers, and on-the-job training in clinical settings are also conducted as more practical surgery training. However, there are issues related to animal welfare, training opportunities, and medical safety. Because of the aforementioned technical proficiency challenges, the traditional apprenticeship model of “see one, do one, teach one” is inadequate for training surgeons in laparoscopic surgery. In addition, owing to the shortage of surgeons and limited working hours, expert surgeons have insufficient time for providing training and guidance to beginners. Therefore, there is an urgent need to develop an objective surgical skill evaluation method to improve the skill transfer efficiency.

To meet these demands, several studies on motion analysis and the assessment of surgeons’ psychomotor skills have been conducted in various training situations. Oropesa et al. [4] and Chmarra et al. [5] measured and analyzed the motion of surgical instruments for handling tasks such as pipe cleaners and peg transfer in a dry lab, respectively. Kowalewski et al. [6] and Franco-González et al. [7] performed motion analysis of needle drivers in the suturing/knotting task conducted in a dry lab. In a wet lab, Hofstad et al. [8] conducted cholecystectomy surgical measurements and an analysis of a porcine liver box model. In a cholecystectomy performed in a clinical setting, measurements of the surgeon’s hand motion using electromagnetic sensors were reported by Dosis et al. [9] and Rasmus et al. [10]. Hwang et al. [11] conducted forceps motion measurements using an optical MoCap system. A vision-based forceps motion measurement system was also developed and utilized to analyze the skills of surgeons [12, 13]. However, the surgical skill analysis in dry-lab training [47] focused only on fundamental laparoscopic procedures such as handling forceps and suturing/knotting. Although forceps motion measurements conducted during wet-lab training using a porcine cadaver organ [8] is more practical than that during dry-lab training, the training task did not involve forceps exchange, and the skills that could be measured were limited. The cholecystectomy surgical measurements in clinical settings reported by Dosis et al. [9] and Rasmus et al. [10] focused on the surgeon’s hand motions, and the movement of the surgical instruments was not measured. In the forceps motion measurements conducted by Hwang et al., the forceps’ tip position and operational force of grasping forceps were measured. However, this study measured only the grasping forceps, and the movements of other surgical instruments were not recorded. Furthermore, the motion of the forceps was restricted because they were connected to the measurement unit by cables; moreover, the measurement success rate and accuracy were not provided in this study. The cholecystectomy measurement conducted by Ganni et al. [12] measured only the 2D position of the instrument’s tip. Although a vision-based measurement system that can obtain 3D position of the instrument’s tip has been developed [4, 13] based on the principle of measurements, position measurements cannot be performed when the forceps are not visible in the operative field. In addition, because the positional data obtained using these systems are based on the local coordinate system of the laparoscope, the position and orientation of the scope must be recorded to transform the tip position into the world coordinate system. Accordingly, a surgical instrument motion measurement system has been developed by the authors in a previous study [14]. This system consists of six MoCap cameras that can track multiple surgical instruments and support instrument exchanges during surgery. Through the measurement experiment, 70 cases of lymphadenectomy and renal parenchyma suturing/knotting on a porcine cadaver organ were recorded, and the skill evaluation model was established [1517]. However, the skill analysis during wet-lab training using pig cadavers was focused only on a simple procedure, which is insufficient for surgical education. Therefore, a simple MoCap-based measurement system that can be easily installed in an operating room was developed for skill assessment during practical surgical training [18]. Nevertheless, there are several obstacles in the operating room, and the measurement data include noise and outliers. Due to the simplification of the measurement system, the measurements were obtained only from the front of the surgeon; hence, the measurement accuracy was slightly inferior to that of the previous system. The effect of this decrease in measurement accuracy on feature calculation has to be determined.

Therefore, in this study, the post processing method for measurement data was newly developed, and the proposed system was further validated through the measurement experiment using animal organs and human cadavers. The results of the accuracy verification test and a comparison of the kinetic indices calculated from the measurement data of both our earlier systems are discussed. The development of outlier removal and smoothing methods are also reported. In addition, the measurement experiment results during cadaver surgical training and the questionnaire survey results regarding the operational feel while using the surgical instruments are presented. The validated system would help evaluate surgical skill and improve the skill transfer efficiency.

2 System configuration

2.1 Ethical statement

Approval was obtained from the Ethical Review Board for Life Science and Medical Research, Hokkaido University Hospital, for cadaveric anatomy and surgical training (CAST) for urologic surgery education and research (i16–043 issued on January 24, 2017), for skill evaluation in basic laparoscopic surgical procedures (ji018–0257 issued on March 12, 2019), and for observational study on measurement of forceps dynamics during laparoscopic surgery (i20–027) issued on December 16, 2020. Written informed consent was obtained from all participants for the use of data in the study. The cadaveric porcine tissues used in our measurement experiment were purchased from a commercial butcher. All procedures performed in studies involving human participants were in accordance with the ethical standards of the Institutional Research Committee and the Declaration of Helsinki of 1964 and its subsequent amendments or comparable ethical standards.

2.2 Measurement device

In this study, a trinocular MoCap camera unit (OptiTrack V120: Trio, NaturalPoint Inc.) was utilized as the measurement device [18]. The measurement system used in a previous study by the authors [14] consisted of six measurement cameras and required a large installation space, whereas the present system consists of only a trinocular camera unit and requires a smaller installation space. Because the three infrared (IR) cameras and IR light-emitting diodes (LEDs) were fixed in the unit, the positional relationship between the cameras does not change and no dynamic calibration is required. In this study, the measurement device was installed on top of the laparoscopic monitor using a dedicated installation bracket. Because the line of sight from the surgeon to the laparoscopic monitor is always guaranteed in laparoscopic surgery, stable measurement of surgical instruments is possible. These features make this system suitable for measurements in operating rooms with several obstacles.

2.3 Surgical instruments with markers

In laparoscopic surgeries, the tips of surgical instruments cannot be measured directly because they are inserted into the abdomen of the patient during the operation. Therefore, the proposed system calculates the tip position and orientation based on the positional relationship between the tip and the marker set attached to the root of the forceps. The marker set was attached to the surgical instrument using a dedicated jig made of ABS (acrylonitrile butadiene styrene)-like resin produced using a 3D printer. As each surgical instrument has a different marker pattern, the proposed system can recognize and track multiple surgical instruments and support intraoperative exchange. All surgical instruments used in the measurement experiments were equipped with a marker set. Fig 1 shows an overview of the grasping forceps with markers as representative examples.

2.4 Measurement data

The developed system measures the tip position and orientation of the surgical instruments. OptiTrack Motive 2.1.1 Final was used as control software for the MoCap system. By registering a marker set as a rigid body in the software, the arbitrary position and orientation of the rigid body can be obtained using its positional relationship with the barycenter of the marker set. Therefore, the tip position was defined by attaching a marker to the instrument tip; subsequently, the tip position can be calculated even if the tip marker is removed. The registration process of the tip position and orientation of the surgical instrument as follows:

  1. (a) Temporary markers were attached to the sheath and tip of the instrument via dedicated marker jigs, and these markers were registered as a rigid body in MoCap software (see Fig 2a). Next, the centroid position of these markers was set as the pivot point of the rigid body, and the local coordinate system of the rigid body ΣGL was defined with the same orientation as the global coordinate system. Notably, the marker set attached to the handle of the instrument was not registered. In the registration process, the surgical instruments were attached to the fixture such that the handles were vertical. This ensured that the y-axis of ΣGL matched the orientation of the instrument’s handle.
  2. (b) The sheath axis vector was obtained from the tip marker position in the local coordinate system of the rigid body. As the structure of the temporary jig in which the two beams had the same length, the centroid position of the rigid body was located on the sheath axis; therefore, the following equality holds: (1)
  3. (c) The rotation matrix was calculated such that the sheath axis and the x-axis of the rigid body coincided based on Rodrigues’ rotation formula as follows: (2) where I is the identity matrix; u is the axis of rotation, θ is the rotation angle; and [u]× is the skew-symmetric cross-product matrix of u. These values are calculated as follows: (3) (4) (5) where , u = (ux, uy, uz). After the calculation, was transformed to Z-Y-X Euler angles and inputted to the MoCap software. Once the rotation angles were inputted, these angles were recorded and the MoCap software automatically applied a rotation such that the sheath axis and x-axis of the rigid body were coincident (Fig 2b).
  4. (d) The pivot point of the rigid body ΣL was translated to the tip marker position in the MoCap software, and the original markers were added to the rigid body (Fig 2c).
  5. (e) The temporary markers were removed (Fig 2d). Subsequently, the tip position and orientation of the instrument were calculated only with the handle markers.
thumbnail
Fig 2. Registration process of the surgical instrument in the MoCap system.

https://doi.org/10.1371/journal.pone.0305693.g002

In this system, the tip position was obtained from the global coordinate system ΣG (see Fig 1), and the orientation was obtained in a quaternion form.

3 Accuracy verification test

As the developed system calculates the tip position of the instrument by measuring the markers attached to the root, the positional error is amplified by the leverage effect. Therefore, an accuracy verification test was conducted in the previous study [18] by comparing the measurement data obtained from this system with those obtained using the former system [14]. The experiment was conducted during wet-lab training using a cadaveric porcine organ. The participants performed a dissection and suturing/knotting task.

3.1 Participants and tasks

The experimental task involved the dissection of lymph nodes and suturing/knotting of the renal parenchyma in a cadaveric porcine organ. In the dissection task, the participants were asked to detach the lymph node from the aorta using three surgical instruments: grasping forceps, scissor forceps, and a clip applier (Weck Hem-o-lok Polymer Locking Ligation System Applier; size L). In the suturing/knotting task, the participants were asked to suture the incised kidney at two sites using two needle holders. Both tasks have been reported to have high construct validity [19] and have been performed in previous studies [1517]. An overview of the cadaveric porcine organs used in each task is presented in Fig 3. Fifteen surgeons (13 urologists and 2 junior residents) participated in the experiment. As one urologist participated twice, 16 measurement data were obtained. The background of the participant’s is presented in S1 Table. As presented in the table, a left-handed participant was included in the experiment. However, the data of this participant were not excluded from the analysis in our previous studies [18] because the participant performed the surgery right-handedly. The study was approved on March 12, 2019 (ji018–0257), and the experiments were conducted from August 7 to August 18, 2020.

thumbnail
Fig 3. Overview of the porcine cadaver organ.

(a),(b) An aorta of porcine cadaver before (a) and after (b) the dissection task (c),(d) Incised kidney of porcine cadaver before (c) and after (d) the suturing/knotting task.

https://doi.org/10.1371/journal.pone.0305693.g003

3.2 Experimental environment

To verify the accuracy of the measurement system, the optical MoCap system OptiTrack Prime41 was used. In the experiment, six OptiTrack Prime41 cameras were set to surround the participant, and the OptiTrack V120: Trio was placed above the laparoscopic monitor. Fig 4 shows the experimental environment, and Fig 5 illustrates the camera configuration. Each system independently measured the movement of the surgical instruments. The measurement success rate and accuracy of the measurement data obtained using OptiTrack V120: Trio were validated by comparing them with those obtained using OptiTrack Prime41 as true values. The measurement frequency was 120 Hz for both systems and the measured data were smoothed using a Savitzky–Golay filter (window size:121, polynomial order:3) [20]. Missing position data owing to occlusion and marker interference were interpolated by linear interpolation, and the orientation data were interpolated in a quaternion form by spherical linear interpolation (Slerp).

3.3 Synchronization

An IR LED signal unit (Fig 4) was used to synchronize the two measurement systems; synchronization was performed manually after the experiment based on the timing of the detection of the LED signal. In some cases, LED signals could not be detected; thus, manual synchronization was performed based on the instrument’s movement measurement data. The global coordinate systems of the two measurement systems were matched using the same calibration square when the origin of the coordinate system was set.

3.4 Results

Table 1 presents the experimental results for each surgical instrument. Aprime41 and Atrio are the measurement success rates of OptiTrack Prime41 and OptiTrack V120: Trio, respectively. and are the positional and attitude angle errors of the developed system. In the dissection task, the measurement data of the scissor forceps were excluded from verification because of a defect in the marker jig of the forceps. In the validation, the measurement data were analyzed while the tip of the instrument was moving inside the training box; the data when the tip was moving outside the box were excluded. The measurement success rate A was calculated as follows: (6) where Ncaptured is the number of frames successfully captured by OptiTrack Prime41 or V120: Trio, and N is the total number of frames in the task. As noted above, the frames in which surgical instruments were not used were excluded. The Wilcoxon signed rank sum test, a nonparametric paired statistical test, was also used to evaluate the difference in the measurement success rates between the two systems. The test result is presented in Table 1, and the box plot of the success rate of the measurement is shown in S1 Fig. The mean position error of OptiTrack V120: Trio was calculated as follows: (7) where pref(i) and pobserved(i) are the instrument tip positions in frame i measured using OptiTrack Prime41 and OptiTrack V120: Trio, respectively. In this study, the mean attitude angle error was calculated using the rotation angle of the differential quaternion. The mean attitude angle error was calculated as follows: (8) where qw denotes the element of the difference quaternion ΔQ = (qw, qx, qy, qz). The difference quaternion ΔQ is expressed as follows: (9) where Qobserved and Qref are the quaternions measured by OptiTrack V120: Trio and OptiTrack Prime41, respectively.

thumbnail
Table 1. Measurement success rate and average error of each surgical instrument.

https://doi.org/10.1371/journal.pone.0305693.t001

3.5 Comparison of the kinematic indices between the systems

In this study, as the developed system is intended for evaluating the skill of the surgeon, the kinematic indices calculated from the measurement data of the developed system were also verified. The summary of these indices as follows:

  1. (a) Path length: PL (m)
  2. (b) Depth path length: DPL (m)
  3. (c) Depth velocity: DV (cm/s)
  4. (d) Percentage of PL in close zone (0 ≤ to < 2.0 (cm) from target object): Close (%)
  5. (e) Average velocity:
  6. (f) Average acceleration:
  7. (g) Average jerk:
  8. (h) Time percentage of forceps moving at certain velocity range
    1. (i) Idle (%) [0 ≤ to < 0.5 (cm/s)]
    2. (ii) Low (%) [0.5 ≤ to < 2.0 (cm/s)]
    3. (iii) Middle (%) [2.0 ≤ to < 5.0 (cm/s)]
    4. (iv) High (%) [5.0 ≤ to < 12.0 (cm/s)]
  9. (i) Angular length along roll: AL-Roll (°)
  10. (j) Angular length along pitch and yaw: AL-PitchYaw (°)

The calculated indices were defined and utilized for skill evaluation in the previous studies by the authors [1517]. These indices were calculated from the measured data of both systems, and Spearman’s rank correlation coefficients were calculated to compare the values. Table 2 lists the results of the verification of kinematic indices. The indices not used in skill evaluation in the previous studies were excluded from the calculation.

thumbnail
Table 2. Spearman’s rank correlation coefficients of kinematic indices between both systems.

https://doi.org/10.1371/journal.pone.0305693.t002

3.6 Questionnaires regarding the operational feel of the instruments

After conducting the experiment, a questionnaire survey regarding the operational feel of the surgical instruments was conducted to investigate whether the markers attached to the forceps interfered with the surgical procedure. The participants were asked to respond regarding the operating feel of the five surgical instruments used in the training task on a 5-point scale (1: I do not think that it was disturbing at all; 2: I do not think that it was disturbing; 3: Neutral; 4: I agree that it was disturbing; and 5: I strongly agree that it was disturbing). The questions were as follows: Did the markers attached to the instruments interfere with the operation?

  1. (a) Grasping forceps
  2. (b) Scissor forceps
  3. (c) Clip applier
  4. (d) Needle holder (R)
  5. (e) Needle holder (L)

Responses were obtained from 15 subjects. For the subjects who participated twice, only the first response was used in the analysis. A summary of the questionnaire is presented in Table 3.

thumbnail
Table 3. Averages of user evaluation of surgical instruments with markers.

https://doi.org/10.1371/journal.pone.0305693.t003

4 Measurement experiments during surgical training on cadavers

Measurement experiments were then conducted during surgical training on cadavers to investigate the measurement stability and establish a measurement system in the operation room of training. A donated human cadaver is generally used for surgical training, by which surgeons can improve their skills on actual anatomical structures. This form of surgical training is regularly conducted twice a year by the Department of Renal and Genitourinary Surgery, Graduate School of Medicine, Hokkaido University, to improve the practical surgical skills of surgeons. For this training, the donated cadaver body is fixed using the Thiel method [21] such that the properties of the cadaver body, such as the color and soft tissues, are almost identical to those of a living body. Therefore, a high educational level can be expected because the surgery can generally be performed as in a real case, except for some factors such as the absence of bleeding. In these experiments, 31 urologists performed laparoscopic radical nephrectomy and the movement of the surgical instrument was recorded using the proposed system.

4.1 Participants and task

The CAST was approved on January 24, 2017 (i16–043), and the study was approved on December 16, 2020 (i20–027). A total of 31 urologists were included in the study. Because five urologists participated in the experiment twice, 36 cases of measurement data were obtained. The background of the participants is presented in S2 Table. Although a left-handed participant was enrolled in the experiment, the corresponding data were not excluded from the analysis because the subject performed the actual surgery right handedly. The experiments were conducted in five terms from January 11, 2021 to January 10, 2023 (1st: January 11 to January 22, 2021, 2nd: July 27 to August 19, 2021, 3rd: November 4, 2021 to January 8, 2022, 4th: September 6 to September 16, 2022, 5th: January 5 to January 10, 2023). The training task in this experiment was laparoscopic radical left/right nephrectomy via a transperitoneal approach. During the training, the surgeon inserted the surgical instruments through three to five locations in the abdomen using cannulas. The surgeon performed only one left or right nephrectomy in each experiment. The cadaver was placed in the left or right lateral position, depending on the kidney to be removed. After inserting the cannula to be used as the laparoscopic camera port, carbon dioxide insufflation was performed. The task began with the insertion of two cannulas into the abdomen and was completed when the kidneys were dissected from the surrounding tissue, except for the ureters. In addition to the grasping forceps, scissors forceps, and clip appliers, Ligasure (Covidien LigaSure Maryland37NC), right-angle dissecting forceps (manufactured by OLYMPUS), and suction instruments (Covidien SurgiwandII) were used in the task.

4.2 Experimental setup

The experimental environment and experimental device setup are shown in Figs 6 and 7, respectively. The entire training period was recorded using a video camera, as shown in Fig 7, as well as using a laparoscopic camera. The measurement settings and parameters were the same as those used in the accuracy verification tests. Because the measurement data contained considerable noise and outliers due to the various obstacles in the room, these were removed by performing data processing described in the next section. After insufflation was completed and all cannulas were inserted, the vinyl portion of the surgical drape over the cadaver was closed because the measurements were strongly affected by reflective objects. IR markers were attached to three anatomical features, namely, the shoulder, anterior superior iliac spine, and knee, to determine the relationship between the surgical instruments and the cadaver body (see Fig 8). The port positions at which the cannula was inserted into the abdomen were recorded using IR markers after the experiment. In this study, a region of interest (ROI) was defined based on the cannula positions, and the frames during the tip of the instrument not in the ROI on the abdomen were excluded from the analysis. Because the cannula positions were not recorded in the five experiments, the ROI based on the anatomical marker positions shown in Fig 8b was used in the experiments. Both ROIs were of the same size and their validity was visually confirmed from the plots of the surgical instrument tip positions for all data.

thumbnail
Fig 8. Definition of ROI.

(a) ROI based on cannula position. (b) ROI based on anatomical marker position.

https://doi.org/10.1371/journal.pone.0305693.g008

4.3 Results

Among the 36 cases in the experiment, 9 failed owing to malfunctions of the hardware of the measurement equipment, and 2 were excluded from the analysis owing to inadequate cadaver body conditions. Therefore, measurement data from 25 cases were obtained for skill analysis. The malfunction of the measurement system was resolved by replacing the PC. Therefore, this was not considered a major challenge in the system. Table 4 and S2 Fig lists the measurement success rates for each surgical instrument used in the experiment. Note that the measurement success rate was calculated using Eq 6.

thumbnail
Table 4. Summary of the measurement success rate (%) in cadaver surgical training.

https://doi.org/10.1371/journal.pone.0305693.t004

4.4 Questionnaires regarding the operational feel of the instruments

The influence of the marker set attached to the surgical instruments was investigated using a questionnaire on the feel of the operation. The participants were asked to respond on the operational feel of the six surgical instruments used in the training task on a 5-point scale (1: I do not think that it was disturbing at all; 2: I do not think that it was disturbing; 3: Average; 4: I agree that it was disturbing; and 5: I strongly agree that it was disturbing). The questions were as follows: Whether the markers attached to the instruments interfered with the operation.

  1. (a) Grasping forceps
  2. (b) Scissor forceps
  3. (c) Clip applier
  4. (d) Suction instrument
  5. (e) Ligasure
  6. (f) Right angle forceps

Responses were obtained from 31 subjects. Five subjects participated twice, but only the first response was used in the analysis. A summary of the results of this survey is shown in Table 5. In addition to the 25 successfully conducted experiments, questionnaires were collected from 11 experiments that failed or were excluded from the analysis because of the poor conditions of the cadaver body. Responses were collected regarding the surgical instruments used in the task, and unused surgical instruments were excluded.

thumbnail
Table 5. Averages of user evaluation of surgical instruments with markers in cadaver surgical training.

https://doi.org/10.1371/journal.pone.0305693.t005

5 Data correction process for noise and outliers

As described in the previous section, a high success rate was achieved in the measurement experiment. However, because this system only measures instrument motion from one direction, the measurement data often include noise and outliers owing to misrecognition by the MoCap system. In this study, the data processing method for removing noise and outliers was evaluated. The smoothing and interpolation processes for the missing values are described in this section.

5.1 Outlier removal method

5.1.1 Outlier removal based on the partial measurement success rate.

Measurement data often include burst errors or missing data of several seconds owing to marker occlusion or overlapping of the markers. The measurement data obtained under such unstable conditions contain several outliers and should be excluded for a reliable skill analysis. Therefore, a threshold-based outlier removal method for measuring the success rate was implemented. In this method, the partial measurement success rate (PMR) was calculated using a moving window and data were removed when the PMR fell below a threshold value. The moving window size was set to 600 frames (5 s), the reference point of the window was set at the center, and the threshold of PMR was set to 25%. Therefore, in this study, the PMR was calculated based on the frames 5 s before and after the frame to be evaluated; if the rate was less than 25%, the frame was eliminated.

5.1.2 Outlier removal based on the ROI.

When a misrecognition occurs in the MoCap system, the tip position of the surgical instrument traverses the ROI defined in Fig 8 for a short period. Hence, the data removal process for the data for which measurement was not conducted for a certain period after moving from the inside to the outside of the ROI was performed. The threshold for the continuous measurement was set to 5s. The missing values with a duration of less than 0.1s were ignored in this process for noise reduction. By applying this process, outliers that were not outside the ROI owing to misrecognition by the MoCap system could be excluded.

5.1.3 Outlier removal based on the estimated values of the Kalman filter.

In this study, the outlier removal method based on the estimated value of the Kalman filter was implemented. The Kalman filter is a type of sequential Bayesian filter that models the target system and estimates its state from measurement data. In this process, the measured data are compared with the estimated value of the Kalman filter, and, if the discrepancy between the two values exceeds the threshold, the measured data is considered an outlier and removed. The second-order Kalman filter was generated for the tip position of the surgical instrument by setting the linear discrete-time state equation and the observation equation. Let for the state variables and for the observed variables. The state and observation equations are defined as follows: (10) (11) F in Eq 10 is a state transition matrix of size 9×9 and Γ is the noise gain of the system. The matrices are defined as follows: (12) (13) where Δt is the sampling period of the MoCap system (Δt = 1/120(s)). H in Eq 11 is the observation matrix, defined as follows: (14) v(k) and w(k) in Eqs 10 and 11 are normal white noise with a mean of 0 and variances of and , which are called system noise and observation noise, respectively. Notably, v(k) and w(k) are independent of each other. In this study, and were set to and , respectively. Based on these parameters, the state estimation was conducted as follows:

  1. Estimation step (15) (16) where is the prior state estimate and P(k) is the prior error covariance matrix. The state estimate and the posterior error covariance matrix P(k − 1) at frame k − 1 are calculated using the following update step.
  2. Update step (17) (18) (19) where S(k) = IG(k)H. G(k) is the Kalman gain, is the state estimate, and P(k) is the posterior error covariance matrix. The posterior error covariance matrix P(k) is generally obtained by P(k) = IG(k)HP(k); however, rounding errors in numerical calculations on a computer can cause asymmetry, which may lead to problems such as divergence of the Kalman filter. Therefore, Joseph’s equation [22] expressed in Eq 19 was used in this study.

In this study, the outliers were detected by comparing the prior state estimates obtained using Eq 15 with the measured value z(k) in the prediction step. First, a(k) is obtained from the prior state estimate as the estimated value of the tip position of the surgical instrument using the Kalman filter at frame k as follows: (20) where is the ith element of . From the prior error covariance matrix P(k) in Eq 16, let the i and j elements of P(k) be denoted as , and the variances of the state variables , and can be obtained by , and ,respectively. Thus, the standard deviations corresponding to the state variables σx, σy, and σz are expressed as follows: (21) Therefore, when a(k) satisfies the following equation, the observed value z(k) is handled as an outlier and the update step is not performed: (22) where rth is the ratio for setting the threshold and the standard deviation multiplied by rth is the threshold. In this study, rth was set to 5. Therefore, the measured data in frame k were excluded as outliers when the norm of the difference vector between z(k) and a(k), which was the estimated value of the Kalman filter, was more than five times the norm of the standard deviation vector of the tip position estimated by the Kalman filter. In this method, as the value of the right-hand side is variable, it has the advantage that an appropriate threshold can always be set according to the movement of the tip position. In principle, once an outlier is identified, if it does not return to the normal value based on Eq 22, the estimated value of the tip position a(k) exhibits an iso-accelerated linear motion. Therefore, the filter was reset by initializing all state variables to zero when an outlier or missing value was continuously measured for more than 5 s.

5.2 Interpolation and smoothing method

5.2.1 Interpolation process.

In this study, interpolation was performed on the missing values, and the data were excluded using the aforementioned outlier removal process. Linear interpolation was used for instrument tip positions, and Slerp was used for the orientation obtained in the quaternion form.

5.2.2 Smoothing process.

A Savitzky–Golay filter [20] was used to calculate the tip position, smoothed velocity, acceleration, and jerk. The filter window size was set to 121 and the polynomial order was set to 3. The orientation was converted from quaternions to Z-Y-X Euler angles before the filter was applied.

5.2.3 Data removal process for long-term/long-distance interpolated values.

Although the missing values were interpolated using the aforementioned process, the interpolated values may deviate from the actual motion of the surgical instrument, particularly when the interpolation is performed over a long period. Such data may lead to incorrect results when skill analysis is performed. Therefore, a thresholding process was used to exclude unreliable data with long interpolated values or distances. The threshold of the interpolated period was set to 5 s and that of the interpolated distance was set to 0.1 m. If the interpolated values exceeded these thresholds, they were excluded as missing values and were not used in the skill analysis. Fig 9 shows the flow of the measurement data correction process. Fig 10 shows the 3D plot of the tip positions of the surgical instruments obtained through these processes, and Fig 11 shows an example of the attitude angle of instruments handled by the surgeon’s right hand. The blue rectangular box in Fig 10 indicates the ROI and the switching of line colors in Figs 10 and 11 show the intraoperative exchange of instruments.

thumbnail
Fig 10. Trajectory of the tip position of all surgical instruments.

https://doi.org/10.1371/journal.pone.0305693.g010

thumbnail
Fig 11. Attitude angle of the instruments handled by right hand.

https://doi.org/10.1371/journal.pone.0305693.g011

5.3 Feature extraction

The kinematic indices that reflected the surgeons’ skills in the training were calculated preliminarily from the measurement data of the 25 participants that were successfully measured. The indices were calculated for the grasping forceps, scissor forceps, and Ligasure, which were primarily used in the training task. The calculated indices were as follows:

  1. (a) Average velocity:
  2. (b) Average acceleration:
  3. (c) Average jerk:
  4. (d) Path length: PL (m)
  5. (e) Depth path length: DPL (m)
  6. (f) Average roll: (°)
  7. (g) Average pitch: (°)
  8. (h) Average yaw: (°)

(a)–(c) show the averages of the values calculated using the Savitzky–Golay filter. (d) shows the sum of the movement of the tip position of the instrument, (e) shows the movement along the depth direction, and (f)–(h) show the average attitude angles. These indices were calculated in the authors’ previous study for skill evaluation [1517]. All missing frames were excluded from the calculations. Table 6 lists the results of feature extraction.

6 Discussion

This study developed a surgical instrument motion measurement system for use in practical laparoscopic surgery training, such as cadaver surgical training. As mentioned before, the measurement system for wet-lab laparoscopic training has been developed, and a skill evaluation model has been established by calculating the kinematic indices of surgical instruments. Although the developed system can perform measurements with high accuracy, it requires a large installation space and cannot be used in practical surgical training, presenting several obstacles. Therefore, a simple motion-capture-based measurement system was developed, and the measurement accuracy was validated via a measurement experiment using animal organs and human cadavers in this study.

In the accuracy verification test described in Section 3, the measurement success rate, positional error, and attitude angle error were verified by comparing the measurement data of the proposed system (OptiTrack V120: Trio) with those of the previously developed 6-camera system (OptiTrack Prime41) in wet-lab training. As presented in Table 1, the measurement success rate of OptiTrack V120: Trio was approximately 90%, except for the right needle holder. However, the statistical analysis results show that the measurement success rate of the proposed system was significantly lower than that of the 6-camera system in all instruments. This is because of the simple configuration of OptiTrack V120: Trio, which measures the instrument with only one unit, whereas the 6-camera system performs the measurement using six cameras. The missing value of the motion data of the instruments was linearly interpolated, the median of the positional error of the task was approximately 2 mm to 4 mm, and that of the attitude angles was approximately 0.8 to 1.4°. In addition, the kinematic indices for evaluating the skills of the surgeons were calculated using the measurement data of both systems and verified by calculating Spearman’s rank correlation coefficients. As presented in Table 2, the absolute values of the correlation coefficients of all the kinematic indices exceed 0.7, and the coefficients of most of the indices exceed 0.9. This result indicates that the developed system can evaluate the skills of surgeons to the same extent as the 6-camera system. Furthermore, as presented in Table 3, regarding the operational feel of the surgical instruments, the mean evaluation values of all surgical instruments were less than 2, indicating that the surgical instruments with markers developed in this study slightly affect the operation of the instruments. These results show that although the accuracy of the developed system was slightly lower than that of the 6-camera measurement system, practical surgical measurements can be realized with the former.

In the measurement experiment in cadaver surgical training discussed in Section 4, six surgical instruments in motion were recorded for 25 cases of laparoscopic radical left/right nephrectomy. As presented in Table 4, the measurement success rate of all the surgical instruments used in the experiments was approximately 90%. Furthermore, as presented in Table 5, the mean evaluation values of the operational feel of the surgical instruments range from 1.31 to 2.03, suggesting that the instruments with markers developed in this study slightly affect the surgeries. This result suggests that adequate measurements can be performed for skill analysis during practical surgical training. As mentioned before, measurements with 6-camera systems could not be performed because of the presence of obstacles. In addition, because of the different measurement environments, the measurement data could not be compared with those acquired in the wet-lab training discussed in Section 3. Therefore, the measurement success rate of cadaver surgical training could not be evaluated in comparison with 6-camera systems. This is considered as a limitation of this study.

As mentioned earlier, because this system only measures the instrument motion from one direction, the measurement data of cadaver surgical training often include noise and outliers because of misrecognition by the MoCap system. Therefore, a data processing method for removing noise and outliers was developed. Through post-processing of the measurement data, the trajectory of the tip position (Fig 10) and attitude angle (Fig 11) were obtained. As shown in these figures, this system can identify each surgical instrument and detect intraoperative instrument changes. Therefore, the proposed system was not compared with 6-camera systems. However, these figures confirm that outliers were eliminated and reasonable measurements were performed. Table 6 presents the calculated kinematic indices of the surgical instruments for skill evaluation. The skill analysis results based on these indices will be investigated in future studies.

Other limitations of this study include a small sample size and heterogeneity, including the differing dominant hands of the subjects. As noted earlier, the measurement accuracy of the proposed system was not evaluated in cadaver surgical training because measurements using 6-camera systems could not be performed due to presence of obstacles. The effect of the low measurement success rate on skill evaluation was acceptable by the accuracy verification test in the wet-lab training. However, as presented in Table 4, approximately 10% of the data were missing in this system. This is a major challenge in using this system for other applications such as the surgical assistant system. A possible solution to this challenge is to install an inertial measurement unit (IMU) on the surgical instrument and interpolate the missing data using sensor fusion. When used alone, an IMU produces considerable errors due to the yaw angle drift. However, when employed with the MoCap system, their mutual limitations can be offset, and robust measurement can be performed. Such a measurement system is currently under development by our research group and will be reported in the future. Future studies will include skill analysis by applying other statistical analysis methods and calculating new kinematic indices of surgical instruments based on the collected data, as well as the establishment of practical surgical skill assessment methods and their application to surgical education.

7 Conclusion

In this study, the development and accuracy validation of a surgical instrument motion measurement system for skill evaluation during practical laparoscopic surgery training were described. The results of the accuracy verification test showed that the proposed system can measure the movement of instruments with practical accuracy and can be used for skill evaluation. By conducting measurement experiments during surgical training using cadavers, it was confirmed that the proposed system was capable of making reliable measurements with minimal impact on surgery. A limitation of this study is that the data of about 10% with long-term deficits are still not available for skill analysis. One way to address this problem is considered to compensate for missing values by sensor fusion using measurement methods other than motion capture such as IMU.

Future research involving skill analyses using the collected data is warranted. The establishment of a practical surgical skill evaluation method and its application in surgical education for the efficient acquisition of surgical skills will be considered in future studies.

Supporting information

S1 Fig. Box plots of measurement success rate in accuracy verification test.

(a) Grasping forceps (b) Clip applier (c) Right angle forceps (d) Left needle holder.

https://doi.org/10.1371/journal.pone.0305693.s001

(PDF)

S2 Fig. Box plot of measurement success rate in cadaver surgical training.

https://doi.org/10.1371/journal.pone.0305693.s002

(PDF)

S1 Table. Participants background in wet-lab training using porcine cadaver organ.

https://doi.org/10.1371/journal.pone.0305693.s003

(PDF)

S2 Table. Participants background in cadaver surgical training.

https://doi.org/10.1371/journal.pone.0305693.s004

(PDF)

References

  1. 1. Inoue T, Kinoshita H, Satou M, Oguchi N, Kawa G, Muguruma K, et al. Complications of Urologic Laparoscopic Surgery: A Single Institute Experience of 1017 Procedures. Journal of Endourology. 2010;24(2):253–260. pmid:20064000
  2. 2. Tanoue K, Ieiri S, Konishi K, Yasunaga T, Okazaki K, Yamaguchi S, et al. Effectiveness of endoscopic surgery training for medical students using a virtual reality simulator versus a box trainer: a randomized controlled trial. Surgical Endoscopy. 2008;22(4):985–990. pmid:17710487
  3. 3. Chellali A, Mentis H, Miller A, Ahn W, Arikatla VS, Sankaranarayanan G, et al. Achieving interface and environment fidelity in the Virtual Basic Laparoscopic Surgical Trainer. International Journal of Human-Computer Studies. 2016;96:22–37. pmid:30393449
  4. 4. Oropesa I, Sánchez-González P, Chmarra MK, Lamata P, Fernández Á, Sánchez-Margallo JA, et al. EVA: Laparoscopic Instrument Tracking Based on Endoscopic Video Analysis for Psychomotor Skills Assessment. Surgical Endoscopy. 2013;27(3):1029–1039. pmid:23052495
  5. 5. Chmarra MK, Klein S, de Winter JCF, Jansen FW, Dankelman J. Objective classification of residents based on their psychomotor laparoscopic skills. Surgical Endoscopy. 2010;24(5):1031–1039. pmid:19915915
  6. 6. Kowalewski KF, Hendrie JD, Schmidt MW, Garrow CR, Bruckner T, Proctor T, et al. Development and validation of a sensor- and expert model-based training system for laparoscopic surgery: the iSurgeon. Surgical Endoscopy. 2017;31(5):2155–2165. pmid:27604368
  7. 7. Franco-González IT, Minor-Martínez A, Ordorica-Flores RM, Sossa-Azuela JH, Pérez-Escamirosa F. Objective psychomotor laparoscopic skills evaluation using a low-cost wearable device based on accelerometry: construct and concurrent validity study. Surgical Endoscopy. 2023;37(4):3280–3290. pmid:36890413
  8. 8. Hofstad EF, Våpenstad C, Bø LE, Langø T, Kuhry E, Mårvik R. Psychomotor skills assessment by motion analysis in minimally invasive surgery on an animal organ. Minimally Invasive Therapy & Allied Technologies. 2017;26(4):240–248.
  9. 9. Dosis A, Aggarwal R, Bello F, Moorthy K, Munz Y, Gillies D, et al. Synchronized Video and Motion Analysis for the Assessment of Procedures in the Operating Theater. Archives of Surgery. 2005;140(3):293–299. pmid:15781796
  10. 10. Rasmus M, Riener R, Reiter S, Schneider A, Feussner H. In vivo kinematic measurement during laparoscopic cholecystectomy. Surgical Endoscopy And Other Interventional Techniques. 2004;18(11):1649–1656. pmid:15931490
  11. 11. Hwang H, Lim J, Kinnaird C, Nagy AG, Panton ONM, Hodgson AJ, et al. Correlating motor performance with surgical error in laparoscopic cholecystectomy. Surgical Endoscopy And Other Interventional Techniques. 2006;20(4):651–655. pmid:16391955
  12. 12. Ganni S, Botden SMBI, Chmarra M, Goossens RHM, Jakimowicz JJ. A software-based tool for video motion tracking in the surgical skills assessment landscape. Surgical Endoscopy. 2018;32(6):2994–2999. pmid:29340824
  13. 13. Zhao Z, Chen Z, Voros S, Cheng X. Real-time tracking of surgical instruments based on spatio-temporal context and deep learning. Computer Assisted Surgery. 2019;24(sup1):20–29. pmid:30760050
  14. 14. Ebina K, Abe T, Komizunai S, Tsujita T, Sase K, Xiaoshuai C, et al. Development and Validation of a Measurement System for Laparoscopic Surgical Procedures. SICE Journal of Control, Measurement, and System Integration. 2020;13:191–200.
  15. 15. Ebina K, Abe T, Higuchi M, Furumido J, Iwahara N, Kon M, et al. Motion analysis for better understanding of psychomotor skills in laparoscopy: objective assessment-based simulation training using animal organs. Surgical Endoscopy. 2021;35:1–19.
  16. 16. Ebina K, Abe T, Hotta K, Higuchi M, Furumido J, Iwahara N, et al. Objective evaluation of laparoscopic surgical skills in wet lab training based on motion analysis and machine learning. Langenbeck’s Archives of Surgery. 2022;407. pmid:35394212
  17. 17. Ebina K, Abe T, Hotta K, Higuchi M, Furumido J, Iwahara N, et al. Automatic assessment of laparoscopic surgical skill competence based on motion metrics. PLOS ONE. 2022;17:e0277105. pmid:36322585
  18. 18. Ebina K, Abe T, Hotta K, Higuchi M, Furumido J, Iwahara N, et al. Development and validation of a measurement system for laparoscopic surgical procedures in practical surgery training. In: 2023 IEEE/SICE International Symposium on System Integration (SII); 2023. p. 1–6.
  19. 19. Higuchi M, Abe T, Hotta K, Morita K, Miyata H, Furumido J, et al. Development and validation of a porcine organ model for training in essential laparoscopic surgical skills. International Journal of Urology. 2020;27(10):929–938. pmid:32743896
  20. 20. Savitzky A, Golay MJE. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. Analytical Chemistry. 1964;36(8):1627–1639.
  21. 21. Thiel W. The preservation of the whole corpse with natural color. Ann Anat. 1992;174(3):185–195. pmid:1503236
  22. 22. Bucy RS, Joseph PD. Filtering for Stochastic Processes with Applications to Guidance. AMS Chelsea Publishing Series. AMS Chelsea Pub.; 2005.