The adoption of minimal invasive surgery as a frequently used surgical approach has had a significant impact on surgical practice. Traditionally, an important part of surgical training follows a master–apprentice model. This model is time-consuming and costly, and there is the risk of secondary harm to the patient due to trainee inexperience [1,2,3,4]. Also, the introduction of minimally invasive techniques requires a specific set of skills which creates a new teaching dynamic [4, 5]. Therefore, an alternative for this teaching model is the training of laparoscopic surgical skills in a simulated setting. Inanimate simulators are incorporated increasingly as a training module and a strong development of these systems is observed. However, this results in increasing costs with the advancement of the technology used [6]. Although, these expensive high-fidelity simulators are an interesting development they are not affordable to all and often lack mobility [7]. Therefore, there is still a need for more affordable simulators with the ability to give direct assessment to the trainee, to ensure a training, independent of the availability of expert observers. EoSurgical ltd. (Edinburgh, Scotland, United Kingdom) has recently developed the eoSim laparoscopic simulator, which is a relatively low-cost simulator ranging from £699 to £1298 at the time of writing, this includes the instrument tracking capability [8,9,10,11]. The main advantage of this system is that it is low in weight and easily transportable, to practice laparoscopic skills in any setting. It also shows performance parameters after each task while working with real instruments. Therefore, this simulator is an augmented reality simulator that allows parametric feedback independently of the task that is performed. This objective parametric feedback differentiates the eoSim from other affordable laparoscopic simulators without the instrument tracking capabilities.

Recent studies demonstrate a positive effect of pre-operative warm-up exercises on performance of the surgeon in terms of time and psychomotor abilities [12,13,14,15,16]. The eoSim could be an attractive tool to train laparoscopic psychomotor skills as a pre-operative warm up tool. Especially when used for more complex surgical procedures, such as suturing in small areas or complex angles (e.g. oesophageal anastomoses, jejunostomy placement). However, for a training system to be implemented in training programs, it has to be validated. Validation of a system in terms of face, content and construct validity is required to assess the systems value for training [17,18,19]. The aim of this study is to validate the eoSim laparoscopic simulator in terms of face, content and construct validity regarding complex suturing and needle transfer tasks.

Method

Participants

The participants were tested in a period of 5 months (February–June 2017) at the Radboud university medical centre Nijmegen, Rijnstate hospital Arnhem and Catharina hospital Eindhoven in the Netherlands. Additional participants were included during the ‘National Endoscopic Surgery symposium’, Amsterdam and the ‘National Surgeon’s congress’, Veldhoven, the Netherlands. The subjects were divided in three groups based on their self-reported laparoscopic experience: ‘novices’ without clinical experience but with understanding of the concept of laparoscopy, consisting of medical interns and 1st year residents. The ‘target’ group with more than 10 basic laparoscopic procedures performed but less than 20 advanced laparoscopic procedures. And the ‘expert’ group with extensive laparoscopic experience, including more than 20 advanced laparoscopic procedures performed, therefore consisting of residential surgeons in staff. The group setup was based on the results of the study by Botden et al. [20] showing assessment scores of novice laparoscopic intracorporeal suturing training that indicate a proficiency curve plateau observed from 15 sutures. Due to the non-medical intervention setup, no medical ethical approval was required.

Equipment

In this study, the eoSim augmented reality laparoscopic simulator by Eosurgical ltd., Edinburgh, Scotland, United Kingdom was used in a standard setup (Fig. 1). This setup consisted of the eoSim laparoscopic case with an internal mounted high-definition camera and standard supplied equipment that consists of laparoscopic instruments, needle holders, suturing pad, thread transfer platform and a box with standard exercise equipment. A 15-in. laptop with the required specification as recommended by eoSurgical with the eoSurgical Surgtrac software installed. The tracking camera that is mounted in the case was connected to the laptop via USB 2.0. The Surgtrac software allowed for instrument tracking by means of the coloured markings on the instruments (right = red, left = blue). Placement of the laptop screen was adjusted to the proper height for every participant with the laparoscopic box being placed on a standard height table. The 30-mm curved needle braided thread sutures were used to perform the tasks.

Fig. 1
figure 1

Set up of the eoSim as used in this study

Tasks

Task 1 commenced with a video instruction, provided by the eoSurgical software. Two surgical knots had to be placed on the standard suturing pad in a horizontal plane (double wrap followed by two single wraps to create a secure surgeon’s knot). A standard length of 20 cm thread was used for each participant. If the thread of a suture was too short to reuse after being cut by the research assistant, a new suture would be placed on the suture pad.

Task 2 developed by the researchers to simulate abdominal wall suturing. This was simulated by using a cardboard panel to create an 80° plane on which a suture pad (Edustitch®, Breda, The Netherlands) was attached with the camera tilted upwards to simulate the difficult working angle (e.g. fixation of the stomach to the abdominal wall in gastrostomy placement). Each participant was given a total of three needles which had to be placed in the marked areas on the suturing pad with the tip of the needle being visible coming out of the suture pad again. In the case a needle was dropped, it was considered lost and the participant was required to grasp a new needle.

Task 3 simulated a running end-to-end anastomosis of an oesophagus or bowel anastomosis. This was performed with two cut out balloons as shown in Fig. 4, with eight dots around the edges on each side. Participants were required to transfer a needle through all the dots as if they were placing an anastomosis without knot tying, resulting in 16 transfers to complete the task. During the study, a high difficulty of the task was noticed for each experience group to complete the task. Therefore, a simplified version of the task was constructed as shown in Fig. 4. This simplified version consisted of four dots around the edge of each balloon to decrease the duration of the task. To take these changes in the task during the study into account, a subanalysis has been performed.

Questionnaire

The questionnaire used in this study was previously used in other validation studies [21, 22]. The questionnaire consisted of two parts, in which the first part contained the informed consent, demographics including age, dexterity, previous simulator experience and the participant’s current laparoscopic experience. The laparoscopic experience was elaborated as current profession, years in surgical/gynaecologic/urologic training and number of basic and advanced laparoscopic procedures performed. Basic laparoscopic procedures were defined as non-suturing procedures, such as cholecystectomy and appendectomy. Advanced laparoscopic procedures were defined as procedures with intracorporeal suturing, such as gastric fundoplication, bariatric surgery or colon surgery. The second part of the questionnaire consisted of questions on the realism (regarding realistic response, grasper use, force feedback, supposed simulated task and tissue behaviour), didactic value (value to train novice surgeons, experienced surgeons and ability to assess the skills of a trainee) and usability (attractiveness, user friendliness and possibility of instruments selection) of the system. These were rated for each separate task, on a five-point-Likert scale. With ‘1’ resulting in strong disagreement, ‘3’ being the neutral opinion and ‘5’ resembling a strong agreement [23]. There also was an option to react ‘No answer’.

Protocol

After completing the first part of the questionnaire, each participant received a short instruction on the functionality of the system and were allowed to practice each exercise once to get familiar with the system. Subsequently, the participants performed the intracorporeal suturing exercise (Task 1, Fig. 2), the upside down needle transfer task (Task 2, Fig. 3) and the anastomose needle transfer (Task 3, Fig. 4). All exercises had a maximum of 20 min to be completed. After completion of the exercises, participants completed the remainder of the questionnaire regarding their opinion on the realism, didactic value and usability of each performed task.

Fig. 2
figure 2

Screenshot of the vertical defect suturing (Task 1)

Fig. 3
figure 3

Screenshot of the up side down needle transfer (Task 2)

Fig. 4
figure 4

Screenshot of the anastomosis needle transfer (Task 3)

Outcomes

The recorded parameters of the simulator consisted of time to complete the task (seconds) and specific parameters based on the tracking of the instruments tips, which were recorded for each instrument separately (left and right). These parameters were path length (centimetres), instrument tip off-screen (percentage), average speed (millimetre per second) and the working area (average distance between instruments in centimetres). Acceleration (millimetre per square second) and motion smoothness (millimetre per second to the third exponentiation) were also presented. These parameters are derivatives from the ‘speed’ parameter.

Statistical analysis

Mean outcomes for face, content and construct validity were analysed for statistical differences between groups using a one-way ANOVA test. Regarding the face and content validity outcome ‘3’ represents ‘neutral’ on the questionnaire therefore, a mean of > 3.5 was considered valid. The analysis was performed using Statistical package for social sciences (SPSS) version 22 (IBM Corp., Armonk, NY). Outcome parameters from the eoSim and the results of the questionnaires were inserted in the database and compared using a one-way ANOVA test with Welch correction for unequal variance. Results with a p-value of < 0.05 considered a statistically significant difference. To specify the origin of a significant p-value in the ANOVA test, a post hoc analysis was performed.

Results

Demographics

As shown in Table 1, a total of 105 participants were included in this study, with 60 participants in the novice group, 31 intermediates and 14 experts. Due to the high demand of skill and available time, not all included participants were able to finish all Tasks or follow the same order. Therefore, the order of Tasks was switched to avoid bias of fatigue or lack of time. In order to prevent possible selection bias, all participants were included for analysis. From the total number of participants, 94 performed Task 1 (50 novices, 31 intermediates and 13 experts), 69 performed Task 2 (39 novices, 21 intermediates and 9 experts) and 45 performed Task 3 (31 novices, 6 intermediates and 8 experts). The novice groups consists of only interns and 10 junior postgraduate year (PGY) residents. The target group consisted of surgical residents (mainly PGY 4–5, n = 21. 68%). The expert groups consists of 13 specialized surgeons and one 6th year resident.

Table 1 Demographics of participants

Face and content validity

Task 1: intracorporeal suturing

The overall mean scores of the realism (3.9), didactic value (4.0) and usability (4.2) show positive opinion scores without any significant differences (Table 2). Analysis of difference in opinion scores between the groups for individual questions did not show any statistical differences. The realism of behaviour of the tissue (Novice 3.4 Target 3.2 Expert 3.0) was scored lowest. The didactic value of this module to train novice laparoscopic surgeons (Novice 4.2 Target 4.2 Expert 4.1) was scored the highest.

Table 2 Mean opinion scores (SD) on realism, didactic value and usability of the three performed tasks on the eoSim

Task 2: upside down needle transfer

The overall mean score of this self-made task on realism (3.6) didactic value (3.4) and usability (3.7) was slightly lower than Task 1 (Table 2). Analysis of the mean individual questions shows tissue realism (Novice 3.1 Target 3.0 Expert 2.0), and realism of the suture running through the tissue (Novice 3.2 Target 3.1 Expert 3.1) was scored the lowest. Realism of the on-screen response of the instruments received high scores (Novice 3.8 Target 3.5 Expert 4.1). Analysis of the difference in opinion scores between the groups for the individual questions, resulted a p-value of < 0.001 regarding the didactic value of the module to train novice laparoscopic surgeons (Novice 3.8 Target 3.3 Expert 2.0). Post hoc analysis shows a statistical difference between Experts versus Novices (p < 0.001) and Experts versus Target group (p = 0.033).

Task 3: anastomosis needle transfer

This task resulted in positive total mean scores on realism (3.7) didactic value (3.7) and usability (4.0 Table 2). The individual question analysis scored lowest means in realism of the tissue (Novice 3.2, Target 2.7, Expert 2.8). The highest scores were again found on the realism of the on-screen response (Novice 4.2, Target 4.0, Expert 4.4). Analysis of the difference in mean opinion scores resulted in a statistical difference on the didactic value of this module to train novice laparoscopic surgeons (Novice 4.1, Target 3.0, Expert 2.5. p = 0.002). Post hoc analysis results in a statistical difference only between Experts versus novices (p = 0.004).

Construct validity

As depicted in Table 3, there are significant differences between the groups in time (p < 0.001), distance (p < 0.001) and acceleration (p = 0.042) for Task 1. Task 2 shows significant differences in time (p < 0.001), distance (p = 0.003), instruments off-screen (p < 0.001) and working area (p < 0.001). Task 3, however, shows no significant differences in the system outcome parameters between the groups, although experts were faster compared to novices (764 versus 1123 s, p = 0.092) and had travelled a shorter distance with their instruments (13.0 cm versus 22.8 cm, p = 0.107). Due to the difficulty to complete the 16 stitches anastomosis task, a subanalysis was performed in which the anastomosis required only eight sutures. This subanalysis shows the difference in the completion of the task between novices and experts for the 16 sutures (38% versus 0%, p = 0.003) and eight sutures (100% versus 35%, p = 0.003). Other relevant factors were the number of completed sutures (13.6 versus 5.9, p < 0.001), time per suture (57.6 versus 100.5, p < 0.001) and instrument distance per suture (1.04 versus 4.29, p < 0.001) between the expert and novice group.

Table 3 Construct validity of the eoSim

To determine between which groups there was a significant difference, a post hoc analysis was performed as shown in Table 4. There was a significant difference between all the groups for ‘time’ and ‘distance’ in Task 1. However, for ‘acceleration’, the significant p-value of 0.042 of the initial analysis could not be proven significant between groups. Regarding Task 2, the ‘time’ to complete the task is significantly different between experts versus novices and novices versus target group (p < 0.001). It was not significantly different between novices and target although, the target group performed Task 2 in less time (519 s versus 897 s, p = 0.885). The parameter ‘distance’ is significantly different for intermediates versus novices (p < 0.001) indicating more instrument movement in the novice group. This could not be shown for the expert versus novice group. However, despite the shorter instrument path of the experts (18.8 m versus 26.6 m, p = 0.267), ‘working area’ and ‘off-screen’ show construct validity for experts versus novice (p-values 0.029 and 0.017, respectively) and experts versus target group (both p < 0.001). Task 3 shows significant differences between experts versus novices and novice versus target group for ‘number of stitches completed’ (p < 0.001 and 0.005, respectively), ‘time per stitch’ (both p < 0.001), ‘distance per stitch’ (p < 0.001 and 0.010, respectively) and ‘completion 16 stitches’ (p < 0.001 and 0.019, respectively). ‘Completion of eight stitches’ only shown significant differences for experts versus novices (p < 0.001).

Table 4 Post hoc ANOVA analysis of significant results

Discussion

With the growing need for accessible and affordable minimal invasive simulators for the training of surgeons, the eoSim laparoscopic simulator is able to fulfil this need. The eoSim laparoscopic simulator offers the instrument tracking capabilities of a virtual reality system, but is combined with the natural haptic feedback of traditional inanimate training boxes. This study demonstrates the fundamental validity of the eoSim regarding the construct, content and face validity. The system is capable of differentiating the between expert and novice users on multiple outcome parameters regarding advanced suturing tasks. Its versatility allows it to be a potential valid training tool to train basic tasks up to complex suturing tasks or even personal adjusted tasks as done in Task 2 in this study to simulate upside-down suturing.

With the mobile case setup only requiring a laptop it is accessible to train in almost every location, by anyone. This could be beneficial for the clinical setting in which surgeons can practice difficult tasks in a safe setting and refresh their skills before starting a complex surgical procedure. Subsequently, this can affect improvement of the quality of sutures and anastomoses in the patient and reduce complications.

Previous validation studies of the eoSim laparoscopic simulator have been performed [8,9,10], but none of those articles were based on the current version of the eoSim or regarding basic laparoscopy. Hennessey et al. concluded a significant construct and concurrent validity of the standard suturing task (Task 1 of this study). However, no face validity was demonstrated. Also, an overall scoring rate from 0 to 100% was used for the outcomes of the simulator, without a calculation of the separate outcomes, making a comparison with the outcomes from this study difficult (17). Retrosi et al. did validate the current version of the eoSim, regarding a similar suturing task, but was adjusted for paediatric surgery (16). They concluded a construct and concurrent validity of the eoSim for the suturing task in a paediatric simulation. However, construct validity for the parameters smoothness, acceleration and speed as demonstrated in that study was not observed in the current study. This could be due to multiple factors, including the difference in population size (28 participants versus 104 participants in the current study), newer software version with improved tracking or a difference in task protocol. We assessed both the face, content and construct validity of the suturing tasks on the eoSim with a larger population including the target group, in a multicentre setting, compared to previous studies.

Outliers in the results of this study regarding face validity are shown in Task 2. The didactic value of the upside-down task to train novice surgeons scored relatively low by the expert group (mean 2.6), which can be due to the high difficulty of the task. The upside down needle transfer was not designed for novice surgeons and therefore it is an expected outcome on this question. Also, the experienced surgeons struggled with this task as well, which could bias their opinion on this task in general. The construct validity has been determined for Task 2, because it has the capability to differentiate between experts and novices on the level of time, off-screen and working area. However, for the anastomosis needle transfer task these results were not reached. This could be due to the lower number of participants, because this task was a time-consuming exercise and demanded a very high level of laparoscopic skill. The parameters ‘time’ and ‘distance’, however, did have a relevant difference between the experts and novices (means 764 s and 13.0 m versus 1123 s and 22.8 m, p = 0.092 and p = 0.107).

To determine the differences between each group, a post hoc analysis was performed which shows significant differences between novices and experts for ‘time’ and ‘distance’ in Task 1, as well as ‘time’, ‘working area’ and ‘off-screen’ in Task 2. Therefore, the construct validity on these parameters has been established. Construct validity could not be established on all separate parameters produced by the simulator. It is, however, important to assess the clinical meaningful parameters for this construct validity. Time, for example, shows how fast a task is completed, but it does not provide any information on how a task has been performed. Regarding the technical performance and safety, the parameters ‘off-screen’, ‘working area’, ‘distance’, ‘acceleration’ and ‘speed’ are more relevant indication of what the participant has done with its instruments movements and in the surgical area where that happened. It is also important to notice that ‘acceleration’ (mm/s2) and ‘smoothness’ (mm/s3) are calculated from the parameters ‘speed’ (mm/s), therefore they should not be taken into account as primary parameters to base the construct validity on. However, it would be meaningful to have a construct validity established on multiple parameters to give a complete assessment and possible proficiency score of the performed task. Thus far these calculated parameters should not be used for assessment purposes.

Limitations

There were some limitations in this study that need to be considered. The tasks that were validated were difficult and complex, which led to participants getting frustrated. This could have a negative influence on the opinion of the eoSim, affecting the face validity. Intracorporeal suturing is a complex task to perform and often replaced by methods like self-locking sutures or staples to reduce operating time and workload [24]. Only a small group of surgeons were adequately trained to meet the criteria of > 20 advanced laparoscopic procedures with intracorporeal suturing and therefore qualified as experts, making it difficult to include a sufficient number of participants in this category. Due to the switch of a simpler anastomosis for Task 3, there could be a bias regarding the parameters of the system, because the participants who performed and completed the eight sutures anastomosis within the time limit would have lower parametric scores and therefore affect the average group score. To account for this possible bias, a subanalysis per suture was performed.

During the execution of the tasks, the participants also could notice the changing focus of the installed webcam which was on either the suture pad or the instruments but could not be manually adjusted to the user’s preference. This could cause frustration to the participants and affect their opinion and performance.

Conclusion

In this study, the eoSim was regarded as a useful tool to train complex suturing tasks by the expert, target and novice participants. Also, the eoSim was capable to determine the difference between expertise levels in terms of ‘time’ and ‘distance travelled by the instruments’, on these complex tasks, demonstrating the construct validity of these parameters. Moreover, the relative low costs, easy transferability of this simulator and option of introducing adapted tasks for the training of a specific type of surgery, make it an attractive training tool for any minimal invasive surgery specialism. With these results, the eoSim is a potential valid tool for the training and assessment of advanced laparoscopic suturing.