Introduction

In general, the preferred surgical approach to hysterectomy is evident [1]. In case vaginal hysterectomy (VH) is not regarded possible or in case of early-stage endometrial cancer, laparoscopic hysterectomy (LH) is associated with clear advantages over abdominal hysterectomy (AH) [15]. In obese patients (BMI 30.0–34.9 kg/m2), a similar approach to hysterectomy is considered to be best practice [6, 7]. However, no conclusive evidence exists regarding the preferred approach in the very obese and morbidly obese patients, i.e. a BMI ≥35 kg/m2 [810]. Only one of the 34 randomized controlled trials (RCT) included in the most recent Cochrane review on the surgical approach to hysterectomy, described patients with a BMI ≥35 kg/m2 [1, 11]. All other studies either excluded these patients from analysis or did not report the BMI.

Some non-randomized studies suggest that, compared to the AH, also this group of patients benefits most from the vaginal approach [1215]. In daily practice, however, the VH frequently seems to be a less favourable approach due to large uterine size, (early stage) malignancy and/or expected intraoperative difficulties regarding exposure [1618]. In more recent studies, LH was proven to be feasible and safe in these patients [2, 10, 19, 20]. Although, compared to the AH, fewer postoperative complications were found, an important point of concern is the report of a relatively high conversion rate and its suggested association with a higher postoperative morbidity [2, 8, 19, 2124]. In contrast to these presumed better outcomes, research on the implementation and the preference of gynaecologists show that that the rate of AH increases as the BMI increases [7, 25, 26].

These dilemmas have almost become daily practice due to rising prevalence of obesity over the past decades; in Europe fluctuating between 6 and 37 % among its countries [27]. In the United States, the prevalence of BMI ≥35 kg/m2 remained relatively stable around 15 % [28]. Due to an increased unopposed oestrogen effect in hormonally responsive tissues, obesity can promote a number of gynaecological diseases, such as abnormal uterine bleeding and endometrial hyperplasia [29]. As a result, a higher prevalence of enlarged uteri and especially a higher incidence of endometrial carcinoma is observed among these patients [2932]. Inherently, the number for which hysterectomy is indicated, is likely to rise over time.

Current practice shows that these controversies in literature cause diffusion in the approach to hysterectomy in these patients. To provide also the raising amount of these patients with optimal counselling and subsequent route of hysterectomy, it is necessary that conclusive evidence on this subject is obtained.

The objective of this study was to evaluate the outcomes of abdominal, laparoscopic and VH in very obese and morbidly obese patients (BMI ≥35 kg/m2) by means of a systematic review with cumulative analysis.

Methods

The PubMed and EMBASE databases were systematically searched for records (last update October 9, 2014). We aimed to identify all studies on AH, LH and VH in patients with a BMI ≥35 kg/m2. A clinical librarian was consulted, who assisted in composing a search string including the terms (and synonyms for) body mass index, obesity, laparoscopy, abdominal, laparotomy, vaginal and hysterectomy (“Appendix 1”). No limitations regarding publication date and language were applied. All titles and subsequently the abstracts of all relevant titles were screened on relevance by two authors individually (MB and ES). Exclusion criteria during the title and abstract screening were: conference abstracts, studies without abstract, non-clinical studies (e.g. review, case report, cadaver study), a mean/median BMI <35 kg/m2 and studies involving extensive combined procedures (e.g. radical hysterectomy in combination with panniculectomy). Articles likely to be relevant were read in full text. Excluded were studies in which the BMI was not specified, the minimum BMI of the range was <35 kg/m2 (or a mean BMI <40 kg/m2 in case the range was not specified), multiple publications based on an overlapping cohort, studies that were not available in full text, and series of radical hysterectomies for cervical carcinoma. If the two independent reviewers did not achieve consensus on the inclusion or exclusion, a third reviewer (FWJ) was consulted.

Study selection

From each study that was included, a predefined set of data was extracted. This consisted of study design, inclusion period (years) and indication (malignant, benign or both). In case of malignancy, it was specified if the hysterectomy was performed with or without lymph node dissection (LND). Per approach (AH, LH and VH), the number of patients and in case of LH, the type of LH [laparoscopic-assisted vaginal hysterectomy (LAVH) or total laparoscopic hysterectomy (TLH; conventional, robotic(-assisted) or both)], along with the patient and procedure characteristics, were extracted. Patient characteristics included age, BMI and uterine weight. Procedure characteristics included operating time (in minutes, skin-to-skin), blood loss (in millilitres), length of hospital stay (in days, from day of procedure), complications and conversion to laparotomy. If possible, postoperative complications were separately labelled as wound problems, dehiscence (abdominal or vaginal cuff) or wound infection. Conversion to laparotomy was defined as an intraoperative switch from a laparoscopic to an open abdominal approach. Strategic conversion (e.g. due to inadequate visibility, adhesions or additional pathology) was distinguished from reactive conversion (i.e. because of a complication) [33].

Assessment of risk of bias

All studies were graded on the level of evidence (according to the Oxford Centre of Evidence-Based Medicine) [34]. From the highest to the lowest level, an adequately sampled (RCT) (level 1b), is followed by a low-quality RCT or observational/prospective cohort study (level 2b), an individual case–control study (3b) and a case series (and poor quality cohort or case–control study) (level 4).

Statistical analysis

A cumulative analysis (i.e. a meta-analysis on all types of comparative studies) was conducted due to the lack of randomized evidence [35, 36]. This analysis was based on the results of all comparative studies that were included in our systematic review and was conducted using Review Manager 5.3 (Cochrane Collaboration, Copenhagen, Denmark). The pooled results of these comparative studies were expressed as risk ratios (RR) with 95 % confidence interval (CI) for dichotomous outcomes and as mean difference (MD) with 95 % CI for continuous outcomes. Regarding the latter, only results that are presented as mean with standard deviation can be included in such an analysis. Since statistical heterogeneity between the studies was expected, random effects models were used. This resulted in de most ‘conservative’ estimation of the intervention effect. Only if two or more studies could be used to estimate the effect of the pooled outcome, this outcome was reported in the Results section. The guidelines for reporting of Meta-analysis Of Observational Studies in Epidemiology (MOOSE) were followed [37].

Hysterectomy in very obese and morbidly obese patients in our centre

All patients with a BMI ≥35.0 kg/m2 who underwent an elective AH, LH or VH at the Leiden University Medical Centre between January 2005 and September 2014 were also included in this study. All laparoscopic procedures were performed by two gynaecologists with extensive experience in advanced laparoscopic surgery (>200 procedures). Patients who underwent radical hysterectomy or a combined procedure (such as prolapse surgery) were excluded. All above-mentioned patient and procedure characteristics were derived by retrospective chart review. Uterine weight was derived from the pathology report. In case an actual weight was missing, the uterine volume was calculated from the pathology report or preoperative ultrasound measurements and transformed to weight by a validated model [38]. Adverse events were registered for type of complication, severity (i.e. requiring re-operation or not) and moment of onset, up to 6 weeks after discharge (i.e. marking the legitimate adverse event reporting period), according to the definitions and regulations as determined by the Guideline Adverse Events of the Dutch Society of Obstetricians and Gynaecologists [39].

The data were analysed using SPSS 20.0 statistical software (Chicago, IL, USA). A Pearson Chi square test was used to compare proportions and a student’s T test was used for continuous variables. To describe non-normally distributed data (kurtosis between −1 and +2) or in case Levene’s test showed no homogeneity of variance, the median and interquartile range (IQR, 25th and 75th percentiles) were used and a Mann–Whitney test was performed. A P < 0.05 was considered statistically significant.

Results

The initial search yielded 3207 articles. After exclusion of conference abstracts (n = 1073), duplicates (n = 540), and irrelevant titles (n = 1052), the abstracts of 542 potentially relevant titles were screened. Based on the predefined exclusion criteria, 439 articles were excluded because no abstract was present (n = 30), the articles represented reviews, case reports, or cadaver studies (n = 104), the reported mean or median BMI of the study population was not ≥35 kg/m2 (n = 296), or the studies involved combined procedures (such as hysterectomy and panniculectomy, n = 9). Of the remaining 103 articles that were subjected to a full-text review, another 81 studies were excluded because the minimum BMI of the range was <35 kg/m2 or—in case the range was not reported—the mean BMI was <40 kg/m2 (n = 44), the BMI was not specified (n = 24), overlap between study populations existed (n = 3), no full text was available (n = 9), or it concerned a study on the outcomes after hysterectomy for cervical carcinoma (n = 1). A total of 22 articles met all inclusion criteria. Figure 1 illustrates the search and exclusion algorithm.

Fig. 1
figure 1

Flowchart of the search and exclusion algorithm. Asterisk i.e. review, case report, cadaver studies. Double dagger e.g. panniculectomy. Yen sign including mean BMI <40 kg/m2 if range not specified

Hysterectomy in very obese and morbidly obese patients in our centre

During the study period, in our centre a total of 27 AHs, 48 LHs, and five VHs were performed in patients with a BMI ≥35 kg/m2. In 22 % of AHs (n = 6) and 42 % of LHs (n = 20) the BMI was ≥40 kg/m2. Due to the low number of VHs, these procedures could not be used for further analysis.

Conversion to laparotomy was required in 12.5 % of LHs (n = 6). Of these, five (83 %) were for strategic considerations. The reactive conversion was performed in a patient with a BMI of 60 kg/m2 because of inadequate visibility during the colpotomy combined with inability to maintain the Trendelenburg position because of hypercapnia.

Patient characteristics between the groups were comparable (Table 1). Compared to AH, LH is associated with less blood loss (mean 204 ± 181 vs. 575 ± 528 mL; P = 0.001) and a shorter length of hospital stay (mean 3.7 ± 1.7 vs. 6.0 ± 1.8 days; P = 0.000). No difference in operating time was detected (mean 138 ± 38 vs. 131 ± 47 min; P = 0.522).

Table 1 Patient characteristics of all AHs and LHs performed in patients with a BMI ≥35 kg/m2 in our hospital from 2005 until 2014

All adverse events are listed in Table 2. In 18.5 % of AHs (n = 5), intraoperative blood loss of >1 L was observed; all other adverse events were noted in the postoperative course. Two adverse events after LH required a re-operation (4.2 %). Compared to LH, the overall complication rate after AH was higher (40.7 vs. 16.7 %; P = 0.029). Among the six LHs that were converted to laparotomy, no complications were observed.

Table 2 Adverse events of all AHs and LHs

Summary of included studies

Including the data of our hysterectomies in patients with a BMI ≥5 kg/m2, these 23 studies resulted in a total of 2232 hysterectomies, of which 1058 were AH (14 studies), 959 LH (18 studies), and 215 VH (3 studies) [8, 14, 15, 1922, 4054]. Of all LHs, 952 were TLH (of which 513 were performed robotically) and 7 were LAVH. The designs of the studies were 2 RCTs, 7 prospective studies, 1 case–control study, and 13 case series or retrospective studies. In 2 studies the level of evidence was graded as 2b, in 1 study as 3b and in the remaining 20 studies as 4.

All extracted data regarding AH, LH, and VH are summarized in Tables 3, 4, 5, 6, 7 and 8, respectively (see “Appendix 2”). The pooled conversion rate was 10.6 % (95 out of 900). We calculated that 82 % of conversions (18 out of 22) could be regarded as strategic. Except for one study [52], the outcomes of all converted cases were included in the LH group (intention-to-treat analysis).

Table 3 Characteristics of the included studies concerning AHs (part 1 of 2)
Table 4 Characteristics of the included studies concerning AHs (part 2 of 2)
Table 5 Characteristics of the included studies concerning LHs (part 1 of 2)
Table 6 Characteristics of the included studies concerning LHs (part 2 of 2)
Table 7 Characteristics of the included studies concerning VHs (part 1 of 2)
Table 8 Characteristics of the included studies concerning VHs (part 2 of 2)

Given the fact that only 2 RCTs were found, we performed a cumulative analysis based on the included studies that were performed in a comparative design (11 out of the 22 included studies) (Tables 3, 4, 5, 6, 7 and 8, “Appendix 2”). Among these, 10 compared AH with LH, 1 compared AH with VH and none compared LH with VH.

AH vs. LH

Compared to LH, AH was associated with a higher overall complication rate (RR 2.28, 95 % CI 1.62–3.20; P = 0.000) (Fig. 2). Intraoperative complications were rare and no difference was observed (RR 1.43, 95 % CI 0.66–3.11; P = 0.36) (Fig. 3). Regarding the postoperative complications, wound problems (RR 3.05, 95 % CI 1.43–6.49; P = 0.004), wound dehiscence (RR 2.58 95 % CI 1.71–3.90; P = 0.000), and wound infection (RR 4.36, 95 % CI 2.79–6.80; P = 0.000) all favoured LH (Figs. 4, 5, 6). No difference in operating time and estimated blood loss between AH and LH was detected (MD −33 min, 95 % CI −72–7; P = 0.10 and MD 135 mL, 95 % CI −30–301; P = 0.11, respectively) (Figs. 7, 8). The length of hospital stay was longer after AH (MD 2.9 days, 95 % CI 2.0–3.7; P = 0.000) (Fig. 9). No separate analysis was performed to compare benign indication and malignancy. All studies included in this cumulative analysis were for malignancy, except for one study and the hysterectomies performed in our centre [21]. Excluding the studies on robotic hysterectomies [19, 21, 42, 52] from these analyses did not cause clinically relevant differences, except for wound dehiscence (RR 2.08, 95 % CI 0.69–6.25; P = 0.19) and operating time (MD −19 min, 95 % CI −28 to −10; P = 0.000) (not shown).

Fig. 2
figure 2

AH vs. LH, overall complication rate

Fig. 3
figure 3

AH vs. LH, intraoperative complication rate

Fig. 4
figure 4

AH vs. LH, wound problem

Fig. 5
figure 5

AH vs. LH, wound dehiscence (including vaginal cuff dehiscence)

Fig. 6
figure 6

AH vs. LH, wound infection

Fig. 7
figure 7

AH vs. LH, operating time

Fig. 8
figure 8

AH vs. LH, estimated blood loss

Fig. 9
figure 9

AH vs. LH, length of hospital stay

AH vs. VH

The results of one study showed more wound problems (18.0 vs. 0.0 %), more wound dehiscence (8.0 vs. 0.0 %) and a longer length of hospital stay after AH (5.3 vs. 2.6 days, Tables 3 4, 5 and 6, “Appendix 2”) [15].

Discussion

Compared to both laparoscopic and VH, the abdominal approach in patients with a BMI ≥35 kg/m2 is associated with more postoperative complications and longer length of hospital stay. The majority of LHs (89 %) were completed laparoscopically. Due to better clinical outcomes, the feasibility of LH and VH should be considered prior to the abdominal approach to hysterectomy in these patients.

Although especially in patients with a BMI ≥35 kg/m2 a restrictive policy to abdominal surgery is warranted, the rate of AH increases as the BMI increases [7, 25, 26]. This is also reflected by the VH rates that remain stable at around 20 %, despite the fact that, in general, the vaginal approach is considered to be the preferred route to hysterectomy [1, 18]. Reasons could be a lack of experience, but also factors such as large uterine size and malignancy [55]. Since obesity is accountable for a higher incidence of both disorders, especially in the very obese and morbidly obese patients the laparoscopic approach could be the best alternative to bypass these contraindications, as confirmed by present study. Nonetheless, during laparoscopic surgery in this group of patients special considerations have to be taken into account and three-dimensional vision systems could make adequate visualisation less difficult [13, 56].

Compared to AH, both the laparoscopic and vaginal approaches are associated with a significantly lower incidence of postoperative complications. This was mainly caused by the lower risk of wound problems, such as infection and dehiscence. However, not only the incidence, but also especially the severity of these complications is a matter of concern. Unfortunately, the identified studies did not provide sufficient data to assess the severity of these complications and also other studies on this subject (mainly regarding wound infections) did show contrasting results [5760].

Another important advantage of the laparoscopic and vaginal approach over AH is the significantly shorter length of hospital stay. Similar to the results from our centre, the cumulative analysis revealed a significant and clinically very relevant difference of approximately 3 days for the disadvantage of AH. Albeit differences in local recovery regimens and healthcare systems make comparison between studies difficult, this conclusion can be regarded valid. Firstly, it is based on differences that were found within multiple studies and secondly, they are also in line with the results of the non-comparative studies (“Appendix 2”).

Literature focusing on the outcomes of hysterectomy in patients with a BMI ≥35 kg/m2 proved to be scant. Instead of a meta-analysis, a cumulative analysis had to be performed on the results from prospective, non-randomized and retrospective studies [35, 36]. Since this introduced heterogeneity in our analysis, we used a random effects model to correct for the differences between studies, thereby providing the most conservative detection of differences between interventions. While these precautions have been taken into account, in our opinion, especially the major differences in complication rate and length of hospital stay cannot solely be explained by the limitations in the design of the included studies. Nonetheless, some precaution in the interpretation of our findings remains necessary. For example, the analyses on operating time, estimated blood loss and length of hospital stay are based on the results of three or four studies. Despite this, the results of these studies were similar to the outcomes of the non-comparative studies that could not be included in the cumulative analysis (“Appendix 2”).

The presumed higher conversion rate is most likely the main reason for the tendency to perform an AH instead of a LH in these patients. Conversion in general, and especially reactive conversion, is associated with more postoperative morbidity and a prolonged hospital stay [6163]. Especially among very obese and morbidly obese patients, it is observed that conversion can result in high postoperative morbidity which has a significant impact on the quality of life, thereby obscuring the cost-effectiveness of LH over AH [8, 22, 64, 65]. The present cumulative analysis revealed a pooled conversion rate of 10.6 % and although no cost-effectiveness analysis could be performed, in our opinion, this percentage is quite comparable to the 6.5 % found in the only study that assessed cost-effectiveness with respect to conversion rate (versus a conversion rate of 32.3 % that was found to be not cost-effective) [8]. This hypothesis is further supported by the fact that the far majority (82 %) were strategic conversions. Although the risk for additional postoperative morbidity is thereby inherently minimized, further research is needed to draw more definite conclusions.

To determine superiority of VH over LH or vice versa with regard to postoperative complications, too little evidence was found. Most likely this is mainly due to the fact that VH is frequently (relatively) contraindicated due to either large uterine size or malignancy [55]. Additionally, LH was originally introduced as an alternative to AH in 1989, but at first was not accepted as an alternative for hysterectomy in very obese patients [66]. Although nowadays with the widespread implementation of LH potentially an adequately powered RCT could provide the answer, it is questionable if conducting such a study is still feasible from a methodological and ethical perspective.

The results of our systematic review with cumulative analysis finally provide sufficient evidence that also with regard to very obese and morbidly obese patients both the LH and VH result in better clinical outcomes, compared to the abdominal approach to hysterectomy. In contrast to VH, LH is considered standard of care in case of early-stage malignancy and it is less challenging to obtain adequate visualisation. Therefore, in current perspectives, LH should become the most frequently performed approach to hysterectomy in the patients with a BMI ≥35 kg/m2. Although a reasonable rate of conversion to laparotomy (10.6 %) was observed, hypothetically, increased experience and clustering of LH in high-volume centres might enable further improvement in the outcomes of this procedure in these patients.