Introduction

Satisfaction after total knee arthroplasty ranges between 75 and 90% [8, 9, 13, 29]. Different contributing factors for dissatisfaction after TKA have been identified like socioeconomic status, mental well-being, fulfilment of expectation, and the postoperative general physical health of the patient [12]. From a surgical point of view, joint line elevation, among other factors, is considered to have a negative effect on postoperative outcome, since it alters the biomechanics of the knee. By changing the centre of rotation of the knee, the isometry of the medial collateral ligament (MCL) is changed, with mid-flexion instability as a result [15, 34]. In addition, as a consequence of the elevated joint line, the posterior condylar offset (PCO) is likely to be reduced, which negatively influences flexion angle and the extensor mechanism strength, and resolves in mid-flexion instability [6, 36, 37].

Mean joint line elevation after primary TKA varies between 1.1 up to 5.6 mm [44, 46]. For revision TKA, this number is even higher, up to a mean of 8 mm [40]. Some studies report a correlation between a raised joint line and patient-reported outcome measures (PROMs) [10, 17, 40], while others do not find this correlation [49]. Therefore, there is still debate if minor joint line elevation affects PROMs after TKA [44], and if so, what amount of joint line elevation is acceptable after TKA.

Therefore, the aim of the present study was (1) to examine the effect of joint line alteration on outcome measures after primary and revision TKA and (2) to investigate whether practical recommendations can be made concerning acceptable joint line alterations after (revision) TKA.

Materials and methods

Identification of studies

A systematic literature search was performed with the help of a clinical librarian. Search terms were: knee arthroplasty, knee replacement, knee surgery, TKA, joint line, outcome, joint instability, knee kinematics, and range of motion. The following databases were searched: PubMed/Medline, the Cochrane Clinical Trial Register, and Embase. The search was limited to English, German, Dutch, and Spanish languages without limitation of publication date. The search was performed August 2017. The reference lists of the included studies were searched for any studies that might potentially meet the inclusion criteria.

Inclusion and exclusion

PRISMA methodology was used for the analysis and reporting of the systematic review [38]. Titles and abstracts from potentially relevant studies were reviewed using a set of predefined inclusion and exclusion criteria. Studies were included when they reported both joint line alterations after TKA surgery and an outcome measurement. Both primary as well as revision TKA studies were included in our study. Studies reporting on (revision) uni-compartmental knee arthroplasty and knee arthroplasty after high tibial osteotomy (HTO) were excluded.

Two reviewers (WvL and KV) independently screened the titles and abstracts of the studies that could meet the inclusion criteria. A list of studies to be reviewed in full text was composed from the input of both reviewers. Finally, these selected full-text studies were judged on inclusion and exclusion criteria and a definite selection of studies was made. Disagreement in the selection process was debated on and, if necessary, resolved in a group discussion with the third author (KK).

Data extraction

The following data were extracted from the included studies: number of TKAs, patient demographics, primary or revision TKA, type of prosthesis [cruciate retaining (CR) and posterior stabilized (PS)], method of joint line assessment, mean joint line alteration, outcome scores, and follow-up time. The methodological quality of the included studies was assessed by assigning levels of evidence as defined by the Oxford Centre for Evidence-Based Medicine [22]. Levels of evidence were assigned by two reviewers (WvL and KV). Disagreement was resolved by consensus. A grade of recommendation was added to the present findings based on the methodological quality of the included studies in the systematic review [20].

Statistical analysis

Statistical analysis was done using IBM SPSS version 24.0. In the analysis, the joint line alteration and TKA outcome scores were weighted by the number of patients. Normal distribution was evaluated by frequency histograms. Degree of correlation [with 95% confidence intervals (CIs)] between mean postoperative joint line alteration and outcome scores was calculated. The preferred test for correlation analysis depends on the distribution of the data; for normally distributed data, Pearson’s correlation is used, for not normally distributed data, Spearman’s rho is used. Significance was set at the 1% (0.01). A small effect size is defined as ρ between 0.10 and 0.30, a medium effect size as ρ between 0.30 and 0.50, and a large effect size is operationally defined as one that yields ρ ≥ 0.50 [14]. An independent-sample t test (confidence interval 95%) was used to compare the outcome of revision TKA groups with the joint line maintained versus elevated.

Registration

The systematic review was registered in the Prospero Database CRD42017057320.

Results

After performing the search, 396 studies were identified. A total of 27 studies met the inclusion criteria and were included (Fig. 1) [1,2,3,4, 7, 10, 11, 17, 19, 21, 23, 25,26,27,28, 30, 32, 35, 39,40,41,42, 44,45,46,47, 49]. These studies were published between 1986 and 2017. A total of 18 studies reported on primary TKA and 9 on revision TKA. The level of evidence for the included studies was level II for 4 studies, level III for 11 studies, and the remaining 12 studies were classified as level IV.

Fig. 1
figure 1

Flowchart systematic review

A great diversity was found across papers in the way joint line deviation was measured (Table 1). Revision TKA joint line alteration is defined as the joint line position postoperative compared to either the preoperative or the healthy contralateral knee joint line position. Studies that reported absolute joint line changes were combined to establish a mean joint line change after (r)TKA (see Table 1 for eligible studies). For 1225 primary TKAs, the mean joint line was raised with 3.0 mm. For 355 revision TKAs, the mean joint line elevation was 3.6 mm.

Table 1 Study inclusion systematic review: study characteristic table

Functional assessments were done with the Oxford Knee Score, Short Form 12 (SF-12), Short Form 36 (SF-36), and Knee Society Score (KSS). The KSS is divided in two separate components (knee and function), and some studies reported these two components separately and some presented the score in total. One study used the Mayo Clinical Score, one the Bristol Knee score, and one used the patellar-score [16] to present their results. All studies reported an improvement in clinical scores after TKA. In most studies, the patients were assigned to subgroups with related cut-off values for joint line alterations (i.e., joint line alterations < 4 or > 4 mm). In the group of studies on primary TKA, in 6 out of 18 studies, statistically significant lower outcome scores were found when the joint line was elevated. In the group of studies on revision TKA, five out of nine studies reported statistically significant better outcome scores for patients with an adequately re-created joint line compared to those with an elevated joint line (i.e., joint line reconstruction within 4 mm of the preoperative or healthy contralateral knee joint line height).

Eight out of eighteen studies on primary TKA could be included in a correlation analysis of the postoperative KSS knee and function scores and postoperative joint line alteration (mm). The remaining ten studies did not report absolute joint line alteration values or used a different outcome measurement. Seven studies (664 patients) compared two surgical techniques, and in one study (32 patients), one treatment arm could be included in the correlation analysis (Table 2). The other arm contained patients with TKA after HTO which was an exclusion criterion. A statistically significant negative correlation was found between joint line elevation and postoperative KSS functional score (ρ = − 0.496, p < 0.001, Fig. 2) and KSS total score (ρ = − 0.425, p < 0.001, Fig. 3). No significant correlation was found between joint line and KSS Knee score (ρ = − 0.052, p = 0.17, Fig. 4). Other PROMs presented in the included studies were SF-36 and OKS. Only the study by Liow et al. reported a statistically significant result. In their subgroup analysis (< 5 or > 5 mm joint line shift), the elevated joint line group had a statistically significant lower physical function in the SF-36 questionnaire (45 versus 67) [32]. No significant differences between joint line position subgroups and OKS outcome were reported. The results of the remaining studies that could not be included in the correlation analysis are presented in Table 3. Regarding range of motion (ROM), a wide variety with regards to the notification was observed (i.e., absolute ROM, delta ROM, and only maximum flexion) which limited further analysis. Two studies reported ROM as their sole outcome measurement and, therefore, are not presented in Table 3 [27, 42].

Table 2 Studies included for data correlation analysis for primary TKA
Fig. 2
figure 2

Correlation analysis of KSS function versus joint line alteration in primary TKA

Fig. 3
figure 3

Correlation analysis of KSS total versus joint line alteration in primary TKA

Fig. 4
figure 4

Correlation analysis of KSS knee versus joint line alteration in primary TKA

Table 3 Studies not included for data correlation analysis for primary TKA

Nine studies reported TKA revision outcome data with joint line measurement (Table 4). Indications for revision were infection, aseptic loosening, instability, polyethylene wear, pain, and/or stiffness or peri-prosthetic fracture. No studies reported TKA revision for joint line elevation as its sole reason. Due to heterogeneity of joint line measurements and outcome data, no correlation analysis could be performed. In four studies, patients with a maintained joint line (i.e., < 4, < 5, or < 8 mm joint line elevation) were compared to patients with an elevated joint line after revision TKA. In this analysis, 187 pooled patients in the maintained joint line group were compared with 96 patients in the elevated joint line group [7, 10, 35, 40]. A statistically significant (p < 0.001) higher postoperative total KSS score was found for the maintained joint line group (149; SD 10) compared to the elevated joint line group (129; SD 5). Of the five studies that could not be included in the pooled analysis, two studies found better postoperative outcome scores for their maintained joint line group [21, 41] and three studies did not find a significant correlation [11, 26, 45].

Table 4 Revision TKA studies

Discussion

The most important finding of the present study was that joint line elevation has a negative effect on outcome after TKA. A statistically significant negative correlation between joint line elevation and the KSS functional and total score after primary TKA was established. For revision TKA, it was found that a correctly re-created joint line results in statistically significant better postoperative KSS total scores, compared to knees in which the joint line was not adequately re-created.

The statistically significant negative correlation between postoperative joint line elevation and KSS functional scores for primary TKAs is in accordance with several studies that could not be included in the correlation analysis [3, 17, 25, 46, 47]. Four studies did not find a correlation between joint line elevation and outcome scores [1, 2, 19, 49]. These four studies reported only minor mean joint line elevation postoperative (0–2.3 mm). Studies with a larger range in joint line alteration and those who chose a higher cut-off value for joint line analysis did find differences in outcome [17, 32, 46, 47]. In contrast, those with lower cut-off values (e.g., 3 mm) did not find any difference in outcome [3, 49].

Five out of nine studies reporting on revision TKAs found statistically significant better outcome scores for maintained compared to elevated joint line groups [7, 10, 21, 40, 41]. In the pooled analysis of four studies, a statistically significant higher postoperative total KSS score was found for the maintained joint line group compared to the elevated group. This finding was in accordance with two studies that could not be included in this pooled analysis [21, 41]. The studies of Kannan et al. and Clement et al. reported no effect of joint line position on outcome; however, the joint line alteration was not specified [11, 26]. Seon et al. compared two-stage revision TKAs for prosthetic joint infection versus one-stage revision for other reasons. Their subgroup analysis for elevated joint line (> 5 mm) versus the maintained joint line group did not show any differences in outcome scores. However, this result is most likely influenced by the distribution of septic patients amongst their groups since the septic group reported statistically significantly lower outcomes [45].

It is questionable how much joint line elevation can be accepted. Several studies in primary as well as revision TKA have shown statistically significantly lower outcome scores when a cut-off value of 4 mm joint line elevation is exceeded [10, 21, 47]. Babazadeh et al. and Yang et al. used respectively 2 and 3 mm as cut-off point, and they did not find any difference between their groups [3, 49]. Therefore, the aim should be to restore the joint line to its native position and not to accept more than 4 mm of joint line elevation.

Some surgical aspects should be taken into consideration with regards to joint line preservation. First, in case of distal femoral bone loss, the distal femoral bone cut should be reduced. Second, in case of a tight extension gap, it is advised to remove all posterior osteophytes before recutting the distal femur since posterior osteophytes tend to tighter up the posterior capsule and, consequently, reduce the extension gap. Furthermore, computer-assisted surgery and patient-specific instruments can be supportive and promising results with regards to joint line reconstruction are being published [1, 19, 31, 32]. In revision TKA, joint line restoration can be more challenging due to bone loss and the absence of landmarks to determine the original joint line height. Undersizing the femoral component should be avoided and distal and posterior bone loss should be accounted for with augments, thereby restoring the joint line position and the PCO [5]. The three-step technique described by Vince et al. can be a valuable tool to achieve these goals [48]. The medial adductor tubercle is identifiable in most cases, and can be used for preoperative planning and perioperative reference [43]. Preoperative planning, e.g., radiographs of the contralateral knee, can help the surgeon to determine the exact native joint line distance to this landmark before surgery [5].

A major limitation of the current study was the heterogeneity of the data. For both joint line assessment as well as outcome measures, a variety of measurements was reported. This impeded a correlation analysis of the TKA revision studies and of the primary TKA studies only 8 out of 18 primary TKA studies could be included. Furthermore, joint line assessment was performed with a variety of methods which could have influenced the results. However, all reported methods have acceptable inter and intra observer reliability [4, 24, 41], and only absolute changes were used in the analysis, which, in theory, should be interchangeable between the methods. Most of the included studies used the KSS to evaluated patient outcome. It is questionable, however, if this questionnaire is a valid tool to identify symptoms related to joint line alterations (i.e., mid-flexion instability; loss of flexion due to loss of PCO). Functional tests like a 6 min walk test and the stair climbing test have shown to correlate well with mid-flexion instability and might give better insight in functional outcome [18].

Although it was shown by Luyckx et al. that the use of ratio measurements (i.e., medial epicondyle ratio and adductor ratio) is more reliable and reproducible for joint line assessment and reconstruction in revision TKA [33], most commonly the tibial tubercle or the fibular head landmarks are used for evaluating joint line position. These measurements, however, are affected by tibial slope change, and if the joint line is only assessed from the tibial side, femoral sided joint line alterations are unrecognized. Therefore, the ideal joint line assessment would include a separate tibial and femoral referenced measurement. Since only one primary TKA [3] and one revision TKA study [7] reported separate data for the femoral and tibial sided joint line alterations, a separate tibial and femoral analysis could not be performed. Interestingly, both studies illustrated discrepancies between tibial and femoral based joint line measurements. However, correlations of these measurements with outcome scores showed conflicting results. Due to the relatively low level of evidence of the included studies, a weak recommendation can be assigned to the suggested cut-off value of 4 mm maximal acceptable joint line elevation.

Conclusion

In this systematic review, a negative correlation between joint line elevation and outcome was found, with an elevation of more than 4 mm resulting in statistically significant lower outcome scores. Hence, it is advised not to exceed 4 mm of joint line elevation in primary TKA. For revision TKAs, the aim should be to restore the joint line to its native height.