Introduction

Osteoarthritis (OA) is the most common joint disorder in the USA, estimated to affect over 30 million adults, and the prevalence is expected to increase for the foreseeable future [1, 2]. Joint pain is a common complaint among individuals with OA and a primary reason for them to seek medical care [2, 3]. As the disease is chronic and nonfatal, identifying effective measures to not only prevent but also treat it will have significant impacts at both the clinical and socioeconomic levels [1, 4]. Osteoarthritis may occur in any joint, but knee OA is one of the most common manifestations of this disease and, as the knee takes on a substantial amount of the weight-bearing load, can lead to significant disability if left untreated [1, 4].

While surgical intervention is typically reserved for the most severe OA, more conservative therapies are initiated earlier in the disease process in effort to alleviate symptoms and delay progression. Treatment with intra-articular hyaluronic acid (IA-HA) injections is one of these options, although there is inconsistency in its recommendations across knee OA guidelines [4,5,6,7]. HA is a naturally occurring substance present in synovial fluid and the quality and quantity of HA are reduced in arthritic knees [3, 8]. IA-HA injections are typically indicated for patients who are non-responders to nonpharmacological and pharmacological therapies or experience adverse effects from these treatments [3, 8, 9]. In addition to restoring the viscoelasticity of the synovial fluid, IA-HA may also have anti-inflammatory and antinociceptive properties, and stimulate in vivo high molecular weight (HMW) HA synthesis [8]. Both basic science studies and clinical trials have demonstrated the potential benefit of HA injections [4]. While a number of different HA products are available in various injection regimens, literature has demonstrated there are also significant differences in molecular and rheological properties [4, 8, 10, 11].

Radiographs play an important role in the diagnosis of OA [1, 2]. In clinical trials investigating knee OA, disease severity is most commonly assessed by the Kellgren–Lawrence (KL) criteria [1, 2]. This system grades OA into five categories of severity, from 0 to 4, with lower grades representing greater joint space and less disease severity [12]. Most IA-HA studies have focused on its use in patients with early to moderate knee OA (K–L grade ≤ 3), suggesting that HA injections should only be considered earlier in the disease process and that studies that have included patients with end-stage disease (K–L grade 4) have contributed to the current controversy surrounding their use [4, 13]. A number of prior studies have demonstrated a greater response to HA therapy when baseline radiographic presentation was less severe [14,15,16].

The purpose of this review was to determine if the inclusion of end-stage OA (KL grade 4) participants reduced the measured effectiveness of IA-HA in randomized controlled trials measuring its effect on knee OA pain.

Methods

Literature Search

A comprehensive literature search for relevant articles was conducted using a detailed search of the MEDLINE, EMBASE, and PubMed databases (Supplementary Material). The inclusion criteria were (1) blinded randomized controlled trial (RCT) comparing IA-HA with intra-articular saline injections; (2) knee pain or treatment-related adverse events (AEs) were a reported outcome; (3) described disease severity using the K–L grade criteria; and (4) articles that were published in English. After the list of eligible studies was finalized, a manual search of relevant reference lists was conducted to ensure that no potentially eligible trials were missed. This article is based on previously conducted studies and does not contain any studies with human participants or animals performed by any of the authors.

Search Results

The literature searched identified 2198 citations (Fig. 1). After titles and abstracts were screened, 166 were included for full-text review. Nineteen studies met the predefined inclusion criteria [14, 17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34] and an additional study was hand-selected [35], for a total of 20 trials; however, one of these studies was only included in the analysis of treatment-related adverse events [29]. The hand-selected study was retrieved from the reference list of previously published articles regarding IA-HA use for knee osteoarthritis.

Fig. 1
figure 1

Flow diagram of search results

Data Extraction and Outcome Measures

Data extraction consisted of study characteristics, patient demographics, and reported outcome and safety measures. The primary outcome measure was the mean change in knee pain score from baseline at two separate visit windows: (1) 4–13 weeks (earlier visit) and (2) 22–27 weeks (later visit). The Western Ontario and McMaster Universities Arthritis Index (WOMAC) pain scores were extracted whenever reported. If the WOMAC pain scores were not reported, an a priori hierarchy of outcomes was used to extract the next most relevant pain measure. The hierarchy used was taken from a previous meta-analysis, and was as follows: WOMAC pain, visual analog scale (VAS) pain on activity/walking, VAS pain on weight-bearing, VAS pain at rest, or other pain outcomes (Knee Injury and Osteoarthritis Outcome Score (KOOS), Musculoskeletal Outcomes Data Evaluation and Management System (MODEMS), Index of Severity for Osteoarthritis for the Knee (ISK) assessment, and WOMAC total score [11]. If data for these outcomes were not reported in a given study, it was not included in the primary outcome. Safety data was also extracted when possible on the total number of participants experiencing a treatment-related AE. If data for these outcomes were not reported in a given study, it was not included in the safety analysis. Data from the intent-to-treat population was used whenever possible. Data extraction was completed in duplicate by two independent reviewers.

Data Analysis

Standardized mean differences (SMD) and relative risks (RRs) were analyzed using the Cochrane Review Manager 5.3 software [36]. For continuous outcomes, a negative SMD represented a result favoring IA-HA, while a positive effect estimate represented a result favoring IA-saline. Missing standard deviations were estimated on the basis of the methods recommended in the Cochrane Handbook for Systematic Reviews of Interventions [37]. For binary outcomes, an RR less than 1 favored IA-HA, while a value greater than 1 favored IA-saline. Effect size estimates were analyzed using a generic inverse variance statistical method and a random-effects analysis model with a 95% confidence interval (CI). The number of participants who experienced a treatment-related AE was analyzed under a dichotomous outcome assessment using Mantel–Haenzel statistical method and a random-effects analysis model with a 95% CI. Heterogeneity between the included trials was measured using the I2 statistic.

Studies were stratified into two groups: (1) studies with early to moderate (early-moderate) knee OA participants (i.e., K–L grade ≤ 3 only), and (2) studies with early to late (late) knee OA patients (i.e., the authors enrolled patients with K–L grade 1–4 knee OA) based on the trial’s demographic characteristics. If a trial had less than 5% of the patients with a baseline K–L grade of 4, the trial was included in the early-moderate OA subgroup.

Sensitivity Analysis

A sensitivity analysis was conducted to determine if single-blinded studies and the non-blinded study had a significant impact on the total treatment effect of IA-HA on knee pain vs IA-saline at 13-week and 26-week follow-up periods. To accomplish this, single-blinded studies were removed from analyses to determine if they had a significant impact on treatment efficacy. Additionally, we conducted a sensitivity analysis to determine if the hand-selected study had a significant impact on the total treatment effect of IA-HA on knee pain vs IA-saline. To accomplish this, the hand-selected study was removed from analyses to determine if it had a significant impact on treatment efficacy.

Results

Study Characteristics and Demographics

The sample sizes of the included trials ranged from 12 to 588 patients (Table 1). Sixteen studies (80.0%) were double-blinded RCTs and four (20.0%) were single-blinded. The IA-HA formulations used across these trials were Adant, Durolane, Euflexxa, Fermathron Plus, Gel-One, Hyalgan, Orthovisc, Monovisc, and Synvisc; the specific brand was not reported in two studies.

Table 1 Study characteristics of the included trials

The average ages and BMIs ranged from 53 to 73 years and 25 to 33 kg/m [2], respectively, (Table 2). Of 19 studies, 16 were included in the early-moderate OA subgroup and the other three were included in the late OA subgroup. The other study, Henderson et al. presented their results for patients with K–L grade 2 and those with K–L grade 3–4; therefore this study was represented in both subgroups as Henderson 1994a (early-moderate OA subgroup) and Henderson 1994b (late OA subgroup) [25].

Table 2 Demographic characteristics of the included trials

Pain at 4–13 Weeks (Earlier Visit)

In the early-moderate OA subgroup (16 trials; n = 1578 for IA-HA, n = 1341 for saline), the mean change in pain scores from baseline within this visit window was statistically significant in favor of IA-HA [SMD = − 0.30, 95% CI − 0.44 to − 0.15, p < 0.0001; I2  = 68%] (Fig. 2). In the late OA subgroup (4 trials; n = 288 for IAHA, n = 278 for saline), there was no significant effect between IA-HA and saline [SMD = 0.28, 95% CI − 0.12 to 0.68, p = 0.17; I2 = 72%].

Fig. 2
figure 2

Pain at 4–13 weeks

The test for subgroup differences (early-moderate OA versus late OA) was statistically significant (p = 0.008). The funnel plot revealed no evidence of publication bias (Fig. 3).

Fig. 3
figure 3

Funnel plot for pain at 4–13 weeks

Pain at 22–27 Weeks (Later Visit)

In the early-moderate OA subgroup (9 trials; n = 1058 for IA-HA, n = 981 for saline), the mean change in pain scores from baseline within this visit window was statistically significant in favor of IA-HA [SMD = − 0.27, 95% CI − 0.39 to − 0.16, p < 0.00001; I2  = 38%] (Fig. 4). In the late OA subgroup (2 trials; n = 254 for IAHA, n = 254 for saline), there was no significant effect between IA-HA and saline [SMD = 0.03, 95% CI − 0.14 to 0.21, p = 0.72; I2 = 3%].

Fig. 4
figure 4

Pain at 22–27 weeks

The test for subgroup differences (early-moderate OA versus late OA) was statistically significant (p = 0.005). The funnel plot revealed no evidence of publication bias (Fig. 5).

Fig. 5
figure 5

Funnel plot for pain at 22–27 weeks

Treatment-Related AEs

In the early-moderate OA subgroup (9 trials; n = 1304 for IA-HA, n = 1104 for saline), there was no significant difference in the risk of treatment-related AEs between IA-HA and saline injections [RR = 1.03, 95% CI 0.89–1.20, p = 0.68; I2 = 36%] (Fig. 6). In the late OA subgroup (3 trials; n = 230 for IAHA, n = 232 for saline), IA-HA was associated with a significantly greater risk of treatment-related AEs relative to saline [RR = 1.76, 95% CI 1.16–2.67, p = 0.008; I2 = 0%].

Fig. 6
figure 6

Treatment-related AE

The test for subgroup differences (early-moderate OA versus late OA) was statistically significant (p = 0.02).

Sensitivity Analysis

Three single-blinded studies and one non-blinded were removed in the sensitivity analysis [23, 24, 29, 34]. The pooled effect size remained statistically significant with little change in total effect size at 13-week (SMD = − 0.16 [− 0.32, − 0.01], P = 0.004) and 26-week (SMD = − 0.19 [− 0.30, − 0.07], P = 0.001) follow-up periods.

At 13-week and 26-week follow-up periods, the pooled effect size remained statistically significant with little change in total effect size when the hand-selected study was removed from the analysis. Similar results were observed when single-blinded studies were removed from analyses; total treatment efficacy and subgroup differences remained statistically significant at 13 weeks and 26 weeks.

Discussion

The results of this meta-analysis suggest that IA-HA therapy is most efficacious in reducing pain in patients with early-moderate knee OA, but not in the late OA subgroup. This analysis also revealed that significant pain relief with HA injections can occur within 4–13 weeks (earlier visit) post-injection and remain beneficial up to approximately 6 months (later visit); the effect estimates were similar between these two time points (SMD = − 0.30, 95% CI − 0.44 to − 0.15 at the earlier visit and SMD = − 0.27, 95% CI − 0.39 to − 0.16 at the later visit). This suggests that the effect seen with HA plateaus and is maintained up to 6 months post-injection.

In 2015, Strand et al. published a meta-analysis evaluating the effects of IA-HA within subcategories of disease severity [16]. Similar to the current study, Strand et al. found that the effect size for pain between IA-HA and saline injections in the early-moderate OA subgroup (K–L grade ≤ 3) was statistically significant (SMD = − 0.35, 95% CI − 0.57 to − 0.14), but this outcome was not significant in the subgroup that also included K–L grade 4 patients (SMD = − 0.11, 95% CI − 0.46 to 0.24). Other SMDs reported in previous meta-analyses, without considering OA grade, have ranged from − 0.19 to − 0.43 [10, 16, 38,39,40,41,42]. The estimates that were calculated in the current analysis for the early-moderate OA subgroup (− 0.30 at 4–13 weeks and − 0.27 at 22–27 weeks) were within this range, but those for the late OA subgroup (0.28 at 4–13 weeks and 0.03 at 22–27 weeks) were not. Such an observation may be explained by the fact that most IA-HA studies have only included patients with early to moderate knee OA [4]. The current study provides further quantitative evidence that including patients with K–L grade 4 knee OA dilutes the benefit of HA injections when averaging treatment effects within a sample. These results also demonstrated the potential for HA to provide greater benefit to patients if it is provided earlier in the course of their disease, as opposed to being provided as a later treatment option once disease severity has already progressed to late stages. These results also demonstrate that the late OA subgroup experienced significantly more treatment-related AEs, suggesting that patients with KL grade 4 may be more susceptible to adverse events. These results may have been influenced by the inclusion of avian-derived HA and non-animal stabilized HA formulations, as well as the use of low molecular HA. Further insight into the product differences of IA-HA in KL grade 4 participants is required.

Several knee OA guidelines are available to practicing physicians; however, there is clearly inconsistency in the recommendations for IA-HA injections [4, 5, 43]. What many of these guidelines fail to address is how the efficacy of IA-HA may vary by select patient characteristics. The results of this systematic review suggest that future clinical practice guidelines base their treatment recommendations on the individual patient’s disease state and focus on the potential for greater benefit when IA-HA is provided in earlier stage knee OA.

A limitation of the current study is the heterogeneity within subgroups, suggesting that there may be other factors contributing to the difference in results between the subgroups. It has been previously established that IA-HA trials are variable in terms of patient eligibility criteria, HA molecular characteristics, injection schedules, and outcomes assessment [3], which must also be further investigated. Additionally, there is a small proportion of included patients who were in advanced stages of their knee OA. Although this proportion is small, the results of this study still provide valuable information regarding the potential effects seen in these late stage patients. Studies included in this analysis varied in terms of blinding, HA product (with different injection regimens and molecular weights), and used either the VAS for pain or WOMAC pain subscale, though there is even variability in the subscale for WOMAC pain (e.g., the 0–100 mm, 0–20 or 5–25 Likert, or 0–500 scale). The K–L grade is the most widely used OA severity scale; however, other measures exist, with their own definitions and scoring systems [1], and studies that reported relevant outcomes were excluded from this analysis for this reason. There is also controversy as to whether or not radiographic criteria always correlate with clinical symptoms [1]. It cannot be said, definitely, that patients with more severe OA based on radiographs are also more symptomatic at baseline. Non-English trials were excluded, which may limit the generalizability of these findings and confidence in the effect estimates. The included studies were predominantly representative of sites in North America and Europe, though two trials were conducted in Asia [26, 33] and demonstrated comparable treatment effects for HA. Lastly, the number of studied including participants with KL grade 4 knee OA was limited, suggesting that this cohort may not be actively recruited in IA-HA clinical trials.

A strength of this analysis was that only data from randomized, saline-controlled trials were included, ensuring that pooled effects estimates were from evidence of high-quality studies with a common comparator. There was also a large number patients in the analysis (n = 3485 in the assessment of pain at the earlier visit window), though a substantial proportion of this sample represented patients in the early-moderate OA subgroup. The outcome scales used to assess pain (i.e., the VAS and WOMAC) are validated measures and commonly evaluated in the knee OA literature. The funnel plots also revealed that publication bias is unlikely.

Conclusions

Treatment with IA-HA provides statistically significant pain relief compared to saline injections for patients with early-moderate knee OA, with no increase in the risk of treatment-related adverse effects, up to 6 months post-injection. IA-HA demonstrated no benefit over controls in the late OA subgroup and was associated with significantly greater treatment-related AEs. In this regard, some of the prior studies that demonstrated no significant benefit with HA injections may have confounded their results with the inclusion of a considerable proportion of patients with end-stage knee disease. Future investigations on the topic should take caution in completely rejecting the potential benefit of IA-HA when it may indeed be efficacious for a subset of the knee OA population.