Introduction

Modular components in primary THA are attractive because they provide the surgeon with options to correct leg length, improve implant stability, and decrease implant inventory [8]. Unfortunately, with modularity comes the potential for fretting and corrosion at the head-neck modular junction. The mechanism for mechanically assisted crevice corrosion has been studied since the 1980s with few reports of clinical complications with modular junctions until 2010 [13]. Recently, high failure rates of metal-on-metal (MoM) THAs were attributed to adverse local tissue reactions (ALTRs) from exposure to corrosion debris, and surgeons have returned to metal-on-polyethylene (MoP) bearings to avoid ALTR. This has refueled interest in the design, assembly, and performance of these junctions.

Fretting and corrosion at the modular junction may be greater with larger heads. Femoral head size has been associated with the severity of fretting and corrosion at the junction with ultralarge (> 36 mm) MoM THAs, which were used for the advantages of greater ROM and decreased dislocation risk [1, 3, 14, 20, 26, 29]. Larger heads produce increased rotational and bending moments at the junction and can increase micromotion between the junction contact surfaces [20]. This phenomenon was particularly evident with MoM THAs, where simulator data indicated that lack of lubrication between bearing surfaces caused increased friction moments at the head-neck junction [2].

Most surgeons have returned to MoP bearings because of the high rate of early failure, the increased cost, and lack of efficacy of MoM THAs [4, 5, 25] and because of the improved wear resistance of highly crosslinked polyethylene [17]. However, despite the risks of increased polyethylene wear and increased corrosion at the taper junction with larger heads, some surgeons are reluctant to return to smaller heads because of concerns about dislocation. Little is known about the association of head size on fretting and corrosion in modular junctions of MoP THAs. We therefore sought to determine: (1) Is femoral head size associated with increased fretting or corrosion at the head-neck junction in MoP total hips? (2) Is duration of implantation associated with increased fretting or corrosion?

Patients and Methods

Between January 1, 2007, and December 31, 2013, our implant retrieval program collected 3282 total hip devices, including 359 with MoM bearings, 107 with ceramic-on-ceramic bearings, and 2816 MoP bearings. For the current study, we included MoP THA where both the femoral head and stem had been revised simultaneously and, therefore, were available for examination. We excluded 145 implants with the stems that had a dual modular neck, because these implants were implicated in an increased incidence of modularity-related adverse events [16], and 50 implants with titanium alloy heads, because these heads experienced high wear and are no longer in use. That left 154 MoP THAs (from 145 patients) retrieved after a mean length of implantation of 6.0 ± 6.1 years (range, 0–27 years). There were 21 implants retrieved within 6 months, 39 retrieved between 0.5 and 2 years, 29 between 3 and 5 years, 31 between 5 and 10 years, 24 retrieved between 10 and 20 years, and six retrieved after more than 20 years in vivo.

Eighty-five implants were retrieved from 75 men, and 74 implants were retrieved from 70 women (Table 1). The mean age at time of implantation was 62.0 (SD 13.8), and the mean body mass index was 27.9 kg/m2 (SD 6.0). Reasons for revision included infection in 67 THAs (43.5%), aseptic loosening in 64 (41.6%), instability in eight (5.2%), periprosthetic fracture in seven (4.5%), dislocation in five (3.2%), leg length discrepancy in two (1.3%), and implant malpositioning in one (0.7%).

Table 1 The characteristics of the patients from whom the 154 implants had been retrieved

All heads had been fabricated from cobalt-chromium (CoCr) alloy. Heads were grouped by diameter (Table 2): 17% were ≤ 26 mm (n = 26; 20 22-mm and six 26-mm heads), 30% were 28 mm (n = 46), 34% were 32 mm (n = 52), and 20% ≥ 36 mm (n = 30; 22 36-mm heads, one 38-mm head, six 40-mm heads, and one 44-mm head). Implants were grouped into same-alloy (CoCr head with CoCr stem, n = 75) and mixed-alloy combinations (CoCr head with Ti-6Al-4V or Ti-12Mo-6Zr-2Fe stems, n = 79). The taper/trunnion designs included 12/14 (58 implants [38%]), C-tapers (30 [19%]), V40 (15 [10%]), 11/13 (15 [10%]), Type I (eight [5%]), and PCA tapers (six [4%]) (Fig. 1). The method of fixation was cemented for 65 stems and cementless for 89 stems.

Table 2 The characteristics of the 154 retrieved implants
Fig. 1
figure 1

The 154 retrieved components included stems with six different taper designs. The taper dimensions were measured on a subset of the retrievals to determine the proximal and distal diameters, the contact length, and the taper height and angle. Please note that measurements were performed on a subset of the 154 retrieved implants.

Before examination, retrieved components were disinfected in a 10% bleach solution for 20 minutes, washed with a mild detergent and tap water, rinsed in methanol, and allowed to air-dry overnight. The head taper and stem trunnion surfaces were examined by two independent observers (GKT, MEE) using an optical stereomicroscope (Wild Type 376788, Heerbruug, Switzerland) at magnifications from × 6 to × 12. We used “trunnion” to refer to the male component on the stem and “taper” to describe the female component on the femoral head [26]. The head taper was divided into proximal and distal regions. The stem trunnion was divided into eight quadrants with proximal and distal regions each subdivided into posterior, anterior, superior, and inferior regions (Fig. 2).

Fig. 2
figure 2

The head taper (green) and the stem trunnion (purple) were divided into regions, and each region was examined for fretting and corrosion.

Fretting damage included small scars perpendicular to and interrupting machining lines and irregularly shaped polished areas where machining lines had been worn away [19]. Fretting damage is as significant as corrosion because fretting leads to continuous repassivation of the oxide layer with ultimate consumption of available oxygen in the crevice, alteration of pH, and subsequent initiation of the corrosion process. Discoloration, pits, etching scars, and black debris were attributed to corrosion. Damage from tools used to remove components at surgery was ignored. Each region was graded for fretting and corrosion using the scoring system developed by Goldberg et al. [19] with a minimum score of 1 and a maximum score of 4 (Fig. 3). Two regions were scored for the taper and eight for the trunnion, so the maximum total score for fretting or corrosion was 8 (2 × 4) for the taper and 32 (8 × 4) for the trunnion. When scoring differences arose between the two investigators, a joint examination was conducted to reach consensus; the differences in scores between the two observers never exceeded one grade of the Goldberg scoring scale, and this occurred in 17% of the examined implants (26 of 154).

Fig. 3A–B
figure 3

Microscopic images of the head taper and the stem trunnion are shown with sequentially higher scores for fretting (A) and corrosion (B). The left column of each picture represents the scores of the scale [19] with 1 corresponding to the absence of any damage and 4 corresponding to excessive fretting/corrosion.

The numbers and proportions of head tapers and stem trunnions with fretting and corrosion (Goldberg scores ≥ 2) were calculated as were the numbers and proportions falling within each Goldberg class. Univariate analyses were conducted between the revision diagnosis and the related fretting and corrosion scores for the stem trunnions and head tapers. Taper fretting and corrosion scores were not related to the revision diagnosis (p = 0.832 and 0.616, respectively). Similarly, trunnion fretting and corrosion were not related to revision diagnosis (p = 0.104 and 0.587, respectively). Therefore, revision diagnosis was not included in examining associations with the other, nonclinical factors on fretting and corrosion. Repeated-measures generalized estimating equations (GEEs) were used to correlate head size, alloy combination, length of implantation, and location within the taper or trunnion (proximal/distal for tapers, proximal/distal and quadrant for trunnions) to the magnitude of head and stem fretting and corrosion. Separate models were constructed for head fretting, head corrosion, stem fretting, and stem corrosion. All variables and their two-way interactions were initially included; nonsignificant terms were subsequently removed in order of decreasing p value starting with interaction terms. Post hoc tests were adjusted for multiple comparisons with the Tukey-Kramer method. The number of implants obtained for this study provided 88% power to detect differences of 0.5 in fretting or corrosion scores in these analyses. A second set of analyses was performed including taper/trunnion design into the multivariate repeated measures GEEs; however, the number of implants provided only adequate power (80%) to detect a difference of 0.8 between taper groups. As a result of the limited numbers of some taper designs within some of the head size categories, the interaction between taper design and head size was not included in the statistical models. All statistical tests were performed with SAS 9.3 (Cary, NC, USA) with a level of significance of α = 0.05. Descriptive statistics are displayed as means (and 95% confidence intervals) for continuous variables and frequencies and percentages for categorical variables. Results presented are significant unless stated otherwise.

Results

Relationship of Femoral Head Size to Fretting and Corrosion

We found no association between head size and the degree of fretting and corrosion on the tapers or trunnions. For the GEE models, the significance values for fretting of the taper and trunnion and corrosion of the taper and trunnion were p = 0.247, p = 0.471, p = 0.837, and p = 0.868, respectively (Tables 3, 4). Fretting and corrosion were common findings. Taper fretting was observed in 73% of the 154 heads with 23% having moderate or severe (≥ 3) average fretting scores. Taper corrosion was noted in 93% of the 154 heads with 34% having moderate or severe (≥ 3) average corrosion scores. Trunnion fretting was observed in 86% of the 154 stems with 10% of the trunnions having moderate or severe (≥ 3) average fretting scores. Trunnion corrosion was noted in 72% of the stems with 14% having moderate to severe (≥ 3) average corrosion scores.

Table 3 Probability values from multivariable GEEs examining the relationship between damage and head size, taper, length of implantation, material combination, and location
Table 4 Taper and trunnion fretting and corrosion scores according to head size

Relationship Between Length of Implantation and Fretting and Corrosion

Length of implantation was associated with taper fretting (p < 0.001) and trunnion fretting (p = 0.044), but was not associated with corrosion (Table 3). For every 5 years of implantation, the taper fretting score increased by 0.2 points (0.1–0.4), and the trunnion fretting score increased by 0.1 points (0.0–0.2).

Other Factors Associated With Fretting and Corrosion

Taper design was associated with taper fretting (p = 0.005), taper corrosion (p = 0.031), and trunnion fretting (p = 0.028) (Table 3). PCA tapers had 0.8 (0.1–1.5) points less fretting than 12/14 (p = 0.010) and 1.1 (0.1–2.0) points less than V40 (p = 0.022) designs (Fig. 4). PCA trunnions had 0.8 (0.0–1.5) points less fretting than V40 trunnions (p = 0.018) and had 1.0 (0.0–1.9) points less fretting than Type I trunnions (p = 0.034). Additionally, 12/14 trunnions had 0.7 (0.2–1.1) points less fretting than V40 trunnions (p = 0.002) and 0.8 (0.0–1.6) points less fretting than Type I trunnions (p = 0.043). When looking at corrosion, PCA tapers had 1.2 (0.3–2.2) points less corrosion than 11/13 tapers (p = 0.004), 1.1 (0.2–2.1) points less than C-tapers (p = 0.009), and 1.1 (0.0–2.1) points less than V40 tapers (p = 0.040).

Fig. 4
figure 4

The graph depicts the average fretting and corrosion scores (with SDs) for both the taper and the trunnion. Taper fretting was significantly less for PCA tapers compared with 12/14 and V40 tapers. Moreover, PCA tapers had significantly less taper corrosion in comparison to C-tapers, V40 tapers, and 11/13 tapers. Finally, both PCA and 12/14 trunnions had significantly less fretting when compared with V40 or Type I trunnions.

Trunnion fretting was also associated with alloy combination. When different alloys were combined, trunnion fretting was greater than when the same alloy was used (1.0 [0.6–1.5] points greater, p < 0.001). Interestingly, comparing different alloy treatments (eg, forging versus casting) did not add any predictive value for fretting beyond whether the same or different alloys were used alone.

Trunnion fretting and corrosion were associated with location (Table 3). The stem trunnion fretting scores of the four quadrants differed between the proximal and distal ends of the trunnion. On the proximal end, the posterior quadrant had more fretting than the superior quadrant by 0.2 (0.1–0.4) points (p = 0.001). On the distal end, fretting scores did not differ among quadrants. Trunnion corrosion was greater on the distal compared with the proximal halves of the trunnions with the distal end having 0.2 (0.0–0.3) points more corrosion (p = 0.043). Trunnion corrosion did not differ among quadrants, except in 36-mm heads. In 36-mm heads, the superior quadrant of the trunnion had more corrosion than the posterior quadrant (0.3 [0.0–0.6] points greater, p = 0.036). With the numbers available, associations between increased trunnion corrosion and increased length of implantation (p = 0.075) or between corrosion and trunnion design (p = 0.097) did not reach significance.

Discussion

Fretting and corrosion are regular occurrences in modular connections in THA. Although the problem gained attention in large head MoM implants, in our study of MoP THAs, 93% of 154 femoral heads showed evidence of fretting and corrosion damage. The concern of corrosion and its relation to ALTR in large head sizes led us to question whether the MoM THA findings apply to MoP implants. A recent report [23] described corrosion debris-related ALTR in a small series of patients with MoP THAs, yet we know little about how modular junctions in MoP THAs perform in vivo. In our large collection of retrieved MoP THAs, we found that neither fretting nor corrosion of head tapers or stem trunnions was related to head size. Conversely, the design of the modular junction, the length of implantation, a mixed-alloy combination, and the location along the modular junction were all associated with the severity of fretting and corrosion.

None of the patients in this study were revised for ALTR, so whether a patient is predisposed to an ALTR cannot be determined from our data. Interestingly, revision diagnosis did not affect the fretting and corrosion observed in this large cohort of retrieved THA femoral components, despite the fact that the majority of cases (Table 1) were revised for infection (not necessarily associated with mechanical factors) and aseptic loosening (sometimes associated with an increased mechanical burden in the THA).

Our study had limitations. First, like with all retrieval studies, ours is a retrospective study of a select group of retrieved implants, which might not reflect well-functioning or nonrevised implants. Nonetheless, retrieval analysis is an accepted method for evaluating in vivo implant performance. Second, very small and very large heads were infrequent in our cohort, so we included them in ≤ 26-mm and ≥ 36-mm groups. This approach provided more homogenous group sizes for comparison. However, small but clinically relevant differences might have been missed as a result of insufficient power. Additionally, as a result of limited numbers of some taper designs within some of the head size categories, we were not able to test whether the head size differed among the different taper designs. We also used a subjective scoring system, although one that has been extensively used in evaluating corrosion and fretting; its validity was recently studied, and although substantial inter- and intraobserver reliabilities for corrosion were found, reliability for fretting scores was only slight to fair [21]. In our study, differences in scores between observers never exceeded one grade on the scoring scale and occurred in only 17% of the 154 implants that were examined. Another limitation of the system is that small differences might not be clinically relevant. Lastly, severe corrosion could mask underlying fretting, so fretting might not be assessed in its full extent. However, this is a well-known disadvantage of observing fretting and corrosion under optical microscopy and is a limitation in all similar studies.

We did not find an association between head size and fretting and corrosion. Studies with MoM bearings concluded that larger heads are associated with greater fretting and corrosion severity [3, 20]. In contrast, recent studies associating head size and corrosion in MoP THAs are conflicting. Huot Carlson et al. [22] studied 78 stems and 72 heads and found no correlation between head size and fretting and corrosion. Kurtz et al. [24] reported the same conclusion in their series of 50 implants. Conversely, Dyrkacz et al. [12] compared 15 28-mm and 59 32-mm-diameter heads of MoP THAs, concluding that larger heads had more corrosion on the taper and trunnion, although fretting was not different between the two groups. Early studies of MoP implants did not investigate the association of head diameter with fretting and corrosion [811, 15, 19, 27, 28]; it was not until 2010 that ALTR was regularly reported with taper junctions [13]. Whether this is the result of implant design changes, including larger heads and smaller trunnions, or increased recognition by the orthopaedic community remains unknown.

Length of implantation correlated with taper and trunnion fretting. Our cohort spans longer implantations than those of previous studies with 30 retrievals in vivo more than 10 years, six of which were in vivo for over 20 years. Greater length of implantation correlated with increased head and stem fretting but not with corrosion. Conversely, the rate of increase in head taper and stem trunnion fretting was low (0.2 and 0.12 in Goldberg scores after 5 years, respectively), which may explain why ALTR in MoP THAs is rare. Collier et al. [8], in a cohort of 411 modular prostheses, reported a positive correlation between corrosion and length of implantation, although the bearing combinations (eg, MoM and MoP) were not reported. Similar results were reported for MoM cohorts [14, 20]. Interestingly, Goldberg et al. [19] found that although corrosion of the head and neck increased with time, head fretting actually decreased, an effect attributed to etching or buildup of corrosion debris. Conversely, others showed no correlation between length of implantation and fretting or corrosion [11, 12, 24, 29]. The fretting corrosion process starts at the time of implantation, as evident from our < 6-month retrievals, with average head fretting and corrosion scores of 2.07 and 2.1, respectively, and the average stem fretting and corrosion scores of 2.03 and 1.4, respectively.

Taper design was previously shown to influence fretting behavior of the head-neck junction of MoM implants with thicker tapers and thus larger contacting surface areas being related to higher fretting scores [29]. These conclusions were not confirmed by our study of MoP bearings. Head fretting was greater with V40 and 12/14 modular connections as compared with PCA tapers, although the number of PCA tapers in our cohort was small. V40 tapers were designed to have a small diameter, small contact length, and small taper angle. The PCA taper has the largest taper contact length of any of the tapers studied, but what role this may play is unclear. Taper design is challenging, because a balance must be achieved between flexural rigidity of the trunnions, which increases with taper diameter, and impingement-free ROM, which decreases with diameter. A further investigation of the association between taper designs, including surface finish, is warranted to understand their relation to fretting and corrosion. Taper contact area and surface finish have been highlighted as important factors of the fretting corrosion process [30].

Combining different metallic alloys (ie, a CoCr femoral head with a titanium [Ti] alloy stem) has been associated with increased corrosion at the head-neck junction [8, 11, 19, 27]. In our cohort, when a CoCr head was combined with a Ti alloy stem, trunnion fretting was more severe than when the mating components were both CoCr alloy, but no differences were found in taper fretting and taper or trunnion corrosion between same- and mixed-alloy combinations. CoCr alloys have nearly twice the elastic modulus of Ti alloys and are thus more resistant to fretting. Mixed- and same-alloy couplings showed similar fretting corrosion behavior in vitro [33], implying that, even in the mixed-alloy combination, the CoCr alloy determines the outcome. As the environmental pH gets lower and corrosion ensues, dissolution rates become greater for CoCr than for Ti alloys [7]. Thus, despite less resistance to fretting, Ti alloys are less prone to corrosion. This might partly explain the discrepancy of our results regarding fretting and corrosion in same and mixed couples.

Other factors play a role in fretting corrosion, including the fretting regime (full slip versus partial slip or full stick) [32] and the applied [18] and assembly loads [31] to which the junctions are subjected. Microstructure can affect corrosion behavior and can be modified through thermomechanical treatments. For example, wrought alloys have smaller grains than cast alloys; generally, alloys with smaller grains are harder and more resistant to fretting and corrosion [6]. Goldberg et al. [19] reported that wrought CoCr heads had lower fretting and corrosion scores than cast CoCr heads, but for the necks, more corrosion was noted with wrought alloys. However, all differences in fretting and corrosion between alloys and treatments in our study were explained by whether the same or dissimilar alloys were used for the taper and trunnion.

In conclusion, our retrieval study of MoP implants suggests that head size does not influence either fretting or corrosion at the head-neck modular junction. This finding supports the use of larger diameter heads by surgeons reluctant to do so because of concerns for modular junction damage. However, taper/trunnion design and alloy combinations remain substantial concerns because small tapers and dissimilar alloys forming the junction are detrimental. Longer implantation times are also detrimental for head and stem fretting.