Introduction

The United States (US) has long had the most expensive healthcare system per capita and healthcare expenditures continue to rise, without a demonstrable corresponding improvement in health outcomes [1, 2]. Among the recognized drivers of the ballooning healthcare costs are fragmentation of care, variation in costs at regional and provider level and variation in clinical practice [1]. In order to control the ever increasing amount spent on healthcare, several value based payment models have been proposed, one such model was bundled payments [3, 4].

Bundled payment models, such as the Centers for Medicare and Medicaid Services (CMS) Bundled Payments for Care Improvement (BCPI), offer a standardized payment for all healthcare provided during a defined time period, termed an “episode of care” [5]. Surgical care has been a prime target for bundled payments, due to variable costs though all phases of care and an easy to identify start date [6, 7]. Several studies have shown that surgical bundles in Medicare have led to real or potential cost savings with no decrease in quality [8,9,10,11,12,13], including bundles for spine surgery [14]. Recent work by Wynn-Jones, et al., examined the potential impact of a bundled payment model on the cost of common surgical procedures among a working age adult population (age 18–64) including spinal surgery [15]. They found significant variation in overall costs and that spending patterns in the different phases of care for working age adults (including index hospitalization, readmissions, post-acute care, and physician fees) diverged from those observed in the older adult Medicare population.

Spine disorders are associated with high healthcare spending across the adult age spectrum in the US, making them a particularly attractive target for value based payment strategies, with one study finding that those with spine conditions had healthcare expenditures 73% higher than those without spine disorders [16]. Other work utilizing national secondary data has shown that more than 400,000 primary spinal fusion operations are performed annually in the US and that the number of spine operations performed annually has been consistently increasing, with approximately half of patients requiring these operations under age 65 [17,18,19].

Prior studies of bundled payment models in spine surgery have largely examined the 65-and-over Medicare population and focused on fixed 90-day episodes of care [14]. In our study, we evaluated variation in costs for spinal fusion surgery among working age adults to show the potential impact of implementing bundled payments in this population and identify drivers of variation in the cost of spine surgery. We also sought to identify variation in cost by indication for the procedure and to establish the optimal episode of care time period for spine surgery.

Methods

Study data and population

We employed data from the TRICARE claims in the Military Health System Data Repository. TRICARE is the US Department of Defense health insurance product that covers members of the armed forces, retirees, and their dependents; approximately 80% are not active duty servicemembers [20]. The population covered by TRICARE has been shown to be nationally and demographically representative of US population under 65 years of age [20, 21]. The US Military Health System (MHS) is a bifurcated care system, providing care at both military owned hospitals (direct care) and at civilian hospitals where TRICARE functions as a fee-for-service health insurance product (purchased care). Importantly, this system does not include care provided by the US Department of Veterans Affairs. This study examined data from TRICARE Prime, one of TRICARE’s constituent insurance plans.

Patients who underwent spinal fusion operations between 2012 and 2014 were identified by International Classification of Diseases ICD, Ninth Revision, Clinical Modification (ICD-9-CM) Procedure Codes and Medicare Severity-Diagnosis Related Group (MS-DRG) Codes. Patients were stratified by procedure category: cervical spinal fusion and non-cervical spinal fusion (Additional file 1). Procedures performed for trauma, fracture, or malignancy were excluded. Demographic information, including age, sex, race, and the sponsor beneficiary’s military rank, was collected. Sponsor rank has been previously used as a proxy for socio-economic status, with junior enlisted sponsor rank being considered indicative of lower socioeconomic status [22, 23]. For patients with missing data regarding race, their sponsor beneficiary’s race was used.

In order to maximize our ability to capture all costs, patients with dual coverage with another payer were excluded. Patients who were admitted directly from skilled nursing, rehabilitation, or hospice facilities or who died during the index hospitalization were excluded in order to create a more homogenous cohort where we could focus on regional cost variation. As in prior studies, we used Hospital Referral Regions (HRR) as our aggregated unit of analysis [15, 24]. HRRs where fewer than 5 cases were performed were excluded from this analysis.

Calculation of Tricare payments

We used payments made by TRICARE on behalf of each patient to commercial healthcare providers rather than submitted charges by the hospitals. We collected information on all payments made on behalf of each patient during the bundle period, including those for inpatient and outpatient services. We calculated payments made for all claims during episodes of care of varying duration (60-, 90-, and 180-day). All patient specific payment information for the 6 months prior to the index admission was used in calculation of the case mix index.

We applied a regional price standardization, similar to those used in Medicare, to control for claims cost variation resulting from variation in wage indices [25, 26]. Factors included in case mix adjustment were patients’ age, sex, race, sponsor rank, and admission acuity. We also included each patient’s Charlson Comorbidity Index to account for pre-existing comorbid conditions and TRICARE claims payments made in the 6 months prior to the index procedure.

Each payment was categorized into one of four payment buckets. The Index Hospitalization includes claims paid by TRICARE for the reference inpatient procedure. Readmission payments include claims paid for inpatient acute-care hospitalizations and procedures performed after discharge from the index hospitalization but within the bundle period. Post-acute care payments included those for acute or sub-acute rehabilitation facilities, skilled nursing facilities, outpatient services, home healthcare, hospice, durable medical equipment, as well as facility fees for outpatient care. Healthcare Professional Fees constituted the fourth category.

Statistical methods

Case mix adjustment was done using a generalized linear mixed model using a log link function. We included a random effect for the grouping variable HRR, to account for correlation of outcomes within an HRR, and assumed a gamma distribution based on prior literature on modelling cost data. From this, we obtained a case-mix and empirical Bayes reliability-adjusted estimate of the mean cost for each HRR. HRRs were sorted by the adjusted total payments made for the period from discharge from index hospitalization to the end of the episode of care duration. The payments were grouped into quintiles and the average per quintile reported with confidence intervals. Consistent with previous work, inter-quintile variation was measured by using the inter-quintile difference, defined as the absolute difference in dollars between the lowest and highest quintile, expressed as a percentage of the lowest quintile [15]. The HRRs were then resorted, grouped into quintiles and averaged, for each of the four payment buckets identified (i.e., Index Hospitalization payments, Readmission payments, Post-acute Care payments, and Healthcare Professional fees). This process was repeated each spinal fusion type and episode of care duration combination. (i.e., cervical vs noncervical at 60-, 90-, and 180-days).

Finally, we also followed each patient for 1 year after their index procedure to identify patients undergoing reoperation beyond what was captured by the 180-day bundle episode. This was done in order to test whether our longest episode duration (180-days) was sufficient to capture major sources of cost variation related to the index procedure in this population, or if delayed reoperations would mandate even longer episode durations to properly account for cost variability.

A p-value less than 0.05 was a priori determined to be statistically significant. All analyses were performed using SAS version 9.4. The study data were used under a data use agreement with the Uniformed Services University of the Health Sciences, and the study was deemed exempt from full review by the institutional review board of Uniformed Services University of the Health Sciences and the Partners Healthcare Human Research Committee.

Results

We identified 12,811 patients meeting inclusion criteria, of which 6977 underwent a cervical spine fusion and 5834 underwent a non-cervical spinal fusion. These procedures were performed for TRICARE beneficiaries in 168 HRRs during the study period and were included in this analysis.

Total costs for non-cervical spinal operations were higher than total costs for cervical spine operations (Table 1). There was significant variation in the price standardized and case mix adjusted (fully adjusted) amount paid by TRICARE for both cervical and non-cervical spinal fusions. The inter-quintile variation in the fully adjusted price for non-cervical spinal fusions was higher than for cervical spinal fusions across 60- (76% vs 63%), 90- (74% vs. 60%), and 180-day (69% vs 55%) episodes, although the difference narrowed with increasing episode duration.

Table 1 Actual, price-standardized, and fully adjusted TRICARE payments during 60-, 90- and 180-day episode lengths for cervical and non-cervical spinal fusion

The index hospitalization contributed the most to the overall cost of both cervical and non-cervical spinal operations (Table 2). At 90 days, readmissions did not contribute to the overall cost in the lowest quintiles for both types of operation and therefore had large variation. Post-acute care costs had the smallest contribution to the overall costs for both types of operation (3% in quintile 1 to 10% in quintile 5).

Table 2 TRICARE price standardized and case mix adjusted payments by payment bucket for simulated 90-day episodes of care after cervical and non-cervical spinal fusion

A total of 787 (6.1%) patients underwent reoperation within 1 year of their index operation, of whom 357 (45.4%) had their first reoperation within 180-days of their index operation. There were 386 cervical patients that underwent reoperation and 401 non-cervical. The mean time to first reoperation was 187 days (SD 109 days). A very small number (39; 0.30%) of patients underwent multiple reoperations within 1 year.

Discussion

In this study, we examined costs of variable length episodes for working patients undergoing spine surgery for in a working-age adult population across the United States and found significant variation in the cost of both cervical and non-cervical spine surgery. However, even though overall costs and the absolute inter-quintile difference increased as the episode duration increased, the relative inter-quintile variation decreased. We also demonstrated that for this younger population, the post-acute care spending component is only a small portion of the total spent during an episode of care, leading to relatively flat accumulation of costs in varying episode durations compared to the older adult populations.

As episode-based payment systems have evolved for patients over 65 insured by Medicare, interest has grown about the application to younger patients. Large companies, who are significant purchasers of healthcare in the US, are beginning to negotiate prospective bundled payments for various operations (including spine surgery) with specific health-systems and providers [27]. Early data from these programs have shown reduced cost, streamlined care, and presented opportunities for “hardwired” quality improvement metrics. Some commercial insurers are also beginning to design and implement bundled payment model for their beneficiaries [28]. While episode of care based bundled payments have been implemented in some specific markets, our study shows the potential impact on cost variation for spine surgery if bundled payments were implemented on a national level.

Our findings of high variation and low post-acute care spending are consistent with prior studies of episode of care-based bundles in working age adults. A recent study examining spine surgery in working age adults with commercial insurance also found significant cost variation within and between MS-DRGs, but little difference in spending over the duration of 30-, 60-, and 90-day simulated bundles [29]. The variation in cost within MS-DRGs may result from considerable differences previously demonstrated in both the underlying pathology and procedural complexity within MS-DRGs [30]. Non-cervical spinal fusions represent a more heterogenous mix of surgical approaches and complexity than do those of the cervical spine, which likely contributes to the higher variation seen in index admission costs. However, our findings vary considerably from studies of bundle payments in Medicare, in which post-acute care spending is a considerable driver of both total costs and variation in costs [31, 32]. This is consistent with prior research showing the likelihood of discharge to an inpatient rehabilitation center or skilled nursing facility significantly increases for older patients [33].

Our results show that there is wide variation in the readmission costs among patients and only a small number of patients underwent reoperation within 6 months of their index procedure. Prior studies have shown low overall rates of 30-day readmissions (5%) in spine surgery patients, with the majority being related to the index initial surgery, such as wound complications [34, 35]. Working age adults are generally healthier than the older adult Medicare population, which may account for relatively flat spending over time, as they are less likely to accrue medical costs due to unrelated conditions. While rates of reoperation and readmission are already low in this population, implementation of bundles has the ability to change behavior and care practices in ways that are not accounted for by our simulated bundles. When United Healthcare implemented its “Spine and Joint Solution,” a bundled payment for knee, hip, and spine surgeries, they found a 3.4% decrease in complications and a 10% decrease in readmissions for spine surgeries [28]. Similarly, other studies have shown that after implementation of bundled payments by Medicare, discharge to post-acute care also decreased [36, 37].

Limitations

First, as this study utilizes claims data, it may not fully capture the complete premorbid state of the patient, potentially affecting our adjustments for case mix. Similarly, patients may pay for some services out-of-pocket or in ways not captured by claims data, though it is unlikely that these expenses contribute greatly to overall costs or variation as patients in our study have a consistent insurance product. It is also unlikely for patients to pay for elements that drive costs (such as inpatient hospitalizations and long-term care) out of pocket. We also excluded patients with evidence of other insurance coverage, such as Medicare. Due to the constraints of our data use agreement, we report our results at the HRR level rather than the hospital/institution level, though facility level work presents an opportunity for future work. While MS-DRGs are a standard grouping system for conditions and procedures and are currently used for reimbursement purposes, there is varying heterogeneity within groups that limits comparisons between MS-DRGs. In this study, we create a case mix adjustment in an attempt to control for pre-operative patient characteristics, however the optimal way of risk adjusting that best considers the severity of condition, procedural difficulty, as well as comorbidities for spine surgery remains has not yet been established [38]. Additionally, our study utilizes ICD-9-CM coding, which may limit the direct applicability of our findings given the change to ICD-10-CM coding in the US, if care patterns have changed.

Conclusions and implications

The large variation in the cost of care for spinal surgery in working age adults indicates that there is a clear opportunity to implement a value-based payment model for these types of procedures. Unlike in studies of older adults, post-discharge spending following spine surgery among working age adults is low and does not increase substantially over the 6-months post discharge period. This suggests that increasing the episode length beyond to current standard of 90-days would do little to better control the variation in costs related to spine surgery among working age adults.