INTRODUCTION

Osteoporosis-related fractures affect one in five men aged 50 years and older globally,1 with the incidence of fractures increasing with age; the majority of these fractures occur in men aged 65 years and older.2 In 2010, an estimated 27.9% of hip fractures occurred in men, and this percentage is expected to rise.1 , 3 Given these data, several organizations4,5,6, 7 have recommended screening for osteoporosis using bone mineral density (BMD) testing in older men. However, there is a lack of data in men on both screening and treatment strategies, resulting in considerable uncertainty about which men to recommend for BMD testing.

Organizations such as the American College of Physicians, National Osteoporosis Foundation (NOF), and Endocrine Society have recommended BMD testing for all men ≥70 years of age.4,5,6, 7 Such a strategy of universal testing naturally identifies all men in this age group with osteoporosis based on World Health Organization (WHO) BMD T-score criteria,8 but at considerable cost and based on evidence rated as “low quality” by the Endocrine Society.7 There are many knowledge gaps regarding optimal screening strategies. These include 1) a lack of randomized trials demonstrating that screening reduces fractures or fracture-related morbidity or mortality for men or women; 2) limited data in men on pharmacologic therapy for osteoporosis, resulting in greater uncertainty regarding appropriate treatment indications and the efficacy of treatment; 3) controversies regarding the appropriate reference range for calculation of T-scores for men, with most advocating the use of a female reference range9 but others recommending a male reference range,5 with resulting discrepancies in classification of men as osteoporotic10; and 4) controversy regarding the use of estimated fracture risk to identify candidates for pharmacologic therapy.11

In 2011, the US Preventive Services Task Force (USPSTF) concluded that evidence was not sufficient to assess the balance of benefits and harms of osteoporosis screening in men (Grade I).4 However, the task force did suggest that men most likely to benefit from BMD testing would have a 10-year risk for major osteoporotic fracture (MOF) greater than or equal to that of 65-year-old white women, a group for whom screening is widely recommended. This risk is estimated at 9.3% using the Fracture Risk Assessment Tool (FRAX), a web-based instrument that uses clinical risk factors with and without femoral neck BMD to estimate the 10-year probability of hip and MOF fractures.12 Others have suggested a variety of other tools for identifying individuals for screening13; the most extensively studied is the Osteoporosis Self-Assessment Tool (OST), with a range of OST score cutoffs recommended to select men for screening.13,14,15,16,17, 18 Unlike FRAX, which was developed to predict fracture risk, the OST was developed to predict low BMD and is simple to use. The score is calculated by subtracting the age of the patient in years from the weight in kilograms and multiplying the result by 0.2.

BMD testing, usually with dual-energy x-ray absorptiometry (DXA), is clinically used to screen for two potential conditions. One is the identification of osteoporosis, defined as a T-score ≤ −2.5 at the femoral neck, total hip, or lumbar spine; clinical trial data have demonstrated fracture risk reduction with pharmacologic intervention in individuals with T-scores in this range. The other is identification of individuals with T-scores > −2.5 but ≤ −1 (often referred to as osteopenia) and an elevated estimated fracture risk. For individuals with T-scores in this range and with an elevated estimated fracture risk, some guidelines recommend consideration of pharmacologic treatment as well. The NOF and Endocrine Society have suggested that in addition to individuals with osteoporosis, those with osteopenia and a 10-year risk of a MOF of ≥20% or 10-year risk of hip fracture ≥3% be considered candidates for pharmacologic intervention.5 , 7 , 19 The current NOF guidelines recommend the use of a male-specific reference group for the calculation of T-scores in men.5

Using data from 4043 community-dwelling men aged ≥70 years enrolled in the Osteoporotic Fractures in Men (MrOS) study, we compared strategies for selecting older men for initial BMD testing to identify men with T-score-defined osteoporosis and to identify men with osteopenia and an elevated estimated fracture risk. The specified strategies were as follows: screen all men aged 70 and older; screen men with a FRAX (without BMD)-estimated 10-year MOF risk above a range of cutoffs, including 9.3% (USPSTF strategy); screen men whose OST scores are below a cutoff value ranging from <−1 to <3.

METHODS

Study Participants

From March 2000 through April 2002, 5994 men aged ≥65 years were recruited from population-based listings in six regions of the United States for participation in the baseline examination for MrOS.20 , 21 Men with a history of bilateral hip replacement or who were unable to walk without the assistance of another person were excluded. Men were eligible for this analysis if they were representative of the screened population (i.e., they had no history of non-traumatic hip or clinical vertebral fracture and reported no bisphosphonate or other anti-fracture treatment) and were ≥70 years of age (eFigure 1). The institutional review board at each participating institution approved the study protocol, and written informed consent was obtained from all participants.

Outcomes

For our primary analysis, we defined the outcome (osteoporosis) as having a BMD T-score of ≤ −2.5 at the femoral neck, total hip, or lumbar spine, with T-scores calculated using National Health and Nutrition Examination Study (NHANES) III BMD norms for white women aged 20–29, as recommended by the World Health Organization for the diagnosis of osteoporosis in clinical practice.22 , 23

For a secondary analysis, we defined the outcome as a T-score ≤ −2.5 at any of the three sites or a T-score ≤ −1.0 but > −2.5 and an elevated estimated fracture risk. We defined elevated estimated fracture risk as a 10-year risk of a MOF of ≥20% or 10-year risk of a hip fracture of ≥3% using the FRAX (with BMD) risk calculator. For this outcome, we used a male rather than a female reference group, consistent with the current NOF guidelines. We excluded men with a history of hip or clinical vertebral fracture, as these individuals would already be considered for treatment and would not be considered part of the routine BMD screening population.

Risk Assessment Strategies

Information regarding osteoporosis risk factors (age, race, rheumatoid arthritis, history of prior fracture, smoking, alcohol intake, and parental history of hip fracture) was obtained from baseline questionnaires. Body weight (balance beam scale at all sites except the Portland site, where a digital scale was used) and height (Harpenden stadiometer24) were used to calculate BMI in kg/m2. The FRAX tool was used without a BMD measurement (FRAX [w/o BMD]) to calculate a 10-year risk of a MOF (version 3.3). We categorized participants with a FRAX (w/o BMD)-predicted 10-year risk of MOF ≥9.3% as being recommended for BMD testing, as suggested by the USPSTF.

The OST was calculated as 0.2 × (weight [kg] − age [yr]), and the score was truncated to yield an integer.

Measurement of BMD

BMD at the total hip, femoral neck, and lumbar spine was measured at baseline using dual-energy x-ray absorptiometry (DXA) with Hologic QDR 4500 W scanners (Hologic, Inc., Bedford, MA). A central quality-control laboratory, certification of DXA technicians, and standardized procedures for scanning were implemented to ensure reproducibility of DXA measurements.21

Other Measurements

Participants attending the clinic examinations were asked to bring all current (any use within the last 2 weeks) prescription and nonprescription medications with them. Interviewers completed a medication history for each participant, including name of medication and frequency of use. Current use of bisphosphonates, other anti-fracture treatments, and oral corticosteroids was determined using a computerized dictionary to categorize type of medication from product brand and generic names obtained from containers.25

Statistical Analysis

We calculated the proportion of men who would be selected for BMD testing using a range of cutoffs for each pre-screening strategy (OST and FRAX [w/o BMD]). We also calculated the proportion of men with a BMD T-score ≤ −2.5 and the proportion of men who would be recommended for pharmacologic therapy based on NOF guidelines that would be identified by each of these strategies.

We calculated the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio (LR+), negative likelihood ratio (LR−), the number of missed cases of T-score-defined osteoporosis, (false negatives), and the number of DXAs performed on men without osteoporosis (false positives) for the identification of men with osteoporosis for a range of cutoffs for each of the strategies. We used a female reference group for calculation of T-scores for this outcome. We performed similar analyses for the identification of men with osteoporosis or osteopenia and an elevated estimated fracture risk, using a male reference group.26 In addition, we plotted the receiver operating characteristic (ROC) curves for the detection of these outcomes across a range of possible thresholds for the OST and the FRAX (w/o BMD) and calculated the area under the curve (AUC).27

RESULTS

Baseline Characteristics

Baseline characteristics of the study population are presented in Table 1. Using a female and male reference group for T-score ≤ −2.5 (at the femoral neck, total hip, or lumbar spine), 216 (5.3%) and 397 (9.8%) men, respectively, were classified as having osteoporosis. A total of 787 (19.5%) men had a T-score ≤ −1.0 but > −2.5 at one or more of the three sites and an estimated fracture risk at or above the NOF FRAX (with BMD) intervention thresholds. Based on NOF guidelines, 1184 (29.3%) men in the study population would be classified as eligible for drug treatment (Table 1).

Table 1 Baseline Characteristics of Men Aged 70 Years and Older, Non-Traumatic Hip or Spine Fracture Excluded

Comparison of Risk Assessment Strategies

The proportion of the study population who would be referred for BMD testing ranged from 23.5% for an OST cutoff of <−1 to 100% for the strategy of screening all men 70 years and older. The FRAX (w/o BMD) tool (10-year risk of MOF estimated at ≥9.3%) identified 42.2% of men for BMD testing. Of the 216 men with T-score-defined osteoporosis using a female reference group, 76.9% were identified using an OST cutoff of <1, 82.4% were identified using an OST cutoff of 2, and 59.3% were identified using a FRAX (w/o BMD) with a cutoff of ≥9.3% (Table 2). Results were similar when a male reference group was used for the calculation of T-scores.

Table 2 Proportion of Men Recommended for Screening by Proposed Strategies and T-Score* Category

The receiver operating characteristic (ROC) curves for the OST- and FRAX (w/o BMD)-estimated 10-year risk of a MOF over their full ranges for the prediction of osteoporosis are shown in Fig. 1a. The OST had a higher AUC (0.68 vs. 0.62) than the FRAX (w/o BMD) (p = 0.004).

Figure 1
figure 1

a ROC curves for the OST and FRAX (w/o BMD) over a range of values for the prediction of FN, TH, or LS T-score ≤ −2.5. b ROC curves for the OST and FRAX (w/o BMD) over a range of values for the prediction of men identified for treatment based on NOF guidelines. The black diagonal line indicates a reference AUC of 0.50 (no better than chance alone). ROC receiver operating characteristic, OST Osteoporosis Self-Assessment Tool, FRAX Fracture Risk Assessment Tool, FN femoral neck, TH total hip, LS lumbar spine, NOF National Osteoporosis Foundation, MOF major osteoporotic fracture, AUC area under the curve.

Table 3 shows the sensitivity, specificity, PPV, NPV, LR+, LR−, and the number of false negatives (missed cases) and false positives (unnecessary DXA tests) for a range of cutoffs of the OST- and FRAX (w/o BMD)-estimated risk for predicting osteoporosis based on a female reference group. The strategy of testing men with a FRAX (w/o BMD)-estimated 10-year risk of a MOF of ≥9.3% had sensitivity of 0.59 and specificity of 0.59; an OST score of <1 had sensitivity of 0.77 and specificity of 0.51, while an OST score < 2 had sensitivity of 0.83 and specificity of 0.36.

Table 3 Test Characteristics of Proposed Screening Strategies for the Detection of Men with Femoral Neck, Total Hip, or Lumbar Spine T-Score ≤ −2.5 (Female Reference)

The use of the OST cutoff of <2 to identify men for testing would result in missing 9 of 53 cases of osteoporosis per 1000 men, while reducing the number of men referred for BMD testing who would not have osteoporosis to 601/1000 (Table 3).

For the detection of men with osteoporosis or osteopenia with elevated fracture risk (using a male reference group), the ROC curves for the OST and FRAX (w/o BMD) for the prediction of this outcome are shown in Figure 1b. The FRAX (w/o BMD) had a higher AUC (0.79 vs. 0.68; p < 0.001) than the OST for this outcome. Sensitivity, specificity, PPV, NPV, LR+, LR−, and the number of false negatives (missed cases) and false positives (unnecessary DXA tests) for a range of cutoffs of the OST- and FRAX (w/o BMD)-estimated risk are shown in Table 4.

Table 4 Test Characteristics of Proposed Screening Strategies for the Detection of Men above NOF Treatment Threshold*

DISCUSSION

We found a low prevalence of T-score-based osteoporosis among men, consistent with data from NHANES 2005–2010.28 Using a female reference group to calculate T-scores, 5.3% of these community-dwelling men ≥70 years of age had a T-score ≤ −2.5 at one or more of the three anatomic sites. An additional 4.5% of the cohort would be classified as having osteoporosis if a male-specific reference group were used to calculate the T-scores. Because men on average have higher BMD than women, the use of a male reference group to calculate T-scores results in men being classified as having a T-score < −2.5 at a higher absolute BMD and thus at a lower risk of fracture. In 2013, the International Society for Clinical Densitometry formally recommended the use of a female reference group for the calculation of T-scores for men and women.29

Our data make explicit the trade-offs clinicians face when deciding whether to implement a universal testing strategy for men ≥70 years of age or to utilize a pre-screening tool to identify men for BMD testing. Use of the OST with a cutoff of 1 or 2 identified 77% and 83% of men with osteoporosis, respectively, while use of the FRAX (w/o BMD)-predicted 10-year risk of MOF ≥9.3%, as suggested by the USPSTF, identified 59% of men with BMD-defined osteoporosis.

The positive LRs for all cutoffs for the OST and FRAX (w/o BMD) studied for the prediction of T-score-defined osteoporosis are relatively small; however, the negative LRs for an OST cutoff of 1–3 are in a range (0.4–0.5) that may be clinically useful for identifying men who should not be referred for BMD testing.30 Although negative LRs in this range for a diagnostic test are relatively weak, these tools are not being used as a diagnostic test, but rather to screen men to determine whom to refer for testing with DXA; in this setting, negative LRs in this range may be reasonable.

The pre-screening strategy based on FRAX (w/o BMD) performed better than pre-screening based on OST for the detection of those who would be considered for pharmacologic therapy if the indications for treatment were extended to those with osteopenia and an elevated estimated fracture risk, as suggested by NOF guidelines. This result is to be expected given that the NOF guidelines include an elevated FRAX score with BMD as an indication for treatment. This highlights the unique position of FRAX: FRAX was developed as a treatment decision tool to identify individuals at higher risk of fracture; its use as a pre-screening tool to select individuals for BMD testing was not its original purpose, and may lead to more men being considered for treatment. In this cohort, nearly 30% of men aged 70 years and older would meet NOF criteria for pharmacologic treatment.

Our results are similar to those of Pang et al., who found that the simpler OST performed better than FRAX (w/o BMD) for identifying osteoporosis in a mixed-gender cohort of 626 adults aged 70 years and older.31 The OST, although most extensively evaluated in women, has been evaluated in several studies in men.13,14,15, 16 , 18 , 32 In a recent systematic review and meta-analysis of the performance of several clinical risk assessment instruments for osteoporosis screening, the OST performed as well as or better than more complex instruments in identifying individuals with BMD-defined osteoporosis.13

A previous study of the MrOS cohort that included all men aged ≥65 years used a male-specific reference for the calculation of BMD T-scores15 and reported sensitivity of 0.88 for an OST cutoff of <2 in identifying osteoporosis. In addition to the difference in reference groups, our work also differs from the earlier work in that we have extended it to examine the use of these pre-screening strategies in selecting men for DXA testing when the outcome of interest incorporates fracture risk estimates, as recommended by several organizations.

Cost-effectiveness analyses have not supported a universal BMD testing approach for all men ≥70 years of age,33 although one found that body weight could be used to identify older men without prior fracture for whom BMD testing would be cost-effective.34 Another analysis35 found that using the OST to select high-risk men for testing improved the cost-effectiveness of performing DXA in older men. None of the existing cost-effectiveness analyses examined the effect of expanding treatment indications to include men with osteopenia and an elevated estimated fracture risk.

The study has several strengths, including the large, well-characterized cohort and the measurement of BMD in all participants. However, the population studied was predominantly white, thus limiting generalizability; in addition, participants were recruited from the community and may be healthier than other populations.

Those guidelines that recommend pharmacologic treatment for individuals with osteopenia (T-score ≤ −1.0 but > −2.5) and elevated fracture risk do so based on findings from one cost-effectiveness study36; no randomized clinical trial utilizing these treatment thresholds has been conducted to date. Treatment recommendations that call for consideration of pharmacologic intervention in those with a T-score ≤ −2.5 at the femoral neck, total hip, or lumbar spine, on the other hand, are based on rigorous clinical trial data, albeit primarily in postmenopausal women.

Given the low prevalence of osteoporosis in community-dwelling older men, as well as the uncertainty surrounding the use of estimated fracture risk to guide treatment decisions, a universal screening approach results in BMD testing in many men who do not have osteoporosis and the potential over-identification of candidates for pharmacologic therapy for whom therapy is of uncertain benefit. While most older men who sustain a fracture do not have pre-existing osteoporosis,37 the evidence that existing pharmacologic treatments, all of which address skeletal health, reduce fracture risk in men without osteoporosis is limited.

Among the strategies examined, the use of an OST cutoff of <2 substantially reduced the proportion of men screened, reduced the number of false positives while missing few cases of T-score-defined osteoporosis, and performed better than the more complex FRAX (w/o BMD) strategy. Further research toward developing targeted screening of men and additional trials of pharmacologic therapy in men to better characterize appropriate candidates for treatment are critical for developing targeted screening strategies for men or for creating the evidence base needed to support universal BMD testing in men.