Introduction

Children with malignant solid tumors may present with skeletal metastases at diagnosis with a varying frequency according to the type of tumor. More than 50% of patients with neuroblastoma and up to 6% of patients with rhabdomyosarcoma have metastases in bone marrow and/or cortical bone at diagnosis [1,2,3]. Patients with Ewing sarcoma have bone metastases (with or without lung metastases) at diagnosis in up to 11.5% [4]. Osteosarcoma patients sometimes present with skip lesions, i.e. bone metastases within the same bone or across a neighboring joint or with bone metastases at diagnosis [5, 6] and a rare pediatric renal tumor, clear cell sarcoma of the kidney, is known for its propensity to metastasize to bone [7]. Accurately detecting bone and bone marrow metastases is essential for treatment assignment and prognosis. Conventional radiography, computed tomography (CT), and various scintigraphic techniques (bone scan, 123I meta-iodobenzylguanidine scintigraphy [MIBG], 18F–fluorodeoxyglucose positron emission tomography [FDG-PET]), are currently used for staging, yet all these techniques expose the patient to ionizing radiation and have a variable sensitivity for detecting osteomedullary metastases in different types of tumors [8,9,10,11,12,13]. Consequently, there is an increasing interest in alternative, preferably radiation-free, techniques with a track record in osteomedullary pathology, such as whole-body magnetic resonance imaging (WB-MRI) [14].

Several studies have shown WB-MRI to be useful in evaluating adult cancer patients [15,16,17,18,19,20,21] and MRI to be a sensitive technique for detecting osteomedullary lesions in children [22, 23]. Publications on the usefulness of WB-MRI in pediatric cancer patients are still limited; only a few studies have investigated the potential of WB-MRI for detecting skeletal metastases specifically in children [24,25,26,27,28]. The difference in tumor types and the complexity of bone marrow conversion from red to yellow marrow that takes place during childhood and adolescence call for evidence on the value of WB-MRI in pediatric oncology staging specifically.

The purpose of this study was to assess the diagnostic value of WB-MRI for detecting skeletal metastases in children and adolescents with primary solid tumors as judged from the available literature.

Materials and methods

Search strategy

Computerized searches in MEDLINE and EMBASE databases until 15 May 2017 were performed to identify studies reporting on the diagnostic value of WB-MRI in children and young people (ages <21 years) with primary solid tumors. The literature search consisted of terms related to the three main elements: 1. Whole-Body Magnetic Resonance Imaging and 2. Patient population (oncological patients) and 3. Childhood/Adolescence. Different terms were used to have a broad search to identify all relevant studies (Supplement 1).

Selection of relevant studies

The selection of relevant studies was performed by one author with experience in data extraction of 25 meta-analyses (SB, Clinical epidemiologist with 14 years expertise in evidence based radiology).

  1. Step 1:

    The title, article type and abstract of all retrieved articles were checked for potential relevance. Duplicates (i.e. identical articles found in both databases) and non relevant articles, such as letters, comments, editorials and conference papers, that were not disease related and other imaging were excluded and remaining articles were considered potentially relevant.

  2. Step 2:

    All potentially relevant articles were checked on full text by the same author for further selection. During this step, reviews, studies on other outcomes, case reports and studies on adults were excluded. After an interval of 2 weeks, the author double-checked this step to make sure all relevant papers were selected for further analyses.

Inclusion criteria

All remaining articles were considered relevant and full text was checked on inclusion criteria by two pediatric radiologists (AMJBS, 25 years experience in pediatric oncological imaging and EED, 8 years experience in pediatric oncological imaging) with 25 and 8 years of experience, respectively, in pediatric oncological imaging.

Inclusion criteria were:

  1. 1.

    Children or adolescents (up to 21 years) with any of the following primary solid tumors: neuroblastoma, soft-tissue sarcoma, Ewing sarcoma, osteosarcoma, hepatoblastoma or renal tumors. If both children/adolescents and adults were included, at least 90% of the study population had to be children/adolescents or data on children/adolescents could be separately extracted. If data were combined with other types of tumors, e.g., lymphoma, data on primary solid tumors could be extracted separately;

  2. 2.

    Number of patients ≥5;

  3. 3.

    Evaluation of WB-MRI compared with any type of reference standard (such as MIBG-, bone marrow biopsy, bone biopsy trephine, FDG-PET, CT, bone scan, etc.);

  4. 4.

    Evaluation of skeletal metastases by WB-MRI (if other types of metastases [extra skeletal] were included, at least 75% had to be skeletal metastases or data on skeletal metastases could be extracted separately);

  5. 5.

    Data on true positives, true negatives, false-positives and false-negatives could be extracted.

Data extraction of included papers

Data extraction was also performed by AMJBS and EED by using a standardized form. The following data were assessed: 1. Methodological quality characteristics; 2. Study design characteristics; 3. Patient characteristics; 4. Technical characteristics regarding WB-MRI; 5. Interpretation of WB-MRI, 6. Reference standard and flow of timing and 7. Data on accuracy. Disagreement was resolved through discussion amongst SB, AMJBS and EED.

For critical appraisal and assessment of the methodological quality of the included studies, the QUADAS 2 checklist was used (Quality Assessment of Diagnostic Accuracy Studies 2). Both risk of bias and concerns regarding applicability were checked [29].

The risk of bias was assessed for the following domains: patient selection, index test, reference test and patient flow. For each domain signaling questions were answered with yes, no, or unclear. With these answers, a risk of bias per domain was then assessed. The risk of bias was “low” if the answers to all questions were “yes.” If any of the questions was answered with “no,” a potential risk of bias exists, hence a judgment of “high.” The unclear category was used if insufficient data were reported to permit judgment.

Concerns regarding applicability were assessed for the following domains: patient selection, index test, reference test and patient flow.

Details on risk of bias and concerns regarding applicability are described in Table 1.

Table 1 QUADAS (Quality Assessment Tool for Diagnostic Accuracy Studies, version 2) by domain

The following study design characteristics were recorded: a. First author, year of publication; b. Period of recruitment; c. Country of origin; d. Design of study (single center or multicenter; if authors from different institutions were involved regardless of patient inclusion, the design of the study was considered multicenter); e. Department of first author (Radiology or other), f. Data collection (Prospective or Retrospective or Unclear; in case informed consent was obtained, we considered the study to be prospective.) and g. Institutional Review Board approval (approved and informed consent obtained or approved and requirement for informed consent waived or not approved or unclear).

The recorded patient characteristics were: a. Inclusion and exclusion criteria; b. Number of patients included (either total number, included, or analyzed); c. Age (mean + standard deviation [SD] or median + range) in years, d. Male: female ratio and e. Distribution of patients (type of tumor).

Technical MRI features that were assessed: a. Magnetic field strength; b. Coil(s) used (only body coil or also other); c. Types of sequences (T1-weighted and/or T2-weighted and/or diffusion weighted imaging [DWI] and/or short tau inversion recovery [STIR] or other); d. Imaging planes (axial and/or coronal and/or sagittal), e. Examination time and f. Sedation.

Execution of MRI was considered to be described sufficiently if a-c were given.

Data on interpretation of WB-MRI that were recorded: a. Number of observers for interpretation; b. Experience of the observers; c. In case >1 observer, how data of the observers analyzed for comparison with reference standard were reported (consensus reading or separately/independently or unclear) and d. Criteria for skeletal metastases on WB-MRI. Interpretation of MRI was considered to be described sufficiently if a-d were given.

All reference tests used for comparison were recorded for each study. In addition, the time interval between WB-MRI and reference standard and any patients who were initially included (based on inclusion and exclusion criteria) but who did not undergo reference standard or were excluded for other reasons were reported.

Data were extracted on per-lesion, per-region or per-patient basis. For each study, data were extracted/constructed for WB-MRI compared with the reference standard. Per lesion: numbers of true positive (TP), false-negative (FN) and false-positive (FP) results were extracted and per-lesion sensitivity was calculated as follows: TP/(TP+FN); and positive predictive value (PPV) as follows: TP/(TP+FP).

Per region and per patient: numbers of true positive (TP), false-negative (FN), false-positive (FP) and true negative (TN) results were extracted and both sensitivity (TP/[TP+FN]) and specificity (TN/[TN+FP]) were calculated.

If more than one dataset was given (e.g., more observers or more comparisons), all datasets were extracted.

Data analysis

Since the number of studies were limited and data was heterogeneous [30], meta-analysis with random-effect approach [31, 32] would not be suitable for pooling sensitivity, specificity or positive predictive values (PPV). All data are therefore presented per study.

Results

Search and selection of relevant studies

The searches in Medline and EMBASE databases resulted in 2,641 articles (Fig. 1). Based on title and abstract, 2,521 articles were excluded. The remaining 120 articles were double-checked on full text and an additional 106 articles were excluded as these were reviews (n=51), reported other outcome (n=14), case reports (n=5) and studies including a majority of adults (n=36). The remaining 14 articles were found to be relevant and were checked for fulfilling inclusion criteria.

Fig. 1
figure 1

Search, selection and inclusion of relevant papers.a Not relevant disease related (other disease, ovarian cancer, neurofibromatosis, cervix cancer, glioma).b Other type of imaging was evaluated: such as PET, scintigraphy, CT. cpotentially relevant studies were checked on two occasions two weeks apart. TP=true positive; TN=true negative; FN=false negative

Inclusion of relevant studies

Of the 14 studies, 9 were excluded due to inclusion of patients with malignancies other than solid tumors (n=4, all patients had lymphoma) [33,34,35,36]; proportion of children unknown and/or data on children/adolescents could not be extracted (n=2) [37, 38]; fewer than 5 patients included with solid tumors (n=2) [39, 40] and data on true positives, false-negatives, false-positives and true negatives could not be extracted (n=1) [41].

Five studies [24,25,26,27,28] were included in this review. The complete search is presented in Fig. 1.

Methodological quality characteristics

The risk of bias for patient selection was unclear in all studies for lack of information on whether patients were included consecutively or randomly (Q1) or whether patients were excluded inappropriately (Q3) (Table 2).

Table 2 Methodological quality criteria of the included studies: risk of bias and concerns regarding applicability

The risk of bias for WB-MRI was low for all studies; this might be explained by the fact that all studies were initiated by the department of radiology and therefore both the blinded interpretation (Q4) and the radiologic criteria for skeletal metastases (Q5) were described in detail.

The risk of bias for the reference standard was high. None of the studies used only histology or follow-up as the reference standard; other imaging modalities and even MRI were included in the reference standard (Q6). In most studies, it was also unclear whether the reference test was interpreted without knowledge of the findings on WB-MRI (Q7) [24, 26, 27].

The risk of bias for flow and timing was low for all studies (Q8), all patients underwent a reference test (Q9) and all included patients were analyzed (Q10).

The concerns regarding applicability for patient selection were high in two studies [25, 28] (Table 2). In one study [28], more than 50% of the patients had tumors other than primary solid tumors. In the other study [25], more than 50% of patients were examined following the start of treatment.

The concerns regarding applicability for WB-MRI were high in four studies [24, 26,27,28] since no description on the experience of the radiologists was provided. If we discard the data on experience, most of the studies would have low concerns.

The concerns regarding applicability for the reference test were high in all studies, as MRI was included in the reference test or it was unclear if MRI was included in the reference test.

Study design characteristics

All included studies were single center, prospectively designed by the department of radiology. All studies obtained informed consent. Studies were from Germany [24, 28], the United States [26], Korea [25] and India [27]. For 2 studies, the study period was unknown [24, 27]; the other studies were performed between March and December 2000 [26], May 2003 and September 2004 [25], and January 2004 and January 2006 [28].

Patient characteristics

A total of 132 patients were included (age range: 4 months-19 years) (Table 3). Ninety-six patients had solid tumors, including 39 Ewing sarcomas/PNET (40.6%), 23 (ganglio)neuroblastomas (24%), 19 rhabdomyosarcomas (19.8%), 6 osteosarcomas (6.3%), 3 germ cell tumors (3.1%), 1 hepatoblastoma (1%), 1 Wilms tumor (1%), 1 melanoma (1%), 1 small round cell tumor (1%), 1 alveolar sarcoma (1%) and 1 fibrosarcoma (1%).

Table 3 Patient characteristics of the included studies

Goo et al. [25] included a large number (25/36) of patients who had already been treated with chemotherapy and xMazumdar et al. [26] included two patients with a recurrence.

Technical characteristics regarding whole-body magnetic resonance imaging

All scans were performed on a 1.5-T scanner (Table 4). Different types of coils were used depending on patient size: Young children were examined with head coils, older children with body-, phased array- and/or spine coils. Kumar et al. [27] used a combination of coil elements for the older children.

Table 4 Magnetic resonance imaging (MRI) technical features and interpretation of the included studies

In all but one [24] study, a short tau inversion recovery (STIR) sequence was performed. In three studies, a subset of patients (young and/or uncooperative) were sedated for the WB-MRI-scan [25, 27, 28]; one study did not provide information on sedation [24] and in another study [26], WB-MRI was appended to another diagnostic test under sedation. The examination time varied from 15 to 60 min. None of the studies evaluated WB-DWI.

Interpretation of whole-body magnetic resonance imaging

In all studies data, were interpreted by two radiologists in consensus (Table 4). On T1-weighted images, a metastatic bone or bone marrow lesion was defined as focal or diffuse hypointense bone marrow signal intensity relative to adjacent (or, in the extremities, contralateral) normal bone marrow. On STIR images, it was defined as focal or diffuse hyperintensity of bone marrow relative to skeletal muscle or as destruction of cortical bone.

Reference test and flow of timing

All patients underwent a reference standard. Different reference standards were used, including MRI in three studies [25, 27, 28] (Table 5). The other two studies used clinical and imaging/radiologic follow-up without specifying which imaging modality was used [24, 26]. The time interval was less than 5 weeks and in studies without follow-up [26, 28] the interval between WB-MRI and reference standard was less than 13 days.

Table 5 Reference standard, and time interval between whole-body magnetic resonance imaging (WB-MRI) and reference standard

Data on per-lesion, per-region and per-patient

Data were reported on a per-lesion basis in three studies [24, 25, 28], per-region basis in one study [27] and per-patient basis in two studies [24, 26] (Table 6).

Table 6 Data on detection of skeletal lesions with whole-body magnetic resonance imaging

Daldrup-Link et al. [24] described 51 bone lesions in 21 patients, proven by follow-up imaging or biopsy. WB-MRI detected 42 of these lesions (sensitivity: 82.4%); however, 9 lesions were false-negative on WB-MRI.

Goo et al. [25] presented three datasets, i.e. subsets for comparison of WB-MRI with bone scintigraphy (n=58), MIBG scan (n=26) and CT scan (n=48). For all subsets, a high sensitivity of WB-MRI was found (range: 99.1–100%). There was only one false-negative finding, occurring in the subset comparing WB-MRI with bone scintigraphy. In the other two datasets, the numbers of false-positives were high, especially in the subset of patients who underwent WB-MRI and MIBG scan (PPV=7.7%).

In the study by Krohmer et al. [28], WB-MRI had a high sensitivity (100%) but also a high number of false-positives (88/130): These lesions were both skeletal and extraskeletal lesions. The number of skeletal false-positives was not given.

Kumar et al. [27] evaluated bone metastases in 208 regions. According to the reference standard, 40 regions showed bone (marrow) lesions; WB-MRI detected metastases in 39 of these regions, resulting in a sensitivity of 97.5%. The specificity of WB-MRI was also high (99.4%).

In the study by Daldrup-Link et al. [24], 21 of 39 patients showed bone metastases and WB-MRI detected metastases in 16 of these patients, with a sensitivity of 76.2% and a specificity of 100%.

In the study by Mazumdar et al. [26], two patients had skeletal metastases, which were correctly detected on WB-MRI (sensitivity and specificity were both 100%).

Discussion

Summary of findings

With this systematic review, we provide a summary of the available evidence on the diagnostic performance of WB-MRI for detecting skeletal metastases in children with a malignant primary solid tumor. Because all included studies had a reference standard with a high risk of bias, it is difficult to determine the true value of WB-MRI.

The sensitivity of WB-MRI for detecting bone metastases per lesion ranged from 82.4% to 100%. The sensitivity upper range of 100% most likely is an overestimation of the sensitivity of MRI by incorporation bias because MRI was part of the reference standard in 3/5 studies [25, 27, 28].

In the study by Daldrup-Link et al. [24], the false-negative rate of WB-MRI was higher compared to the other studies. The reason for this lower sensitivity of MRI is unclear, but it may be due to the use of a T1-weighted sequence, instead of a STIR sequence in the other studies, although the authors state that T1-weighted images showed fewer movement artifacts, had a higher spatial resolution, and had a comparable sensitivity but higher specificity than STIR images in children. The false-negatives in their study were generally small lesions in small or flat bones (n=7/9).

A few studies found a high number of false-positives for WB-MRI. These high rates of false-positives might be explained by the reference tests used: MIBG and CT [25]. These reference tests might be less sensitive than WB-MRI, leading to a high false-positive rate (for lesions that might be true positives). In the study comparing WB-MRI with MIBG [25], the high false-positive rate of WB-MRI might be explained by the low specificity of the STIR sequence, since hyperintense lesions represent an increase in water content and may also be of traumatic, infectious or inflammatory origin or represent areas of highly cellular hematopoietic marrow, cysts or other benign lesions [30]. Furthermore, a substantial number of patients in this study were examined after treatment and osteomedullary metastases may continue to be visible on WB-MRI when they have become fibrotic [42]. Krohmer et al. [28] also described a high number of false-positive lesions on WB-MRI, but they did not mention how many of the lesions were in the skeleton. They also described 33 of the 88 false-positive lesions as bone lesions that had been detected in the peripheral skeleton of only 2 patients in whom conventional cross-sectional imaging of these particular areas had not been performed.

Finally, it should be pointed out that in all studies data were interpreted by two radiologists in consensus, which is a known study limitation [43].

Strengths of this review

In the past 20 years, several studies have been conducted to study the diagnostic value of WB-MRI for detecting skeletal metastases in solid tumors [18,19,20]. A recent study by Kembhavi et al. [37] showed that WB-MRI using the STIR sequence detected all bone lesions both in adults and children.

However, differences between adults and children are particularly important in this context. The types of tumors children develop, and the biology of these types of tumors, generally differ from adult types. Furthermore, it is during childhood and adolescence that bone marrow conversion takes place, from predominantly hematopoietic (red marrow) at birth to fatty (yellow) marrow and this process complicates the interpretation of bone marrow lesions in the pediatric age group. This enhances the importance of summarizing the existing evidence on the diagnostic value of WB-MRI for detecting skeletal metastases specifically for children and adolescents.

Limitations of this review

One of the limitations of this review is the low number of included studies. Although there were a significant number of relevant studies, many studies included both adults and children, from which the data of children could not be extracted.

Another limitation is related to the reference standard: the lack of histopathological proof of many lesions and the fact that various different imaging modalities were used as reference standard, including MRI. However, all included patients did undergo a reference standard.

Furthermore, the included patient cohorts were rather heterogeneous: All studies pooled patients with different types of malignancies, and in most studies results were not specified per tumor type. Hence, no evidence per specific tumor type on the value of WB-MRI in the staging process could be extracted. Nevertheless, we can conclude that, in general, WB-MRI seems to be a sensitive tool for detecting skeletal metastatic lesions.

Finally, all studies have either an unclear or high risk of bias or high concern on applicability, although in some aspects, our criteria might have been too strict: We judged low for lack of information on experience of the observers, but if we discard these data, most of the studies would have low concerns.

Conclusion

WB-MRI seems a promising radiation-free alternative to the currently used modalities for detecting skeletal metastases in children with solid tumors. The heterogeneity and the small size of the studies, however, do not provide conclusive evidence that WB-MRI can offer equally good or better staging for each specific tumor type. In addition, before implementing WB-MRI in the staging process, the choice of sequence(s) and parameters should be standardized.