Introduction

To evaluate the efficacy of diagnostic imaging, one can assume a hierarchical model consisting of the following: (1) technical performance (can the target abnormality be visualized?), (2) diagnostic performance (is an accurate diagnosis possible?), (3) efficacy for patient management (is the management plan likely to be modified based on an accurate diagnosis?), (4) efficacy for promoting patient’s health (is the patient’s health likely to be improved as a result of the management plan modification?), and (5) social efficacy (is the diagnostic imaging cost-effective?). Evaluating these steps, in this order, is thought to be appropriate when conducting studies to determine the efficacy of diagnostic imaging [1, 2].

A large number of articles and books concerning the diagnostic performance of positron emission tomography using [18F]-fluorodeoxyglucose (FDG-PET) are already available worldwide [37]. Thus, the most important step in evaluating the efficacy of FDG-PET as a diagnostic imaging modality, at present, is an evaluation of its efficacy for patient management during clinical practice, i.e., an evaluation of whether the patient management strategy is modified based on the FDG-PET findings.

A randomized inter-group comparison and a comparison of patient management plans between pre- and post-diagnostic imaging periods (comparison between pre-test and post-test periods) are now available as two different approaches for evaluating the efficacy of diagnostic imaging for patient management. A randomized inter-group comparison involves two groups (a group receiving the diagnostic imaging and a group not receiving the diagnostic imaging). Using the inter-group comparison study design, it is difficult to assign the subjects (i.e., to select two groups of patients with strictly matched background variables) to two different evaluation groups. A comparison between pre-test and post-test periods is, on the other hand, a more efficient study design because a diversity of illnesses and pathophysiologies can be included by evaluating a single group of patients [8]. All the previous research on this topic has adopted a “comparison compares patient management plans between pre-test and post-test periods” study design [6, 927].

The present study was undertaken to analyze the proportion of patients whose management strategies were modified based on findings obtained from the addition of FDG-PET to their existing test menus (the strategy modification rate) using the concept of a comparison between pre-test and post-test periods. This study was initiated as a pre-marketing clinical study and was later modified to become a post-marketing clinical study because the manufacture and distribution of FDG were approved during the study period. Thus, this study was carried out in accordance with both Good Clinical Practice (GCP) guidelines and Good Post-marketing Study Practice (GPSP) guidelines in Japan [28, 29].

Materials and methods

This study was performed between April 5 and December 28, 2005, as a multicenter open study involving eight medical facilities. The participating facilities and the study organization are shown in the Appendix. Before the study commenced, the study protocol, case report form, informed consent form for patients, as well as other necessary documents, and the appropriateness of the study were reviewed by the Institutional Review Board (IRB) of each participating facility from ethical and scientific points of view. The IRB of each participating facility approved the study.

Subjects

The study involved patients with one of nine types of cancer (lung cancer, breast cancer, colorectal cancer, head/neck cancer, brain tumor, pancreas cancer, malignant lymphoma, malignant melanoma and cancer of unknown origin). Informed consent was obtained from each patient before the start of the study. Patients were eligible to participate in the study if their management strategy could be evaluated at both the time of study entry and after an FDG-PET examination conducted for one of the following purposes: (1) distinction between malignant and benign lung nodules and diagnosis of lung cancer metastasis/recurrence, (2) distinction between malignant and benign breast tumors and diagnosis of breast cancer metastasis/recurrence, (3) diagnosis of colorectal cancer metastasis/recurrence, (4) diagnosis of head/neck cancer metastasis/recurrence, (5) diagnosis of brain tumor recurrence, (6) distinction between malignant and benign pancreas tumors, (7) malignant lymphoma staging and diagnosis of recurrence, (8) identification of the primary location of cancer of unknown origin, and (9) diagnosis of malignant melanoma metastasis/recurrence.

The exclusion criteria were as follows: (1) pregnant or possibly pregnant women or lactating women, (2) patients who had participated in any other clinical trial involving the use of FDG, (3) patients who had been treated with any other test drug within the 6 months prior to FDG administration, and (4) patients who were judged by the investigators as being inappropriate for inclusion in a study evaluating FDG efficacy and safety.

Patient enrollment

Written informed consent to participate in the study was obtained from 578 patients. Of these patients, 565 were administered FDG (565/578, 97.8 %) and 13 were not administered FDG (13/578, 2.2 %) for reasons such as withdrawn consent, and others. After FDG administration, the study was not discontinued in any of the patients. Of the 565 patients who were administered FDG, 5 patients were excluded from the analysis because they failed to meet inclusion criteria or because they met exclusion criteria (5/578, 0.9 %). The remaining 560 patients (560/578, 96.9 %) were included in the analysis. Table 1 shows the demographic data of the patients who were analyzed. Overall, 263 males and 297 females with a mean age of 63.1 years (range: 21–89 years) were included in the analysis.

Table 1 Demographic data for the patients analyzed

PET scan

Each patient received an intravenous injection of 2 mL of FDG (185 MBq at reference time). The FDG was provided by Nihon Medi-Physics Co., Ltd. (Tokyo, Japan) or The Medical and Pharmacological Research Center Foundation (Ishikawa, Japan). Drip infusions of glucose were suspended and the patients were asked not to drink beverages containing alcohol or carbohydrates from 4 h before FDG administration until the end of PET scanning. Before FDG administration, each patient’s blood glucose level was measured to judge the appropriateness of the FDG administration (cutoff blood glucose level: 200 mg/dL). In diabetic patients receiving insulin for blood glucose control, the administration of insulin was suspended during the 4-h period before FDG administration to stabilize the blood glucose level before FDG administration by avoiding a sharp reduction in the blood glucose level.

PET scanning was started approximately 60 min after the FDG administration (mean 57.4 ± 13.7 min). The initial emission scan (2–3 min × 6–9 scans) was followed by a transmission scan. Image reconstruction was performed using a matrix size of 128 × 128, and preprocessing filters, such as Gaussian filters, FORE filters, or ramp filters (filters were not used at some facilities). OSEM filters were used as reconstruction filters. The presence or absence of a preprocessing filter had no effect on the image assessment. The PET/CT camera used for this study was a Discovery ST (GE) (306 patients) or a Biograph LSO Duo (Siemens) (136 patients). The PET camera used was an ECAT EXACT (Siemens) (60 patients) or an ECAT ACCEL (Siemens) (63 patients). In this study, 442 patients (78.2 %) were evaluated using PET/CT and the remaining 123 patients using PET.

Efficacy evaluations

Figure 1 illustrates the flow of the management strategy evaluation before and after FDG-PET. The percentage of patients whose management strategy as determined before FDG-PET was modified after FDG-PET (strategy modification rate) was analyzed. Investigators at each institution evaluated the management strategy. The parameters/indicators of the management strategy evaluation are shown in Table 2.

Fig. 1
figure 1

Flow of management strategy evaluation before and after FDG-PET

Table 2 Parameters and definitions of variables for management strategy evaluation

The investigators at each institution assessed the overall findings and the patient information available before FDG-PET and decided on the initial management strategy; they then transmitted the records by facsimile to the data center. After the FDG-PET examination, the investigators recorded the overall findings on the addition of the test findings obtained using FDG-PET and checked the presence/absence of additional diagnostic information arising from the FDG-PET examination (Table 2). Based on these overall findings and the additional diagnostic information, a new management strategy was decided (Table 2). The investigators devised the new management strategy based on the FDG-PET findings before any further tests were carried out and then transmitted the records regarding the new management decision by facsimile to the data center. In this manner, the study design minimized the factors affecting the two management strategies adopted before and after the FDG-PET examination, i.e., the modifications to the first management strategy were based only on the FDG-PET findings.

The definitions of management strategy modifications are shown below:

  1. 1.

    The planned tests were modified, supplemented, or skipped after FDG-PET

  2. 2.

    Test planning was difficult before FDG-PET, but became possible after FDG-PET

  3. 3.

    The planned treatments (including treatment methods) were modified, supplemented, or skipped after FDG-PET

  4. 4.

    Treatment planning was difficult before FDG-PET, but became possible after FDG-PET

The modification of the management strategy was investigated by checking the management strategy evaluation sheet filled in by the investigators before and after the FDG-PET examination, with reference to the definition of a modification of the management strategy mentioned above.

Of the variables/indicators analyzed, the primary endpoint was the strategy modification rate for patients with lung cancer, because the number of lung cancer patients who underwent FDG-PET examination was relatively large. The target modification rate was set at 25.6 % because that was the average rate (493/1924 lung cancer patients who underwent FDG-PET) found in 17 reports that we used to estimate the number of subjects needed for our study [5, 924]. The secondary endpoint was the percentage of patients with cancer other than lung cancer in whom the management strategy was modified. The target number of subjects with lung cancer for our study was set at 170 to allow estimation of the modification rate of 25.6 % set for the lung cancer patients, with the lower bound of the 95 % confidence interval [CI] being 7 % or less, based on an F distribution. The number of subjects was set at 600 (in total), based on the percentage of each type of disease among all patients who underwent FDG-PET during 1 month in 2004 (in addition to the estimated requirement for 170 lung cancer patients and taking into account the anticipated exclusion of some patients from the analysis).

As a post hoc subanalysis, major changes or minor changes in the treatment plan and changes in the intensity of treatment after FDG-PET, compared with the pre-FDG-PET period, were analyzed. The method used for this subanalysis was based on a report by the National Oncologic PET Registry [30]; the parameters/indicators that were analyzed and the definitions that were used are given in Table 3.

Table 3 Criteria for classification of treatment plan modification patterns

Evaluation of safety of FDG administration

The parameters/indicators of the safety evaluation included subjective symptoms, objective findings, heart rate, blood pressure, and laboratory parameters such as RBC count, hemoglobin, hematocrit, WBC count, differential leukocyte count (neutrophil, lymphocyte, monocyte, eosinophil, and basophil), platelet count, albumin, Al-P, AST (GOT), ALT (GPT), γ-GTP, LDH, total bilirubin, urea nitrogen, creatinine, Na, K, Cl, urinary protein, urinary glucose, urinary urobilinogen, and urinary occult blood. The safety parameters/indicators were evaluated within 7 days after the FDG administration and were compared with the corresponding values obtained before FDG administration.

Statistics analysis

Comparisons between the target modification rate (25.6 %) and the modification rate of this study were performed using the χ 2 test, and differences were considered statistically significant when p value was less than 0.05. SAS System ver. 9.2 (SAS Institute Japan, Tokyo, Japan) was used for the statistical analysis.

Results

Primary endpoint (lung cancer)

Management strategy modification after FDG-PET and calculation of the modification rate (analysis according to patient)

Table 4 shows the management strategy modification rate (percentage of patients whose test plan or treatment plan was modified) and the 95 % CI for patients with lung cancer (n = 208). The management strategy modification rate for lung cancer was 71.6 % (149/208 patients, 95 % CI 65.0–77.7 %), which was higher than the target modification rate (25.6 %). The difference was statistically significant (p < 0.01). Table 4 also shows management strategy modification rates for lung cancer patients subdivided according to the objective of the diagnostic imaging.

Table 4 Management strategy modification rate after FDG-PET (lung cancer, analysis according to patients)

Details of management strategy modification

In an analysis of modifications to the test plan, the most characteristic finding was the modification of the pre-FDG-PET judgment “difficult to devise a test plan at present” (n = 59) to the post-FDG-PET judgment “test plan available” (n = 25) or “no test plan” (n = 33) in 58 (98.3 %) of the 59 patients (Table 5a).

Table 5 Modification of lung cancer management strategy based on FDG-PET findings

In an analysis of modifications to the treatment plan, the most characteristic finding was the modification of the pre-FDG-PET judgment “difficult to devise a treatment plan at present” (n = 117) to the post-FDG-PET judgment “treatment needed” (n = 55), “follow-up needed” (n = 40), or “no treatment needed” (n = 2) in 97 (82.9 %) of the 117 patients (Table 5b).

Secondary endpoint (cancers other than lung cancer)

Table 6 shows the management strategy modification rates and their 95 % CIs for patients with cancers other than lung cancer. Because there was little number of cases, brain tumor (n = 3) and malignant melanoma (n = 7) were eliminated. The modification rate was in the range 44.4–85.0 % for each type of cancer.

Table 6 Management strategy modification rate based on FDG-PET findings (cancers other than lung cancer, analysis according to patients)

Subanalysis

Modification of treatment plan

Major and minor changes in the treatment plan based on FDG-PET findings were analyzed among the 560 patients with cancer divided into subgroups according to the objectives of the FDG-PET examination (differential diagnosis, disease staging, metastasis/recurrence diagnosis, and primary tumor location identification). The results are shown in Table 7. Modifications of the treatment plans based on the FDG-PET findings were made in 55.4 % of patients. Major changes in the treatment plans were in the range 43.3–68.2 % for patients categorized according to the objectives of the FDG-PET examination. The treatment plan was changed in more than 50 % of patients in the differential diagnosis group, the metastasis/recurrence diagnosis group, and the primary location identification group, with such changes occurring in 224 of the 410 patients. In the disease-staging group, the treatment plan was modified in 44.0 % (66/150) of the patients.

Table 7 Modification of treatment plan based on FDG-PET findings

Intensity of treatment

Modifications of the intensity of treatment were analyzed for patients who had been judged as “treatment needed” (n = 156) before the FDG-PET examination. These patients were divided into groups according to the objectives of the FDG-PET examination (differential diagnosis, disease staging, metastasis/recurrence diagnosis, and primary tumor location identification). The results are shown in Table 8. For 89 % (139/156) of patients, the intensity of treatment was unchanged after FDG-PET examination. However, the intensity of treatment was increased after FDG-PET examination in 14.3 % (1/7) of the patients in the differential diagnosis group and 8.8 % (9/102) of the patients in the disease-staging group, whereas it was decreased in 7.7 % (3/39) of the patients in the metastasis/recurrence diagnosis group.

Table 8 Modification of the intensity of treatment based on FDG-PET findings

Safety

Of the 565 patients who were administered FDG, 100 patients (17.7 %) experienced adverse reactions, including abnormal changes in the laboratory parameters. Frequent adverse reactions were urinary protein positive (15 cases, 2.7 %), urinary occult blood positive (13 cases, 2.3 %), urinary glucose positive (10 cases, 1.8 %), blood pressure increased (10 cases, 1.8 %) and nausea (5 cases, 0.9 %). All adverse reactions were of minimal severity and posed no clinical problems. No serious adverse reactions were noted during this study.

Discussion

The present study investigated the proportion of management strategies modified after an FDG-PET examination was performed in addition to the existing test plan. The study was performed using a design based on comparison between the pre-test and post-test periods, which is known to involve various possible biases. With this in mind, the following measures were taken to optimize the study design. In this manner, we sought to ensure the reliability of the efficacy evaluation in this study.

  • The data center sets the parameters/indicators for the management strategy evaluation in advance, taking into account the status of FDG-PET use at medical facilities and referring to textbooks, published papers, and guidelines relating to each illness. The data center then obtained detailed records of all modifications that were made to the management strategy after the FDG-PET examination, compared with the pre-FDG-PET strategy. In this way, the validity of the evaluation was ensured.

  • Individual investigators made a general assessment of the findings based on the results of tests available at the time of entry and filled in the pre-FDG-PET management strategy on the entry sheet, which was then transmitted by facsimile to the data center. The data center then checked the entry sheet received by facsimile and registered the patient. Only then was the FDG for use in that patient delivered. In this way, it was assured that a series of evaluations of the management strategy had indeed been made prior to the FDG-PET examination.

  • Individual investigators filled in the post-FDG-PET management strategy on the case report form by the day when the test affecting the decision on management strategy was performed. The filled-in case report forms were then transmitted by facsimile to the data center. This step was intended to eliminate biases between the pre- and post-FDG-PET periods.

For each lung cancer patient, the management strategy was investigated both before and after FDG-PET and the percentage of patients for whom the strategy was modified (management strategy modification rate) was calculated. The management strategy modification rate was 71.6 % (149/208 patients), which was higher than the target modification rate of 25.6 %. We thus judged that the primary endpoint for this study had been verified. In an analysis of the modification rates according to the objectives of the FDG-PET examinations, the modification rate was 88.0 % (73/83 patients) for the differential diagnosis group, 49.3 % (36/73 patients) for the disease-staging group, and 76.9 % (40/52 patients) for the metastasis/recurrence group; each of these rates markedly exceeded the target modification rate (25.6 %).

The target modification rate for this study was based on the modification rate for previous reports evaluating the efficacy of the FDG-PET examinations in patient management [5, 924]. The modification rates varied among these previous reports. The differences among the previous reports used for the study design and also between the target modification rate and the modification rate in this study could come from the different populations (e.g., the number of patients by the stage of lung cancer) and so on.

Among the patients enrolled in this study, the pre-FDG-PET judgments made regarding lung cancer and other types of cancer were sometimes “difficult to devise a test plan at present” or “difficult to devise a treatment plan at present”. This uncertainty seems to have contributed to the high strategy modification rate after the FDG-PET examination. The patient eligibility criteria for the present study were similar to the criteria used for coverage under the national health insurance system in Japan as of 2005. This situation probably explains why, among the patients enrolled in this study, there were many for whom devising a test or treatment plan was difficult before the FDG-PET examination.

In the present study, the management strategy modification rate was also within a favorable range (44.4–85.0 %) for cancers other than lung cancer overall (breast cancer, colorectal cancer, head/neck cancer, malignant lymphoma, pancreas cancer and cancer of unknown origin). These results suggest that FDG-PET contributes to the determination of management strategies not only in patients with lung cancer, but also in patients with other types of cancer as well.

According to the National Oncologic PET Registry (NOPR) report [30], major modifications were made to the management strategy in 30.3–39.7 % of patients undergoing a FDG-PET examination for the purposes of diagnosis, initial staging, restaging, or suspected recurrence. In the present study, major modifications of the treatment plan were made in 43.3–68.2 % of the patients who underwent FDG-PET examinations for these same purposes. The NOPR report additionally showed that minor changes in the management strategy were made at a frequency close to that of the major changes. In the present study, on the other hand, the highest frequency of minor changes was 1.5 % (for patients who underwent FDG-PET examinations for the diagnosis of metastasis/recurrence), and major changes were, instead, predominant. The reason why the results of this study differ from the results of the NOPR report was thought to be due to the different populations (the main types of cancer) and study design. Actually, our study design was one of the most objective and accurate evaluation method for strategy modification among previous reports. These results indicate that FDG-PET has a large impact on determining the treatment plan for the types of cancer for which the use of this imaging has been approved in Japan.

Regarding management plans devised before and after FDG-PET, the NOPR report [30] stated that the intensity of treatment was increased in 10 % of all cases and decreased in 22 % after FDG-PET. In the present study, the intensity of treatment was increased in 14.3 and 8.8 % of patients who underwent an FDG-PET examination for a differential diagnosis and disease staging, respectively, and was decreased in 7.7 % of patients who underwent an FDG-PET examination for the diagnosis of metastasis/recurrence. Thus, the intensity of treatment was changed in about 10 % of all patients after the FDG-PET examination. The increase in treatment intensity probably resulted from judgments regarding the need to use additional treatment methods based on the FDG-PET findings, and the reduction in the treatment intensity probably resulted from judgments regarding the feasibility of skipping some treatment methods. Thus, these results suggest that FDG-PET examinations also have an impact on treatment intensity.

The NOPR report [30] stated that it was not possible to judge whether modifications planned after FDG-PET were appropriate or would provide long-term benefits. In the present study, the management strategy modification rate was calculated and analyzed by comparing the pre-FDG-PET strategies and the post-FDG-PET strategies to evaluate step 3 in a hierarchical evaluation model (efficacy for patient management) to assess the efficacy of diagnostic imaging. However, since the present study did not collect data on the relationship between post-FDG-PET treatment and patient outcome, we cannot discuss such a relationship at this time. To resolve these limitations, a study evaluating step 4 of the hierarchical evaluation model (efficacy for promoting patient’s health) is needed. However, such a study will not be easy to implement because it will require a long period to follow-up patient outcomes.

The present study has several limitations. First, the number of patients in several types of cancer was small, which can be explained by the fact only a few patients undergo the FDG-PET examination. Second, this study used several types of PET or PET/CT cameras for the FDG-PET examination. The FDG-PET examinations are better to be performed under some kinds of standardization for PET imaging systems.

Conclusions

The present study was performed as a multicenter study using a design based on comparisons between pre-test and post-test strategies to evaluate the efficacy of FDG-PET for patient management. The patient management strategies for lung cancer patients were modified after the FDG-PET examinations in 71.6 % of patients analyzed, which was higher than the target modification rate of 25.6 %. Thus, the primary endpoint was verified. There were no serious adverse reactions to the FDG-PET examination, and no concerns were raised from the risk–benefit standpoint. FDG-PET appears to have an impact on decisions regarding the need for additional tests, judging the appropriateness of treatment, and adopting management strategies when dealing with patients encountered during clinical practice whose optimal test plans or treatment plans are difficult to devise.