Diffuse large B cell lymphoma (DLBCL) is the most common type of mature B-cell non-Hodgkin lymphoma, accounting for approximately one-third of all cases. This is a heterogeneous disease with different prognostic patterns. Patients benefit from immuno-chemotherapy; however, 30 to 40% of patients are refractory to it or rapidly relapse after first-line treatment, emphasizing the need to better characterize these patients.

Many prognostic scores have been elaborated over time to better categorize patients into different risk groups. Although the International Prognostic Index (IPI) is the most used scoring system at diagnosis in clinical practice [1], it fails to characterize the biological and molecular patterns of the disease and thus patients’ outcomes [2].

Immunohistochemical (IHC) algorithms have been used for risk stratification, according to the cell of origin (COO). It categorizes patients into germinal center B-cell like (GCB)-DLBCL, activated B-cell like (ABC)-DLBCL, about 10–15% of cases being unclassified DLBCL subtypes. Patients with the GCB subtype usually have better prognosis than patients with the ABC subtype. Although COO is useful for predicting patients’ outcome, the GCB and ABC subtypes remain heterogeneous, with better and worse prognostic subsets within each group. On the other hand, clinical studies investigating the capacity of COO by IHC algorithms in determining the efficacy of certain therapies showed conflicting results, which is partly due to the IHC techniques used.

A major advance in the determination of COO was the application of gene expression profiling (GEP), which facilitates the identification of multiple genetic abnormalities in DLBCL.

GEP can be performed using several molecular biology techniques. GEP used in formalin-fixed paraffin-embedded tissue (Affymetrix) [3] is probably the preferred method, and the digital gene expression-based assay using NanoString technology (Lymph2Cx) [4] shows promising results. However, these techniques are not yet available in most centres, expensive, and not used in routine practice. The reverse transcriptase–multiplex ligation-dependent probe-amplification (RT-MLPA) on frozen sections of the tumour specimen showed a good correlation with the Affymetrix technique, the gold standard classification in determining the COO of DLBCL [5]. RT-MLPA is cost effective and rapid to implement in routine practice.

FDG PET is now routinely used for baseline staging and response assessment in DLBCL [6]. Retrospective studies have shown that a high total metabolic tumour volume (TMTV) on baseline PET could be used as a prognosticator [7]. There is however to date no consensus on the method to be used to draw TMTV, and the different contouring methods available have been shown to affect TMTV and therefore to provide different thresholds for patients’ risk stratification. It should be noted that Cottereau et al. [8] showed in another NHL subtype that despite providing different thresholds, TMTV remains a strong prognosticator whatever the contouring method used. However, their study was performed using EARL-compliant data in four out of five centres, and the same TMTV delineation for the whole series of patients. Applying different contouring method for each centre in the multicentre setting would probably affect the capability of FDG PET to predict patients’ outcome. TMTV is also strongly affected by the newer reconstruction algorithms [9, 10]. These issues are exemplified by the figure below. Figure 1.

Fig. 1
figure 1

Representative examples of TMTV delineation using the 41% isocontour method recommended by the EANM, a SUVmax = 2.5 threshold or a more advanced gradient based method in EARL compliant images (left panels) or images reconstructed with unfiltered point spread function (PSF) reconstruction (right panels)

In this issue of the EJNMMI, Toledano et al. [11] present a retrospective analysis of baseline TMTV in 114 DLBCL patients. TMTV was computed using the 41% isocontour method recommended by the EANM guidelines and combined to GEP using RT-MLPA to predict patients’ outcome. Their study showed that the combination of TMTV and GEP led to the identification of three subgroups harbouring strikingly different prognoses: (i) patients with low (<261 cm3) TMTV, whatever the GEP, (ii) high TMTV (>261 cm3) and BCB phenotype, and (iii) high TMTV and ABC phenotype. The 5-year progression-free survival (PFS) of these patients was 72%, 51%, and 17% respectively, and both TMTV and GEP were found to be independent prognosis factors for PFS.

One can question how and when this combination of two biomarkers so far unused in clinical routine will make its way into our daily practice to achieve our goal of a tailored medicine: the identification of patients with the worst prognosis who could benefit from alternative therapeutic strategies. Confirmation of Toledano and colleagues’ findings will require larger series of patients, pooling data from different centres. Because of the issues mentioned above, this confirmation will need harmonization of PET methodology, from acquisition/reconstruction to TMTV contouring using an automatic or semi-automatic (to minimize inter- and intra-observer variability as much as possible), reliable, and user-friendly contouring tool. The last item is not trivial; computation of TMTV will certainly be accepted by the PET community in routine practice only if it minimally affects the time required for PET reporting in busy PET centres. While we know for sure that standardization of the methodology used to delineate TMTV is warranted, one can wonder how much standardization is needed for GEP and which technique is likely to be used on a large scale. RT-MLPA, the GEP technique used by Toledano et al. to determine the COO of DLBCL, is cost-effective and rapid to implement in routine practice.

Finally, evaluation of the combination of several biomarkers to predict PFS or OS using the Cox model implies statistical considerations, given that the number of patients required to reach an adequate statistical power depends on the number of events (relapse for PFS and death for OS) and the number of variables (biomarkers and other known prognostic factors such as the IPI score). Hence, larger series of patients would open the way to multivariable analysis with an adequate statistical power or even the use of machine-learning methodology [12], using multiple PET metrics such as total lesion glycolysis (TLG), and tumour heterogeneity using textural features (TFs) [13]. These features are extracted from the TMTV and are also known to be sensitive to reconstruction parameters [14], highlighting, again, the need for a standardization of PET data.

In conclusion, the study from Toledano et al. paves the way for the combination of FDG PET metrics with a simple molecular technique to determine the prognosis and eventually tailor the therapy of patients with DLBCL. To this end, a consensual TMTV delineation procedure is mandatory.