Introduction

Positron emission tomography integrated with computed tomography (PET/CT) imaging using 18F-2-fluoro-2-deoxy-d-glucose (18F-FDG) is widely used in oncology for diagnosis, staging, and treatment response evaluation [1,2,3,4,5,6,7,8,9,10,11]. The standardized uptake value (SUV), a semi-quantitative metric derived from PET images, is most commonly used as a surrogate of metabolic activity for quantifying 18F-FDG tumour uptake [1]. SUV can be derived from static PET acquisitions, typically initiated 1 h post-injection (pi) where every bed position is scanned once for 2–5 min [1, 12]. Through standardization methods regarding patient preparation (to avoid, e.g., high plasma glucose), PET acquisition settings, image reconstruction, and analysis methods, SUV variability can be limited to a great extent [1, 13, 14]. However, quantitative accuracy of SUV can also be influenced by changes in plasma kinetics due to treatment possibly causing inaccurate assessments [15,16,17].

Dynamic PET imaging allows for spatiotemporal activity concentration distribution measurement which can provide voxel-wise metabolic information when used by tracer pharmacokinetic modelling methods, for example, full kinetic analysis, i.e., Patlak analysis [18,19,20,21].

Up to recently, dynamic PET imaging was mainly performed using single-bed/single-axial field-of-view acquisitions. Currently, with state-of-the-art PET/CT systems, whole-body dynamic (Patlak) images can be obtained from a combined acquisition of first 6 min of dynamic imaging over the heart to obtain the arterial input function (IF) followed by multiple whole-body sweeps up to 60 min pi. This procedure followed by a standard static whole-body PET, however, can take a total examination time to 75 min [22], including the time needed for patient positioning and CT procedure. Initial PET examinations using whole-body Patlak imaging showed a high frequency of patients’ inability to comply with the long scan duration required for the protocol.

The use of a population-averaged input function (PIFs) could obviate the need for the first dynamic scan and minimize whole-body sweeps to an interval of 30–60 min pi [23] making whole-body dynamic Patlak imaging clinically feasible. There have been various studies in which using a PIF is explored in oncological whole-body dynamic 18F-FDG imaging [21, 23,24,25,26]. Promising results were obtained in comparison with using an arterial IF and an image derived IF; however, further evaluation of microparameter estimation (such as k3 and (if it exists) k4) is recommended before implementation [24]. Therefore, given that this approach may introduce some inaccuracy, this study explored the effects of (incorrect) PIFs on the accuracy of the proposed Patlak method using various simulations including variations in rate constants. In addition, the extent of mitigating these biases through rescaling of the PIF to image-derived values at 30–60 min pi was evaluated.

Materials and methods

To explore the effects of (possibly incorrect) PIFs on the accuracy of Patlak analysis based on dynamic whole-body PET acquisition from 30 to 60 min pi, various tumour time-activity curves (TACs) were simulated. To this aim, a representative IF from previously acquired data (acquisition and processing described in [21]) was used as the PIF as well as rate constants based on literature: K1 was 0.301 min−1, k2 was 0.600 min−1, and k3 was 0.047 min−1 [18, 27]. TACs were created according to Eq. 1:

$$ {C}_{\mathrm{tissue}}={C}_{\mathrm{blood}}\bigotimes \left[{k}_2{e}^{-\left({k}_2+{k}_3\right)t}+{k}_3\right]\frac{K_1}{k_2+{k}_3} $$
(1)

Here, Cblood represents the original representative left ventricle arterial IF, and Ctissue is the obtained TAC. The PET TAC was then generated using Eq. 2:

$$ {C}_{\mathrm{PET}}=\left(1-{V}_b\right)\times {C}_{\mathrm{tissue}}+{V}_b\times {C}_{\mathrm{blood}} $$
(2)

with Vb equal to the blood volume fraction.

Simulations were performed in the Python programming language using an in-house written code.

Through the multiplication of the IF with a positive and negative gradual linear bias using different slopes over 60 min, variations in IF were simulated. Differences in slope steepness included a positive and negative gradual linear bias from one to 5, 10, 15, 20, and 25% multiplied by the IF, e.g., the IF was multiplied by a positive and negative linear function starting at y = 1 to y = 1.25 and starting at y = 1 to y = 0.75 in the case of 25% bias. These simulations were also reversed, e.g., the IF was multiplied by a positive and negative linear function starting at y = 1.25 to y = 1 and starting at y = 0.75 to y = 1 in the case of 25% bias. Variations in IF accounted for a total number of 35 simulations.

Other variations that were combined with the gradual bias from one to different slope sizes included the use of rate constants (K1, k3, and both K1 and k2) multiplied by 2, 1.5, and 0.75 (fitting the mean and exceeding the range of rate constants observed in clinical data [27]). These rate constant variations combined with IF modifications accounted for a total number of 180 simulations. In addition, noise was added to the IF modifications (μ = 0 and σ = 5, 10, and 15% simulating high, medium, and low noise equivalent count rate [28], respectively) to account for the differences between PET/CT systems and reconstruction settings. The noise additions combined with IF modifications accounted for a total number of 75 simulations. For the simulations described above, the blood volume fraction was fixed at 8.9% [27].

To show the extent to which incorrect PIFs influence the accuracy of Patlak analysis, analyses using the original IF (representing the PIF) were used to obtain the influx constant (Ki) for the differently simulated TACs. Subsequently, the incorrect PIFs were scaled towards the correct value using the 30–60-min pi time-interval simulating scaling of the PIF to image-derived values. After rescaling, Ki were obtained using these adjusted PIFs to see the extent to which previously obtained biases in Ki could be mitigated. A schematic overview regarding the modification and rescaling of the IF is shown in Fig. 1.

Fig. 1
figure 1

Schematic overview of PIF (green) with a gradual linear modification (dashed red) towards a variety of (incorrect) IFs (red) (ac) and subsequent rescaling of the PIF at 30–60 min pi (d, e)

Results

In total, 290 variations were simulated (35 gradual in or decrease of the IF and reversed, 180 rate constant variations, and 75 noise simulations). Percentage bias in Ki using simulated incorrect IFs varied from − 16 to 16% depending on the simulation type, amplitude, addition of noise, and direction of the IF modifications. Subsequent scaling of the PIF reduced these Ki biases in most cases to between − 3 and 4% for the gradual in and decrease of the IF and reversed, between − 3.7 and 4% for the rate constant variations combined with gradual in and decrease of the IF, and between − 2.4 and 3% for the noise addition combined with gradual in and decrease of the IF.

Figure 2 shows the influence of positive, negative, and reversed gradual input function modification over 60 min of 5, 10, 15, 20, and 25% on the percentage difference in Ki before and after rescaling of the PIF. The effect of variations in rate constants combined with a gradual modification of the IF on the percentage difference in Ki before and after rescaling of the PIF is shown in Fig. 3. For the influence of added noise combined with a gradual modification of the IF on the percentage difference in Ki before and after rescaling of the PIF, see Fig. 4.

Fig. 2
figure 2

The effect of positive (a), negative (b), positive reversed (c), and negative reversed (d) gradual input function modification over 60 min of 5, 10, 15, 20, and 25% on the percentage difference in Ki. The empty markers represent the effect of the modification on Ki. The filled markers show the remainder of this bias after rescaling of the PIF

Fig. 3
figure 3

The effect of variations in rate constants combined with 15% gradual modification of the IF on the percentage difference in Ki. The empty markers represent the effect of the modification on Ki. The filled markers show the remainder of this bias after rescaling of the PIF

Fig. 4
figure 4

The effect of added noise combined with 15% gradual modification of the IF on the percentage difference in Ki. The original input function with added noise is represented by the filled black marker. The empty markers represent the effect of the modification on Ki. The filled markers show the remainder of this bias after rescaling of the PIF

Discussion

Tumour quantification using SUVpeak following the PERCIST guidelines describes stable metabolic disease as an increase or decrease in SUVpeak of less than 30% [29]. However, SUV can be affected by variations (between subjects or longitudinally) of the plasma kinetics which may cause inaccuracies in clinical assessments. Nevertheless, Patlak analysis also remains an estimate of the tumour’s kinetic behaviour and may be biassed in case of large blood volume fractions for example in cases with bulky tumours [21]; this should be taken into account. Based on the reasoning as described above, we determined that an accuracy level of 5% is acceptable, i.e., well within repeatability levels (which are in the order of 10 to 15% [30,31,32]).

In most cases, rescaling of the PIF reduced Ki biases to < 5%. However, there were three incidences (out of 290) which resulted in remaining biases > 5% after rescaling: the combination of 25% gradual IF modification and k3*2 resulted in a remaining Ki bias of 8%, the combination of 25% gradual IF modification and k3*1.5 resulted in a remaining Ki bias of 7%, and the combination of 20% gradual IF modification and k3*2 resulted in a remaining bias in Ki of 6%. When using PIFs for whole-body Patlak imaging in patients with high nuclear grade and/or high proliferation activity tumours (associated with higher k3 [33]), this should be taken into account.

Please note that when using Patlak analysis, bias may occur even with a perfectly correct IF. Bias in the Patlak analysis may occur as a result of not incorporating the fractional blood volume in the Patlak equations. Blood volume fractions of 0, 8.9, and 20% resulted in a Patlak Ki bias of − 1.2, − 10, and − 20%, respectively (data not shown). This indicates that the Ki bias through the use of a (possibly incorrect, but rescaled) PIF is small compared to possible biases in case of large blood volume fractions. With regard to the acceptable accuracy level of 5% when using a PIF for estimating the tumour kinetic behaviour, the acceptable total bias would be 25%, which is just within the PERCIST recommendations [29]. Yet, for most tumours, the blood volume fraction is typically smaller than 10% [27] resulting in a total acceptable bias of 15%, so again, well within the limits of agreement regarding repeatability and PERCIST criteria.

A similar study to develop a simplified Patlak protocol through using PIFs based on clinical data was performed by S. Yao et al. [26]. Here, similar biases (from − 20 to 20%) were induced to a representative IF to simulate and explore the effect of possible errors in PIFs when applied at 20 min pi. They concluded that whenever the IF modification remains below 20%, quantitative inaccuracy regarding Ki would be around 4% [26], which is in line with the results of our simulations.

Another study that explored the alternative of using a PIF instead of arterial blood sampling found a very high correlation between the two methods [23]. The addition of variability in rate constants and noise in our study provides a more comprehensive and realistic reflection of the possible range in kinetic parameters seen in tumours.

Conclusion

Simulations suggest that scaling of a possibly incorrect PIF to (image derived) IF values seen in whole-body dynamic imaging from 30 to 60 min pi could be a good strategy to obtain accurate Ki estimates. Consequently, dynamic Patlak imaging protocols may be performed for 30–60 min pi making whole-body Patlak imaging clinically feasible.