Introduction

An historical background

Time-of-flight (TOF) measurement was suggested as early as 1969 by Brownell et al. [5]. The first practical scanners using this technology were built in the early 80s by three leading groups. They used CsF [2] as the scintillator. CEA-LETI built a TOF PET, TTV01, in Grenoble [18] and the system was evaluated and used clinically at CEA-Orsay for at least a decade [3, 49]. The second system was built and used at Washington University in Saint Louis [55] and the third one at the University of Texas in Houston [68, 69]. Another fast scintillator, the BaF2, was studied [28] and used by both CEA-LETI in another TOF PET, TTV03, [35], which was also used clinically for about a decade, and a CTI PET system [22]. Budinger [6], Tomitani [57] and Lewellen et al. [29] also contributed significantly in understanding the benefits of TOF. At the system level, measurement between 470 and 750 ps were reached. However, while the light scintillation properties of CsF or BaF2 were favorable for TOF performance, both the scintillators had low density that prevented the possibility of high spatial resolution and sampling. Moreover, the ultraviolet emission of BaF2 made light collection difficult and required the use of more expensive photomultipliers (PMT) with quartz windows.

The need for high spatial resolution led to the parallel development of nonTOF PET during the late 70s and early 80s. The scintillator used was bismuth germanate (Bi4Ge3012 or BGO) [64], which has a high detection efficiency, acceptable light output, and a light emission wavelength of around 480 nm. BGO became the standard in positron emission tomography (PET). However, its long decay time (300 ns) and low light output (about 8000 photons MeV−1) made it unusable in TOF PET. With the success of BGO-based PET systems, which enabled higher spatial resolution and higher sensitivity (although lower count rate than CsF or BaF2), TOF PET development came to a halt.

The discovery of new scintillators in the 90s led to a new phase of TOF PET research and development. Cerium-doped lutetium oxyorthosilicate or LSO (Lu2SiO5) [36] has a relatively high light yield (about 30,000 photons MeV−1), high effective Z (Zeff ≈ 66), high density (7.4 g cm−3) and a short decay time (40 ns). These characteristics allowed for higher spatial resolution and a higher count rate and preserved high sensitivity. This, combined with TOF information, pushed LSO to become the standard PET detector for one of the major PET scanner manufacturers [20, 66]. The short decay time was immediately used to reduce the coincidence window from the 12 ns typical of BGO scanners to 6 ns, and it was later reduced to 4.5 ns with the development of faster electronics. The short time coincidence window, now defined by the desired dimension of the transverse field-of-view (FOV), reduced the random coincidences in the data.

The LSO characteristics could also be used for a new generation of TOF PET scanners. Time resolution down to 300 ps was reported with two single LSO crystals in coincidence, making the TOF PET a viable solution [37]. First attempts to perform TOF reconstruction on a commercial LSO PET scanner showed a measurable gain in signal-to-noise ratio (SNR) due to the TOF reconstruction [9], even with a poor 1.2 ns time resolution.

In parallel, the more recent discovery of cerium-doped lanthanum bromide (LaBr3) [60] opened another avenue for TOF instrumentation. While lanthanum bromide showed lower stopping power than LSO, it had a shorter decay time (16 ns), excellent energy resolution (about 3% at 662 keV), and twice as much light output, making it an attractive candidate for TOF PET [51]. The first LaBr3 TOF scanner was developed at the University of Pennsylvania [24]. A 7.5% energy resolution allowed for a reduced scatter fraction and a 460 ps time resolution was measured. Further improvements in the triggering technique brought the time resolution to 420 ps, and there is evidence that time resolution could be brought down to 315–330 ps [25].

A first commercial TOF PET scanner, the Gemini TF PET/CT, was introduced by Philips in 2006 [52], and uses a LYSO scintillator crystal. LYSO is a material with a structure very similar to LSO: a fraction of the lutetium atoms in the crystal are replaced with yttrium. LYSO properties are also very similar to those of LSO, the main difference being the lower density due to yttrium’s lower weight. The time resolution of the Gemini TF is 585 ps if measured with a low activity source.

At about the same time, the idea of extending the axial field-of view was proposed as a very efficient way to detect more radiation emitted inside the human body, for a given injection dose, in 3D acquisition mode [26].

Shortly after that, both General Electric and Siemens commercialized TOF PET/CT systems based on lutetium-based scintillators with similar TOF performances between 520 and 550 ps [4, 27].

The combination of extending the axial field of view (FOV) and improving TOF performance have contributed to further progress in PET performance. The former leads to more count rates of true event, the latter leads to better use of such counts and thus to the reconstruction of images with better SNR, as will be discussed in the part II of this paper.

Additional valuable sources of information on TOF PET history and research can be found in some recent review papers available in the literature [11, 13, 30, 38, 39, 54, 59].

The recent development of SiPM-based TOF PET

While the TOF capabilities in commercial systems was facilitated by the transition from the BGO scintillator to the fast LSO scintillator, the next evolution in reaching even better TOF performances is currently facilitated by the transition of the photodetector from a photomultiplier tube (PMT) to a silicon photomultiplier (SiPM). SiPM can provide a signal equivalent to PMTs through a similar gain but with the advantage of better energy and time resolution, and a much more compact packaging, as shown in Fig. 1. A very detailed review of the advantage of SiPM and other photodetectors has been written by Lewellen [31].

Fig. 1
figure 1

Photomultiplier tube (PMT) (left) and silicon photomultiplier (SiPM) (right)

In brief, SiPMs consist of an array of photosentitive cells, called single-photon avalanche photodiodes (SPAD), which are operated in Geiger mode. Each time a photon hits one of the cells a charge avalanche is produced, resulting in a discrete signal. Similar to a PMT, the signal resulting from all the cells determines the energy of the incident gamma ray on the scintillator crystal. The signal from all the cells can be processed in different ways. In a so-called analog SiPM, signals from each individual cell are first summed up then digitized to produce timing and energy information. In a so-called digital SiPM, the signal from each cell is first digitized and then the desired information is obtained through summation of the number of cells that produced a single count. A detailed description of both devices as well as respective performance information for individual detectors has been published by Gundacker [19].

Through the recent use of SiPM photodetectors, TOF performance was improved to 300–400 ps. The first system was introduced commercially by Philips in 2014 and a TOF performance of 345 ps was reported [41]. This was later further improved to 310 ps [43]. A second system was recently developed by General Electric, and a TOF performance of 375 ps was reported [21].

While SiPM alone helped to improve TOF performance, other factors in the detector design also contribute to its overall performance. For example, the length of the scintillator crystal used, the type of coupling between the scintillator and the photodetector, as well the electronics processing the signal from the SiPMs all contribute to the detector performance. Siemens recently commercialized a new PET system using SiPM and a TOF performance of 210 ps was reported [44, 61]. This new PET system also exploits the compactness of the SiPM using smaller 3.2 mm crystals, thus improving both TOF and spatial resolution at the same time.

In addition to TOF performance improvements, larger PET sensitivity through an extended axial FOV (to cover the whole body) is also being explored by other groups: it will open the path to new research opportunities, and likely create new clinical applications [8].

In the following section, we will expand the discussion on the concepts of TOF PET and the benefits of the recent technological advances.

The principles and advantages of TOF PET

Principles of TOF PET

Time-of-flight differences between two photons stopped in two detectors of the PET scanner are used to determine if the photons are in “time coincidence”, and, therefore, associated with the annihilation of a positron–electron pair. If the detection time difference between two photons is smaller than a coincidence window (traditionally 4–10 ns), the two events are considered physically associated to the same annihilation event. A line-of-response (LOR) joining the two detectors is drawn, and the source of the positron is assumed to be located in a undetermined position along the line. Without more accurate TOF information, it is impossible to determine which voxel along the line is the source of the two photons; therefore, all the voxels along the line are given the same probability of emission. From the full set of LORs derived from the acquired data, analytical or iterative reconstruction algorithms are used to reconstruct the original activity distribution. Accurate measurement of the time difference is not needed and not available in conventional nonTOF PET.

On the contrary, TOF PET scanners are characterized by a more accurate measurement of the TOF difference, which allows better localization of the source of annihilation along the LOR. The precision is dominated by the time resolution of the TOF scanner or the uncertainty of the measurement of time difference, which ranges from 200 to 600 ps. In fact, the key performance parameter of a TOF PET scanner is the time resolution, typically measured as the full-width-half-maximum (FWHM) of the time distribution of a point source at the center of the FOV.

Since the TOF difference t is proportional to the path length difference x of the two photons, it could be used to know the “exact” position, along the LOR, of the positron annihilation (and if we ignore positron range, this is also the position of the radioisotope or tracer). How “exact” is obviously related to the time resolution Δt of the scanner. Space and time uncertainty are proportional with c and the speed of light: ΔxcΔt/2.

If the time difference could be assessed with no uncertainty, no reconstruction algorithm would be necessary, since the position of the source along the line would be known. In today’s TOF PET, such time measurement accuracy is not achievable, but the TOF information is still used, together with its estimated uncertainty. Each coincidence event is back projected along its LOR, using a bell-shaped probability curve like a Gaussian distribution, with mean position x and FWHM Δx, derived from the measured time difference t and time resolution Δt.

This TOF weighting process localizes the events and reduces the propagation of the statistical noise in the image. In each spatial position, only coincidence events with TOF differences consistent with such positions are accumulated. Noise events and randoms with inconsistent time information are filtered out. The overall effect is lower noise and higher contrast recovery; in other words, higher SNR. It is self-evident that the narrower the TOF kernel Δt is, the more accurate the localization and the stronger the noise reduction.

In fact, the first studies on TOF PET demonstrated that TOF reduces the variance in the images of a factor proportional to Δx/D, where D is the size of the object to be imaged [6, 57].

The ratio VarnonTOF/VarTOF is used as a figure of merit and called TOF gain. Budinger estimated such variance gain or TOF gain as [6]:

$$({\text{SNR}}_{\text{gain}} )^{2} = \left( {\frac{{{\text{Var}}_{\text{nonTOF}} }}{{{\text{Var}}_{\text{TOF}} }}} \right) \cong \frac{D}{{{\raise0.7ex\hbox{$c$} \!\mathord{\left/ {\vphantom {c 2}}\right.\kern-0pt} \!\lower0.7ex\hbox{$2$}}\Delta t}} = \frac{D}{\Delta x}.$$

Tomitani [57] mathematically derived the gain in image variance for a TOF confidence weighting filtered back projection (FBP) reconstruction, and obtained a gain reduced by a factor α2 = 1/1.6. More recently, Conti derived from experimental data a factor α2 = 1/1.47 [14]:

$$({\text{SNR}}_{\text{gain}} )^{2} = \left( {\frac{{{\text{Var}}_{\text{nonTOF}} }}{{{\text{Var}}_{\text{TOF}} }}} \right) \cong \alpha^{2} \frac{D}{{{\raise0.7ex\hbox{$c$} \!\mathord{\left/ {\vphantom {c 2}}\right.\kern-0pt} \!\lower0.7ex\hbox{$2$}}\Delta t}} = \alpha^{2} \frac{D}{\Delta x}.$$

Finally, an extension of the previous formulas has been introduced that shows a dependence of TOF gain with random fraction: higher gain is observed at higher random fractions [10, 17].

All these estimates are theoretically valid only for the variance measured at the center of a cylindrical source in the middle of the FOV that has a diameter D and is uniform in attenuation properties and source distribution, and when an analytical or linear reconstruction is used, not an iterative method or a method in which image pixels are strongly correlated. Nevertheless, they represent a very good approximation of the TOF gain of a TOF PET scanner, and are commonly used also for general source distributions and iterative methods.

Since the SNR has been demonstrated to be proportional to the square root of the noise effective counts (NEC) in a PET scan [50], the TOF SNR gain is equivalent to a gain in counts, or NECgain = \({\text{SNR}}_{\text{gain}}^{ 2}\). In other terms, a TOF image is equivalent to a nonTOF image obtained with a larger number of counts. One can also consider TOF reconstruction as a “virtual counts or sensitivity amplifier” for a TOF PET scanner [13]. In describing the specifications of TOF PET scanners, sometimes the measured sensitivity and NEC performance are amplified by the estimated TOF gain, and defined “TOF-effective” or “effective” sensitivity and NEC.

To compare the image quality of a different time resolution, the same image quality phantom was imaged in the Siemens Biograph mCT PET/CT scanner with 527 ps intrinsic time resolution [27] and the new Siemens Biograph Vision with 210 ps intrinsic time resolution [44, 61]. The phantom had six hot spheres (5, 8, 10, 13, 17, and 22 mm diameter) with a contrast of about 4:1, and a total activity in the background of about 50 MBq of 18F. The data were reconstructed TOF and nonTOF in both scanners. In Fig. 2, we show one transaxial slice of the image obtained with iterative reconstruction with resolution recovery and no post-reconstruction filter: a nonTOF image from the mCT, a corresponding TOF image from the mCT (527 ps resolution), and a TOF image from the Biograph Vision (210 ps resolution). One can observe the improving image quality with improving time resolution: the smallest sphere is visible only using 210 ps time resolution TOF PET reconstruction.

Fig. 2
figure 2

Transaxial slices of an image quality phantom, reconstructed with high-resolution iterative reconstruction with resolution recovery. The scans were performed on a Siemens Biograph mCT PET/CT scanner (527 ps intrinsic time resolution) and the new Siemens Biograph Vision (210 ps intrinsic time resolution). From left to right: a non TOF image from mCT, b TOF image from Biograph mCT, and c TOF image from Biograph Vision. The phantom had six hot spheres (5, 8, 10, 13, 17, 22 mm diameter) with a contrast of about 4:1, and a total activity in the background of about 50 MBq of 18F

Robustness of TOF PET

There is a whole series of possibilities opened by a characteristic of TOF PET that is intrinsically tied to the extra time information available. The recent literature has pointed out that TOF PET reconstruction is able to better handle data that are inconsistent, incomplete, or incorrect [13, 54]: TOF PET reconstruction is less sensitive to mismatched attenuation correction [12, 34, 58], inaccurate normalization, and scatter correction [12, 63, 65]. To visually demonstrate this phenomenon, the same image quality phantom was scanned in the Siemens Biograph mCT PET/CT scanner (527 ps intrinsic time resolution) and the new Siemens Biograph Vision (210 ps intrinsic time resolution). The phantom had six hot spheres (5, 8, 10, 13, 17, and 22 mm diameter) with a contrast of about 4:1, and a total activity in the background of about 50 MBq of 18F. The data were reconstructed TOF and nonTOF in both scanners. To show the improving robustness of TOF reconstruction with improving time resolution, artifacts were intentionally introduced in the normalization, the attenuation correction, and scatter correction: we replaced the accurate corrections with inaccurate or artificially manipulated corrections. By doing so, artifacts appeared in the images. We observed that, while the inclusion of the TOF information dramatically decreases the artifacts compared to the corresponding nonTOF reconstruction, better time resolution produces images with even lower residual artifacts. Typical normalization ring artifacts are much reduced, attenuation artifacts almost disappear, and TOF images are less biased by inaccurate scatter correction. As an example, in Fig. 3, reconstruction was performed with no scatter correction, and one can observe that the typical hump in the center of the phantom is barely visible in the 210 ps TOF image.

Fig. 3
figure 3

Transaxial slices of an image quality phantom, scanned on a Siemens Biograph mCT PET/CT scanner (527 ps intrinsic time resolution), and the new Siemens Biograph Vision (210 ps intrinsic time resolution). a Non TOF image from mCT, b TOF image from mCT, c nonTOF image from Biograph Vision, and d TOF image from Biograph Vision. The scatter correction was replaced by no scatter correction, which results in a typical bump of activity towards the center of the phantom

Overall, TOF reconstruction provides a dramatic improvement in robustness of the image quality, since it is much less sensitive to any possible human or instrumentation error or even mismatch between PET and CT.

Fast convergence of TOF PET

It is well known that in iterative reconstruction, the choice of the number of iterations is somewhat arbitrary: both contrast recovery (or image sharpness) and noise increase with the iteration numbers, and typically a balance needs to be found.

The spatial information, provided by more precise TOF information of the new generation TOF PET scanners, results in a faster convergence of the TOF version of the same nonTOF iterative algorithm. One can take advantage of this property by pushing convergence higher and reaching higher contrast recovery at comparable noise levels (noise-equivalent images), or using a lower number of iterations for TOF reconstruction, with the result of having a much lower noise level for the same contrast recovery (contrast-equivalent images), or any intermediate choice is possible.

The better the time resolution, the more dramatic and fast the convergence of TOF reconstruction is; see Fig. 4 [14]. In this study, a uniform phantom with hot spheres (4:1) was simulated using Geant4. The contrast recovery of the smallest 10 mm sphere was studied vs. the iteration number or the noise in the background (image roughness).

Fig. 4
figure 4

Contrast recovery coefficient (CRC) vs. noise level, for a Monte Carlo simulation of a 10-mm sphere in a uniform cylinder (4:1 contrast). The nonTOF reconstruction and the TOF reconstructions: 600 ps, 500 ps, 400 ps, 300 ps and 200 ps. Iterative reconstruction: each point represents an iteration (from reference [14])

The clinical advantages of TOF PET

Clinical opportunities for TOF PET

Advantages of the TOF-advanced performance include reduced variance or noise in the images, and higher quality images for clinical diagnosis. This translates also in better quantitative accuracy of lesion uptake and improves confidence in the clinical diagnosis [16]. Furthermore, it improves detectability of small lesions, as demonstrated in past phantom and patient studies [13, 23, 34 and 53]. Since TOF gain increases with patient size, TOF reconstruction acts as a weight equalizer, obtaining consistent image quality over a patient population, regardless of size and weight [34, 56]. With the improvement of TOF performances, the use of higher matrix size (and consequently smaller voxel size) has been facilitated. It is well known [42] that higher spatial resolution requires improved SNR to be clinically useful. Thus, the gain offered by TOF can be useful to allow smaller voxel size, which in turn improves detectability of lesions in clinical images [33, 41].

A second avenue to exploit the improved TOF performance is the further reduction of the injected dose. There is a worldwide trend toward dose reduction, motivated by concerns for patients, population exposure in general, and for exposure of medical professionals [40, 45]. In addition, pediatric applications become more attractive if the dose can be dramatically reduced. A lower dose would make the use of repeated PET scans to monitor the response to therapy more feasible. This could also improve therapy efficacy and reduce therapies that are ineffective or not necessary.

A third opportunity is the reduction of the scan time: scan time can be shortened while keeping the same image quality with better clinical workflow and added comfort for the patient. A lower cost and larger use of PET can be a consequence of this trend.

Obviously, not all three objectives can be obtained at the same time; a trade-off must be reached based on the patient case and the hospital needs.

Emerging clinical applications enabled by high-performance TOF PET

The dramatic development of theranostics and immunoPET has taken place naturally in TOF PET technology, because of the powerful reduction in image noise. Many theranostics and immunoPET tracers use long half-life radioisotopes, often associated with spurious prompt gammas, and typically at very low doses [46]. The combination of these three characteristics produces low count images, high noise, and contamination by a prompt gamma background, which is hard to correct [15]. TOF reconstruction provides noise reduction, and at the same time tends to filter out coincidence data that are inconsistent with the space–time distribution of pure coincidence events [12].

An example can be found in 124I PET imaging for thyroid cancer, which is used in association with 131I radiotherapy. 124I is a radioisotope with a high prompt gamma fraction (about 1/3 of the coincidence events) and long half-life (100 h). It is administered in small doses and, moreover, patients cannot be scanned for longer than 10–20 min. This results in very noisy images, as visible in Fig. 5. The patient in the figure has metastatic thyroid cancer, was injected with 40 MBq of 124I, and was scanned for 4 min per bed on day 5 after injection. Total counts in the sinograms were very low (10–100 × 103). TOF reconstruction (Biograph mCT, 527 ps intrinsic time resolution) improves quantification accuracy and reduces noise in the image, in particular the number of noise artifact foci in the chest and the abdomen are much reduced.

Fig. 5
figure 5

Coronal slice of PET image of a patient injected with 40 MBq of 124I, day 5. Patient with differentiated thyroid cancer at high risk, bearing distant metastases. a NonTOF image and b TOF image. Data kindly provided by Walter Jentzen (University Clinic of Essen, Department of Nuclear Medicine, Germany). To better appreciate the difference of background noise, the color scale has been thresholded to 0.5% of the maximum value

Another example of TOF PET technology as enabler of new clinical applications is the imaging of 90Y radioembolization. Selective internal radionuclide therapy (SIRT) is a treatment for liver cancer based on injection of microspheres embedded with 90Y therapeutic radioisotopes. 90Y is characterized by a very small fraction of positron–electron pair annihilations (32 ppm), and, therefore, an extremely low PET signal (between 50 × 103 and 500 × 103 true counts per scan, which is orders of magnitude lower than a typical oncology scan). As soon as TOF PET became widely available in clinical practice, it showed its characteristics of superior quantitative performance [32], and quickly took off as the preferred technology. Multisite studies showed the superior capability of TOF PET scanners compared to older nonTOF models [7, 67]. Figure 6 shows an example of PET images of a patient undergoing SIRT treatment: noise reduction and better image quality characterize the TOF image compared to the nonTOF PET, in particular the extra hepatic artifact hot spots in the background tissue are greatly reduced.

Fig. 6
figure 6

Coronal slice of PET image of a patient liver undergoing SIRT treatment with 90Y-embedded resin microspheres in the liver. a NonTOF image and b TOF image. Data kindly provided by Kathy Willowson (Institute of Medical Physics, School of Physics, University of Sydney, Sydney, Australia). To better appreciate the difference of background noise, the color scale has been thresholded to 3% of the maximum value

PET/CT screening of patients at risk for lung cancer is a novel clinical application that might be enabled by high-performance TOF PET scanners. While CT lung cancer screening has been proposed [1], CT screening produces a false-positive rate of around 96% [1]. The addition of metabolic information from PET could improve accuracy and greatly reduce false positives [48, 62]. This becomes feasible if the PET dose can be greatly reduced, to reduce the dose to the population of patients at risk. The new generations of PET/CT scanners, with about 200 ps time resolution, can provide a sensitivity gain (compared to conventional non TOF) of about 13 for a patient of about 40 cm abdominal size, which directly translates to an equivalent dose reduction. Dose reduction could be even larger, if a higher level of noise is acceptable for PET/CT screening [47]. PET/CT screening for populations at risk would, therefore, become feasible.

Conclusions

The introduction of lutetium-based scintillators and new photosensors, the SiPMs, has allowed a performance leap in time resolution. This translates into a considerable increase in TOF gain in SNR, unseen reconstruction robustness to inconsistent data, and increased convergence speed. New applications are enabled for PET scanners in the field of low count imaging, such as 90Y imaging and theranostics, pediatric imaging, screening of patients at risk, and possibly many others.

We believe that even if the advantages of TOF PET technology have not yet been extensively studied and understood, the full adoption in clinical practice will continue to grow. While many scientific articles have already demonstrated the benefits of TOF, more clinical benefits and applications are yet to be exploited and advanced: this is an exciting time for PET scientists and clinicians.