1 Introduction

Precision measurements of the Higgs boson and of the top quark are key objectives of the FCC-ee physics programme [1], and define the requirements for the detector design and the reconstruction algorithms for the higher-energy stages of the machine at \(\sqrt{s}=\) 240 GeV and 350/365 GeV. The four- and six-fermion final states of HZ and \(t\bar{t}\) events are typically characterised by the presence of multiple hadronic jets, heavy quarks, leptons, and neutrinos. To fully capitalise on the potential for precision measurements provided by the high integrated luminosity and the clean experimental environment at the FCC-ee, specific detector technologies as well as reconstruction strategies and algorithms are needed to achieve an optimal performance for the complex Higgs boson and top quark events. Jet energy resolution, flavour tagging in hadronic jets, and global event and missing energy reconstruction are the key ingredients to achieve this goal. By taking advantage of the knowledge of the initial state, powerful kinematic constraints can be applied, which have the potential to significantly increase the precision of the overall event reconstruction as well as that of individual objects.

In the context of Higgs boson physics, the quality of jet energy and directional reconstruction enters at various levels, from the quality of the identification of Higgsstrahlung events for hadronic Z boson decays, to the precision of the measurement of the Higgs boson mass and to the identification of distinct decay modes. Flavour tagging is central for the latter, given the dominance of hadronic decays of the Higgs boson. A highly efficient discrimination of b, c, and gluon jets allows to access novel decay modes that cannot be identified at the LHC, adding qualitatively new dimensions to the Higgs physics programme.

The identification of the complex events of top quark pair production profits significantly from an accurate jet and global event reconstruction via an excellent measurement of the invariant mass of gauge bosons occurring in the decay of top quarks and in background processes, and from an highly precise measurement of the invariant mass of the overall final state. Also here, flavour tagging is crucial, since top quark pair events are characterised by the presence of two b quark jets. Their clean identification enables improved signal and background separation.

In the following, we discuss key reconstruction techniques that are being developed to achieve the goals for Higgs boson and top quark physics at lepton colliders. These are particle-flow algorithms for physics object and overall event reconstruction, and advanced flavour tagging algorithms. Following this, the potential and future opportunities are discussed with selected examples. Since studies for FCC-ee are still in early stages in many cases, concrete examples are also taken from linear collider-based studies, which have a high degree of overlap in the physics goals, very similar experimental conditions, and provide starting points for setting goals for FCC-ee detector concepts, reconstruction and analysis strategies. In this context, full-simulation studies of detector performance and selected physics channels performed in the context of CLIC at centre-of-mass energies of 350 GeV and 380 GeV and in the context of ILC at 250 GeV are particularly relevant for the higher-energy stages of FCC-ee at 240 GeV, at the top quark pair threshold around 350 GeV, and at 365 GeV.

2 Particle flow and beyond: reconstructing individual particles, jets, and global event kinematics

Particle flow (PF) aims to achieve the best possible reconstruction of individual final-state particles by optimally combining the information available from the different subsystems of the detector to achieve a global particle-level description of reconstructed events. Besides the clear benefits for jet energy reconstruction provided by improved single-particle measurements, this enables more performant heavy flavour tagging of jets, and the exploitation of kinematic constraints by global event kinematic fitting with advanced techniques, expanded further in the two following sections. A first generation of PF was developed for the ALEPH experiment at LEP [2], which enabled that experiment to achieve the best jet energy resolution of all four LEP experiments. However, the performance of this algorithm was limited by the relatively low granularity of the calorimeters, allowing the detection of neutral hadrons only via a significant excess of calorimeter energy compared to the sum of particle track momenta in the same geometric region. The CMS experiment at LHC has developed a sophisticated algorithm that results in significant improvements of the jet energy resolution compared to calorimeter-only reconstruction [3].

The detector concepts for the linear \(e^+e^-\) colliders ILC and CLIC are specifically designed for the optimal performance of PF algorithms, achieving a jet energy resolution of 3–4% over a wide energy range with sophisticated reconstruction algorithms [4,5,6], as required for the efficient separation of W and Z bosons in hadronic final states. The crucial detector feature to achieve this performance is a high spatial granularity in the calorimeter systems [7], which is exploited in the pattern recognition required to achieve an accurate matching of subdetector information, and also provides the potential for improved calorimetric reconstruction using software compensation methods [8, 9]. This principle is also adopted for the CLD detector concept [10] for FCC-ee, which is derived from the CLIC detector design, adjusted for the different experimental conditions in terms of collision energy and operation mode.

The accurate reconstruction of hadronic jets is of importance in many areas of Higgs boson and top quark physics due to the prevalence of hadronic final states in their decay. In general, improved performance in this area results in reduced backgrounds and higher signal significance or more accurate measurements. Comprehensive studies have been performed in the context of CLIC, which demonstrate the capabilities of a particle-flow-optimised detector, albeit for an energy range largely beyond that achievable at FCC-ee [11, 12]. Similarly, studies in the context of ILC show the potential provided by the accurate reconstruction of top quark pair production events, and by the precise reconstruction of jet directions for the measurement of the Higgs boson mass in HZ events with H \(\rightarrow \) \(b\bar{b}\) and Z decaying into quark pairs [6].

While the main motivation for PF, and for detector concepts optimised for its application, is generally given by the jet energy resolution, the benefits of this concept extend far beyond that one metric alone. PF algorithms improve background rejection, as implemented for CLIC in particular for the higher-energy stages. The additional precision of the reconstruction at the level of individual particles also supports more powerful flavour tagging of reconstructed jets, as discussed further in Sect. 3, and provides the potential for flavour tagging strategies exploiting the full event information. The information provided by a global particle-level PF-based event reconstruction also allows for a precise reconstruction of global event kinematics, which can be used together with the constraints provided by the well-known initial state of the collision in kinematic fits to significantly improve the measurement of kinematic variables. In the context of Higgs boson and top quark physics, this is for example beneficial for the rejection of physics background in hadronic and semi-leptonic final states, for the kinematic reconstruction of particle masses and for the constraints on the jet energy scales, as shown for top quark pair events in [13]. It is expected that significant additional improvement on several aspects of jet and event reconstruction with PF can be achieved with machine learning techniques that use the available information on various levels: from low-level object reconstruction, to jet finding, to global event reconstruction aspects such as jet pairing and other key aspects of precision measurements of the complex final states characterising many Higgs boson and top quark events at the FCC-ee.

3 Flavour tagging: separating bottom and charm quarks, and gluons

The identification of the flavour of the parton that initiated the formation of a jet, known as flavour tagging, is a crucial ingredient to determine the complex final states that result from Higgs boson and top quark decays. The measurement of the coupling of the Higgs boson to quarks and gluons directly relies on the ability to identify the flavour of these jets. One of the highest priorities of the FCC-ee physics programme is to measure with O(\(\sim 1\%\)) or better precision the coupling of the Higgs boson to bottom and charm quarks, and gluons. Moreover, measurements of top quark properties also crucially rely on the identification of bottom quarks. Yet, to achieve the desired precision requires advancements in the detector design and on the flavour tagging algorithms.

Jet flavour identification algorithms were already developed at LEP [14, 15]. The cornerstone of these algorithms are tracks with significant displacement from the beam axis, which originate from the large lifetime of the weak decays, \(\sim \) 1.5 (\(\sim \) 0.4–1.0) ps for B (C) hadrons, as opposed to “light” hadrons that are produced with a vanishing displacement. Such a property of the tracks can be used to identify possible displaced “secondary” vertices (SV) in the event. Therefore, the precision at which the track impact parameter can be measured directly translates into the discrimination power between heavy and light flavour jets. For future detector designs, vertex and tracking detectors with high granularity and as little material as possible, are necessary. In addition, constraints related to the beam pipe are parameters that have to be carefully studied. Ideally, the first layer of the vertex detector would be placed within the beam pipe. However, this approach requires shielding to reduce noise and interference from the beam current, which results in an increased material budget for the signal readout of this detector. An alternative approach that can be explored is to investigate different beam pipe sizes allowing to reduce the distance between the inner tracking layer and the interaction point.

Additional information such as the track multiplicity, the mass of the SV, and potentially the presence of a non-isolated lepton from semi-leptonic heavy flavour decays, were also some of the main ingredients of the flavour tagging algorithms. These algorithms were based on either simple selections on a combination of these SV properties, or, in more advanced implementations, exploited likelihood ratio methods using these properties. The first generation of jet flavour tagging algorithms at the LHC, e.g. in Refs. [16,17,18,19,20] were based on very similar principles, yet applied under the harsher data taking conditions of the LHC. However, taking advantage of the developments in computing (e.g. graphics processing units) and the availability of very large Monte Carlo simulated and collision data samples, a new era in jet flavour identification has begun. These algorithms explore directly much lower-level information (e.g. reconstructed particles from the PF algorithm or even reconstructed hits), which are processed using state-of-the-art machine learning (ML) algorithms, tailored to the jet-tagging problem. The full spatial and energy distribution can be exploited in jet flavour identification, particularly in the separation between quark and gluon jets [21].

Finally, the nature of each of the jet constituents, or particle identification (PID), is expected to provide another useful handle in discriminating between different jet species. Powerful particle identification capabilities based on ionisation energy loss (via dE/dx or cluster counting), or via precise time-of-flight measurements, could be highly beneficial for jet flavour tagging at the FCC-ee allowing the full jet content information to be exploited.

At the LHC, these new approaches can bring more than an order of magnitude improvement in background rejection compared to the traditional approaches [22]. They are now also being explored in the context of the FCC-ee, leading to the development new flavour tagging algorithms which aim to explore the full potential of the detectors and the significantly more benign FCC-ee environment. A recent development [23] explores directly the particles reconstructed by the PF algorithm, which are then used as input to ML-based flavour tagging algorithms. The algorithm is based on ParticleNet [24], a novel approach using state-of-the-art jet representation and a neural network (NN) architecture. Preliminary results of the performance of the algorithm are displayed in Fig. 1 in terms of receiver operating characteristic (ROC) curves, for bottom and charm quark tagging. Different jet flavours from hadronic Higgs decays are taken from a Higgsstrahlung sample at \(\sqrt{s} = 240\) GeV. The IDEA [25] detector concept was used and the detector’s response is simulated using the Delphes [26] fast detector simulation framework. Compared to previous flavour tagging algorithms, e.g. in Ref. [6], a significant improvement in performance is observed.

Fig. 1
figure 1

Background contamination efficiency as a function of jet-tagging efficiency for bottom (left), charm (centre), and strange (right) quark jets in HZ events at a centre-of-mass energy of 240 GeV in the IDEA detector

Another important aspect that is expected to further improve the physics performance is the development of flavour tagging algorithms with the ability to simultaneously discriminate between multiple different jet flavours. For example, in the \(\text {H}\rightarrow \text {c} \bar{\text {c}}\) process where the \(\text {H}\rightarrow \text {b} \bar{\text {b}}\) and \(\text {H}\rightarrow \text {g} \text {g}\) background processes are copiously produced, this is expected to bring important improvement. Having this in mind, the jet flavour algorithm under development provides a probability for each jet to originate from different particle species, namely, up or down, strange, charm, or bottom quark, or from a gluon. In addition, this approach gives the opportunity to explore the potential of strange quark tagging in virtually unexplored physics channels, such as \(\text {H}\rightarrow \text {s} \bar{\text {s}}\) or flavour changing neutral currents (FCNC) in the top sector. Further improvements can be obtained when PID information is also included in the algorithm. PID is already exploited at the LHC to provide powerful discrimination between pions, kaons, and protons [27,28,29]. Particularly for strange quark tagging, exploring such handles yields in an improvement in the discrimination power against first-generation quark jets. Figure 1 (right) shows preliminary results on strange jet tagging that indicate that a powerful discrimination against up, down and gluon jets can be obtained with the use of PID. Further studies will explore the impact of timing information on PID and subsequently on jet-tagging performance. Finally, apart from the clear challenges in the detector and algorithm design, another big challenge is the calibration of such algorithms with the required precision in order to achieve relative uncertainty for most of the Higgs coupling measurements better than 1%. The FCC-ee running scenario includes the operation at the Z pole with the goal to collect O(\(10^{12}\)) events. This will provide excellent conditions for the calibration of the jet flavour tagging algorithm with unprecedented precision.

4 Exploitation for precision measurements and opportunities for further development

The physics potential of the techniques outlined above has already been studied for Higgs boson and top quark physics, primarily in the linear collider context. In the following, we discuss one concrete example, and then highlight areas where we see significant potential for further developments that address key remaining challenges.

Fig. 2
figure 2

Reconstructed di-jet Z candidate mass, \(m_{qq}\) versus reconstructed hadronic recoil mass \(m_{rec}\) for a centre-of-mass energy of 250 and 350 GeV assuming the CLIC luminosity spectrum for signal HZ events (top), and standard model background (bottom). Figure taken from [30]

A concrete example for the exploitation of precise jet energy reconstruction in the context of Higgs physics is the reconstruction of HZ Higgsstrahlung events for hadronic decays of the Z boson. The recoil mass measurements in the HZ process give model-independent access to the total HZ cross section, and with that to the coupling of the Higgs boson to the Z, and thus also allow to constrain invisible Higgs boson decays. In principle, the extension from considering only Z \(\rightarrow \) \(\mu ^+\mu ^-,\, e^+e^-\) to the inclusion of hadronic Z boson decays increases the number of HZ signal events by one order of magnitude. The actual improvement achievable by adding these events depends on the level of background in the signal region, which in turn is influenced by the accuracy of the recoil mass reconstruction, which is determined by the jet reconstruction and the beam parameters. Concrete full-simulation studies have been carried out in the context of CLIC and ILC, using the PandoraPFA algorithm [4, 5]. Due to the constrained phase space at 250 GeV close to the threshold of ZH production, there is significant background in the signal region from four-quark final states that end up being reconstructed near the kinematic limit, resulting in only modest improvements in the measurement of the ZH cross section on the order of 20% compared to leptonic Z decays only, assuming the CLIC luminosity spectrum [30]. Nevertheless, Fig. 2 (left) demonstrates the capability for a clean reconstruction of the hadronic Z decays and the associated recoil mass. The technique unfolds its full potential at energies of around 350 GeV, where signal and background are well separated, as shown in Fig. 2 (right). Here, an improvement of the cross section measurement by a factor of 2.3 is achieved in a CLIC study [11, 30].

This example demonstrates that the potential for Higgs boson precision measurements profits significantly from increasing performance of the jet energy resolution, resulting in corresponding requirements on the calorimeter and tracker imposed by the PF reconstruction. At the same time, it shows that this performance alone does not guarantee precision, with kinematic boundary conditions and the quality of the association of final-state particles to jets also highly relevant.

In the context of top quark physics, we have already discussed in 2 with the example of [13] that the exploitation of kinematic constraints in the event reconstruction can improve quantities such as the reconstructed invariant mass, and with that can also serve as a means of signal selection. Measurements that use the top quarks as tools to explore physics beyond the standard model impose further requirements on the overall event reconstruction. It is crucial to assess the flavour and the correct association of the jets to the final-state partons for measuring asymmetries or searching for CP-violating couplings. Moreover, these will help also increase the sensitivity for the search for FCNC in the top sector, where the current expectation obtained with a traditional analysis approach is comparable to that for the HL-LHC [1].

In general, it is important to make use of the fully hadronic six jets final state. A kinematic fitting strategy is necessary to minimise the background contribution, but it is even more crucial to have the correct association of all the decay products to the top and anti-top. Fully exploiting all the event information, such as jet flavour, composition and charge, is necessary to maximise the precision of this measurement. This requires the extension of the PF reconstruction and the flavour tagging strategies outlined in earlier sections with algorithms that provide an accurate determination of the jet charge to enable a unique identification of the top and anti-top quarks.

With the object reconstruction that is required for precise hadronic energy measurements already well developed, the limitations on global event reconstruction imposed by the uncertainties of the association of individual particles to jets emerge as a key challenge for the full exploitation of the physics potential of the inherently clean \(e^+e^-\) collisions. This area has not yet been thoroughly explored, and may hold significant potential for further improvement with more advanced analysis strategies. For Higgs boson physics, strategies for “colour singlet clustering” [31,32,33,34], that exploit the physical properties of colour-neutral particles decaying into two hadronic jets, may improve the matching of particles to jets, resulting in a better measurement of the properties of the underlying original partons. Such techniques, if proven to be capable of improved precision for the reconstruction of signal final states while also being robust against backgrounds, would improve the measurement of the global event kinematics, and would result in a clearer separation of signal and background. Following this direction may enable further advances, and is expected to lead to new reconstruction paradigms that consider the overall event from the start. Further potential is provided by the increasing power and sophistication of machine learning techniques, which are expected to open new avenues, such as radically different approaches, where the PF candidates are processed directly via ML algorithms to obtain a full event reconstruction [35, 36]. Such methods would by-pass the more traditional approaches which rely on the precise reconstruction of the physics objects as an intermediate step. This is clearly a very bold approach, yet promising given the clean event topologies at the FCC-ee, as well as the current and future developments in the detector design and the PF algorithm.

5 Summary

Some of the major priorities of the FCC-ee physics programme, and of the High Energy Physics community in general, is the measurement with unprecedented precision of the Higgs boson and top quark properties. Breakthroughs in multiple areas from detector design, physics object reconstruction and identification, and analysis techniques, are necessary to achieve this goal. It is observed that the detector concepts currently being discussed for future \(e^+e^-\) colliders, with high resolution, low mass trackers, and highly granular calorimetry, enable object reconstruction and flavour tagging with unprecedented precision. At the same time, this increased precision puts new limitations in the focus, which currently drive the uncertainties in high-multiplicity final states as present in Higgs boson and top quark physics. Novel, more radical approaches to global event reconstruction combining physics-driven algorithms and machine learning techniques may hold significant additional potential for the full exploitation of the excellent experimental conditions provided by FCC-ee.