1 Introduction

In the pursuit of new physics at the CERN LHC, many scenarios have been proposed in which production of particles that leave no trace in collider detectors is accompanied also by production of a standard model (SM) particle, which balances the transverse momentum in an event. The final state considered in this analysis is the production of a pair of leptons (\(\ell ^{+}\ell ^{-}\), where \(\ell =\mathrm {e}\) or \(\mathrm {\mu }\)), consistent with originating from a \(\text{ Z } \) boson, together with large missing transverse momentum (\(p_{\mathrm {T}} ^\text {miss} \)). This final state is well-suited to probe such beyond the SM (BSM) scenarios, as it has relatively small and precisely known SM backgrounds.

One of the most significant puzzles in modern physics is the nature of dark matter (DM). In the culmination of over a century of observations, the “\({\varLambda }_{\mathrm {CDM}}\)” standard model of cosmology has established that, in the total cosmic energy budget, known matter only accounts for about 5%, DM corresponds to 27%, and the rest is dark energy [1]. Although several astrophysical observations indicate that DM exists and interacts gravitationally with known matter, there is no evidence yet for nongravitational interactions between DM and SM particles. While the nature of DM remains a mystery, there are a number of models that predict a particle physics origin. If DM particles exist, they can possibly be produced directly from, annihilate into, or scatter off SM particles. Recent DM searches have exploited various methods including direct [2] and indirect [3] detection. If DM can be observed in direct detection experiments, it must have substantial couplings to quarks and/or gluons, and could also be produced at the LHC [4,5,6,7,8,9].

A promising possibility is that DM may take the form of weakly interacting massive particles. The study presented here considers one possible mechanism for producing such particles at the LHC [10]. In this scenario, a \(\text{ Z } \) boson, produced in proton-proton (pp) collisions, recoils against a pair of DM particles, \(\chi \overline{\chi }\). The \(\text{ Z } \) boson subsequently decays into two charged leptons, producing a low-background dilepton signature, together with \(p_{\mathrm {T}} ^\text {miss} \) due to the undetected DM particles. In this analysis, the DM particle \(\chi \) is assumed to be a Dirac fermion. Four simplified models of DM production via an s-channel mediator exchange are considered. In these models, the mediator has a spin of 1 (0) and vector or axial-vector (scalar or pseudoscalar) couplings to quarks and DM particles. The free parameters of each model are the masses \(m_\text {med}\) and \(m_\mathrm {DM}\) of the mediator and DM particle, respectively, as well as the coupling constant \(g_{\mathrm {q}}\) (\(g_\mathrm {DM}\)) between the mediator and the quarks (DM particles). The vector coupling model can be described with the following Lagrangian:

$$\begin{aligned} \mathcal {L}_{\text {vector}} = g_\mathrm {DM} {Z'}_{\mu }\overline{\chi }\gamma ^{\mu }\chi + g_{\mathrm {q}} \sum _{\mathrm {q}} {Z'}_{\mu } \overline{\mathrm {q}}\gamma ^{\mu }\mathrm {q}, \end{aligned}$$

where the spin-1 mediator is denoted as \(\text{ Z } '\) and the SM quark fields are referred to as \(\text{ q }\) and \(\overline{\text{ q }}\). The Lagrangian for an axial-vector coupling is obtained by making the replacement \(\gamma ^\mu \rightarrow \gamma ^5\gamma ^\mu \). In the case of a spin-0 mediator \(\phi \), the couplings between mediator and quarks are assumed to be Yukawa-like, with \(g_{\mathrm {q}}\) acting as a multiplicative modifier for the SM Yukawa coupling \({y_{\mathrm {q}} = \sqrt{2}m_{\mathrm {q}}/v}\) (where \(v = 246 \,\text {GeV} \) is the SM Higgs field vacuum expectation value), leading to the Lagrangian:

$$\begin{aligned} \mathcal {L}_{\text {scalar}} = g_\mathrm {DM} {\phi }\overline{\chi }\chi + g_{\mathrm {q}} \frac{\phi }{\sqrt{2}}\sum _{\mathrm {q}} y_{\mathrm {q}} \overline{\mathrm {q}}\mathrm {q}. \end{aligned}$$

The Lagrangian with pseudoscalar couplings is obtained by inserting a factor of \(i\gamma ^5\) into each of the two terms (i.e., \({\bar{\chi }}\chi \rightarrow i{\bar{\chi }}\gamma ^5\chi \) and \({\bar{\mathrm {q}}} \mathrm {q}\rightarrow i{\bar{\mathrm {q}}}\gamma ^5 \mathrm {q}\)). Example diagrams of DM production via spin-1 and spin-0 mediators are shown in Fig. 1 (upper left and right, respectively).

Fig. 1
figure 1

Feynman diagrams illustrative of the processes beyond the SM considered in this paper: (upper left) DM production in a simplified model with a spin-1 mediator \(\text{ Z } '\); (upper right) DM production in a simplified model with a spin-0 mediator \(\phi \); (lower left) production of a Higgs boson in association with Z boson with subsequent decay of the Higgs boson into invisible particles; (lower right) unparticle or graviton production. The diagrams were drawn using the TikZ-Feynman package [11]

A primary focus of the LHC physics program after the discovery of a Higgs boson (H) [12,13,14] by the ATLAS and CMS Collaborations is the study of the properties of this new particle. The observation of a sizable branching fraction of the Higgs boson to invisible states [15,16,17] would be a strong sign of BSM physics. Supersymmetric (SUSY) models embodying R-parity conservation contain a stable neutral lightest SUSY particle (LSP), e.g., the lightest neutralino [18], leading to the possibility of decays of the Higgs boson into pairs of LSPs. Certain models with extra spatial dimensions predict graviscalars that could mix with the Higgs boson [19]. As a consequence, the Higgs boson could oscillate to a graviscalar and disappear from the SM brane. The signature would be equivalent to an invisible decay of the Higgs boson. There could also be contributions from Higgs boson decays into graviscalars [20]. With the same effect as the simplified DM models presented earlier, “Higgs portal” models [21,22,23] construct a generic connection between SM and DM particles via a Higgs boson mediator. This analysis considers decays into invisible particles of an SM-like Higgs boson produced in association with a \(\text{ Z } \) boson, as shown in Fig. 1 (lower left).

Another popular BSM paradigm considered here is the Arkani-Hamed–Dimopoulos–Dvali (ADD) model with large extra spatial dimensions [24,25,26], which is motivated by the hierarchy problem, i.e., the disparity between the electroweak unification scale (\(M_\mathrm {EW} \sim 1\,\text {TeV} \)) and the Planck scale (\(M_\mathrm {Pl} \sim 10^{16}\,\text {TeV} \)). This model predicts graviton (\(\text{ G }\)) production via the process \(\text{ q } \overline{\text{ q }} \rightarrow \text{ Z } + \text{ G }\). The graviton escapes detection, leading to a mono-\(\text{ Z } \) signature (Fig. 1, lower right). In the ADD model, the apparent Planck scale in four space-time dimensions is given by \(M_\mathrm {Pl}^2 \approx M_\mathrm {D}^{n+2}R^n\), where \(M_\mathrm {D}\) is the true Planck scale of the full n+4 dimensional space-time and R is the compactification radius of the extra dimensions. Assuming \(M_\mathrm {D}\) is of the same order as \(M_\mathrm {EW}\), the observed large value of \(M_\mathrm {Pl}\) points to an R of order 1 mm to 1 fm for 2 to 7 extra dimensions. The consequence of the large compactification scale is that the mass spectrum of the Kaluza–Klein graviton states becomes nearly continuous, resulting in a broad \(\text{ Z } \) boson transverse momentum (\(p_{\mathrm {T}}\)) spectrum.

The final BSM model considered in this analysis is the phenomenologically interesting concept of unparticles, which appear in the low-energy limit of conformal field theories. In the high-energy regime, a new, scale invariant Banks–Zaks field with a nontrivial infrared fixed point is introduced [27]. The interaction between the SM and Banks–Zaks sectors is mediated by particles of large mass scale \(M_{\textsf {U}}\), below which the interaction is suppressed and can be treated via an effective field theory (EFT). The low-energy regime will include unparticles, which have phase space factors equivalent to those of a noninteger number of ordinary particles [28,29,30]. In this analysis, the emission of spin-0 unparticles from SM quarks is considered. Because of the weakness of the unparticle interactions with the SM fields, the unparticle evades detection. The EFT Lagrangian used to interpret the results is defined as follows:

$$\begin{aligned} \mathcal {L}_{U} = \frac{\lambda }{{\varLambda }_{\textsf {U}} ^{d_{\textsf {U}}-1}} \mathcal {O}_{\textsf {U}} \overline{\text{ q }}\text{ q }, \end{aligned}$$

where \(\lambda \) represents the coupling between the SM and unparticle fields, \({\varLambda }_{\textsf {U}}\) is the cutoff scale of the EFT, and \(d_{\textsf {U}}\) is the characteristic scaling dimension of the theory. The unparticle operator is denoted as \(\mathcal {O}_{\textsf {U}}\). A representative Feynman diagram of the interaction is shown in Fig. 1 (lower right).

The search described in this paper is based on a data set recorded with the CMS detector in 2016, which corresponds to an integrated luminosity of \(35.9 \pm 0.9{\,\text {fb}^{-1}} \) of pp collisions at a center-of-mass energy of 13\(\,\text {TeV}\).

The paper is organized as follows: after a brief review of previous work in Sect. 2, followed by a description of the CMS detector in Sect. 3, we discuss the background composition in Sect. 4. Simulated samples are reviewed in Sect. 5, followed by the event reconstruction and event selection description in Sects. 6 and 7, respectively. The details of the background estimation are given in Sect. 8. The multivariate analysis of invisible Higgs boson decays is summarized in Sect. 9, followed by the discussion of selection efficiencies and systematic uncertainties in Sect. 10. The results are given in Sect. 11, and Sect. 12 summarizes the paper.

2 Review of previous work

A search by the CMS Collaboration in the same topology using an earlier data set corresponding to an integrated luminosity of 2.3\(\,\text {fb}^{-1}\) of pp collisions collected in 2015 at \(\sqrt{s}=13\,\text {TeV} \) found no evidence for BSM physics [31]. In addition to the order of magnitude increase in the integrated luminosity, significant differences with respect to the previous analysis include: new techniques for estimating irreducible backgrounds, which were not viable with the previous data set; improvements in the event selection; and a broader range of BSM models probed.

In the previous CMS result [31], under the same simplified model assumptions as used in this paper, DM mediator masses of up to 290 (300)\(\,\text {GeV}\) were excluded for fixed vector (axial-vector) couplings of \(g_{\mathrm {q}}=0.25\) and \(g_\mathrm {DM}=1.0\). Here and in what follows all limits are given at 95% confidence level (CL), unless explicitly specified otherwise. Similar DM models have been also probed in the \(\gamma \)+\(p_{\mathrm {T}} ^\text {miss}\)  [32] and jet+\(p_{\mathrm {T}} ^\text {miss}\)  [33] topologies at \(\sqrt{s} = 13\,\text {TeV} \) by the ATLAS Collaboration, excluding mediators with vector (axial-vector) couplings up to masses of 1.2 (1.25) \(\,\text {TeV}\). The most stringent limits on DM production in this context were obtained in a CMS analysis of events with a jets+\(p_{\mathrm {T}} ^\text {miss}\) topology performed on a subset of the present data set, corresponding to an integrated luminosity of 12.9\(\,\text {fb}^{-1}\)  [34]. In that analysis, mediator masses of up to 1.95\(\,\text {TeV}\) were excluded for both vector and axial-vector couplings. In the case of a scalar mediator coupled only to quarks and DM particles with \(g_{\mathrm {q}} = g_\mathrm {DM} = 1\), no exclusion was set. For the pseudoscalar mediator, under the same assumptions, masses below 430\(\,\text {GeV}\) were excluded.

Invisible decays of the SM Higgs boson—hereafter H(inv.)—have been targeted by both ATLAS and CMS. These searches used both the Z+\(p_{\mathrm {T}} ^\text {miss}\) and jets+\(p_{\mathrm {T}} ^\text {miss}\) topologies, the latter including gluon fusion and vector fusion processes as well as associated production with a vector boson reconstructed as a single jet. The most stringent constraints were obtained from a combination of searches in these final states at \(\sqrt{s} = 8\,\text {TeV} \) by ATLAS [35] and at multiple center-of-mass energies by CMS [36], which, under the assumption of SM production, exclude a branching fraction for H(inv.) decays larger than \(25\%\) for ATLAS and \(24\%\) for CMS.

Real emission of gravitons in the ADD scenario has been most recently probed in the jet+\(p_{\mathrm {T}} ^\text {miss}\) topology by CMS at \(\sqrt{s} = 8\,\text {TeV} \) [37] and by ATLAS at \(\sqrt{s} = 13\,\text {TeV} \) [38]. In these analyses, the fundamental Planck scale \(M_\mathrm {D}\) of the n+4 dimensional theory has been constrained to be larger than 3.3–5.6\(\,\text {TeV}\) (CMS) and 4.1–6.6\(\,\text {TeV}\) (ATLAS), for the number of extra dimensions between 6 and 2. Previous CMS analyses in the same final state as this analysis have excluded unparticle cutoff scales from \(400\,\text {GeV} \) at large values of the scaling dimension \(d_{\textsf {U}} =2.2\), up to hundreds of \(\,\text {TeV}\) at low values of \(d_{\textsf {U}} \approx 1\) [31, 39].

3 The CMS detector

The central feature of the CMS apparatus is a superconducting solenoid of 6\(\text {\,m}\) internal diameter, providing a magnetic field of 3.8\(\text {\,T}\). Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Forward calorimeters extend the pseudorapidity coverage provided by the barrel and endcap detectors. Muons are detected in gas-ionization chambers embedded in the steel flux-return yoke outside the solenoid.

Events of interest are selected using a two-tiered trigger system [40]. The first level, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select events at a rate of around 100\(\text {\,kHz}\) within a time interval of less than 4\(\,\mu \text {s}\). The second level, known as the high-level trigger, consists of a farm of processors running a version of the full event reconstruction software optimized for fast processing, and reduces the event rate to around 1\(\text {\,kHz}\) before data storage.

A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [41].

4 Background composition

Several SM processes can produce the dilepton+\(p_{\mathrm {T}} ^\text {miss} \) final state. Since none of the BSM physics signals probed in this analysis are expected to produce a resonance peak in the \(p_{\mathrm {T}} ^\text {miss} \) distribution, adequate modeling of each SM background process is necessary. The following SM background processes have been considered in this analysis:

  • \(\text{ Z } \text{ Z } \rightarrow 2\ell 2\nu \) production, which yields the same final state as the signal and contributes approximately 60% of the total background.

  • \(\mathrm {W}\text{ Z } \rightarrow \ell \nu \ell \ell \) production, where the lepton from the \(\mathrm {W}\) boson decay is not identified either because it fails the lepton identification, or because it falls outside the detector acceptance or kinematic selections. This process contributes approximately 25% of the total background, and the kinematic distributions are similar to those for the \(\text{ Z } \text{ Z } \rightarrow 2\ell 2\nu \) process.

  • \(\mathrm {W}\mathrm {W}\rightarrow \ell \nu \ell \nu \) events, where the dilepton invariant mass falls into the \(\text{ Z } \) boson mass window. These events constitute approximately 5% of the background.

  • Events with leptonically decaying top quarks (mostly \({\text{ t }\overline{\text{ t }}} \) and \(\text{ t } \mathrm {W}\)), where the dilepton invariant mass falls into the \(\text{ Z } \) boson mass window, and which contribute about 5% of the total background.

  • Drell–Yan (DY) production, \(\text{ Z }/\gamma ^*\rightarrow \ell \ell \), which can produce events with large \(p_{\mathrm {T}} ^\text {miss} \) caused mainly by jet energy mismeasurement and detector acceptance effects. It amounts to approximately 5% of the total background.

  • Triboson processes (e.g., \(\mathrm {W}\mathrm {W}\mathrm {W}\)), which have a small cross section and contribute less than 1% of the total background.

Processes that were found to have a negligible contribution to the signal region include: \(\mathrm {W}\)+jets, because of the very low probability for a jet to be reconstructed as a lepton and the dilepton system to be within the \(\text{ Z } \) boson mass window; the SM process \(\text{ Z } (\rightarrow \ell \ell )\text{ H } (\rightarrow \text{ Z } \text{ Z } \rightarrow 4\nu )\), which is a subset of the \(\text{ Z } \text{ H } (\text {inv.})\) signal and accounts for 0.1% of SM Higgs boson decays; and \({\mathrm{gg} \rightarrow \text{ H } (\rightarrow \mathrm {W}\mathrm {W})}\), which has similar topology to continuum \(\mathrm {W}\mathrm {W}\) production but makes a negligible contribution after the full selection.

5 Simulation

Simulated Monte Carlo (MC) samples are used to estimate backgrounds, to validate the background estimation techniques using control samples in data, to calculate signal efficiency, and to optimize the analysis.

Diboson production (VV, where \(\mathrm {V}=\) W or Z) via \(\text{ q } \overline{\text{ q }} \) annihilation, as well as \(\text{ Z } \text{ H } \) production via \(\text{ q } \overline{\text{ q }} \) annihilation and gluon fusion, are generated at next-to-leading order (NLO) in quantum chromodynamics (QCD) with powheg 2.0 [42,43,44,45]. The \(\mathrm{gg \rightarrow \mathrm {W}\mathrm {W}}\) and \(\mathrm{gg \rightarrow \text{ Z } \text{ Z }}\) processes are simulated at NLO with mcfm v7.01 [46]. The \(\text{ Z } \)+jets, \(\text{ Z } \gamma \), \({\text{ t }\overline{\text{ t }}} \), \({\text{ t }\overline{\text{ t }}} \mathrm {V}\), and \(\mathrm {VVV}\) samples are generated at NLO with either MadGraph 5_amc@nlo v2.3.2 [47] or powheg.

Samples of DM particle production in the simplified model framework are generated using DmSimp [48,49,50] interfaced with MadGraph 5_amc@nlo v2.4.3. Samples are generated over a range of values for the masses \(m_\text {med}\) and \(m_\mathrm {DM}\). For the vector and axial-vector models, samples are generated at NLO in QCD with up to one additional parton in the matrix element calculations, and the mediator couplings to the SM and DM fields are set to \(g_{\mathrm {q}} = 0.25\) and \(g_\mathrm {DM} = 1\), respectively. For the scalar and pseudoscalar models, samples are generated at leading order in QCD, and the couplings are set to \(g_{\mathrm {q}} = g_\mathrm {DM} = 1\). This choice of couplings is recommended by the ATLAS/CMS dark matter forum [10] and by the LHC dark matter working group [51]. For all DM particle production samples, the central values of the renormalization and factorization scales are set to the \(m_\mathrm {T}^{2}\) scale after \(k_\mathrm {T}\)-clustering of the event.

Events for the ADD scenario of large extra dimensions and for the unparticle model are generated at leading order (LO) using an EFT implementation in pythia 8.205  [52,53,54]. In the ADD case, event samples are produced for \(M_\mathrm {D} = 1\), 2, and 3 \(\,\text {TeV}\), each with \(n =\) 2–7. In order to ensure the validity of the EFT, the signal is truncated for \(\hat{s} > M_\mathrm {D}^2\), where \(\hat{s}\) is the center-of-mass energy squared of the incoming partons. Events above this threshold are suppressed by an additional weight of \(M_\mathrm {D}^4 / \hat{s}^2\). In general, this procedure has a larger effect for large values of n, for which the distribution of \(\hat{s}\) is shifted towards higher values [53]. For the unparticle case, samples are generated for scaling dimensions \(d_{\textsf {U}}\) between 1.01 and 2.2, with the cutoff scale \({\varLambda }_{\textsf {U}}\) set to \(15 \,\text {TeV} \) and the coupling \(\lambda \) set to 1. Since both \({\varLambda }_{\textsf {U}}\) and \(\lambda \) modify the cross sections of the signal prediction, but not its kinematic distributions [54], a simple rescaling of cross sections is performed to obtain signal predictions for alternative values of these parameters. No truncation is performed for the unparticle signal so that the results can be compared with those of previous searches.

In all cases, pythia versions 8.205 or higher is used for parton showering, hadronization, and the underlying event simulation, using tune CUETP8M1 [55]. The merging of jets from matrix element calculations and parton shower descriptions is done using the MLM [56] (FxFx [57]) scheme for LO (NLO) samples. The NNPDF3.0 [58] parton distribution function (PDF) set is used, with the order corresponding to the one used for the signal or background simulation.

For all MC samples, the detector response is simulated using a detailed description of the CMS detector, based on the \({\textsc {Geant4}} \) package [59]. Minimum bias events are superimposed on the simulated events to emulate the additional pp interactions per bunch crossing (pileup). All MC samples are corrected to reproduce the pileup distribution as measured in the data. The average number of pileup events per bunch crossing is approximately 23 in the data sample analyzed.

6 Event reconstruction

In this analysis, the particle-flow (PF) event reconstruction algorithm [60] is used. The PF algorithm is designed to leverage information from all CMS detector components to reconstruct and identify individual particles, namely: electrons, muons, photons, and charged and neutral hadrons. The reconstructed vertex with the largest value of summed physics-object \(p_{\mathrm {T}} ^2\) is taken to be the primary \(\mathrm {p}\mathrm {p}\) interaction vertex. The physics objects are the track-jets, clustered using the jet finding algorithm [61, 62] with the tracks assigned to the vertex as inputs, and the associated missing transverse momentum, taken as the negative vector sum of the \(p_{\mathrm {T}} \) of those jets.

Electron candidates are reconstructed using an algorithm that combines information from the ECAL, HCAL, and the tracker [63]. To reduce the electron misidentification rate, electron candidates are subjected to additional identification criteria, which are based on the distribution of the electromagnetic shower in the ECAL, the relative amount of energy deposited in the HCAL in the cluster, a matching of the trajectory of an electron track with the cluster in the ECAL, and its consistency with originating from the selected primary vertex. Candidates that are identified as originating from photon conversions in the detector material are removed.

Muon candidate reconstruction is based on two main algorithms: in the first, tracks in the silicon tracker are matched to track stubs (or segments) reconstructed in the muon detectors; in the second algorithm, a combined fit is performed to signals in both the silicon tracker and the muon system [64]. The two resulting collections are merged, with the momentum measurement of the latter algorithm taking precedence. To reduce the muon misidentification rate, further identification criteria are applied on the basis of the number of measurements in the tracker and in the muon system, the quality of the muon track fit, and its consistency with the selected primary vertex location.

Leptons produced in the decay of \(\text{ Z } \) bosons are expected to be isolated from hadronic activity in the event. The isolation is defined from the sum of the momenta of all PF candidates found in a cone of radius \(R=\sqrt{\smash [b]{({\varDelta }\eta )^2+({\varDelta }\phi )^2}} = 0.4\) built around each lepton, where \({\varDelta }\phi \) and \({\varDelta }\eta \) are, respectively, the differences in the azimuthal angle (measured in radians) and in the pseudorapidity between the lepton and the PF candidate. The contribution to the isolation from the lepton candidate itself is removed. For muons, the isolation sum is required to be smaller than 15% of the muon \(p_{\mathrm {T}} \). For electrons in the ECAL barrel (endcap), the limit on this isolation sum is 6.9 (8.2)% of the electron \(p_{\mathrm {T}} \). In order to mitigate the dependence of the isolation variable on the number of pileup interactions, charged hadrons are included in the sum only if they are consistent with originating from the selected primary vertex of the event. To correct for the contribution to the isolation sum of neutral hadrons and photons from pileup interactions, different strategies are adopted for electrons and muons. For electrons, a median energy density (\(\rho \)) is determined on an event-by-event basis using the method described in Ref. [65]. The contribution of the pileup particles is then estimated as a product of \(\rho \) and the effective area of the isolation cone and is subtracted from the isolation sum. For muon candidates, the correction is performed instead by subtracting half the sum of the \(p_{\mathrm {T}} \) of the charged-hadron candidates in the cone of interest, which are not associated with the primary vertex. The factor of one half corresponds to the average ratio of neutral to charged particles in pileup interactions.

Jets are constructed from PF candidates using the anti-\(k_{\mathrm {T}}\) clustering algorithm [61] with a distance parameter \(R = 0.4\), as implemented in the fastjet package [62, 66]. The jet momentum is defined as the vectorial sum of all PF candidate momenta assigned to the jet, and is found in the simulation to be within 5 to 10% of the true momentum over the entire \(p_{\mathrm {T}}\) range and detector acceptance used in this analysis. An overall energy subtraction is applied to correct for the extra energy clustered in jets due to pileup interactions, following the procedure in Refs. [65, 67]. Corrections to the jet energy scale and resolution are derived from measurements both in simulation and in data of the energy balance in dijet, multijet, \(\gamma \)+jet, and leptonic Z+jet events [68, 69].

The missing transverse momentum vector, \({\vec p}_{\mathrm {T}}^{\text {miss}} \), is defined as the projection of the negative vector sum of the momenta of all reconstructed PF candidates in an event onto the plane perpendicular to the beams. Its magnitude is referred to as \(p_{\mathrm {T}} ^\text {miss} \). Several event-level filters are applied to discard events with anomalous \(p_{\mathrm {T}} ^\text {miss} \) arising from specific well-understood issues with the detector components or event reconstruction [70]. Jet energy corrections are propagated to the missing transverse momentum by adjusting the momentum of the PF candidate constituents of each reconstructed jet.

For the purpose of rejecting events involving top quark production, jets originating from b quark fragmentation (b jets) are identified by “b tagging.” The b tagging technique employed is based on the “combined secondary vertex” CSVv2 algorithm [71, 72]. The algorithm is calibrated to provide, on average, 80% efficiency for tagging jets originating from b quarks, and 10% probability of light-flavor jet misidentification.

For the purpose of rejecting events containing \(\tau \) leptons, hadronically decaying \(\tau \) leptons (\(\tau _\mathrm {h} \)) are identified using the “hadron-plus-strips” algorithm [73]. The algorithm identifies a jet as a \(\tau _\mathrm {h} \) candidate if a subset of the particles assigned to the jet is consistent with the hadronic decay products of a \(\tau \) lepton [73]. In addition, \(\tau _\mathrm {h} \) candidates are required to be isolated from other activity in the event.

7 Event selection

Events with electrons (muons) are collected using dielectron (dimuon) triggers, with the thresholds of \(p_{\mathrm {T}} > 23\) (17)\(\,\text {GeV}\) and \(p_{\mathrm {T}} > 12\) (8)\(\,\text {GeV}\) for the leading and subleading electron (muon), respectively. Single-electron and single-muon triggers (with \(p_{\mathrm {T}} \) thresholds of 27 and 24\(\,\text {GeV}\), respectively) are also used in order to recover residual trigger inefficiencies.

Events are required to have exactly two (\(N_{\ell } = 2\)) well-identified, isolated leptons of the same flavor and opposite electric charge (\(\mathrm {e}^+\mathrm {e}^-\) or \(\mathrm {\mu ^+}\mathrm {\mu ^-}\)). The leading electron (muon) of the pair must have \(p_{\mathrm {T}} > 25\) (20)\(\,\text {GeV}\), while \(p_{\mathrm {T}} > 20\,\text {GeV} \) is required for the subleading lepton. The dilepton invariant mass is required to be within 15\(\,\text {GeV}\) of the established \(\text{ Z } \) boson mass \(m_\mathrm{Z}\) [74]. The dilepton \(p_{\mathrm {T}} \) (\(p_{\mathrm {T}} ^{\, \ell \ell }\)) must be larger than 60\(\,\text {GeV}\) to reject the bulk of the \(\text{ Z }/\gamma ^*\rightarrow \ell \ell \) background. Since little hadronic activity is expected in this final state, events having more than one jet with \(p_{\mathrm {T}} >30~\,\text {GeV} \) are rejected. The top quark background is suppressed by applying a b jet veto: events with at least one b-tagged jet with \(p_{\mathrm {T}} > 20\,\text {GeV} \) reconstructed within the tracker acceptance, \(|\eta | < 2.4\), are removed. To reduce the \(\mathrm {W}\text{ Z } \) background in which both bosons decay leptonically, events containing additional electrons (muons) with \(p_{\mathrm {T}} > 10\) (5)\(\,\text {GeV}\) and events with loosely identified hadronically decaying \(\tau \) leptons (\(\tau _\mathrm {h} \)) with \(p_\mathrm {T}>18\,\text {GeV} \) are removed.

The event selection is optimized using three variables: the \(p_{\mathrm {T}} ^\text {miss} \), the azimuthal angle formed between the dilepton \(p_{\mathrm {T}}\) and the missing transverse momentum vector, \({\varDelta }\phi ({\vec p}_{\mathrm {T}} ^{\, \ell \ell },{\vec p}_{\mathrm {T}}^{\text {miss}})\), and the \(p_{\mathrm {T}} ^\text {miss}\)-\(p_{\mathrm {T}} ^{\, \ell \ell }\) balance ratio, \(|p_{\mathrm {T}} ^\text {miss}-p_{\mathrm {T}} ^{\, \ell \ell }|/p_{\mathrm {T}} ^{\, \ell \ell }\). The latter two variables are powerful in suppressing reducible background processes, such as DY and top quark production. The selection criteria applied to these variables are optimized in order to obtain the best expected signal sensitivity for a wide range of DM parameters that are considered. For each possible set of selections, the full analysis is repeated, including the estimation of backgrounds from control samples in data and the systematic uncertainties. The final selection criteria obtained after optimization are: \(p_{\mathrm {T}} ^\text {miss} > 100\,\text {GeV} \), \({\varDelta }\phi ({\vec p}_{\mathrm {T}} ^{\, \ell \ell },{\vec p}_{\mathrm {T}}^{\text {miss}}) > 2.6\) rad, and \(|p_{\mathrm {T}} ^\text {miss}-p_{\mathrm {T}} ^{\, \ell \ell }|/p_{\mathrm {T}} ^{\, \ell \ell } < 0.4\).

To avoid positive biases in the \(p_{\mathrm {T}} ^\text {miss} \) calculation due to jet mismeasurement, in events with one jet a threshold is applied on the azimuthal angle between this jet and the missing transverse momentum, \({\varDelta }\phi ({\vec p}_{\mathrm {T}} ^{\, j},{\vec p}_{\mathrm {T}}^{\text {miss}}) > 0.5\text {\,rad} \). To reduce the contribution from backgrounds such as \(\mathrm {W}\mathrm {W}\) and \({\text{ t }\overline{\text{ t }}} \), a requirement on the distance between the two leptons in the \((\eta ,\phi )\) plane, \({\varDelta }R_{\ell \ell } < 1.8\), is applied.

There are two types of analyses performed in this paper. The main analysis method is based on fitting the \(p_{\mathrm {T}} ^\text {miss}\) spectrum in data after applying the above selection criteria defining the signal region (SR). For the specific interpretation of this analysis involving invisible decays of the SM (125\(\,\text {GeV}\)) Higgs boson, a multivariate boosted decision tree (BDT) classifier is employed to increase the sensitivity of the analysis. We use the following set of twelve variables to train a multiclass BDT classifier:

  • \(\left| m_{\ell \ell }-m_{\text{ Z }}\right| \) (dilepton mass);

  • \(p_{\mathrm {T}} ^{\ell 1}\) (leading lepton transverse momentum);

  • \(p_{\mathrm {T}} ^{\ell 2}\) (subleading lepton transverse momentum);

  • \(p_{\mathrm {T}} ^{\, \ell \ell }\) (dilepton transverse momentum);

  • \(| \eta ^{\ell 1} |\) (leading lepton pseudorapidity);

  • \(| \eta ^{\ell 2} |\) (subleading lepton pseudorapidity);

  • \(p_{\mathrm {T}} ^\text {miss} \) (missing transverse momentum);

  • \(m_{T}(p_{\mathrm {T}} ^{\ell 1}, \) \(p_{\mathrm {T}} ^\text {miss}\)) (leading lepton transverse mass);

  • \(m_{T}(p_{\mathrm {T}} ^{\ell 2}, \) \(p_{\mathrm {T}} ^\text {miss}\)) (subleading lepton transverse mass);

  • \({\varDelta }\phi ({\vec p}_{\mathrm {T}} ^{\, \ell \ell },{\vec p}_{\mathrm {T}}^{\text {miss}})\) (azimuthal separation between dilepton and missing momentum);

  • \({\varDelta }R_{\ell \ell }\) (separation between leptons); and

  • \(| \cos \theta ^\mathrm{CS}_{\ell 1} |\) (cosine of the polar angle in the Collins–Soper frame [75] for the leading lepton).

Several classes of event samples are considered for the multiclass BDT: ZH(inv.) signal; ZZ; WZ; DY; and flavor-symmetric or nonresonant backgrounds. A BDT is trained targeting each class, and the final discriminator is taken to be the likelihood assigned to ZH(inv.) production, normalized to the sum of the likelihoods of all processes. The SR selection for the BDT analysis is slightly altered from that of the \(p_{\mathrm {T}} ^\text {miss} \)-based analysis: the dilepton mass requirement is relaxed to be within \(30\,\text {GeV} \) of the \(\text{ Z } \) boson mass, and the selections on \({\varDelta }\phi ({\vec p}_{\mathrm {T}} ^{\, \ell \ell },{\vec p}_{\mathrm {T}}^{\text {miss}})\), \(|p_{\mathrm {T}} ^\text {miss}-p_{\mathrm {T}} ^{\, \ell \ell }|/p_{\mathrm {T}} ^{\, \ell \ell }\), and \({\varDelta }R_{\ell \ell }\) are omitted. The selection for training the BDT additionally requires the missing transverse momentum to be greater than \(130\,\text {GeV} \), where differentiating between the diboson background and signal is most challenging. The BDT performance in the untrained region of \(100\le p_{\mathrm {T}} ^\text {miss} \le 130\,\text {GeV} \) is found to be adequate, whereas a BDT trained on event samples including this region was found to have significantly degraded performance in the \(p_{\mathrm {T}} ^\text {miss} >130\,\text {GeV} \) region.

A summary of the selection criteria for the SR of both the \(p_{\mathrm {T}} ^\text {miss}\)-based analysis and the BDT analysis is given in Table 1.

Table 1 Summary of the kinematic selections for the signal region of both the the \(p_{\mathrm {T}} ^\text {miss} \)-based analysis and the BDT analysis. Where the selections for the two analyses differ, the BDT requirement is given in parentheses

8 Background estimation

Background contributions are estimated using combined information from simulation and control regions (CRs) in data. The normalizations of the dominant background processes are constrained by using a simultaneous maximum likelihood fit to the SR, as well as to the CRs that are described in this section. The contributions of minor backgrounds in both SR and CRs are predicted from simulation.

8.1 Diboson background

The \(\text{ Z } \text{ Z } \) and \(\mathrm {W}\text{ Z } \) processes contribute to the SR via the \({\text{ Z } \text{ Z } \rightarrow \ell \ell \nu \nu }\) and \({\mathrm {W}\text{ Z } \rightarrow \ell \nu \ell \ell }\) decay modes, respectively, where the decay products of one boson are not detected. The background estimate for these processes is improved by selecting CRs with alternative decay modes that not only provide a normalization based on CRs in data, but also probe the lost-boson \(p_{\mathrm {T}}\) distribution, which is expected to be independent of the decay mode. In this way, the \(p_{\mathrm {T}} ^\text {miss} \) spectra of these processes are constrained with respect to their theoretical predictions.

The ability of the simulation to correctly model the lost-boson rapidity is important, as the SR rapidity acceptance of the lost boson is necessarily larger than the rapidity acceptance of the proxy boson in each CR, due to the fact that the visible decay products of the proxy boson in the CR must be inside the detector acceptance. The impact of possible data-to-simulation discrepancies in the high-rapidity portion of diboson background in the SR is suppressed by the fact that, as measured in simulation, the majority of the \(\mathrm {W}\text{ Z } \) and \(\text{ Z } \text{ Z } \) contamination in the SR is comprised of events where the lost boson is within the rapidity range of the CRs. In addition, the proxy boson rapidity distributions in the CRs (or its visible lepton, in the case of the \(\mathrm {W}\text{ Z } \) CR) show a good agreement between data and simulation.

8.1.1 The WZ control region

The \(\mathrm {W}\text{ Z } \) control region is formed from events with three well-reconstructed charged leptons. In this case, the CR is populated by events with the same decay mode as the SR, but no leptons are lost to identification or acceptance requirements. A \(\text{ Z } \) boson candidate is selected in the same manner as for the SR, and an additional electron or muon, with identical quality requirements as applied to the leptons in the SR, is required. To enhance the purity of the \(\mathrm {W}\text{ Z } \) selection, \(p_{\mathrm {T}} ^\text {miss} \) of at least 30\(\,\text {GeV}\) is required, the invariant mass of three leptons is required to be larger than \(100\,\text {GeV} \), and the invariant masses of all opposite-sign, same-flavor lepton pairs are required to be larger than \(4\,\text {GeV} \). Backgrounds in this CR are similar to those in the SR, with a sizeable nonprompt background from the DY+jets process, where a jet is misidentified as a lepton. All background estimates for this CR are taken from simulation.

The \(\mathrm {W}\) boson \(p_{\mathrm {T}} \) (“emulated \(p_{\mathrm {T}} ^\text {miss}\) ”) is estimated by calculating the vectorial sum of the \({\vec p}_{\mathrm {T}}^{\text {miss}} \) vector and the transverse momentum vector (\({\vec p}_{\mathrm {T}} \)) of the third charged lepton. In simulation, the majority (over \(70\%\)) of \(\mathrm {W}\text{ Z } \) background contamination in the signal region originates from events where over \(90\%\) of the \(\mathrm {W}\) boson transverse momentum is carried by one or more neutrinos from the \(\mathrm {W}\) boson decay. Thus, the majority of the W boson rapidity distribution in the SR is central, although it is less central than in the WZ CR. Neither the SR nor the \(\mathrm {W}\text{ Z } \) CR topology can probe the W boson rapidity directly. However, for the \(\mathrm {W}\text{ Z } \) CR, good agreement between data and simulation in the third lepton pseudorapidity distributions is observed.

A minor source of WZ background contamination in the SR originates from events where the visible lepton from a \(\mathrm {W}\) boson decay failed identification requirements. Data-to-simulation discrepancies in this contribution would also manifest in the measured \(\mathrm {W}\text{ Z } \) CR \(p_{\mathrm {T}} ^\text {miss} \) distribution, for which no such mismodeling effects are evident.

Using the emulated \(p_{\mathrm {T}} ^\text {miss} \) in place of the reconstructed \(p_{\mathrm {T}} ^\text {miss} \), the same selection is applied as for the SR. However, since there is no danger of CR contamination from \(\mathrm {W}\text{ Z } \rightarrow \tau \nu \ell \ell \) or top quark backgrounds, no veto on additional \(\tau _\mathrm {h} \) or b jet candidates is applied. The resulting emulated \(p_{\mathrm {T}} ^\text {miss} \) spectrum is shown in Fig. 2 (upper left).

Fig. 2
figure 2

Emulated \(p_{\mathrm {T}} ^\text {miss} \) distribution in data and simulation for the \(\mathrm {W}\text{ Z } \rightarrow 3\ell \nu \) (upper left) and \(\text{ Z } \text{ Z } \rightarrow 4\ell \) (upper right) CRs, and the ratio between both distributions (lower). No events are observed with emulated \(p_{\mathrm {T}} ^\text {miss} >500\,\text {GeV} \) in either channel. Uncertainty bands correspond to the combined statistical and systematic components

8.1.2 The ZZ control region

The \(\text{ Z } \text{ Z } \) control region is formed from events with four well-reconstructed charged leptons. In addition to a signal-like \(\text{ Z } \) boson candidate, a second \(\text{ Z } \) boson candidate is required, the constituents of which only need to pass relaxed lepton quality requirements. This choice reflects the very high purity of the four-lepton selection. For both candidates, the same \(\text{ Z } \) boson mass constraint as in the SR is applied. Backgrounds, dominated by triboson processes, are almost negligible in this CR and are taken from simulation.

Similar to the \(\mathrm {W}\text{ Z } \) case, the emulated \(p_{\mathrm {T}} ^\text {miss} \) is calculated as the vectorial sum of the \({\vec p}_{\mathrm {T}}^{\text {miss}} \) and the \({\vec p}_{\mathrm {T}} \) of the \(\text{ Z } \) boson with the larger mass difference to the nominal value of \(m_{\text{ Z }}\) of the two identified in the event. The choice of which \(\text{ Z } \) boson to use as a proxy for an invisibly decaying one does not significantly alter the emulated \(p_{\mathrm {T}} ^\text {miss} \) spectrum. In this CR, the rapidity of the proxy boson is observable, for which good agreement between data and simulation is found.

The same selection as in the SR is then applied using the emulated \(p_{\mathrm {T}} ^\text {miss} \) in place of the reconstructed \(p_{\mathrm {T}} ^\text {miss} \), with the exception of the \(\tau \) lepton and b jet vetoes. The resulting emulated \(p_{\mathrm {T}} ^\text {miss} \) spectrum is shown in Fig. 2 (upper right).

8.1.3 The VV ratio constraints

Due to a limited event count in the \(\text{ Z } \text{ Z } \) control region, the normalizations of the \(\mathrm {W}\text{ Z } \) and \(\text{ Z } \text{ Z } \) processes in the \(\mathrm {W}\text{ Z } \) and \(\text{ Z } \text{ Z } \) CRs and the SR are controlled by a single free parameter in the maximum likelihood fit, with their relative normalizations fixed by the theoretical predictions for the \(\mathrm {W}\text{ Z } \) and \(\text{ Z } \text{ Z } \) processes in each \(p_{\mathrm {T}} ^\text {miss}\) bin. The predictions for these processes are obtained from fully reconstructed simulated events generated as described in Sect. 5 with the following additional higher-order corrections applied:

  • a constant (approximately \(10\%\)) correction for the \(\mathrm {W}\text{ Z } \) cross section from NLO to NNLO in QCD calculations [76];

  • a constant (approximately \(3\%\)) correction for the \(\mathrm {W}\text{ Z } \) cross section from LO to NLO in electroweak (EW) calculations, considering also photon-quark initial states, according to Ref. [77];

  • a \({\varDelta }\phi (\text{ Z },\text{ Z })\)-dependent correction, varying in magnitude up to \(15\%\), to \(\text{ Z } \text{ Z } \) production cross section from NLO to next-to-next-to-leading order (NNLO) in QCD calculations [78];

  • a \(p_{\mathrm {T}} \)-dependent correction, varying in magnitude up to \(20\%\) at high \(p_{\mathrm {T}} ^\text {miss} \), to the \(\text{ Z } \text{ Z } \) cross section from LO to NLO in EW calculations, following Refs. [77, 79, 80], which is the dominant correction in the signal region.

We use the product of the above NLO EW corrections and the inclusive NLO QCD corrections [81] as an estimate of the missing \(\text {NLO EW}\times \text {NLO QCD}\) contribution, which is not used as a correction, but rather assigned as an uncertainty. The uncertainties in the \(\mathrm {W}\text{ Z } \) and \(\text{ Z } \text{ Z } \) EW corrections are assumed to be anticorrelated as a conservative measure. The uncertainty associated with the NNLO QCD corrections for both processes is represented by the QCD scale variation uncertainties evaluated on the NLO QCD simulation sample for the respective process, as described in Sect. 10. Figure 2 (lower) shows the ratio of \(\text{ Z } \text{ Z } \) to \(\mathrm {W}\text{ Z } \) CR yields per \(p_{\mathrm {T}} ^\text {miss} \) bin, which probes the validity of taking the relative normalizations from simulation. Good agreement is observed between data and simulation.

8.2 Nonresonant backgrounds

The contribution of the nonresonant flavor-symmetric backgrounds is estimated from a CR based on events with two leptons of different flavor (\(\mathrm {e}^{\pm }\mathrm {\mu }^{\mp }\)) that pass all other analysis selections. Nonresonant background (NRB) consists mainly of leptonic \(\mathrm {W}\) boson decays in \({\text{ t }\overline{\text{ t }}} \), \(\text{ t } \mathrm {W}\), and \(\mathrm {W}\mathrm {W}\) events, where the dilepton mass happens to fall inside the \(\text{ Z } \) boson mass window. Small contributions from single top quark events produced via s- and t-channel processes, and \(\text{ Z } \rightarrow \tau \tau \) events in which \(\tau \) leptons decay into light leptons and neutrinos are also considered in the NRB estimation.

The method assumes lepton flavor symmetry in the final states of these processes. Since the leptonic decay branching fraction to the \(\mathrm {e}\mathrm {e}\), \(\mu \mu \), and \(\mathrm {e}\mu \) final states from NRB are 1:1:2, the \(\mathrm {e}\mu \) events selected inside the \(\text{ Z } \) boson mass window can be extrapolated to the \(\mathrm {e}\mathrm {e}\) and \(\mu \mu \) channels. To account for differences in efficiency for electrons and muons, a correction factor \(k_{\mathrm {e}\mathrm {e}}\) is derived by comparing the NRB yields for the \(\mathrm {e}\mathrm {e}\) and \(\mu \mu \) channels:

$$ k_{\mathrm {e}\mathrm {e}} = \frac{\epsilon _{\mathrm {e}}}{\epsilon _{\mu }} = \sqrt{\frac{N^{\mathrm {e}\mathrm {e}}_\mathrm {NRB}}{N^{\mu \mu }_\mathrm {NRB}}} $$

under the assumption that there are no efficiency correlations between the two leptons. In simulation, \(k_{\mathrm {e}\mathrm {e}}\) is found to be about 0.88 for the final selection. With this correction factor, the relation between the NRB yields in the SR and CR is:

$$\begin{aligned} N^{\ell \ell }_\mathrm {NRB} = \frac{1}{2} \left( k_{\mathrm {e}\mathrm {e}} + \frac{1}{k_{\mathrm {e}\mathrm {e}}} \right) N^{\mathrm {e}\mu }_\mathrm {NRB}. \end{aligned}$$

The ratio of the NRB contributions in the SR and CR is fixed by this relation. Their normalization is controlled by a common scaling parameter that is left to float in the maximum likelihood fit. Perturbations in the predicted transfer factor due to data-to-simulation discrepancies in \(k_{\mathrm {e}\mathrm {e}}\) are suppressed upon summing the \(\mathrm {e}\mathrm {e}+ \mu \mu \) channels. The uncertainty in the transfer factor is set conservatively to 20%.

8.3 The Drell–Yan background

The DY background is dominant in the region of low \(p_{\mathrm {T}} ^\text {miss} \). This process does not produce undetectable particles, therefore any nonzero \(p_{\mathrm {T}} ^\text {miss} \) arises from the limited detector acceptance and mismeasurement. The estimation of this background uses simulated DY events, for which the normalization is taken from data in a sideband CR of \(50 \le p_{\mathrm {T}} ^\text {miss} \le 100 \,\text {GeV} \), with all other selections applied. In two CRs where a larger DY background contribution is expected, regions with inverted selections on \({\varDelta }\phi ({\vec p}_{\mathrm {T}} ^{\, \ell \ell },{\vec p}_{\mathrm {T}}^{\text {miss}})\) and on \(|p_{\mathrm {T}} ^\text {miss}-p_{\mathrm {T}} ^{\, \ell \ell }|/p_{\mathrm {T}} ^{\, \ell \ell }\), the simulation is found to model the data well. The sideband CR is included in the maximum likelihood fit, for which the normalization factor is found to be consistent with unity, and a 100% uncertainty is assigned to the resulting DY estimate in order to cover the extrapolation from this CR to the SR. This uncertainty has little effect on the results owing to the small overall contribution from the DY process in the high-\(p_{\mathrm {T}} ^\text {miss} \) SR of this analysis.

9 Multivariate analysis

For the specific interpretation of this analysis involving invisible decays of the SM (125\(\,\text {GeV}\)) Higgs boson, a maximum likelihood fit is performed to the spectrum of the BDT classifier values for events satisfying the BDT SR criteria described in Sect. 7, with the classifier value between 0.2 and 1. The CR strategy is identical to that in the \(p_{\mathrm {T}} ^\text {miss} \)-based analysis, as described in Sect. 8. The three- and four-lepton events shown in Fig. 3 are chosen using the same CR selections as in the \(p_{\mathrm {T}} ^\text {miss} \)-based analysis.

The multivariate classifier improves the sensitivity of the analysis to the SM H(inv.) model by 10% compared to the \(p_{\mathrm {T}} ^\text {miss} \)-based analysis. Other than the \(p_{\mathrm {T}} ^\text {miss} \) itself, the variables that provide the most discrimination power are the transverse masses of each lepton with respect to the \({\vec p}_{\mathrm {T}}^{\text {miss}} \), along with the azimuthal separation between the \({\vec p}_{\mathrm {T}}^{\text {miss}} \) and the dilepton system momentum. Utilization of this classifier for the other signal models considered in this paper was not pursued, as many of the models’ kinematic distributions can vary considerably over the relevant parameter space.

Fig. 3
figure 3

Distribution of the BDT classifier in the diboson CRs: (left) \(\mathrm {W}\text{ Z } \) CR; (right) \(\text{ Z } \text{ Z } \) CR. Uncertainty bands correspond to the combined statistical and systematic components

10 Efficiencies and systematic uncertainties

The efficiency for all backgrounds is estimated using simulation. The uncertainties in the yields from missing higher-order corrections in signal as well as \(\text{ Z } \text{ Z } \) and \(\mathrm {W}\text{ Z } \) background cross sections are evaluated by independently varying up and down the factorization and renormalization scales by a factor of two. The effect of these variations on the yields is between 5 and 10%. For the \(\text{ Z } \text{ Z } \) and \(\mathrm {W}\text{ Z } \) backgrounds, additional uncertainties related to known higher-order corrections are applied, as discussed in Sect. 8.

For the Higgs boson signal, the PDF and \(\alpha _{s}\) uncertainties comprise the cross section normalization uncertainties computed by the LHC Higgs cross section working group [82] and the effect on the signal acceptance of varying the PDFs and \(\alpha _{s}\) within their uncertainties [83]. For other signal models, as well as the \(\mathrm {W}\text{ Z } \) and \(\text{ Z } \text{ Z } \) backgrounds, the effects of the PDF and \(\alpha _{s}\) uncertainties in the signal acceptance are taken into account following the PDF4LHC prescription [83]. The PDF and \(\alpha _{s}\) uncertainties on these processes are found to be about 1–2%.

The efficiencies for triggering on, reconstructing, and identifying isolated leptons are obtained from simulation, and corrected with scale factors determined via a “tag-and-probe” technique [84] applied to \(\text{ Z } \rightarrow \ell ^+\ell ^-\) events in data. The associated uncertainty is about 1–2% per lepton. An additional 3% uncertainty associated with the \(\mathrm {W}\text{ Z } \rightarrow \ell \nu \ell \ell \) events, where the reconstructed lepton from the \(\mathrm {W}\) boson decay fails identification, is also included.

In order to reproduce b tagging efficiencies and light-flavor jet mistag rates observed in data, an event-by-event reweighting using data-to-simulation scale factors [72] is applied to simulated events. The uncertainty associated with this procedure is obtained by varying the event-by-event weight by \({\pm }1\) standard deviation (s.d.). The impact on the final yields due to the b tagging efficiency and mistag rate uncertainties is around 1% for both signal and background.

The impacts of the jet energy scale and resolution uncertainties are estimated by shifting reconstructed jet energies in simulation by \({\pm }1\) s.d., and each is found to have an effect of about 2% on the yields of the simulated processes after all selections are applied. The impacts of the electron and muon energy scales are evaluated in the same manner, and have a similar effect. Uncertainties in the \(p_{\mathrm {T}} ^\text {miss} \) measurement due to the energy resolution of unclustered PF candidates (i.e., those not associated with an electron, muon, or jet) amount to about 2%.

The uncertainty in the expected yields due to the finite size of the MC samples is considered, and is around 1% for the signal and main backgrounds. The simulated MC samples are reweighted to reproduce the pileup conditions observed in data. The uncertainty related to this procedure is obtained by varying the central value of the estimated inelastic cross section by 5% [85], and is found to be below 1%. The uncertainty assigned to the integrated luminosity measurement is 2.5% [86].

The effect of the systematic uncertainties on the shape of the distribution of the discriminating variable (\(p_{\mathrm {T}} ^\text {miss} \) or BDT classifier) is taken into account by varying the value of the quantity associated with the uncertainty, and observing the resulting variations in the individual bins of \(p_{\mathrm {T}} ^\text {miss} \).

In addition to all of the sources of systematic uncertainty in the \(p_{\mathrm {T}} ^\text {miss} \)-based analysis, the following systematic uncertainties in the BDT-based analysis affect the BDT classifier shape. The most important sources of uncertainty in the BDT classifier shape are the lepton energy scale and \(p_{\mathrm {T}} ^\text {miss}\) uncertainties; their impact on the signal (\(\mathrm {W}\text{ Z } \) and \(\text{ Z } \text{ Z } \) backgrounds) amounts to about 2 (6)% and translates into an additional 2% uncertainty in the expected limit on the H(inv.) branching fraction.

All these sources of uncertainty are summarized in Table 2. The combined uncertainty in the signal efficiency and acceptance is estimated to be about 5% and is dominated by the theoretical uncertainty due to missing higher-order corrections and PDF uncertainties. The total uncertainty in the background estimations in the signal region is about 15%, dominated by the theoretical uncertainties in the \(\text{ Z } \text{ Z } \) and \(\mathrm {W}\text{ Z } \) process description.

Table 2 Summary of the systematic uncertainties for the \(p_{\mathrm {T}} ^\text {miss}\)- and BDT-based analyses. Each uncertainty represents the variation of the relative yields of the processes in the SR. Each uncertainty is fully correlated across processes to which it contributes, including those processes that are also present in CRs. The symbol “—” indicates that the systematic uncertainty does not contribute or is deemed negligible. For minor backgrounds, systematic uncertainties are omitted because of the smallness of their contribution. For shape uncertainties (indicated with a *), the numbers correspond to the overall effect of the shape variation on the yield or acceptance. The impact on the expected upper limit for the signal strength, i.e., the relative decrease in the median expected upper limit for the signal strength upon removing the nuisance term, is evaluated with respect to the SM H(inv.) signal and presented in the last column. In this column the number in parentheses shows the impact on the BDT-based analysis, if different from that for the \(p_{\mathrm {T}} ^\text {miss}\)-based analysis. The last part of the table provides the additional uncertainties in the BDT-based analysis

11 Results

The numbers of observed and expected events for the \(p_{\mathrm {T}} ^\text {miss}\)-based analysis are shown in Table 3. There is no significant difference between the dielectron and dimuon channels in terms of signal-to-background ratio, and hence both are treated together when obtaining the final results. The observed number of events in the \(\mathrm {e}\mathrm {e}\) (\(\mu \mu \)) channel is 292 (406), and the number of events expected from simulation is \(301\pm 23\) (\(391\pm 26\)). Figure 4 shows the \(p_{\mathrm {T}} ^\text {miss} \) distribution in the \(\mathrm {e}\mathrm {e}+\mu \mu \) channel in the SR. The total background estimates and the observed numbers of events in each \(p_{\mathrm {T}} ^\text {miss} \) bin are listed in Table 4, for both a combined background-only fit to the SR and the CRs, as well as for a fit to the CRs only. The latter results can be used in conjunction with the SR bin correlation matrix presented in the supplemental material 1 to recast these results in the simplified likelihood framework [87].

Table 3 Signal predictions, post-fit background estimates, and observed numbers of events in the \(p_{\mathrm {T}} ^\text {miss}\)-based analysis. The combined statistical and systematic uncertainties are reported
Table 4 Expected event yields in each \(p_{\mathrm {T}} ^\text {miss} \) bin for the sum of background processes in the SR. The background yields and their corresponding uncertainties are obtained after performing a fit to data. Two sets of background yields are reported: one from a background-only fit to data in both the SR and the CRs, and one from a fit to data in all CRs, but excluding data in the SR. The observed numbers of events in each bin are also included
Fig. 4
figure 4

Distribution of the \(p_{\mathrm {T}} ^\text {miss} \) in the combination of the \(\mathrm {e}\mathrm {e}\) and \(\mu \mu \) channels after the full selection. The last bin also includes any events with \(p_{\mathrm {T}} ^\text {miss} >600\,\text {GeV} \). The uncertainty band includes both statistical and systematic components. The \(\text{ Z } \text{ H } (\text {inv.})\) signal normalization assumes SM production rates and the branching fraction \(\mathcal {B}(\text{ H } \rightarrow \text {inv.}) = 1\)

No deviation from the SM background expectation is found. Upper limits on the contribution of events from new physics are computed by using the modified frequentist approach \(CL_s\) [88, 89] based on asymptotic formulas [90, 91], via a simultaneous maximum likelihood fit to the SR and the CRs. The expected numbers of background events and signal events, scaled by a signal strength modifier, are combined in a profile likelihood test statistic, in which the systematic uncertainties are incorporated as nuisance parameters. For the dominant backgrounds in the SR, additional parameters are introduced to link the background expectations in the SR to their respective contributions in the CRs discussed in Sect. 8. To compute limits in all models, a binned likelihood test statistic is employed, based on the \(p_{\mathrm {T}} ^\text {miss} \) distribution in Fig. 4 and also on the BDT classifier distribution in the case of invisible decays of the SM Higgs boson.

11.1 Dark matter interpretation

Figure 5 shows the 95% CL expected and observed limits for vector and axial-vector scenarios with couplings \(g_{\mathrm {q}}=0.25\), \(g_\mathrm {DM}=1\). Figure 6 shows the 95% CL expected and observed limits for couplings \(g_{\mathrm {q}}=g_\mathrm {DM}= 1\) in the scalar and pseudoscalar scenarios. In Fig. 7, limits on the DM-nucleon scattering cross section are set at 90% CL as a function of the DM particle mass and compared to selected results from direct detection experiments. Both spin-dependent and spin-independent cases are considered. In both cases, couplings \(g_{\mathrm {q}}=0.25\) and \(g_\mathrm {DM}=1\) are used.

Fig. 5
figure 5

The 95% CL expected and observed limits on \(\sigma _\text {obs}/\sigma _\text {theo}\) for the vector (left) and axial-vector (right) mediators with \(g_{\mathrm {q}}=0.25\) and \(g_\mathrm {DM} = 1\). Limits are not shown for far off-shell (\(2m_\mathrm {DM} > 1.5 m_\text {med}\)) regions of the parameter space

Fig. 6
figure 6

The 95% CL expected and observed limits on \(\sigma _\text {obs}/\sigma _\text {theo}\) for the scalar (left) and pseudoscalar (right) mediated DM scenario with \(g_{\mathrm {q}}=g_\mathrm {DM}=1\). The limits are parameterized as a function of mediator mass \(m_\text {med}\) for a fixed dark matter mass \(m_\mathrm {DM}=1\,\text {GeV} \)

Fig. 7
figure 7

Observed 90% CL limits on the DM-nucleon scattering cross sections in both spin-independent (left) and spin-dependent (right) cases, assuming a mediator-quark coupling constant \(g_{\mathrm {q}} = 0.25\) and mediator-DM coupling constant \(g_\mathrm {DM} = 1\). Limits from the CRESST-II [92], CDMSLite [93], PandaX-II [94], LUX [95], and XENON1T [96] experiments are shown for the spin-independent case (vector couplings). Limits from the PICASSO [97], PICO-60 [98], Super-Kamiokande [99], and IceCube [100, 101] experiments are shown for the spin-dependent case (axial-vector couplings)

11.2 Limits on invisible Higgs boson decays

Upper limits are derived for the Higgs boson production cross section using the same \(p_{\mathrm {T}} ^\text {miss} \)-shape analysis as for the DM model. In addition, for \(m_{\text{H }}= 125\,\text {GeV} \), a shape analysis using the multivariate classifier distribution, as described in Sect. 9, is performed. The resulting post-fit signal region is shown in Fig. 8. The 95% CL expected and observed upper limits on the product of the production cross section and the branching fraction, \(\sigma _{\text{Z} \text{H}} \, \mathcal {B}(\text{ H } \rightarrow \text {inv.})\), computed with the asymptotic \(CL_s\) method are shown as a function of the SM-like Higgs boson mass in Fig. 9 for the \(p_{\mathrm {T}} ^\text {miss} \)-shape analysis. For \(m_{\text{ H }}= 125\,\text {GeV} \), the search can be interpreted as an upper limit on \(\mathcal {B}(\text{ H } \rightarrow \text {inv.})\) assuming the SM production rate of a Higgs boson in association with a \(\text{ Z } \) boson. Assuming the SM production rate, the 95% observed (expected) CL upper limit on \(\mathcal {B}(\text{ H } \rightarrow \text {inv.})\) is 0.45 (0.44) using the \(p_{\mathrm {T}} ^\text {miss} \)-shape analysis, and 0.40 (0.42) using the multivariate analysis. The \(\mathrm {gg} \rightarrow \text{ Z } (\ell \ell )\text{ H } \) process is considered only for the 125 GeV mass point, and only when interpreting the result as a limit on branching fraction. For SM-like Higgs production, considering only the \(\mathrm {qq}\rightarrow \text{ Z } (\ell \ell )\text{ H } \) process, upper limits on \(\mathcal {B}(\text{ H } \rightarrow \text {inv.})\) are presented as a function of \(m_{\text{ H }}\) in the supplemental material 1.

Fig. 8
figure 8

Post-fit distribution of the BDT classifier in the multivariate analysis signal region for the SM H(inv.) decay hypothesis with \(\mathcal {B}(\text{ H } \rightarrow \text {inv.}) = 100\%\). Uncertainty bands correspond to the combined statistical and systematic components

Fig. 9
figure 9

Expected and observed 95% CL upper limits on the product of the production cross section and the branching fraction, \(\sigma_{\mathrm{qq}\to\mathrm{ZH}}\mathcal{B}(\mathrm{H}\to\text{inv.})\), as a function of the SM-like Higgs boson mass. The limits consider only quark-induced Higgs boson production. In addition, for the SM (125\(\,\text {GeV}\)) Higgs boson, the limit on branching fraction assuming SM production rate (considering also gluon fusion) is presented. The vertical gray line indicates that the result at \(m_{\text{H }}=125\,\text {GeV} \) should not be read from the plot, as the gluon contribution is known for that point

11.3 Unparticle interpretation

In the unparticle scenario, a shape analysis of the \(p_{\mathrm {T}} ^\text {miss} \) spectrum is performed. Upper limits are set at 95% CL on the Wilson coefficient \(\lambda / {\varLambda }_{\textsf {U}} ^{d_{\textsf {U}}-1}\) of the unparticle-quark coupling operator, and are shown in Fig. 10 as a function of the scaling dimension \(d_\textsf {U}\).

Fig. 10
figure 10

The 95% CL upper limits on the Wilson coefficient \(\lambda / {\varLambda }_{\textsf {U}} ^{d_{\textsf {U}}-1}\) of the unparticle-quark coupling operator. The results from an earlier CMS search in the same final state [39] are shown for comparison

11.4 The ADD interpretation

In the framework of the ADD model of large extra dimensions, we calculate limits depending on the number of extra dimensions n and the fundamental Planck scale \(M_\mathrm {D}\). For each value of n, cross section limits are calculated as a function of \(M_\mathrm {D}\). By finding the intersection between the theory cross section line, calculated in the fiducial phase space of the graviton transverse momentum \(p_{\mathrm {T}} ^\mathrm{G} > 50\,\text {GeV} \), with the observed and expected excluded cross sections, and projecting that point onto the \(M_\mathrm {D}\) axis, we find limits on \(M_\mathrm {D}\) as a function of n, as shown in Fig. 11.

The observed and expected exclusion of \(M_\mathrm {D}\) ranges between 2.3 and 2.5\(\,\text {TeV}\) for n between 2 and 7, at 95% CL.

Fig. 11
figure 11

Expected and observed 95% CL cross section exclusion limits for the example case \(n=4\) in the ADD scenario (left) and exclusion limits on \(M_\mathrm {D}\) for different values of n (right). In both plots, the markers for the expected exclusion are obscured by the close overlap with those for the observed exclusion. The red solid line in the left plot shows the theoretical cross section for the case \(n = 4\). Cross sections are calculated in the fiducial phase space of \(p_{\mathrm {T}} ^\mathrm{G} > 50\,\text {GeV} \). The vertical line in the left plot shows the projection onto the \(M_\mathrm {D}\) axis of the intersection of the theory curve with the expected and observed exclusion limits

12 Summary

A search for new physics in events with a leptonically decaying \(\text{ Z } \) boson and large missing transverse momentum has been presented. The search is based on a data set of proton-proton collisions collected with the CMS experiment in 2016, corresponding to an integrated luminosity of \(35.9 \pm 0.9{\,\text {fb}^{-1}} \) at \(\sqrt{s} = 13\,\text {TeV} \). No evidence for physics beyond the standard model is found. Compared to the previous search in the same final state [31], the exclusion limits on dark matter and mediator masses are significantly extended for spin-1 mediators in the simplified model interpretation, and exclusion limits for unparticles are also extended. Results for dark matter production via spin-0 mediators in the simplified model interpretation, as well as graviton emission in a model with large extra dimensions, are presented in this final state for the first time. In the case of invisible decays of a standard-model-like Higgs boson, the upper limit of 40% on their branching fraction is set at 95% confidence level, using data not included in the previously published combined analysis [36].