1 Introduction

Extensions to the Standard Model (SM) may include heavy gauge bosons that could be discovered at the Large Hadron Collider (LHC) [1]. For example, heavy gauge bosons are predicted in left-right symmetric models [2, 3] or in the little Higgs model [4]. Conceptually, these particles are heavier versions of the SM W and Z bosons and are generically referred to as \(W^\prime \) and \(Z^\prime \) bosons. The Sequential Standard Model (SSM) [5] posits a \(W'_{\mathrm {SSM}}\) boson with couplings to fermions that are identical to those of the SM W boson. This model represents a good benchmark as the results can be interpreted in the context of other models of new physics, and is useful for comparing the sensitivity of different experiments.

This paper presents a search for a \(W^\prime \) boson conducted in the \(W^\prime \rightarrow \ell \nu \) channel. In the following, the term lepton (\(\ell \)) is used to refer to an electron or a muon. The analysis uses events with a high transverse momentumFootnote 1 (\(p_{\text {T}}\)) lepton and significant missing transverse momentum \(E_{\text {T}}^{\text {miss}}\), that is used to infer the presence of the neutrino in the event as it escapes direct detection. It is based on 36.1 \(\text{ fb }^{-1}\) of pp collision data collected with the ATLAS detector in 2015 and 2016 at a centre-of-mass energy of \(\sqrt{s} = 13\) TeV. The results are interpreted in the context of the SSM. The signal discriminant is the transverse mass, which is defined as \(\smash [b]{m_{\mathrm T}} = \sqrt{\smash [b]{2p_{\text {T}} E_{\text {T}}^{\text {miss}} (1-\cos \phi _{\ell \nu })}}\), where \(\smash [b]{\phi _{\ell \nu }}\) is the azimuthal angle between the directions of the lepton \(p_{\text {T}}\) and the \(E_{\text {T}}^{\text {miss}}\) in the transverse plane.

The most stringent limits on the mass of a \(W'_{\mathrm {SSM}}\) boson to date come from the searches in the \(W^\prime \rightarrow e \nu \) and \(W^\prime \rightarrow \mu \nu \) channels by the ATLAS and CMS collaborations using data taken at \(\sqrt{s} = 13\) TeV in 2015. The ATLAS analysis was based on data corresponding to an integrated luminosity of 3.2 \(\text{ fb }^{-1}\) and sets a 95% confidence level (CL) lower limit on the \(W'_{\mathrm {SSM}}\) mass of 4.07 TeV [6]. The CMS Collaboration used 2.4 \(\text{ fb }^{-1}\) of data and excludes \(W'_{\mathrm {SSM}}\) masses below 4.1 TeV at 95% CL [7]. The sensitivity of the search presented here is significantly improved compared to these earlier searches due to the larger dataset.

2 ATLAS detector

The ATLAS experiment [8] at the LHC is a multipurpose particle detector with a forward-backward symmetric cylindrical geometry and a near \(4 \pi \) coverage in solid angle. It consists of an inner detector (ID) for tracking surrounded by a thin superconducting solenoid providing a 2 T axial magnetic field, electromagnetic (EM) and hadronic calorimeters, and a muon spectrometer (MS). The ID covers the pseudorapidity range \(|\eta |<2.5\). It consists of a silicon pixel detector including an additional inner layer located at a radius of 3.2 cm since 2015 [9], followed by silicon microstrip and transition radiation tracking detectors. Lead/liquid-argon (LAr) sampling calorimeters provide EM energy measurements with high granularity. A hadronic (steel/scintillator-tile) calorimeter covers the central pseudorapidity range (\(|\eta |<1.7\)). The end-cap and forward regions are instrumented with LAr calorimeters for both the EM and hadronic energy measurements up to \(|\eta |=4.9\). The muon spectrometer surrounds the calorimeters and is based on three large air-core toroidal superconducting magnets with eight coils each. The field integral experienced by tracks in the toroidal field ranges between 2.0 and 6.0 T m for most pseudorapidities. The MS includes a system of precision tracking chambers, over \(|\eta | < 2.7\), and fast detectors for triggering, over \(|\eta | < 2.4\). A two-level trigger system is used to select events [10]. The first-level trigger is implemented in hardware and uses a subset of the detector information. This is followed by a software-based trigger system that reduces the accepted event rate to less than 1 kHz.

3 Analysis strategy and modelling of signal and background processes

A \(W^\prime \) signal would appear as an excess of events above the SM background at high \(m_{{\mathrm {T}}}\). The SM background mainly arises from processes with at least one prompt final-state lepton, with the largest source being the charged-current Drell–Yan (DY) W boson production, where the W boson decays into an electron or muon and a neutrino. The second largest source is top-quark pair (\(t\bar{t}\)) and single-top-quark production, denoted in the following as “top-quark background”. Other non-negligible contributions are from the neutral-current DY (\(Z/\gamma ^*\)) process, diboson production, as well as from events in which one final-state jet or photon satisfies the lepton selection criteria. This last component of the background, referred to in the following as the multijet background, receives contributions from multijet, heavy-flavour quark and \(\gamma \) + jet production. The multijet background is determined using a data-driven method, while the other backgrounds are modelled by Monte Carlo (MC) simulations.

The backgrounds from \(W \rightarrow \ell \nu \), \(Z/\gamma ^*\rightarrow \ell \ell \), \(W \rightarrow \tau \nu \), and \(Z/\gamma ^*\rightarrow \tau \tau \) were simulated using the Powheg-Box v2 [11] matrix-element calculation up to next-to-leading order (NLO) in perturbative quantum chromodynamics (pQCD), interfaced to the Pythia 8.186 [12] parton shower model and using the CT10 parton distribution function (PDF) set [13]. The final-state photon radiation (QED FSR) was modelled by the Photos [14] MC simulation. The samples are normalised as a function of the boson invariant mass to a next-to-next-to-leading order (NNLO) pQCD calculation using the numerical programme VRAP  which is based on Ref. [15] and the CT14NNLO PDF set [16]. Compared to the NLO prediction using CT10, the NNLO prediction using CT14 gives a higher cross-section by about 5% at a boson invariant mass of 1 TeV and 10% at 5 TeV. In addition to the modelling of QED FSR, a fixed-order electroweak (EW) correction to NLO is calculated as a function of the boson mass with the Mcsanc [17, 18] event generator at leading order (LO) in pQCD. This correction is added to the NNLO QCD cross-section prediction in the so-called additive approach (see Sect. 6.2) because of a lack of calculations of mixed QCD and EW terms, and lowers the predicted cross-section by an increasing amount as function of the mass, reaching about 10% at 1 TeV and 20% at 5 TeV. The \(W \rightarrow \ell \nu \) and \(Z/\gamma ^*\rightarrow \ell \ell \) events were simulated as multiple samples covering different ranges of the boson invariant mass. This ensures that a large number of MC events is available across the entire \(m_{{\mathrm {T}}}\) region probed in this analysis.

The background from \(t\bar{t}\) production was generated using Powheg-Box v2, with parton showering and hadronisation modelled by Pythia 6.428 [19], using the CT10 PDF set. The \(t\bar{t}\) cross-section is normalised to \(\sigma _{t\bar{t}}= 832\) pb as calculated with the Top++2.0 program at NNLO in pQCD, including soft-gluon resummation to next-to-next-to-leading logarithmic accuracy (see Ref. [20] and references therein). The top-quark mass is set to 172.5 GeV. The single-top-quark production in the Wt channel and EW t-channel was simulated using the same event generators and PDF sets as for the \(t\bar{t}\) process, with the exception that the Powheg-Box v1 program was used for producing events in the t-channel. Diboson events were simulated with the Sherpa 2.1.1 [21] event generator using the CT10 PDF set. As the simulated top-quark and diboson samples are statistically limited at large \(m_{{\mathrm {T}}}\), the expected number of events from each of these backgrounds is extrapolated into the high-\(m_{{\mathrm {T}}}\) region. This is achieved by fitting the lower part of the \(m_{{\mathrm {T}}}\) distributions to functions of the form \(F (x) = a \, x^{b+c \log x}\) and \(F (x) = d/ (x + e)^g\), where \(x = m_{\mathrm T}/\sqrt{s}\), and using the fitted function to predict the background at higher \(m_{{\mathrm {T}}}\). Various fit ranges are used, which typically start between 140 and 360 GeV and extend up to 500–1300 GeV. The fits with the best \(\chi ^2/{\text {d.o.f.}}\) are used for the extrapolation and the results of these fits are used in the high-\(m_{{\mathrm {T}}}\) tail.

The multijet background is estimated from data using the same data-driven matrix method as used in the previous ATLAS analysis [6]. The first step of the matrix method is to calculate the fraction f of lepton candidates that pass the nominal lepton identification and isolation requirements (tight), with respect to a sample of loose lepton candidates in a background-enriched sample. These loosely selected candidates satisfy only a subset of the nominal criteria, which is stricter than the trigger requirements imposed. Potential contamination of prompt final-state leptons in the background-enriched sample is accounted for using MC simulation. In addition, the fraction r of real leptons in the sample of loose candidates satisfying the nominal requirements is used. This fraction is computed from MC simulation. The number of jets and photons misidentified as leptons (\(N_{\mathrm T}^{\mathrm {multijet}}\)) in the total number of candidates passing the signal selection (\(N_{\mathrm T}\)) is

$$\begin{aligned} N_{\mathrm T}^{\mathrm {multijet} } = f \, N_{\mathrm F} = \frac{f}{r-f} \left( \, r\, (N_{\mathrm L} + N_{\mathrm T}) - N_{\mathrm T} \right) , \end{aligned}$$
(1)

where \(N_\mathrm {F}\) is the number of fake leptons and \(N_\mathrm {L}\) corresponds to leptons that pass the loose requirements but fail the nominal requirements. As this background estimate is statistically limited at large \(m_{{\mathrm {T}}}\), the expected number of events is extrapolated into the high-\(m_{{\mathrm {T}}}\) region using a method similar to that for the diboson and top-quark backgrounds.

The SSM signal \(W^\prime \rightarrow e \nu \) and \(W^\prime \rightarrow \mu \nu \) samples were generated at LO in QCD using the Pythia 8.183 event generator and the NNPDF2.3 LO PDF set [22]. As assumed in the SSM, the couplings to fermions are equal to those of the SM W boson. The \(W^\prime \) boson is assumed not to couple to the SM W and Z bosons and interference between the \(W^\prime \) and the SM W boson production amplitudes is neglected. The decay \(W^\prime \rightarrow \tau \nu \), where the \(\tau \) subsequently decays leptonically, is not treated as part of the signal as this contribution was quantified previously and found to give a negligible contribution to the sensitivity [23]. Mass-dependent correction factors are applied to normalise the samples to the same mass-dependent NNLO pQCD calculation as used for the W background. Compared to the LO prediction using NNPDF2.3 LO, the corrections increase the cross-section by about 40% around a boson invariant mass of 1–2 TeV, and by about 10% at 5 TeV. Further EW corrections beyond QED FSR are not considered for the signal. The resulting cross-sections times branching ratio for \(W'_{\mathrm {SSM}}\) masses of 3, 4 and 5 TeV are 15.3, 2.25 and 0.51 fb, respectively. For these \(W^\prime \) masses the branching ratio to each lepton generation from Pythia is 8.2%.

The MC samples were processed through a simulation of the detector geometry and response [24] using the Geant4 [25] framework. The software used for the reconstruction is the same for both simulated and real data. The average number of pile-up interactions (additional pp collisions in the same or a nearby bunch crossing) observed in the data is about 23. The effect of pile-up is modelled by overlaying simulated inelastic pp collision events selected using very loose trigger requirements (“minimum bias”). All MC samples are reweighted so that the distribution of the number of collisions per bunch crossing matches the data. Correction factors to account for differences observed in the detector response between data and simulation are applied to the lepton trigger, reconstruction, identification [26, 27] and isolation efficiencies as well as the lepton energy/momentum resolutions and scales [27, 28].

4 Event reconstruction

The analysis makes use of electrons, muons, and missing transverse momentum, whose reconstruction and identification are explained in the following.

Electrons are reconstructed from ID tracks that are matched to energy clusters in the electromagnetic calorimeter obtained using a sliding-window algorithm in the range \(|\eta | < 2.47\). Candidates in the transition region between different electromagnetic calorimeter components, \(1.37< |\eta | < 1.52\), are rejected. Electrons must satisfy identification criteria based on measurements of shower shapes in the calorimeter and measurements of track properties from the ID combined in a likelihood discriminant. Depending on the desired level of background rejection, loose, medium and tight working points are defined. Full details of the electron reconstruction, identification and selection working points can be found in Ref. [26].

Muon candidates are identified from MS tracks that match tracks in the ID, with \(|\eta |<2.5\)  [27]. These muons are required to pass a track quality selection based on the number of hits in the ID. They are rejected if the absolute value of the difference between their charge-to-momentum ratios measured in the ID and MS divided by the sum in quadrature of the corresponding uncertainties is large. To ensure optimal muon resolution at high \(p_{\text {T}}\), additional requirements are imposed on the quality of the MS track. The track is required to have at least three hits in each of the three separate layers of MS chambers. Furthermore, to avoid \(p_{\text {T}}\) mismeasurements, muons are removed if they cross either poorly aligned MS chambers, or regions in which the ID and the MS are not well aligned relative to one another.

The ID tracks associated with electron and muon candidates are required to be consistent with originating from the primary interaction vertex, which is defined as the vertex whose constituent tracks have the highest sum of \(p_{\mathrm T}^2\). The transverse impact parameter with respect to the beam line, \(d_0\), divided by its estimated uncertainty must satisfy \(|d_0|/\sigma (d_0) < 5\ (3)\) for electrons (muons). For muons, the longitudinal impact parameter, which is the distance between the z-position of the point of closest approach of the muon track in the ID to the beamline and the z-coordinate of the primary vertex, must fulfil \(|\Delta z_0| \times \sin \theta < 0.5\) mm. Both the electrons and muons are required to be isolated with respect to other particles in the event. The sum of the \(p_{\text {T}}\) of tracks that fall inside an isolation cone around the lepton (excluding the track of the lepton itself) divided by the lepton \(p_{\text {T}}\) has to be below a \(p_{\text {T}}\)-dependent threshold. The isolation cone size \(\Delta R = \sqrt{(\Delta \eta )^2 + (\Delta \phi )^2}\) is defined as \(10~\text {GeV}\) divided by the lepton \(p_{\text {T}}\) and the cone has a maximum size of \(\Delta R = 0.3\) for muons and \(\Delta R = 0.2\) for electrons. For electrons, calorimeter-based isolation is also required. The sum of the calorimeter transverse energy deposits in an isolation cone of fixed size \(\Delta R = 0.2\) (excluding the energy deposits of the electron itself) divided by the electron \(p_{\text {T}}\) is used as the discriminating variable. The calorimeter- and track-based isolation criteria depend on \(p_{\text {T}}\) and \(\eta \), and are optimised for an overall efficiency of 98% (99%) for electrons (muons).

The missing transverse momentum is reconstructed as the negative vectorial sum of the calibrated momenta of electrons, muons and jets, where the electrons and muons are required to satisfy the selection criteria described above [29]. The jets used in the calculation are reconstructed in the region \(|\eta | < 4.9\) from topological clusters [30] in the calorimeter using the anti-\(k_t\) algorithm [31] with a radius parameter of 0.4. They are calibrated using the method described in Ref. [32] and are required to have \(p_{\text {T}} ~>~20\) GeV. The computation of \(E_{\text {T}}^{\text {miss}}\) also includes tracks associated with the primary vertex from activity not associated with electrons, muons or jets.

5 Event selection and background estimation

Events in the muon channel were recorded by a trigger requiring that at least one muon with \(p_{\text {T}} > 50\) GeV is found. These muons must be reconstructed in both the MS and the ID. In the electron channel during the 2015 (2016) data-taking period, events were recorded by a trigger requiring at least one electron with \(p_{\text {T}} > 24~(60)\) GeV which satisfied the medium identification criteria, or at least one electron with \(p_{\text {T}} > 120~(140)\) GeV which satisfied the loose identification criteria. The identification criteria for electrons at trigger level are similar to those used in the offline reconstruction [26].

Events recorded by the trigger are further selected by requiring that they contain exactly one lepton. In the muon channel, the magnitude of \(E_{\text {T}}^{\text {miss}}\) must exceed 55 GeV and the muon has to fulfil the tight requirements for high-\(p_{\text {T}}\) muons detailed in Sect. 4 and have \(p_{\text {T}} > 55\) GeV. In the electron channel, the electron must satisfy the tight identification criteria, and the electron \(p_{\text {T}}\) and the magnitude of \(E_{\text {T}}^{\text {miss}}\) must both exceed 65 GeV. Events in both channels are vetoed if they contain additional leptons satisfying loosened selection criteria, namely electrons with \(p_{\text {T}} > 20\) GeV satisfying the medium identification criteria or muons with \(p_{\text {T}} > 20\) GeV passing the muon selection without the stringent requirements on the MS track quality. In addition, the transverse mass is required to exceed 110 (130) GeV in the muon (electron) channel. The acceptance times efficiency, defined as the fraction of simulated signal events that pass the event selection described above, is \(50\%\) (\(47\%\)) for the muon channel and \(81\%\) (\(77\%\)) for the electron channel for a \(W^\prime \) mass of 2 TeV (4 TeV). The difference in lepton sensitivity results from lower muon trigger efficiency and, due to the very strict muon selection criteria applied, a lower muon identification efficiency.

Fig. 1
figure 1

Transverse mass distributions for events satisfying all selection criteria in the a electron and b muon channels. The distributions in data are compared to the stacked sum of all expected backgrounds. As examples, expected signal distributions for three different SSM \(W^\prime \) boson masses are shown on top of the SM prediction. The bin width is constant in \(\log (m_{\mathrm {T}})\). The middle panels show the ratios of the data to the expected background, with vertical bars representing both data and MC statistical uncertainties. The lower panels show the ratios of the data to the adjusted expected background (“post-fit”) that results from the statistical analysis. The bands in the ratio plots indicate the sum in quadrature of the systematic uncertainties discussed in Sect. 6, including the uncertainty in the integrated luminosity

Table 1 The numbers of expected events from the total SM background and SSM \(W^\prime \) signal and the numbers of observed events in data in the electron (top) and muon (bottom) channels in bins of \(m_{{\mathrm {T}}}\). The uncertainties given are the combined statistical and systematic uncertainties. The systematic uncertainty includes all systematic uncertainties except the one from the integrated luminosity (3.2%)

The expected number of background events is calculated as the sum of the data-driven and simulated background estimates described in Sect. 3. Figure 1 displays the \(m_{{\mathrm {T}}}\) distribution in the electron and muon channels. The expected and observed number of events for some wider \(m_{{\mathrm {T}}}\) ranges are shown also in Table 1. For all values of \(m_{{\mathrm {T}}}\), the background is dominated by \(W \rightarrow \ell \nu \) production, which constitutes about 85% of the total background at \(m_{{\mathrm {T}}}\) \(>1\) TeV. As examples, Fig. 1 also shows the expected signal distributions for three assumed \(W'_{\mathrm {SSM}}\) boson masses on top of the SM prediction. The effect of the momentum resolution is clearly visible when comparing the shapes of the three reconstructed \(W'_{\mathrm {SSM}}\) signals in the electron and muon channels. The middle panels of Fig. 1 show the ratio of the data to the SM predictions. The data are systematically above the predicted background at low \(m_{{\mathrm {T}}}\), but still within the total systematic uncertainty, which is dominated by the \(E_{\text {T}}^{\text {miss}}\) -related systematic uncertainties in this region. The bottom panels of Fig. 1 show the ratio of the data to the adjusted background that results from a common fit to the electron and muon channels within the statistical analysis described in Sect. 7. This ratio agrees well with unity.

6 Systematic uncertainties

Table 2 Systematic uncertainties in the expected number of events as estimated for the total background and for signal with a \(W'_{\mathrm {SSM}}\) mass of 2 (4) TeV. The uncertainty is estimated with the binning shown in Fig. 1 at \(m_{{\mathrm {T}}}\) \(=\) 2 (4) TeV for the background and in a three-bin window around \(m_{{\mathrm {T}}}\) \(=\) 2 (4) TeV for the signal. Uncertainties that are not applicable are denoted “n/a”, and “negl.” means that the uncertainty is not included in the statistical analysis. Sources of uncertainties not included in the table are neglected in the statistical analysis.

The systematic uncertainties arise from experimental and theoretical sources. They are summarised in Table 2 and described in the following subsections.

6.1 Uncertainties from the reconstruction of electrons, muons, and \(E_{\text {T}}^{\text {miss}}\)

Experimental systematic uncertainties arise from the trigger, reconstruction, identification and isolation efficiencies for leptons [26, 27], and the calculation of the missing transverse momentum [29]. They include also the effects of the energy and momentum scale and resolution uncertainties [27, 28, 32].

The electron and muon offline reconstruction, identification and isolation efficiencies, and their respective uncertainties, are assessed up to \(p_{\text {T}} \approx 100\) GeV using leptonic decays of Z boson candidates found in data. The ratio of the efficiency measured in data to that of the MC simulation is then used to correct the MC prediction [26, 27]. For higher-\(p_{\text {T}}\) electrons, an additional uncertainty of 1.5% is estimated for the tight identification working point. This uncertainty is based on the differences observed in the electron shower shapes in the EM calorimeters between data and MC simulation around the \(Z \rightarrow ee\) mass peak, which are propagated to the high-\(E_{\text {T}}\) electron sample. For the isolation efficiencies, an uncertainty of 2 and 5% is estimated for \(150< p_{\text {T}} < 500\) GeV and above 500 GeV, respectively, using \(Z/\gamma ^*\) candidates in data. For the identification of high-\(p_{\text {T}}\) muons, the uncertainty is determined conservatively from simulation studies and amounts to 2–3% per TeV. For the isolation criterion, the uncertainty associated with the extrapolation to high-\(p_{\text {T}}\) muons is estimated to be 1%. Systematic uncertainties related to the electron trigger are negligible. For the muon trigger the systematic uncertainty is estimated using the same methodology as in Ref. [33], which results in an overall uncertainty of about 2%.

The main systematic uncertainties in \(E_{\text {T}}^{\text {miss}}\) arise from the jet energy resolution uncertainties [32] and the contribution from tracks originating from the primary vertex and arising from activity not associated with electrons, muons or jets [29]. The uncertainties due to the jet energy and \(E_{\text {T}}^{\text {miss}}\) resolutions are small at large \(m_{{\mathrm {T}}}\), while they are the dominant contributions to the total uncertainty at small \(m_{{\mathrm {T}}}\). The jet energy scale uncertainties are found to be negligible.

6.2 Theoretical uncertainties

Theoretical uncertainties are related to the production cross-sections estimated from MC simulation. The effects when propagated to the total background estimate are significant for W and \(Z/\gamma ^*\) production, but negligible for top-quark and diboson production. No theoretical uncertainties are considered for the \(W^\prime \) boson signal in the statistical analysis.

Theoretical uncertainties in the W and \(Z/\gamma ^*\) background prediction arise from the PDF uncertainties, the value of the strong coupling constant \(\alpha _{\mathrm {s}}\), and higher-order corrections. The dominant effect comes from the PDF uncertainty, which is obtained from the 90% CL CT14NNLO PDF uncertainty set using VRAP to calculate the NNLO cross-section as a function of the boson mass. Rather than using the original 28 CT14 uncertainty eigenvectors, a re-diagonalised set of seven PDF eigenvectors, as provided by the authors of the CT14 PDF using MP4LHC [34, 35], is used. The cross-section variation associated with each of these eigenvectors has a characteristic mass dependence and the sum in quadrature of these eigenvector variations matches the original CT14NNLO uncertainty envelope well. This sum is shown as “PDF variation” in Table 2. An additional uncertainty is derived to account for the choice of the nominal PDF set used. The central values of the CT14NNLO PDF set are compared to the MMHT2014 [36] and NNPDF3.0 [37] PDF sets. A comparison between these PDF sets shows that the central value for NNPDF3.0 falls outside the “PDF variation” uncertainty at large \(m_{{\mathrm {T}}}\). Thus, an envelope of the “PDF variation” and the NNPDF3.0 central value is formed, where the former is subtracted in quadrature from this envelope, and the remaining part, which is non-zero only when the NNPDF3.0 central value is outside the “PDF variation” uncertainty, is quoted as “PDF choice”. The PDF uncertainties are the same at the generator level for the electron and muon channels, but result in different uncertainties at reconstruction level. The uncertainty is larger in the electron channel due to the better energy resolution: there is less migration of events with low generator-level invariant mass, where the PDF uncertainty is smaller, into the high-\(m_{{\mathrm {T}}}\) region in this channel.

Uncertainties in the electroweak corrections are determined as the difference between the additive approach (\(1+\delta _{\mathrm {EW}} + \delta _{\mathrm {QCD}}\)) and a factorised approach (\((1+\delta _{\mathrm {EW}}) \times (1+\delta _{\mathrm {QCD}})\)) for the EW corrections in the combination of higher-order EW (\(\delta _{\mathrm {EW}}\)) and QCD (\(\delta _{\mathrm {QCD}}\)) effects. Uncertainties due to higher-order QCD corrections on the \(Z/\gamma ^*\) process are estimated by varying the renormalisation and factorisation scales simultaneously up or down by a factor of two. The uncertainty due to \(\alpha _{\mathrm {s}}\) is assessed by changing the value of \(\alpha _{\mathrm {s}}\) by as much as 0.003 from the nominal value \(\alpha _{\mathrm {s}}(m_Z) = 0.118\) used by the CT14NNLO PDF set. The uncertainties from the scales and \(\alpha _{\mathrm {s}}\) are both found to be negligible.

Theoretical uncertainties are also considered for the top-quark and diboson backgrounds. An uncertainty in the \(t\bar{t}\) cross-section of \(^{+20}_{-29}\) pb arises from the independent variation of the factorisation and renormalisation scales, while an uncertainty of \(\pm 35\) pb is associated with variations in the PDF and \(\alpha _{\mathrm {s}}\), following the PDF4LHC prescription (see Ref. [38] and references therein) with the MSTW2008 68% CL NNLO [39], CT10 NNLO [40] and NNPDF2.3 NNLO [22] PDF sets. As this background constitutes only a small fraction of the overall background, these normalisation uncertainties are negligible. Furthermore, the modelling of the top-quark background is found to be adequate in a data control region defined by requiring the presence of an additional muon (electron) in events passing the electron (muon) selection. For the diboson background, the theoretical normalisation uncertainty is conservatively taken to be 30%, and this has a negligible effect due to the small contribution of this background.

6.3 Background modelling uncertainties

The dominant systematic uncertainties in the multijet, top-quark and diboson backgrounds at high \(m_{{\mathrm {T}}}\) are due to the extrapolations. These uncertainties are evaluated by varying both the functional form of the fit functions and the fit range as detailed in Sect. 3. The envelope of all variations is assigned for the uncertainty. This results in the largest source of background-related systematic uncertainty at large \(m_{{\mathrm {T}}}\) values in this analysis.

The multijet background uncertainty in the electron (muon) channel includes a 15% (100%) normalisation uncertainty. This uncertainty is dominated by the dependence of the factor f (see Sect. 3) on the selection requirements used for the background-enriched sample definition.

For the \(m_{{\mathrm {T}}}\) region below 700–800 GeV, for which there are not many more MC events than data events, the MC statistical uncertainty is accounted for in the analysis.

The modelling of the pile-up especially affects the calculation of \(E_{\text {T}}^{\text {miss}}\) . A pile-up modelling uncertainty is estimated by varying the distribution of pile-up events in the reweighting of the MC, as detailed in Sect. 3, to cover the uncertainty on the ratio between the predicted and measured inelastic cross-sections [41].

6.4 Luminosity

The uncertainty in the combined 2015 and 2016 integrated luminosity is 3.2%. Following a methodology similar to that detailed in Ref. [42], it is derived from a preliminary calibration of the luminosity scale using xy beam-separation scans performed in August 2015 and May 2016.

7 Results

For the statistical analysis of the results presented in this section, the same methodology is applied as in the previous ATLAS \(W^\prime \) search [6] and is described briefly here. The compatibility between the data and the predicted background is evaluated with a profile-likelihood ratio test quantifying the probability that the background fluctuates to give a signal-like excess equal to or larger than what is observed. The likelihood functions in the ratio are products of Poisson probabilities over all bins in the transverse mass distribution (as shown in Fig. 1) and log-normal constraints for the variations in signal and background yields associated with systematic uncertainties. In the denominator of the likelihood ratio, the likelihood function is maximised assuming the presence of a signal above the expected background, and in the numerator assuming the background-only hypothesis. To model the signal, \(W'_{\mathrm {SSM}}\) templates binned in \(m_{{\mathrm {T}}}\) are used for a series of \(W'_{\mathrm {SSM}}\) masses in the search range \(150~\text {GeV}\ \le \) \(m_{W'}\) \(\le 6000~\text {GeV}\). Figure 1 displays a few examples of these templates. No significant excesses are observed in the data. The most significant excess is at \(m_{W^\prime } = 350\) GeV in the electron channel, with a local significance of \(2.0 \sigma \). In the muon channel, the most significant excess is at high mass, with a maximum local significance of \(1.8 \sigma \) at \(m_{W^\prime } \approx 5\) TeV. These excesses correspond to a global significance of \(0.1 \sigma \) in each channel when the look-elsewhere effect [43] is taken into account.

Based on the above findings, upper limits on the cross-section for producing a \(W'_{\mathrm {SSM}}\) boson times its branching ratio to only one lepton generation (\(\sigma \times {\textit{BR}}\)) are computed at the 95% CL as a function of the \(W'_{\mathrm {SSM}}\) boson mass. The limits are calculated in a Bayesian analysis [44] with a uniform positive prior probability distribution for \(\sigma \times {\textit{BR}}\). The observed upper limits are extracted by comparing data to the expected background and signal using \(W'_{\mathrm {SSM}}\) templates for the same range of signal masses as for the profile-likelihood ratio test. The expected limits are derived from pseudo-experiments obtained from the estimated background distributions. The median of the distribution of the limits from the pseudo-experiments is taken as the expected limit, and \(1 \sigma \) and \(2 \sigma \) bands are defined as the ranges containing respectively 68 and 95% of the limits obtained with the pseudo-experiments.

Fig. 2
figure 2

Observed (solid black line) and expected (dashed black line) upper limits on cross-section times branching ratio (\(\sigma \times {\textit{BR}}\)) as a function of the SSM \(W^\prime \) boson mass in the a electron, b muon and c combined electron and muon channels. The \(1\sigma \) (green) and \(2\sigma \) (yellow) expected limit bands are also shown. The predicted \(\sigma \times {\textit{BR}}\) for SSM \(W^\prime \) production is shown as a red solid line. For illustration the uncertainties in \(\sigma \times {\textit{BR}}\) from the PDF, \(\alpha _{\mathrm {s}}\) and the renormalisation and factorisation scales are also shown as red-dashed lines

The 95% CL upper limits on \(\sigma \times {\textit{BR}}\) as a function of the \(W'_{\mathrm {SSM}}\) mass are shown in Fig. 2 separately for the electron and muon channels and for the combination of the two channels. The theoretical uncertainties and the uncertainties in \(E_{\text {T}}^{\text {miss}}\) , jet energy resolution and luminosity are treated as correlated between the channels. The expected upper limit on \(\sigma \times {\textit{BR}}\) is stronger in the electron channel. This results from the larger acceptance times efficiency and the better momentum resolution (see Sect. 5). Figure 2 also shows the predicted \(\sigma \times {\textit{BR}}\) for the \(W'_{\mathrm {SSM}}\) boson as a function of its mass as well as the uncertainties from the PDF, \(\alpha _{\mathrm {s}}\) and the factorisation and renormalisation scales derived using the same prescription as used for the W boson production. The observed (expected) lower mass limit for a \(W'_{\mathrm {SSM}}\) boson, as summarised in Table 3, is 5.1 (5.2) TeV for the combination of the electron and muon channels. This corresponds to an improvement of approximately 1 TeV in mass reach compared to the previous ATLAS analysis [6], which was based on a subset of the data used in this analysis.

Table 3 Expected and observed 95% CL lower limit on the \(W'_{\mathrm {SSM}}\) mass in the electron and muon channels and their combination

8 Conclusion

The results of a search for a new heavy gauge boson decaying to final states with a high-\(p_{\text {T}}\) electron or muon and large missing transverse momentum are reported. The analysis uses 36.1 \(\text{ fb }^{-1}\) of \(\sqrt{s} = 13\) TeV pp collision data recorded by the ATLAS detector at the Large Hadron Collider in 2015 and 2016. Examining the transverse mass spectrum, no significant excess above the expected Standard Model background is observed. Exclusion limits at 95% CL are placed on the mass of benchmark Sequential Standard Model \(W^\prime \) bosons. Masses for \(W'_{\mathrm {SSM}}\) bosons up to 5.1 TeV are excluded by the combination of the electron and muon channels. This exceeds the previous limit from ATLAS, derived from a similar analysis based on 3.2 \(\text{ fb }^{-1}\) of \(\sqrt{s} = 13\) TeV data, by 1 TeV.