1 Introduction

The Standard Model (SM) has proven to be a very successful theory. The discovery of the Higgs boson boson [1,2,3,4] by the ATLAS and CMS collaborations confirmed the predicted electroweak symmetry breaking, but also highlighted the hierarchy problem [5,6,7,8]. Supersymmetry (SUSY) [9,10,11,12,13,14] is a theoretical framework which assumes supersymmetric particles differing from their SM partners by a half unit of spin. By introducing a new fermionic (bosonic) supersymmetric partner for each boson (fermion) in the SM, SUSY provides a possible solution to the hierarchy problem. In SUSY models conserving R-parity [15], SUSY particles are produced in pairs. The lightest supersymmetric particle (LSP) has to be stable and is possibly weakly interacting, constituting a viable dark-matter candidate [16, 17].

The partner particles of the SM fermions (quarks and leptons) are the scalar squarks (\(\tilde{q}\)) and sleptons (\(\tilde{\ell }\)). In the boson sector, the supersymmetric partners of the gluons are the fermionic gluinos (\(\tilde{g}\)). The fermionic supersymmetric partners of the Higgs scalars (higgsinos) and of the electroweak gauge bosons (winos and bino) mix to form charged mass eigenstates (charginos) and neutral mass eigenstates (neutralinos). In the minimal supersymmetric extension of the Standard Model (MSSM) [18, 19], two scalar Higgs doublets along with their higgsino partners are necessary, resulting in two charginos (\(\tilde{\chi }^{\pm }_{1,2}\)) and four neutralinos (\(\tilde{\chi }^{0}_{1,2,3,4}\)).

Squarks and gluinos, in R-parity-conserving scenarios, are produced in pairs through the strong interaction. If strongly interacting gluinos or squarks are present at the scale, they should be produced copiously in the 13  pp collisions at the Large Hadron Collider (LHC). With the recorded integrated luminosity and the predicted cross-sections for squark and gluino production, the searches are expected to be sensitive to sparticle masses of a few .

This paper targets two simplified SUSY models [20, 21] describing gluino and first two generation squark (\(\tilde{u}_L, \tilde{d}_L, \tilde{c}_L, \tilde{s}_L\)) production processes and decays. These models, introduced in Ref. [22], serve as benchmarks. In the models, referred to as the gluino and squark one-step models, gluinos or squarks are produced in pairs: gluinos subsequently decay via a virtual squark into a \(\tilde{\chi }_{1}^{\pm }\) and two light quarks, while squarks decay into a \(\tilde{\chi }_{1}^{\pm }\) and one light quark (\(q \in \{u, d, s, c\}\)). The \(\tilde{\chi }_{1}^{\pm }\) then decay into a \(W^{\pm }\) boson and a \(\tilde{\chi }_{1}^{0}\). The corresponding diagrams are shown in Fig. 1. It is further assumed that \(\tilde{\chi }_{1}^{\pm }\) is wino-like and the \(\tilde{\chi }_{1}^{0}\) is bino-like. In both models, the branching fractions for SUSY particles are assumed to be 100% for the aforementioned processes squark/gluino decay into \(\tilde{\chi }_{1}^{\pm }\) and quarks, and \(\tilde{\chi }_{1}^{\pm } \rightarrow \tilde{\chi }_{1}^{0} W^{\pm }\). The SM particles are assumed to decay following their known branching fractions. All other sparticles, which do not explicitly appear in the decay chains, are set to be kinematically inaccessible and decoupled.

In this search, two different types of mass spectra are considered. In the first one, the \(\tilde{\chi }_{1}^{\pm }\) mass is set to be exactly midway between the masses of the gluino (squark) and the \(\tilde{\chi }_{1}^{0}\), so that the relative mass splitting \(x= (m(\tilde{\chi }_{1}^{\pm })-m(\tilde{\chi }_{1}^{0}))/(m(\tilde{g}/\tilde{q})-m(\tilde{\chi }_{1}^{0}))\) is equal to 1/2. In the second mass spectrum, the \(\tilde{\chi }_{1}^{0}\) mass is set to be 60 GeV while the gluino (squark) mass and the relative mass splitting are free parameters.

The experimental signature of interest consists of a single charged lepton (electron or muon) produced by the leptonic decay of one of the W bosons, at least two jets, and large missing transverse momentum (\(E_{\mathrm {T}}^{\mathrm {miss}}\), defined in Sect. 4) from the undetected neutrino and the two neutralinos. The sparticle masses determine the energy available in their decays, so the number of jets and their kinematic properties depend on the mass spectrum chosen. To provide sensitivity to a broad range of mass spectra in the gluino and squark one-step models, four signal regions with differing jet multiplicity requirements from \(\ge 2\) to \(\ge 6\) are defined. Furthermore, the signal regions are categorised by allowing or forbidding the presence of jets originating from b quarks (b-tag and b-veto signal regions, respectively) to be sensitive to a wider class of decay processes, e.g. gluino decays producing top quarks.

The results presented in this paper are based on the ATLAS data collected in proton–proton collisions at the LHC during 2015–2018 at a centre-of-mass energy of 13 , corresponding to an integrated luminosity of 139 \(\hbox {fb}^{-1}\). This analysis supersedes the previous ATLAS search with an integrated luminosity of 36.1 \(\hbox {fb}^{-1}\)[23]. Similar searches for gluinos and squarks with decays via intermediate supersymmetric particles were performed by the CMS Collaboration [24, 25].

Fig. 1
figure 1

Diagrams for a gluino and b squark pair production with subsequent decays into quarks and a . The decays and a W boson. This analysis targets final states in which one W decays leptonically and the other hadronically

2 ATLAS detector

The ATLAS detector [26,27,28] is a multipurpose particle detector with nearly 4\(\pi \) coverage in solid angle.Footnote 1 It consists of an inner tracking detector surrounded by a thin superconducting solenoid providing a 2  T axial magnetic field, electromagnetic and hadron calorimeters, and a muon spectrometer. The inner tracking detector covers the pseudorapidity range \(|\eta | < 2.5\). It consists of silicon pixel, silicon microstrip, and transition radiation tracking detectors. Lead/liquid-argon (LAr) sampling calorimeters provide electromagnetic (EM) energy measurements with high granularity. A steel/scintillator-tile hadron calorimeter covers the central pseudorapidity range (\(|\eta | < 1.7\)). The endcap and forward regions are instrumented with LAr calorimeters for EM and hadronic energy measurements up to \(|\eta | = 4.9\). The muon spectrometer surrounds the calorimeters and is based on three large air-core toroidal superconducting magnets with eight coils each. The field integral of the toroids ranges between 2.0 and 6.0 Tm across most of the detector. The muon spectrometer includes a system of precision tracking chambers and fast detectors for triggering. A two-level trigger system [29] is used to select events. The first-level trigger is implemented in hardware and uses a subset of the detector information to keep the accepted rate below 100 kHz. This is followed by a software-based trigger that reduces the accepted event rate to 1 kHz on average.

3 Dataset and simulated events

The search is performed using 139 \(\mathrm {fb}^{-1}\) of LHC pp collision data collected between 2015 and 2018 by the ATLAS detector, with a centre-of-mass energy of 13  and a 25 ns proton bunch crossing interval. The average number of interactions per bunch crossing (pile-up) evolved over the data-taking period from \(\langle \mu \rangle =13\) in 2015, to \(\langle \mu \rangle =25\) in 2016, \(\langle \mu \rangle =38\) in 2017, and \(\langle \mu \rangle =36\) in 2018. The uncertainty in the combined 2015–2018 integrated luminosity is 1.7% [30], obtained using the LUCID-2 detector [31] for the primary luminosity measurements.

The SM background modelling, signal selection efficiencies, and signal event yield are evaluated using Monte Carlo (MC) simulated event samples. All the samples are produced by a fast simulation [32] procedure that combines a parameterisation of the calorimeter response with a \(\textsc {Geant4} \) [33] simulation of the other detector systems implemented in the ATLAS simulation infrastructure [34].

To model the pile-up observed in data, inelastic pp events were generated with Pythia 8.186 [35] using the NNPDF2.3LO set of parton distribution functions (PDF) [36] and a set of tuned parameters called the A3 tune [37]. These events were overlaid on all simulated hard-scatter events to model the additional proton–proton interactions in the same and nearby bunch crossings. The pile-up overlay was reweighted to match the observed distribution in data. The simulated events are reconstructed with the same algorithms as used for data.

Signal gluino (squark) pair production samples were produced with MadGraph5_aMC@NLO v2.6.2 [38] at next-to-leading order for the hard-scattering matrix element and Pythia 8.212 (Pythia 8.230) for underlying event, parton shower and hadronization. Signal cross-sections are calculated to approximate next-to-next-to-leading order in the strong coupling constant, adding the resummation of soft gluon emission at next-to-next-to-leading-logarithm accuracy (approximate NNLO+NNLL) [39,40,41,42,43,44,45,46]. The nominal cross-section and its uncertainty are derived using the PDF4LHC15_mc PDF set, following the recommendations of Ref. [47]. A typical cross-section for gluino production with \(m_{\tilde{g}}= 2000\) GeV, and  GeV is \(1.01 \pm 0.20\) fb, while for squarks with \(m_{\tilde{q}}= 1200\) GeV and  GeV a typical cross-section is \(6.8 \pm 0.9\) fb when the four partners of the left-handed first two generation quarks (\(\tilde{u}_\mathrm {L}\), \(\tilde{d}_\mathrm {L}\), \(\tilde{s}_\mathrm {L}\), and \(\tilde{c}_\mathrm {L}\)) are assumed to be mass-degenerate. A ‘single squark flavour’ limit is also given assuming that only one such left-handed first and second generation quarks is kinematically accessible.

All relevant SM backgrounds are considered: \(t\bar{t}\) pair production; single-top production (s-channel, t-channel, and associated Wt production); W/Z+jets production; \(t\bar{t}\) production with an electroweak boson (\(t\bar{t}+V\)); and diboson (WW, WZ, ZZ) production. Different MC event generators were used to produce the background samples, depending on their production process. The MC-produced events are then normalised to data using the corresponding theoretical cross-sections. The event generators, the routines for parton showering and hadronisation, and the parameter tunes and parton distribution functions for all background processes produced are summarised in Table 1.

The W+jets events were generated using Sherpa: the generation process includes up to two partons at NLO and four partons at LO using Comix [48] and OpenLoops [49, 50]. The matrix element was merged with the Sherpa parton shower [51] according to the ME+PS@NLO prescription [52,53,54,55] using the set of tuned parameters developed by the Sherpa authors. To simulate the properties of the bottom- and charm-hadron decays, the EvtGen v1.2.0 [56] program was used for all samples showered with Pythia.

Systematic uncertainties, for both signal and background samples, derived from the MC generator configuration are evaluated using samples produced without detector simulation. The uncertainties account for variations of the renormalisation and factorisation scales, the CKKW-L [57] matching scale, as well as different PDF sets and fragmentation/hadronisation models. Details of the MC modelling uncertainties are discussed in Sect. 7.

Table 1 Overview of MC generators used for different simulated event samples for background

4 Object reconstruction

Each event is required to have at least one reconstructed interaction vertex with a minimum of two associated tracks, each having \(p_{\mathrm {T}}>500\) . In events with multiple vertices, the one with the highest sum of squared transverse momenta of associated tracks is chosen as the primary vertex (PV) [72]. Baseline quality criteria are applied to reject events with non-collision backgrounds or detector noise [73].

Two levels of object definition for leptons and jets are used: ‘baseline’ and ‘signal’. Loose quality requirements define baseline objects, which are used in the calculation of missing transverse momentum and in the overlap removal procedure described below. Signal objects, obtained by applying more selective identification criteria to objects passing the baseline requirements, are used as input for the actual search region definitions. Isolation criteria applied to a set of track-based and calorimeter-based variables, are used to discriminate between signal leptons and semileptonic heavy-flavour decays, photon conversions as well as jets misidentified as leptons.

Energy deposits in the electromagnetic (EM) calorimeter that are matched to charged-particle tracks in the inner detector (ID) [74] provide electron candidates. The \(p_{\mathrm {T}}\) of electron is calculated based on the energy deposited in the EM calorimeter. Baseline electrons must have \(p_{\mathrm {T}}>7\) GeV and \(|\eta |<2.47\) and must satisfy the Loose working point provided by a likelihood-based algorithm, described in Ref. [74]. The longitudinal impact parameterFootnote 2\(z_0\) relative to the PV is required to satisfy \(|z_0\sin \theta | <0.5\) mm. The number of hits on the track is used to discriminate between electrons and converted photons. Signal electron candidates are required to satisfy the Tight likelihood operating point and the requirement \(|d_0/\sigma (d_0)|<5\). The Loose and HighPtCaloOnly isolation working points, described in Ref. [74], are applied to signal electrons having \(p_{\mathrm {T}}< 200\) GeV and \(p_{\mathrm {T}}> 200\) GeV, respectively.

Signal electrons with \(p_{\mathrm {T}}< 200\) GeV are refined using the Loose isolation working point, while those with larger \(p_{\mathrm {T}}\) are required to pass the HighPtCaloOnly isolation working point, as described in Ref. [74].

Muon candidates are reconstructed from matching tracks in the ID and muon spectrometer, refined through a global fit using the hits from both subdetectors [75]. Baseline muons are required to satisfy \(p_{\mathrm {T}}>6\) GeV and \(|\eta |<2.7\). They are identified using the Medium identification criteria [75]. As with the electrons, baseline muons are required to satisfy \(|z_0\sin \theta | <0.5\) mm. Signal muon candidates must also satisfy tighter pseudorapidity and transverse impact parameter requirements, \(|\eta |<2.5\) and \(|d_0/\sigma (d_0)|<3\), and the FixedCutLoose isolation working point requirements.

Jet candidates are reconstructed from three-dimensional topological energy clusters in the calorimeters using the anti-\(k_t\) algorithm [76] with a radius parameter \(R=0.4\) [77]. Baseline jets must have \(|\eta |<4.5\) and \(p_{\mathrm {T}}>20\) GeV. To suppress pile-up interactions, those jets having \(|\eta |<2.8\) and \(p_{\mathrm {T}}<120\) GeV are required to pass the Medium working point of the jet vertex tagger (JVT), a multivariate algorithm that identifies jets originating from the PV using track information [78, 79]. Signal jets must also have \(|\eta |<2.8\) and \(p_{\mathrm {T}}>30\) GeV.

Jets with \(p_{\mathrm {T}}>20\) GeV in the region \(|\eta |<2.5\) that contain b-hadrons can be ‘b-tagged’ with high efficiency by the MV2c10 [80], which is a boosted decision tree with improved light-flavour jet and c-jet rejection. The b-tagging working point provides an efficiency of 77% for jets containing b-hadrons in simulated \(t\bar{t}\) events, with rejection factors of 110 and 4.9 for light-flavour jets and jets containing c-hadrons, respectively [81]. Signal b-jets should also have \(p_{\mathrm {T}}>30\) GeV.

An overlap removal procedure is applied to the baseline objects defined above to resolve reconstruction ambiguities between electrons, muons and jets. First, any electron sharing the same ID track with a muon is rejected. If two electrons share the same ID track, the one with lower \(p_{\mathrm {T}}\) is discarded. Next, jets are rejected if they lie within \(\Delta R = 0.2\) of an electron and then electrons are removed if they are within a cone of \(p_{\mathrm {T}}\)-dependent size \(\Delta R = \min (0.4,0.04+10GeV/p_{\mathrm {T}})\) around a jet. Subsequently, jets are rejected if they are within \(\Delta R = 0.2\) of a muon or if the muon is matched to the jet through ghost association [82]. Finally, muons within a cone, defined in the same way as for electrons, around any remaining jet are removed.

The missing transverse momentum, \({\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}}\), with magnitude, \(E_{\mathrm {T}}^{\mathrm {miss}}\), is calculated as the negative vectorial sum of the transverse momenta of all reconstructed baseline objects (electrons, muons, jets and photons [83]) and a soft term. The soft term includes all selected tracks associated with the PV but not matched to any reconstructed baseline object. To suppress contributions from pile-up and improve the \(E_{\mathrm {T}}^{\mathrm {miss}}\) resolution, tracks not associated with the PV are excluded from the \(E_{\mathrm {T}}^{\mathrm {miss}}\) calculation [84, 85].

The efficiency differences in the trigger, lepton identification and reconstruction between data and simulated events are closely evaluated in independent measurements, and are accounted for by applying the corresponding corrections to the simulation in this analysis.

5 Event selection

To retain acceptance for soft leptons, events satisfying the \(E_{\mathrm {T}}^{\mathrm {miss}}\) trigger selection were recorded [86] and used in the search. The trigger efficiency is higher than 98% for offline \(E_{\mathrm {T}}^{\mathrm {miss}}\) values above 250 GeV. To target the signal-like events, selected events are required to have exactly one signal lepton, either an electron or a muon. Events with additional baseline leptons are rejected to suppress dilepton \(t\bar{t}\), single-top (Wt-channel), Z+jets and diboson backgrounds. The following observables are used to further reduce background contributions and increase the sensitivity for signal:

  • The transverse mass, \(m_{\mathrm {T}}\), is defined from the lepton transverse momentum \({\varvec{p}}_\mathrm {T}^{\ell }\) and \({\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}}\) as

    $$\begin{aligned} m_{\mathrm {T}} = \sqrt{2 p_{\mathrm {T}}^{\ell } E_{\mathrm {T}}^{\mathrm {miss}}\left( 1-\cos [\Delta \phi ({\varvec{p}}_{\mathrm {T}}^{\ell },{\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}})]\right) }, \end{aligned}$$

    where \(\Delta \phi ({\varvec{p}}_{\mathrm {T}}^{\ell },{\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}})\) is the azimuthal angle between \({\varvec{p}}_{\mathrm {T}}^{\ell }\) and \({\varvec{p}}_{\mathrm {T}}^{\mathrm {miss}}\). It has an upper endpoint at the W boson mass for W+jets events and for semileptonic \(t\bar{t}\) events. The \(m_{\mathrm {T}}\) distribution for signal events extends significantly beyond that endpoint.

  • The effective mass, \(m_{\mathrm {eff}}\), is the scalar sum of the \(p_{\mathrm {T}}\) of the signal lepton and all signal jets and \(E_{\mathrm {T}}^{\mathrm {miss}}\):

    $$\begin{aligned} m_{\mathrm {eff}}= p_{\mathrm {T}}^\ell + \sum _{j=1}^{N_\mathrm {jet}}p_{\mathrm {T},j} + E_{\mathrm {T}}^{\mathrm {miss}}. \end{aligned}$$

    The effective mass provides good discrimination against background events, especially for the signal scenarios with energetic jets. It can also help to distinguish between different signal channels. For example gluino production shows higher jet multiplicity than squark production. High-mass gluinos and squarks are expected to produce harder jets than low-mass ones. Thus, the optimal \(m_{\mathrm {eff}}\) value depends on the different signal scenarios. To achieve a wide-range sensitivity to various SUSY models with a limited number of signal regions, multiple intervals in \(m_{\mathrm {eff}}\) are used in the final model-dependent signal regions.

  • The aplanarity is a variable designed to provide more global information about the full momentum tensor of the event. It is constructed from the lepton and the jets, and is defined as \((3/2) \times \lambda _{3}\), where \(\lambda _{3}\) is the smallest eigenvalue of the sphericity tensor [87]. Typical measured aplanarity values lie in the range 0–0.3, with values near zero indicating highly planar background-like events. Strongly produced SUSY signals tend to have high aplanarity values, since they are more spherical than background events due to the multiple objects emitted in the gluino/squark decay chains.

Four mutually exclusive signal regions (SRs) are designed to enhance the signal sensitivity. The selection criteria for the four SRs are summarised in Table 2. Each SR is optimised for specific SUSY scenarios, as discussed below. They are labelled by the minimum required number of jets and, where relevant, the characteristics of the targeted supersymmetric mass spectrum: 2J, 4J high-x, 4J low-x, and 6J. When setting model-dependent exclusion limits (‘excl’), each SR is divided in \(m_{\mathrm {eff}}\) intervals and in b-veto/b-tag categories, and a simultaneous fit is performed across all bins of the four SRs. This choice enhances the sensitivity to a range of new-physics scenarios with or without b-quarks in the final states, and with different mass splittings. For model-independent limits and null-hypothesis tests (‘disc’ for discovery), the event yield in each SR is used to search for an excess over the SM background using an optimised minimum \(m_{\mathrm {eff}}\) value. The discovery SRs require the b-veto and are separately optimised for gluino and squark cases. The systematic uncertainties, fits, and results discussed in the following sections for the simplfied models are based on the exclusion SRs, while the model-independent results are based on the discovery SRs.

Table 2 Overview of the selection criteria for the signal regions used for gluino/squark one-step models. The requirements that only apply to the exclusion (discovery) SRs are marked with ‘excl’ (‘disc’). The \(m_{\mathrm {eff}}\) bins are of even width and the ‘+’ indicates that overflow events are included in the last bin
Table 3 Overview of the control region selection criteria. The top and W+jets control regions are defined by the b-tag and b-veto requirements, respectively. The \(m_{\mathrm {eff}}\) bins are of even width and the ‘+’ indicates that overflow events are included in the last bin
Fig. 2
figure 2

The \(m_{\mathrm {T}}\) distributions in the signal regions after all of the selection requirements other than the \(m_{\mathrm {T}}\) cut (noted as ‘N-1’ in the figures). Due to the removal of the \(m_\mathrm {T}\) requirement, these plots effectively show the CRs, VRs and SRs for each jet multiplicity. The uncertainty bands include all statistical and systematic uncertainties. Overflow events are included in the last bin. The dashed lines represent benchmark signal points for gluino and squark pair production

The 2J SR targets compressed scenarios where differences between \(m_{\tilde{g}/\tilde{q}}\), , and are small and the decay products tend to have low \(p_{\mathrm {T}}\). Thus, events are required to have one low-\(p_{\mathrm {T}}\) lepton and at least two jets. The lower \(p_{\mathrm {T}}^\ell \) bound is \(7\ (6)~GeV\) for the electron (muon), and the upper \(p_{\mathrm {T}}^\ell \) bound increases with the jet multiplicity up to \(25~GeV\). The upper \(p_{\mathrm {T}}^\ell \) requirement ensures the orthogonality between the 2J SR and other signal regions. The jet multiplicity dependence maintains the balance between background rejection and signal acceptance: the leptons are more energetic for signals with increasing mass splittings. Stringent requirements are placed on \(E_{\mathrm {T}}^{\mathrm {miss}}\) and on \(E_{\mathrm {T}}^{\mathrm {miss}}/m_{\mathrm {eff}}\) to enhance the signal sensitivity by selecting signal events with boosted final-state neutralinos recoiling against energetic initial-state radiation (ISR) jets. Compared to other SRs, a less stringent lower bound on \(m_{\mathrm {eff}}\) is preferred.

The 4J high-x SR provides sensitivity to the models with a fixed of 60 GeV and a high x value, i.e. and \(m_{\tilde{g}/\tilde{q}}\) are relatively close. Events with four or five jets are selected for this scenario. The mass-splitting between and is large enough to produce a boosted W boson that decays into a high-\(p_{\mathrm {T}}\) lepton and a neutrino. Large \(m_{\mathrm {T}}\) is thus the most distinguishing characteristic of this SR. Relatively soft jets are expected to be emitted from the gluino or squark decays. The SM background is further suppressed by tight requirements on \(E_{\mathrm {T}}^{\mathrm {miss}}\), aplanarity, and \(E_{\mathrm {T}}^{\mathrm {miss}}/m_{\mathrm {eff}}\). Compared to the 2J SR, a tighter \(m_{\mathrm {eff}}\) selection is applied due to higher jet activity.

The 4J low-x SR is optimised for the models where is fixed to 60 GeV and \(x \approx 0\), i.e. is close to . The jet multiplicity requirement is the same as in the 4J high-x SR. In contrast to the high-x scenarios, the small mass-splitting between and tends to produce an off-shell W boson, leading to small \(m_{\mathrm {T}}\). To keep this SR orthogonal to the 4J high-x SR, an upper bound is applied to \(m_{\mathrm {T}}\). Other than that, the requirements on \(m_{\mathrm {eff}}\), \(E_{\mathrm {T}}^{\mathrm {miss}}\), aplanarity, and \(E_{\mathrm {T}}^{\mathrm {miss}}/m_{\mathrm {eff}}\) are identical to the ones used in the 4J high-x SR.

The 6J SR targets signal scenarios with high gluino/squark mass, and is optimised for models with \(x \approx 1/2\). Events with one high-\(p_{\mathrm {T}}\) lepton and at least six jets are selected. Large aplanarity is required, reflecting the heavy gluino/squark produced in the targeted signature. Tight requirements on \(m_{\mathrm {T}}\) and \(E_{\mathrm {T}}^{\mathrm {miss}}\) are imposed to reduce the SM background. To achieve high sensitivity for a wide range of \(m_{\tilde{g}/\tilde{q}}\), four exclusive bins are defined in \(m_{\mathrm {eff}}\) and used in the fit. The lowest mass bin starts from 700 GeV, and the highest from 2800 GeV.

6 Background estimation

Robust prediction of the SM background event yields in SRs is important in any search like the one presented in this paper. Different approaches for calculating the SM event yields in the SRs are used depending on the background process of interest. The yields of \(t\bar{t}\), single-top, and W+jets processes are estimated from data using a set of dedicated control regions (CRs), while contributions from Z+jets, \(t\bar{t}\) produced in association with a W or Z boson, and dibosons (WW, WZ, ZZ) are evaluated from MC simulation. The details are described below.

Three sets of CRs, 2J, 4J, 6J, are defined for estimating the backgrounds in 2J, 4J high-x, 4J low-x and 6J signal regions. The CRs satisfy the criteria of high purity for the targeted background process and low signal contamination from the model of interest. The purity varies from 57 to 88% for the top backgrounds (\(t\bar{t}\) and single top) in top CRs and from 73 to 92% for W+jets in W+jets CRs. Each of the CRs is defined with kinematic boundaries close to the corresponding SR in order to reduce the theoretical and detector uncertainties from the extrapolation. The contributions of the top and W+jets backgrounds in the SRs are evaluated with a fit based on the profile likelihood method. The normalised background predictions are obtained from a simultaneous fit across all control regions, as described in Sect. 8. The control regions for top and W+jets are presented in Table 3. Events in the top control region require at least one b-tagged signal jet in the event, while W+jets control regions are defined by vetoing all events containing any b-tagged signal jets. The CRs are crafted in the same way as signal regions, thus each CR is defined as a function of \(m_{\mathrm {eff}}\), with the same binning as the corresponding SR. This permits extrapolation from each b-tag/b-veto and \(m_{\mathrm {eff}}\) bin in CRs to the corresponding bin in the SRs. The extrapolation from CRs to SRs is performed via the \(m_{\mathrm {T}}\) variable, which is found to be well modelled in simulation as shown in Fig. 2.

Table 4 Overview of the validation region selection criteria. The top and W+jets validation regions are defined by the b-tag and b-veto requirements, respectively. The \(m_{\mathrm {eff}}\) bins are of even width and the ‘+’ indicates that overflow events are included in the last bin

In order to validate the background fit results, cross-checks of the background estimates are performed in validation regions (VRs) situated between the SRs and the CRs in \(m_{\mathrm {T}}\), while remaining orthogonal to both regions. The VRs are also defined as functions of \(m_{\mathrm {eff}}\) in the same way as the corresponding CRs and SRs, to ensure an \(m_{\mathrm {eff}}\)-dependent validation. The highest \(m_{\mathrm {eff}}\) bin in the 6J VR is not used because its signal contamination would be too large. Similar to the control regions, events in the top VRs require a b-tag, while events in the W+jets VRs require a b-veto. The VRs are not used to constrain the fit; they serve only to verify that the normalised background predictions agree with the observed data. The VR definitions and their graphical representation are shown in Table 4 and Fig. 3.

The background contributions from Z+jets, \(t\bar{t}+V\) and diboson events are evaluated from simulation. The simulated event samples are normalised to the relevant theoretical cross-sections. No dedicated control regions for the diboson background are used, as the modelling of this background by simulation is found to be sufficiently good when compared with the data in the validation regions. The data and MC predictions yield, obtained from the overall background estimate, differ in all validation regions by less than two standard deviations. The background originating from misidentified leptons, real leptons coming from jets produced by heavy-flavour quarks or photons converted to electrons is evaluated using both MC and data-driven methods, and it is found to be negligible within the statistical error of the data due to the stringent requirements on \(E_{\mathrm {T}}^{\mathrm {miss}}\).

As a representative example, the \(m_{\mathrm {eff}}\) distributions in the 6J top and W+jets control regions are shown in Fig. 4 before and after a fit which constrains only the control regions. The fit strategy is discussed in Sect. 8. A trend is observed, as it was in previous similar searches [23], whereby the MC overestimates the expected yields at large values of \(m_{\mathrm {eff}}\). This is accounted for by applying different normalisation parameter values for each \(m_{\mathrm {eff}}\) bin in the corresponding fit, which effectively corrects the mismodelling. In the post-fit distributions, the data and the background expectation agree well within the uncertainties.

Fig. 3
figure 3

Graphical illustration of the control and validation region configuration corresponding to the 2J (top left), 4J (top right), and 6J (bottom) regions. The variables shown on the horizontal and vertical axes indicate the selections that differ between the corresponding control regions, validation regions and signal regions. The dotted lines show the boundaries of the \(m_{\mathrm {eff}}\) binning of the exclusion SRs

Fig. 4
figure 4

The \(m_{\mathrm {eff}}\) distribution in 6J top (left, labelled as ‘TR’) and W+jets (right, labelled as ‘WR’) control regions before (top) and after (bottom) the fit. The uncertainty bands include all statistical and systematic uncertainties. Overflow events are included in the last bin

7 Systematic uncertainties

The expected yields for both the signal and background events are subject to theoretical and experimental systematic uncertainties. The theoretical uncertainties for the backgrounds normalised to data influence only the transfer factors from CR(s) to VR(s) and from CR(s) to SR(s), while for the other backgrounds, the uncertainties affect the inclusive cross-section of each process and the acceptance of the analysis selection.

Following the procedure described in Sect. 6, both the experimental and the theoretical uncertainties are computed separately for each \(m_{\mathrm {eff}}\) bin. Both theoretical and experimental uncertainties are correlated across all \(m_{\mathrm {eff}}\) bins. For the single-top and \(t\bar{t}\) backgrounds, the theoretical uncertainties due to parton shower and hadronisation/fragmentation are estimated by comparing the predictions obtained with the Powheg-Box generator interfaced with two different parton shower generators, Pythia 8 and Herwig 7 [88], while those due to the hard-scattering are evaluated by a Powheg-BoxPythia 8 to MG5_aMC@NLO + Pythia 8 comparison. Diagram removal (DR) and diagram subtraction (DS) samples, modelled by Powheg-BoxPythia 8, are used to determine the impact of the interference between single-top Wt and \(t\bar{t}\) production [89]. In order to evaluate the impact of the uncertainties coming from the emission of initial- and final-state radiation, the renormalisation, and factorisation scales and showering are varied.

Uncertainties for \(t\bar{t}+V\), W/Z+jets and dibosons coming from scale variations are evaluated by considering the envelope of the seven-point variations of the renormalisation and factorisation scales. The resummation and the CKKW matching variations for W/Z+jets are estimated by varying the corresponding scale parameters up and down by a factor of two relative to the nominal value for each region. The PDF uncertainties for W/Z+jets are considered following the recommendation in PDF4LHC15 [47], while those for \(t\bar{t}\) were found to be negligible in all the regions. Systematic uncertainties of 5% and 6% are assigned to the inclusive cross-sections of the \(t\bar{t}+V\) and diboson processes [90], respectively. For the other background processes such as Z+jets, a systematic uncertainty in the inclusive cross-section is included at the 5% level.

Table 5 Breakdown of the dominant systematic uncertainties in the background estimates in the various signal regions. The numbers are a yield weighted average over the bins. The individual uncertainties can be correlated and do not necessarily add in quadrature to the total background uncertainty. The percentages show the size of the uncertainty relative to the total expected background
Table 6 Breakdown of the dominant systematic uncertainties in the background estimates in the various signal regions. The numbers are a yield weighted average over the bins. The individual uncertainties can be correlated and do not necessarily add in quadrature to the total background uncertainty. The percentages show the size of the uncertainty relative to the total expected background

The theoretical uncertainties in the expected yields for the two signal models are considered and estimated using MG5_aMC@NLO + Pythia 8 samples by varying the parameters corresponding to the factorisation, renormalisation and CKKW-L matching scales.

Detector-related systematic uncertainties include uncertainties from jet energy scale (JES), jet energy resolution (JER), lepton reconstruction and identification, b-tagging, \(E_{\mathrm {T}}^{\mathrm {miss}}\) modelling, pile-up, and the trigger efficiency. The dominant experimental systematic uncertainties stem from the JES and JER uncertainties. They are derived as a function of \(p_{\mathrm {T}}\) and \(\eta \) of the jet, the pile-up conditions and the jet flavour composition [91]. The uncertainties due to the lepton identification, momentum/energy scale and resolution are estimated from samples of \(Z\rightarrow \ell ^+\ell ^-\), \(J/\psi \rightarrow \ell ^+\ell ^-\) and \(W\rightarrow \ell \nu \) decays [74, 75]. The \(E_{\mathrm {T}}^{\mathrm {miss}}\) modelling systematic uncertainties are evaluated by accounting for the uncertainties in the energy and momentum scale of each object used in the calculation, as well as the uncertainties in the soft term’s resolution and scale. The uncertainty due to pile-up modelling is computed by varying the reweighting factor by \(\pm 4\%\). Uncertainties related to the b-tagging efficiency are derived from data-driven measurements in \(t\bar{t}\) events [80, 92], while uncertainties associated with the probability of mistakenly b-tagging a jet which does not contain a b-hadron are determined using dijet samples [93]. The uncertainties in the dominant background normalisation are obtained when performing the fit including the background control regions.

Tables 5 and 6 detail the size of the different systematic uncertainties in the signal regions, summed over all \(m_{\mathrm {eff}}\) bins. The uncertainty in the hard-scattering for \(t\bar{t}\) dominates in many regions. The determination with Monte Carlo samples of the \(t\bar{t}\) uncertainties reported in the tables includes also the statistical component, arising from limited MC statistics, in the uncertainties for all regions. Jet-related uncertainties dominate the detector-related systematic uncertainties.

8 Results

The statistical interpretation of the data is based on a profile likelihood method using the HistFitter framework [94, 95]. The likelihood function includes a set of Poisson functions representing the yields in each of the control and signal regions. These Poisson functions depend on the observed number of data events in the respective region and the expected numbers of signal and background events. Different parameters are included in the likelihood function to control the normalisation of the backgrounds and the signal or to reflect statistical and systematic uncertainties. The normalisations of the \(t\bar{t}\), single-top and W+jets backgrounds are controlled by the respective normalisation factors assigned individually for each \(m_{\mathrm {eff}}\) bin in the SRs. This configuration corrects for the mismodelling of the \(m_{\mathrm {eff}}\) distribution in the Monte Carlo simulation, as discussed in Sect. 6. An exception is made for the 4J case in the \(m_{\mathrm {eff}}> 1600\) GeV region, where for each of the \(t\bar{t}\) and single-top backgrounds the same normalisation factor is used across the two \(m_{\mathrm {eff}}\) bins due to the low statistics in the highest \(m_{\mathrm {eff}}\) bin. The yields in the corresponding control regions are sufficient to allow for two different normalisation factors for W+jets, one in the range \(m_{\mathrm {eff}}\in [1600, 2200]\) GeV and one for \(m_{\mathrm {eff}}> 2200\) GeV. In total, the fit includes nine normalisation factors for \(t\bar{t}\) and single top and ten normalisation factors for W+jets. The normalisation of the signal is controlled by one common normalisation factor applied to all bins included in the fit. Systematic uncertainties are accommodated through the use of nuisance parameters which are constrained by a Gaussian auxiliary term added to the likelihood.

In a background-only fit, only the control regions are used to constrain the likelihood. A signal contribution is neglected in the fit, so the signal normalisation parameter is dropped. The observed yields in the VRs are found to be compatible with the background expectation obtained from this fit, with the largest deviation of data from MC over the 18 bins having a statistical significance of about \(2 \sigma \). Background predictions in the signal regions are compared with the observed data in Tables 7, 8, 9, 10 and illustrated in Figures 5, 6, 7. No significant excess of events is observed.

Table 7 Observed event yields and the background expectation obtained by a background fit in the 2J SRs with an integrated luminosity of 139 \(\hbox {fb}^{-1}\). Each column corresponds to a bin in \(m_{\mathrm {eff}}\). Uncertainties reported for the fitted background estimates combine statistical (in the simulated event yields) and systematic uncertainties. The uncertainties in this table are symmetrised for error propagation purposes
Table 8 Observed event yields and the background expectation obtained by a background fit in the 4J high-x SRs with an integrated luminosity of 139 \(\hbox {fb}^{-1}\). Each column corresponds to a bin in \(m_{\mathrm {eff}}\). Uncertainties reported for the fitted background estimates combine statistical (in the simulated event yields) and systematic uncertainties. The uncertainties in this table are symmetrised for error propagation purposes but are truncated at zero to remain within the physical boundaries
Table 9 Observed event yields and the background expectation obtained by a background fit in the 4J low-x SRs with an integrated luminosity of 139 \(\hbox {fb}^{-1}\). Each column corresponds to a bin in \(m_{\mathrm {eff}}\). Uncertainties reported for the fitted background estimates combine statistical (in the simulated event yields) and systematic uncertainties. The uncertainties in this table are symmetrised for error propagation purposes but are truncated at zero to remain within the physical boundaries
Table 10 Observed event yields and the background expectation obtained by a background fit in the 6J SRs with an integrated luminosity of 139 \(\hbox {fb}^{-1}\). Each column corresponds to a bin in \(m_{\mathrm {eff}}\). Uncertainties reported for the fitted background estimates combine statistical (in the simulated event yields) and systematic uncertainties. The uncertainties in this table are symmetrised for error propagation purposes but are truncated at zero to remain within the physical boundaries
Fig. 5
figure 5

Comparison of the observed and expected event yields in all signal regions in the background-only fit

Fig. 6
figure 6

Post-fit \(m_\mathrm {eff}\) distributions in the exclusion 2J and 4J high-x signal regions. The uncertainty bands include all statistical and systematic uncertainties. The dashed lines represent benchmark signal points. Overflow events are included in the last bin

Fig. 7
figure 7

Post-fit \(m_\mathrm {eff}\) distributions in the exclusion 4J low-x and 6J signal regions. The uncertainty bands include all statistical and systematic uncertainties. The dashed lines represent benchmark signal points. Overflow events are included in the last bin

Table 11 Results of the model-independent limit fits. For each SR defined in Table 2, the observed events and the total SM background, and the observed 95% CL upper limits on the visible cross-section \(\langle \epsilon \sigma \rangle _{\mathrm {obs}}^{95}\) and on the number of signal events \(S_{\mathrm {obs}}^{95}\) are given. The sixth column, \(S_{\mathrm {exp}}^{95}\), shows the 95% CL upper limit on the number of signal events, given the expected number (and \(\pm 1\sigma \) excursions of the expectation) of background events. The last two columns indicate the \(\hbox {CL}_\text {b}\) value, i.e. the confidence level observed for the background-only hypothesis, the discovery p-value \(p(s = 0)\) and the significance Z. In case of fewer events than the fitted background estimate observed, the p-values are capped at 0.5
Fig. 8
figure 8

Exclusion limits for the gluino one-step \(x=1/2\) (top left), gluino one-step variable-x (top right), squark one-step \(x=1/2\) (bottom left) and squark one-step variable-x (bottom right) scenarios. The red solid line corresponds to the observed limit, with the red dotted lines indicating the \(\pm 1 \sigma \) variation of the limit due to the effect of theoretical scale and PDF uncertainties in the signal cross-section, for scenarios where the four left-handed squarks of the first two generations (\(\tilde{u}_\mathrm {L}, \tilde{d}_\mathrm {L}, \tilde{c}_\mathrm {L}, \tilde{s}_\mathrm {L}\)) are mass degenerate. The dark grey dashed line indicates the expected limit with the yellow band representing the impact of the \(\pm 1 \sigma \) variation of the median expected limit due to the experimental and theoretical uncertainties. The orange solid and the dashed lines show the squark one-step \(x=1/2\) (left) and squark one-step variable-x (right) scenarios for cases in which only a single squark flavour is kinematically accessible. For reference, exclusion bounds from previous searches with 36.1 \(\hbox {fb}^{-1}\) of data at 13  centre-of-mass energy [23] are overlaid as the grey area

Using the discovery signal regions defined in Table 2, a test is performed for the presence of beyond-the-SM physics in a model-independent fit in each signal region. The signal contribution is only considered in the respective signal region, and not in the CRs, and therefore a conservative background estimate is obtained in the signal regions. Table 11 shows the observed and expected upper limits (\(S_{\mathrm {obs}}^{95}\) and \(S_{\mathrm {exp}}^{95}\), respectively) on the number of signal events, at 95% confidence level (CL) using the \(\hbox {CL}_\text {s}\) prescription [96]. Also reported is the visible cross-section upper limit (\(\langle \epsilon \sigma \rangle _{\mathrm {obs}}^{95}\)), which is the upper limit on the cross-section times the reconstruction efficiency and region acceptance. The table also presents the discovery p-values (\(p_{0}\)), which quantify the probability to observe at least as many events as expected in the background-only assumption, the \(\hbox {CL}_\text {b}\) value, i.e. the confidence level observed for the background-only hypothesis, and the associated significance.

Observed and expected exclusion limits at 95% CL are calculated for the gluino and squark one-step models using all statistically independent binned signal and control regions in a model-dependent fit. For this exclusion fit, the signal contribution, adjusted using a single floating normalisation factor, is considered in all control and signal regions. The background normalisation factors are simultaneously determined in the same fit. Specific sparticle masses in the gluino or squark one-step models can be excluded if the upper limit of the signal normalisation factor is less than unity.

Figure 8 shows the expected and observed exclusion limits. Gluino masses up to 2.2  and 2.05  can be excluded for masses less than 400 GeV and 1  respectively, while squark masses up to 1.37  can be excluded for low-mass . Benefiting from the increased integrated luminosity, the current observed limit exceeds the previous ATLAS limit by about 100 GeV in \(m_{\tilde{g}}\) and in \(m_{\tilde{q}}\) for low-mass . In squark one-step models in which only a single squark flavour is kinematically accessible, squark masses up to about 1.0  can be excluded.

9 Conclusion

A search for gluinos and squarks in events with one isolated lepton, jets and missing transverse momentum is presented. The analysis uses \(139~\mathrm {fb}^{-1}\) of proton–proton collision data at a centre-of-mass energy of 13  collected by the ATLAS experiment at the LHC. Four signal regions requiring from at least two to at least six jets are used to cover a broad spectrum of the targeted SUSY model parameter space. Three signal regions defined using high-\(p_{\mathrm {T}}\) lepton selections target models with large mass differences between the supersymmetric particles. A separate, low-\(p_{\mathrm {T}}\) lepton region is designed to enhance the sensitivity to models with compressed mass spectra. The data agree with the Standard Model background prediction in the signal regions. For all signal regions, limits on the visible cross-section are derived in models of new physics within the kinematic requirements of this search. In addition, exclusion limits are placed on models with gluino/squark production and subsequent decays via an intermediate chargino to the lightest neutralino. This search extends the exclusion limit by 100 GeV (gluino) and 180 GeV (squark) for a massless LSP with respect to the previous search [23] owing to a more solid background estimation technique and an increased statistical sample. Gluino (Squark) masses up to around 2.2 (1.4)  are excluded for a mass lower than 200 GeV, while for scenarios with a single accessible squark flavour, squark masses up to around 1.04  are excluded.