1 Introduction

One of the most promising extensions of the Standard Model, supersymmetry (SUSY) [19], has been the target of a large number of searches at the LHC. Prompted by the large predicted production cross-section of coloured SUSY particles (sparticles), ATLAS and CMS have performed inclusive searches for strongly produced squarks and gluinos, the superpartners of quarks and gluons [1017]. Assuming R-parity conservation [1822], these sparticles are produced in pairs and decay into energetic jets, possibly leptons, and the lightest SUSY particle (LSP, typically the lightest neutralino \(\tilde{\chi}_{1}^{0}\)), which escapes detection and results in missing transverse momentum. For these searches, the selections adopted to discriminate the signal processes from the background typically include requirements on the missing transverse momentum (\(E_{\mathrm{T}}^{\mathrm{miss}}\)) and the scalar sum of transverse momenta of all selected physics objects (H T) plus the scalar \(E_{\mathrm{T}}^{\mathrm{miss}}\) (effective mass, M eff).

This paper presents a search for strongly produced sparticles that makes use of a variety of final states including high transverse momentum jets and zero, one, or two leptons (electrons or muons). The events are also separated according to the presence of a jet identified as originating from a b-quark (b-tagged jet). Several mutually exclusive search channels are defined, facilitating a simultaneous search in all of the typical final states and increasing the search sensitivity. The search employs a set of observables, called the “razor variables” [23], which make use of both longitudinal and transverse event information. Because of the inclusion of longitudinal information, the requirements on the transverse information to reduce the background are effectively relaxed, making the search sensitive to different regions of kinematic phase space relative to other \(E_{\mathrm{T}}^{\mathrm{miss}}\)-based searches. Thus, these search results complement those already performed by ATLAS. These variables were first employed in SUSY searches by CMS [24, 25].

This paper is organised as follows. The main features of the ATLAS detector are presented in Sect. 2. Section 3 introduces the razor variables. Section 4 describes the data sample, basic event selection, and the Monte Carlo simulation used to model the data. Section 5 defines the basic physics objects and event-level variables that are used through the analysis. The search technique is described in Sect. 6, and the background estimation is presented in Sect. 7. The performance of the search and interpretation of the results are presented in Sect. 8. Finally, Sect. 9 includes a summary of the analysis and of its findings.

2 ATLAS detector

The ATLAS detector comprises an inner tracking detector, a calorimeter, and a muon system [26]. The inner detector includes a silicon pixel detector, a silicon microstrip detector, and a transition radiation tracker. It is immersed in a 2 T axial field and precisely measures the tracks of charged particles in the pseudorapidity regionFootnote 1 |η|<2.5. The calorimeter covers the region |η|<4.9 and is divided into electromagnetic and hadronic compartments. The electromagnetic calorimetry in the central (|η|<3.2) region is provided by liquid argon sampling calorimeters with lead absorbers. In the barrel region (|η|<1.4), the hadronic calorimetry is provided by scintillator tiles with steel absorbers, and the more forward (1.4<|η|<3.2) region is covered by a liquid argon and copper sampling hadronic calorimeter. The forward calorimetry (|η|>3.2) uses liquid argon and copper or tungsten absorbers. The muon spectrometer covers |η|<2.7 and includes a system of air-core toroidal magnets. A variety of technologies are used to provide precision muon tracking and identification for |η|<2.7 and rapid response for triggering for |η|<2.4.

ATLAS uses a three-tier trigger system to select events. The first-level (L1) trigger is hardware-based and only uses coarse calorimeter information and muon system information. The calorimeter information available at the lowest level includes basic objects with rough calibration and simple identification of electromagnetic objects (electrons and photons) as distinct from hadronic objects (jets). The second-level (L2) trigger and event-filter (EF) compose the software-based high-level trigger (HLT), in which full event reconstruction is run, similar to that used offline, in order to accurately identify and measure objects. The L2 only examines η/ϕ regions that triggered the L1. The EF fully reconstructs events that pass L2.

3 Razor variable definitions

Searches for sparticles in R-parity-conserving scenarios generally make the assumption that the sparticles are pair-produced and decay subsequently to an LSP that is invisible in the detector. The heavy sparticles produced are either the same type of particle (pair-production) or are at the same mass scale (i.e. scenarios with associated squark–gluino production are most relevant when m squarkm gluino). Thus, the production mass and visible energy in the decays are fairly symmetric. Most analyses make use of the transverse balance of typical pp collision events, or exploit the event symmetry in the transverse plane. The razor variables attempt to also include longitudinal information about the event by making several assumptions motivated by the kinematics of the models of interest.

In the rest frame of each heavy sparticle, called the R-frame, the sparticle decays are symmetric. In an attempt to reconstruct the primary produced sparticle pair, the razor calculation clusters all final-state particles into a pair of objects with four-momenta called “mega-jets”. Each of these mega-jets is associated with one of the two SUSY decay chains and represents the visible energy-momentum of that produced sparticle. All possible combinations of the four-vectors of the visibly reconstructed/selected objects (signal jets and leptons) are considered when constructing the two mega-jets. The pair of mega-jets, j 1 and j 2, that minimises the sum of the squared masses of the four-vectors is selected. Following the prescription in Ref. [23] and for consistency with Ref. [24], all jets and the mega-jets are forced to be massless by setting their energy equal to the magnitude of their three momenta. Studies indicate that neither this choice nor the mega-jet selection, based on minimizing the mega-jet mass squared, have a significant impact on the reach of the razor-based search.

In the R-frames, each heavy sparticle should be nearly at rest with some mass m Heavy. The sparticle decay may then be approximated as a two-body decay to some visible object (a mega-jet) and the invisible, stable LSP. The final visible decay products (i.e. the final-state quarks and gluons, or the observable jets and leptons) have masses far below the SUSY mass scale and can therefore be approximated as being massless. Then the energy of each mega-jet in the R-frame, E 1 and E 2, becomes:

$$ E_1 = E_2 = \frac{ m_{\text{Heavy}}^2 - m_{\text{LSP}}^2 }{2 \times m_{\text{Heavy}}}, $$
(1)

where m LSP is the mass of the LSP. This leads to a characteristic mass, M R , in the R frame of M R =2×E 1=2×E 2, which for m Heavym LSP is identical to m Heavy. Therefore, in events where heavy particles are pair-produced, M R , which is a measure of the scale of the heaviest particles produced, should form a bump [23, 24]. In \(t\bar{t}\) or WW events, for example, the characteristic mass M R m top or m W . Like the Jacobian peak of the transverse mass distribution in Wℓν events, the width of the bump is dominated by the kinematics of the invisible particles in the event. The product of M R and the Lorentz factor for the boost from the lab to R-frame, \(M_{R}'=\gamma_{R} \times M_{R}\), is useful for characterisation of the sparticle mass scale, in part because of its close relation to m Heavy, and in part because Standard Model backgrounds tend to have small values of \(M_{R}'\). When expressed in terms of the mega-jet quantities in the lab frame, the expression is given by:

$$ M_{R}' = \sqrt{ ( j_{1,E} + j_{2,E} )^2 - ( j_{1,z} + j_{2,z} )^2 } , $$
(2)

where j i,E and j i,z are the energy and longitudinal momentum, respectively, of mega-jet i. The transverse information of the system is taken into account by constructing a transverse mass for the mega-jets, assuming half of the \(E_{\mathrm{T}}^{\mathrm{miss}}\) is associated with each jet:

(3)

where \(\mathbf {E_{\mathrm{T}}^{\mathrm{miss}}}\) is the two-dimensional vector of the \(E_{\mathrm{T}}^{\mathrm{miss}}\) in the transverse plane. When an event contains “fake” \(E_{\mathrm{T}}^{\mathrm{miss}}\) from a detector defect or mismeasurement, the system will tend to have back-to-back mega-jets. In such cases, the vector sum of the two mega-jet momenta will be small. If, on the other hand, there is real \(E_{\mathrm{T}}^{\mathrm{miss}}\), the mega-jets may not be back-to-back and may even point in the same direction. In these cases, the vector sum, and thus \(M_{T}^{R}\), will have a large value. \(M_{\text{T}}^{R}\) is another measure of the scale of the event that only uses transverse quantities in contrast to longitudinal quantities in \(M_{R}'\).

Finally a razor variable is defined to discriminate between signal and background:

$$ R = \frac{ M_{\text{T}}^R }{ M_{R}' } . $$
(4)

This variable takes low values for multijet-like events and tends to be uniformly distributed between 0 and 1 for sparticle decay-like events, providing good discrimination against backgrounds without genuine \(E_{\mathrm{T}}^{\mathrm{miss}}\). The impact of some important experimental uncertainties, like the jet energy scale uncertainty, are reduced in this ratio. In an analysis based on the razor variables, a cut on R can be used to eliminate these backgrounds before a SUSY search is made in the distribution of the variable \(M_{R}'\).

4 Data and Monte Carlo samples

The data included in this analysis were collected between March and October 2011. After basic trigger and data quality requirements, the full dataset corresponds to 4.7±0.2 fb−1 [27, 28].

Events in the zero-lepton channels are selected using a trigger that requires a jet with transverse momentum p T>100 GeV at L1. In the event filter, H T>400 GeV is required, where H T is calculated through a scalar sum of the p T of all calorimeter objects with p T>30 GeV and |η|<3.2. With the exception of a cross-check of the multijet background estimate, which uses prescaled single-jet triggers, this trigger requirement is fully efficient for the offline selection used in the analysis.

The one- and two-lepton channels make use of the lowest-p T single-lepton triggers available for the entire running period. The muon triggers require a muon with p T>18 GeV, and the electron triggers require an electron with p T>22 GeV. Offline, the leading lepton in the event is required to have p T>20 GeV (p T>25 GeV) if it is a muon (electron), in order to ensure that the triggers are fully efficient with respect to the offline event selection. For the two-lepton analysis, where there are overlaps in the triggers, the electron trigger takes priority over the muon trigger.

Offline, an event is required to have at least one vertex with at least five tracks associated to it, each with \(p_{\mathrm{T}} ^{\rm track}>400~\mbox{MeV}\). This requirement reduces cosmic ray and beam-related backgrounds. The primary vertex is defined as the one with the largest \(\sum( p_{\mathrm{T}} ^{\rm track})^{2}\) of the associated tracks. Events that suffer from sporadic calorimeter noise bursts or data integrity errors are also rejected.

Monte Carlo (MC) simulated events were used to develop the analysis and assist in estimations of background rates. All MC samples are processed through ATLAS’s full detector simulation [29] based on Geant4 [30], which was run with four different configurations corresponding to detector conditions of four distinct operating periods of 2011. The fractions of MC simulation events in these four periods match the fractions of data in each period. During the data collection, the average number of proton–proton collisions per bunch crossing in addition to the one of interest (“event pile-up” or simply “pile-up”) increased from approximately two to twelve. To mimic the effect of pile-up, additional inelastic proton–proton collisions are generated using Pythia [31] and overlaid on top of every MC event. Within each period, the profile of the average number of events per bunch crossing (〈μ〉) is re-weighted to match the data in that period. The same trigger selection is applied to the MC simulation events, which are then passed through the same analysis code as the data. Reconstruction and trigger efficiency scale factors are applied to the MC simulation in order to take into account small discrepancies between the data and the MC simulation.

Table 1 lists the major backgrounds along with the chosen estimation method (described in Sect. 7) and the primary and alternative MC generators used in this analysis. In all cases, MC@NLO and Alpgen are interfaced to Herwig and Jimmy for the parton shower, hadronisation, and underlying event modelling. The multijet background is normalised to the leading order generator cross-section predicted by Pythia. The \(t\bar{t}\) production cross-section of 166.8 pb is calculated at approximate NNLO in QCD using Hathor [32] with the MSTW2008 NNLO PDF sets [33]. The calculation is cross-checked with an NLO+NNLL calculation [34] implemented in Top++ [35]. The single-top production cross-sections are calculated separately for s-channel, t-channel, and Wt production at NNLO [3638].

Table 1 Background estimation methods, primary and alternative MC event generators, and normalisation uncertainties for each of the major backgrounds. The backgrounds are constrained using various Control Regions (CRs) that are enriched in certain samples (see Sect. 6). The \(t\bar{t}\) background estimate includes small contributions from \(t\bar{t} W\) and \(t\bar{t} Z\), generated with MadGraph. The diboson WW background estimate also includes W ± W ± jj generated with MadGraph. The last column of the table indicates the uncertainty on the normalization in the simultaneous fit used to test signal hypotheses. “None” indicates that the normalization is fully constrained in the fit. The grouping indicates the samples that are combined and jointly varied in the fit. Within a group, the relative normalizations are fixed

The W and Z (including Drell–Yan with m ℓℓ >40 GeV) production cross-sections of 10.46 nb and 0.964 nb are calculated at NNLO using FEWZ [39]. For the production of vector bosons in association with heavy flavour, in accordance with ATLAS measurements [40], the production cross-section for \(W+ \bar{b} \) and \(W+ c\bar{c} \) are scaled by 1.63, and the cross-section for W+c is scaled by 1.11 compared to the NLO cross-section [41]. Additional uncertainties on the production of W and Z bosons in association with heavy flavour of 45 % for \(W+ \bar{b} \) and \(W+ c\bar{c} \), 32 % for W+c, and 55 % for \(Z+b \bar{b} \) are included. Alpgen describes the jet multiplicity and inclusive \(M_{R}'\) distributions well, but it does not correctly model the vector boson p T distribution. Therefore, the boson p T in the Alpgen samples is re-weighted according to the distribution produced Sherpa. Half of the difference between the weight and unity is applied as a systematic uncertainty on the re-weighting procedure. Further systematic uncertainties on the shapes of Alpgen samples are derived by systematically varying the generator parameters, including matching and factorisation scales. Diboson production cross-sections of 44.92 pb, 17.97 pb, and 9.23 pb for WW, WZ, and ZZ (including off-shell production with m ℓℓ >12 GeV) are calculated at NLO using MCFM [42]. In order to avoid low-mass resonances, all dilepton events are required to have the invariant mass m ℓℓ >20 GeV. These cross-sections provide the starting normalisations for all background processes.

Two SUSY-inspired simplified models are used for the interpretation of the results from this search. The first considers gluino pair-production, with the gluino decaying to a \(t\bar{t}\) pair and the LSP via an off-shell stop. This model is generated using Herwig++ [49], with the gluino and LSP masses being the only free parameters. The top quarks are required to be on-shell, limiting the mass splitting between the gluino and the LSP to greater than 2×m top.

The second considers gluino pair-production, with the gluino decaying to two quarks and a chargino via an off-shell squark. The chargino then decays to a W boson and the LSP. The free parameters of this model are the masses of the gluino, chargino, and LSP. For convenience, two two-dimensional planes are generated: one with the chargino mass exactly between the masses of the gluino and the LSP and one with the mass of the LSP fixed to 60 GeV. Because initial-state radiation can be important for the acceptance of these models when the mass splitting between the gluino and LSP is small, this model is generated using MadGraph [50] with at most one additional jet in the matrix element. Pythia is used for the parton shower and hadronisation. Systematic uncertainties on matrix element matching and initial-state radiation modelling are included, leading to 20 % uncertainties for small mass splittings and small gluino masses, but no uncertainty for mass splittings above 200 GeV and masses above 400 GeV.

Additionally, the results are interpreted in terms of SUSY signal models based on the constrained minimal supersymmetric model (CMSSM or MSUGRA) [1822]. The parameters of this model are the high-energy-scale universal scalar mass, m 0, the universal gaugino mass, m 1/2, the ratio of the vacuum expectation values of the two Higgs fields, tan(β), the tri-linear coupling strength, A 0, and the sign of the Higgsino mass parameter, μ. Samples are generated in a two-dimensional grid of the m 0m 1/2 parameters where tan(β)=10 and A 0=0 are fixed and μ is set positive. This MC data grid is generated using Herwig++ [49], with a more dense population of points at low mass. IsaSUSY [51] is used to run the high-energy-scale parameters down to the weak-scale.

Signal cross-sections are calculated to next-to-leading order in the strong coupling constant, adding the resummation of soft gluon emission at next-to-leading-logarithmic accuracy (NLO+NLL) [5256]. The nominal cross-section and the uncertainty are taken from an envelope of cross-section predictions using different PDF sets and factorisation and renormalisation scales, as described in Ref. [57]. For each of these signal models, the luminosity systematic uncertainty of 3.9 % [27, 28] and statistical uncertainty, typically of order 10 %, is included.

5 Physics object identification and selection

Events are categorised into six exclusive samples defined by the presence of zero, one, or two leptons, with or without b-tagged jets. The particle candidate selections that define these samples are referred to as the “baseline” object selection. Since a particle may simultaneously satisfy multiple particle hypotheses (e.g. electron and jet), an overlap removal procedure (described below) assigns a unique interpretation to each candidate. The selections are then refined to enhance signal candidates whilst removing leptons not originating from gauge bosons, tau-leptons or sparticles.

Baseline electrons are required to have E T>10 GeV, be within the fiducial acceptance of the inner detector (|η|<2.47), and pass a version of the “medium” selection criteria [58] updated for 2011 running conditions, which requires hadronic calorimeter energy deposition and a calorimetric shower shape consistent with an electron and a match to a good quality inner detector track. Signal electrons are required to be isolated from other objects and satisfy “tight” selections. The tight selection applies stricter track quality and matching than medium and ensures the number of hits in the transition radiation tracker is consistent with the electron hypothesis. The isolation requirement is that the sum of the p T of all charged particle tracks associated with the primary vertex within ΔR=0.2, where \(\Delta R = \sqrt{(\Delta\eta)^{2} + (\Delta\phi)^{2}}\), of the electron is less than 10 % of the electron E T. In the leptonic channels, if the leading lepton in a data event is an electron, it is additionally required to match an EF trigger electron. MC simulation events are re-weighted to compensate for mis-modelling of the single-lepton trigger efficiency. The energy of electrons in simulated events is also smeared prior to object selection in order to reproduce the resolution in Z and J/ψ data. Finally, in order to account for percent-level differences in electron reconstruction efficiency, η- and E T-dependent scale factors, derived from Z, W, and J/ψ events in the data, are applied to each simulated electron satisfying overlap removal and selection requirements.

Baseline muons are reconstructed as either a combined track in the muon spectrometer and inner detector, or as an inner detector track matching with a muon spectrometer segment [59]. Tracks are required to have good quality, and the muon is required to have p T>10 GeV and |η|<2.4. Signal muons are required to be isolated by ensuring that the sum of the p T of all charged particle tracks associated with the primary vertex within ΔR=0.2 of the muon is less than 1.8 GeV. Matching to EF trigger muons in data, MC event trigger re-weighting, muon momentum smearing, and MC/data efficiency scaling are performed in a similar way for muons as electrons (described above) [6062]. These corrections are typically percent or sub-percent level.

Calorimeter jets are reconstructed from topological clusters of energy deposited in the calorimeter calibrated at the electromagnetic (EM) scale [63] using the anti-k t jet algorithm [64, 65] with a four-momentum recombination scheme and a distance parameter of 0.4. Jets reconstructed with an EM-scale p T>7 GeV are calibrated to the hadronic scale (particle level) using p T and η-dependent factors, derived from simulation and validated with test beam and collision data [66]. In order to remove specific non-collision backgrounds, events are rejected if they contain a reconstructed jet that does not pass several quality and selection criteria [66]. Signal jets are selected if they lie within |η|<2.5 with a jet vertex fraction (JVF) of at least 75 %, where the JVF is the fraction of summed p T of the tracks associated with the jet that is carried by tracks consistent with the primary vertex of the event, thus associating the jet with the pp collision of interest. Jets are tagged as heavy flavour using the combined neural network “jet fitter” algorithm [67] with the 60 % efficiency working point. Scale factors for heavy flavour jets are used in MC simulation in order to reproduce the expected b-jet identification performance in data.

In order to ensure that objects are not double counted, overlaps between objects are removed using a hierachical procedure. If any two baseline electrons lie within a distance of ΔR=0.1 of one another, the electron with the lower calorimeter E T is discarded. Next, jets passing basic selections are required to be at least 0.2 units away from all surviving baseline electrons in ηϕ. Electrons are then required to be at least 0.4 units away from surviving jets. Finally, in order to mitigate the effect of jets which have deposited significant energy in the muon spectrometer on mass measurements and reduce the number of events with badly measured missing transverse momentum, muons with p T>250 GeV within ΔR=0.2 of a jet with p T>500 GeV are removed. A negligible number of events in the data are removed by this cut.

Following these overlap removal procedures, the missing transverse momentum and razor variables are calculated. The determination of the missing transverse momentum uses all baseline electrons with E T>20 GeV, all baseline muons, all calibrated jets with p T>20 GeV, and EM scale topological calorimeter clusters not belonging to any object. Note that in the MC simulation, objects enter this calculation after the energy or p T smearing described above.

In counting leptons for event classification, baseline electrons and muons are then required to be at least 0.4 units away from all good jets in ηϕ. If an electron and muon are separated by ΔR cone<0.1, neither is counted.

In order to remove events with large missing transverse momentum due to cosmic rays, events are vetoed if they contain a muon in which the transverse and longitudinal impact track parameters are greater than 0.2 mm and 0.1 mm with respect to the primary vertex, respectively. The vertex resolution is significantly smaller than either of these requirements, typically <0.05 mm. Also vetoed are events with badly measured, non-isolated muons. These muons with large momentum uncertainties are rare in both the signal and background events and can have significant impact on the \(E_{\mathrm{T}}^{\mathrm{miss}}\) and razor variables.

During a portion of the run period, a hardware failure resulted in a region of the calorimeter not being read out. For data collected during this period, and for a corresponding fraction of the MC samples, events are rejected if they fail the “smart LAr hole veto” [13]. This ensures that if an event contains one or more jets pointing to the dead region and those jets may contribute substantially to the missing transverse momentum in the event, the event is discarded.

Signal regions are defined after all overlap removal is complete. Events with no baseline leptons and events with the highest-p T lepton below the leading lepton requirement (25 GeV for electrons, 20 GeV for muons) are accepted into the zero-lepton regions. Events with one leading lepton satisfying all requirements, including that on leading lepton p T (above), and no other baseline leptons with p T>10 GeV are accepted into the one-lepton regions. Events with exactly one additional signal lepton above 10 GeV and no other baseline leptons are accepted into the two-lepton regions.

6 Search technique

After sorting events into the six samples described in the previous section, each sample is further divided in the R\(M_{R}'\) plane into control regions (CR), which are choosen so that they are dominated by a specific background, and signal regions (SR). Additionally, validation regions (VR) are constructed, which do not constrain the background but are used to evaluate the agreement between data and MC simulation. Table 2 lists these regions, which are also visualised in the R\(M_{R}'\) plane in Fig. 1. These regions are binned in either R or \(M_{R}'\) and then simultaneously fit to MC estimates for background and signal rates with correlations from sample to sample and region to region taken into account. The hadronic (had.) and one-lepton signal regions are divided into events with and without b-tagged jets (“b-tag” and “b-veto,” respectively). The two-lepton events are divided into regions with opposite-sign (OS) and same-sign (SS) leptons and regions with opposite-flavour (OF) and same-flavour (SF) leptons. While some background components are sufficiently constrained by the CRs to be left free in the fit, others are constrained to estimates derived from other techniques or MC simulation. Table 1 summarises the backgrounds, the estimation technique, the source of the estimate, and the normalization uncertainty used in the fit. Finally, systematic uncertainties on all backgrounds are included as nuisance parameters. The result is a maximum likelihood fit that encapsulates all knowledge about the background and signal consistently across all channels.

Fig. 1
figure 1

A visual representation of the zero-lepton (top), one-lepton (middle), and two-lepton (bottom) control validation (VR), and signal (SR) regions. The CR and VR regions also indicate the respective dominant background. Regions with two leptons are classified as same-sign (SS) or opposite-sign (OS) events and as same-flavor (SF) or opposite-flavor (OF) events

Table 2 Background control, signal, and validation regions. All signal regions include the overflow in the highest bin. “N/A” means that there is no requirement. Regions with two leptons are classified as same-sign (SS) or opposite-sign (OS) events and as same-flavor (SF) or opposite-flavor (OF) events. The binning of the validation regions does not affect the results, so they are listed as “N/A”

When evaluating a signal hypothesis, any signal contamination in the control regions is taken into account for each signal point, as the control region fits are performed for each signal hypothesis. Separately, each signal region (one at a time), along with all control regions, is also fit under the background-only hypothesis. This fit is used to characterise agreement in each signal region with the background-only hypothesis and to extract visible cross-section limits and upper limits on the production of events from new physics (N BSM).

The fit considers several independent background components:

  • \(t\bar{t}\) and single top. A total of five top control regions are defined in the one- and two-lepton channels. The normalisation of this component is allowed to vary freely in the fit.

  • Bosons, except diboson WW. The inclusion of the WZ and ZZ diboson samples is motivated by the dominance of leptonic Z decays in the two-lepton signal regions, the dominance of Zνν in the zero-lepton signal regions, and the dominance of \(WZ\rightarrow\ell\nu q\bar{q} \) in the one-lepton signal regions. In all of these cases, the experimental uncertainties affect the samples in the same way as they do W+jets or Z+jets, and therefore they are combined in order to treat them as fully correlated. The normalisation of this sample is allowed to vary freely in the fit. Independent validation of the Z+jets background is carried out in two-lepton control regions. The agreement is good between data and MC simulation in both normalisation and shape.

  • Diboson WW. This sample is constrained with a 30 % cross-section systematic uncertainty. The constraint is necessary because of the relatively small contribution of the sample in most signal and control regions and because no WW-dominated control region can be constructed; if the background were allowed to vary freely, then the fit may find a minimum with an unreasonably large or small contribution from diboson WW events and hide some other effect with an artificial WW normalisation.

  • Charge flip. Charge mis-identification can occur due to physical effects, like lepton Bremsstrahlung, and detector effects, especially for high-p T leptons with almost straight tracks. These effects generate background in the same-sign dielectron and electron-muon channels. This background is negligible in the dimuon channel, where the contribution from both physical and detector effects is far smaller. The electron charge-flip rate is measured as a function of η in the data [68], allowing MC simulation to model the lesser dependence on p T. These charge-flip rates are applied to opposite-sign MC simulation events, providing an estimate of the overall contribution from charge flip in these channels. The electron p T is additionally shifted and smeared to mimic the effect of charge mis-identification. This shift in the p T is propagated through to the razor variables. The uncertainty from the charge flip probabilities dominates the uncertainty of this background.

  • Fake leptons. The multijet background in the one-lepton signal regions, as well as the W+jets, semi-leptonic \(t\bar{t}\), and multijet background in the two-lepton signal regions, comes predominantly from hadrons faking electrons and muons. This background is estimated using the “matrix method” [13, 68], using the number of baseline leptons not passing signal lepton requirements. The efficiency for a real lepton passing the baseline lepton requirements to pass the signal lepton requirements is estimated using Z MC simulation events. The rejection rate for fake leptons is estimated in data, using samples enriched in fake leptons. For electrons, the factors are derived and applied separately for inclusive samples of events and samples requiring a b-tagged jet. Because this background accounts for all fake background, MC events in the one-lepton (two-lepton) channels are required to have at least one (two) prompt lepton(s) from a τ lepton, W boson, Z boson, or sparticle. The uncertainty on this background estimate has a statistical component from the number of events in the control region and a systematic component from the uncertainty on the scale factors.

    Some fraction of the events with same-sign, baseline leptons in the data may be due to charge flip. Thus, the matrix method overestimates somewhat the fake lepton background in the dilepton channels. In order to correct for this overlap, opposite-sign events in data containing baseline leptons that do not pass the signal lepton requirements are used. Each event is assigned a weight representing the likelihood of that event being subject to charge mis-identification. The weighted events are then presented as a negative component to the same-sign fake background distribution, such that the contribution to the same sign fake background from originally oppositely charged leptons is subtracted.

  • Multijets in zero-lepton channel. Two specific control regions constrain this background, and its normalisation is allowed to vary freely in the fit. Several different approaches are used to cross-check this estimate. Prescaled single jet triggers are used to construct independent multijet-enriched control regions at low \(M_{R}'\) that is free from the inefficiency of the H T-based trigger. The observed number of events in this region are then projected into the signal region using transfer factors from MC simulation. Alternatively, in order to model the mis-measurement of jets in the calorimeter, jets in events collected with these single-jet triggers are smeared according to response functions estimated using data [10]. Both of these methods result in an estimate consistent with that derived in the main fit.

The systematic effects included as nuisance parameters in the fit are: the jet energy scale and resolution uncertainties; b-tagging uncertainties; uncertainty on the MC simulation modelling of the JVF; the additional cross-section uncertainty on the production of heavy flavour in association with a vector boson; the uncertainties on trigger efficiency and matching and reconstruction efficiency; a systematic uncertainty on the re-weighting of the W-boson p T; uncertainties on the missing transverse momentum pile-up dependence and the calibration of energy not associated with an object in the event; the matrix method statistical and systematic uncertainties; the charge flip systematic uncertainties; the diboson WW shape systematic uncertainty taken from comparing Herwig to Alpgen. Where the systematic uncertainties affect object definitions, corrections are propagated to the missing transverse momentum and razor variable calculations. The effects of other uncertainties on the final results are negligible. These uncertainties affect the signal yield and shape in the signal regions, as well as the allowed variation in signal-region background estimates after the control region constraints. In most signal regions, the jet energy scale uncertainty is the dominant experimental uncertainty (from 10 % to 25 %).

7 Background fit

Figure 2 shows the distributions of \(M_{R}'\) and jet multiplicity in the zero-lepton multijet control region with a b-tagged jet requirement, with results from the fit to the control regions overlayed. By design, the multijet background is dominant in these regions. The small contribution from \(t\bar{t}\) and W+jets backgrounds are constrained by other control regions in the simultaneous fit. The hatched area indicates the total systematic uncertainty after the constraints imposed by the fit.

Fig. 2
figure 2

The distribution of \(M_{R}'\) (left) and the number of jets with p T>30 GeV (right) in the multijet control region with a b-tagged jet requirement (dots with error bars), the expectation from the control region fit for various backgrounds (filled), and the systematic uncertainty (hatched)

The distributions of R and jet multiplicty for the backgrounds after the control region fit in the Wμν+jets control region are shown in Fig. 3. The control region at low R is dominated by fake backgrounds, and at moderate-to-high R they are dominated by W+jets. The use of an alternate control region with a cut on transverse mass, which significantly reduces the fake contribution, results in a negligible change in the final search results.

Fig. 3
figure 3

The distribution of R (left) and the number of jets with p T>30 GeV (right) in the Wμν+jets control region (dots with error bars), the expectation from the control region fit for various backgrounds (filled), and the systematic uncertainty (hatched). Error bands on the ratios are only shown for bins with non-zero MC simulation predictions. In the high jet-multiplicity bins, the MC simulation statistics are poor

Figures 4 and 5 show the one-lepton and two-lepton \(t\bar{t}\) control regions, respectively. The fit reduces the normalisation of the \(t\bar{t}\) background in the one-lepton control region by approximately 15–20 % with respect to the unmodified expectation from MC simulation. This shift predominantly affects the semi-leptonic \(t\bar{t}\) background. In the two-lepton analysis, there is a significant contribution to the background expectation from Z-boson events with heavy flavour, particularly at low \(E_{\mathrm{T}}^{\mathrm{miss}}\). The lowest \(M_{R}'\) bin shows the most significant disagreement, which demonstrates the importance of shape profiling by binning the control regions. Although in that lowest bin, particularly in the two-muon channel, the MC simulation underestimates the amount of data, a single-binned normalisation of the \(t\bar{t}\) background would result in an overestimation of the background at high \(M_{R}'\). The distributions of missing transverse momentum are also shown for the two-lepton control regions.

Fig. 4
figure 4

The distribution of R in the one-lepton \(t\bar{t}\) control regions (dots with error bars), the expectation after the control region fit for various backgrounds (filled), and the systematic uncertainty (hatched)

Fig. 5
figure 5

The distribution of \(M_{R}'\) (left) and missing transverse momentum (right) in the two-lepton \(t\bar{t}\) control regions (dots with error bars), the expectation after the control region fit for various backgrounds (filled), and the systematic uncertainty (hatched)

The distributions of \(M_{R}'\) for the two Z+jets control regions are shown in Fig. 6. After the control region fit, good agreement is observed in both the electron and muon channels.

Fig. 6
figure 6

Top, the distribution of \(M_{R}'\) in the Z+jets control regions. Bottom, the distribution of \(M_{R}'\) (left) and dilepton mass (right) in the charge flip control region (dots with error bars), the expectation after the control region fit for various backgrounds (filled), and the systematic uncertainty (hatched)

The charge flip background is significant in the same-sign two-electron channel. Figure 6 also shows the distribution of \(M_{R}'\) in a charge flip enriched control region. There remain significant uncertainties on the background even after the control region fit, since it is dominated by charge flip and fake leptons, both of which have large systematic uncertainties associated with them. The distribution of dilepton mass is also shown.

The contributions to each of the control regions before and after the fit to the control regions are shown in Tables 3 and 4.

Table 3 The number of observed events and the results of the background-only fit to the control regions in the zero- and one-lepton control regions, for an integrated luminosity of 4.7 fb−1. Nominal MC expectations (normalised to MC cross-sections) are given for comparison. The errors shown are the statistical plus systematic uncertainties
Table 4 The number of observed events and the results of the background-only fit to the control regions in the two-lepton control regions, for an integrated luminosity of 4.7 fb−1. Nominal MC expectations (normalised to MC cross-sections) are given for comparison. The errors shown are the statistical plus systematic uncertainties

Various tests of the fit are carried out in order to ensure its stability. As a test of the multijet background constraint and the validity of fitting the \(M_{R}'\) distributions in those control regions, the control region fit is instead performed in the number of jets with p T>30 GeV. The p T cut is raised from the baseline selection to make the fit less sensitive to pile-up effects. The expectation for the multijet background in the signal regions is consistent with the main result.

The yields and distributions in the validation regions show good agreement with the Standard Model expectation. The significance of the deviation of the observation from the expectation in each of the signal and validation regions are shown in Fig. 7. There is some tension in the pre-fit results between the same-flavour and opposite-flavour dilepton \(t\bar{t}\) validation regions, but there is no indication of a systematic mis-modelling of any of the major backgrounds. The yields of all validation regions are within 1.2σ of the SM expectations.

Fig. 7
figure 7

Pull distributions of the numbers of events in the validation regions (VR) and signal regions (SR). The filled (dashed) bars show the agreement after (before) the background-only fit to the control regions has been performed

The numbers of expected events in each signal region before and after the fit to the control regions are shown in Tables 5 and 6. Additionally, the probability (p 0-value) that a background-only pseudo-experiment is more signal-like than observed is given for each individual signal region. To obtain these p 0-values, the fit in the signal region proceeds in the same way as the control-region-only fit, except that the number of events observed in the signal region is included as an input to the fit. Then, an additional parameter for the non-Standard-Model signal strength, constrained to be non-negative, is fitted. The shape of the distributions in the signal region is neglected in this fit. Therefore, in order to provide tighter constraints on non-Standard-Model production, in some of the high-count signal regions the \(M_{R}'\) requirements are tightened. In all other ways, these signal regions follow the definitions in Table 2. Within the fiducial region defined using the same requirements on lepton and jet multiplicities and the razor variables, but using the MC event generator output to define all objects, the typical efficiencies for the models studied are near 100 %. The observed number of events in each of these regions is then compared to the expectation from the Standard Model backgrounds. The significance of the excess is given, along with the model-independent upper limit on the number of events and cross-section times acceptance times efficiency from non-Standard-Model production.

Table 5 The number of observed events and the results of the background-only fit to the control regions in the zero- and one-lepton signal regions, for an integrated luminosity of 4.7 fb−1. Nominal MC expectations (normalised to MC cross-sections) are given for comparison. The errors shown are the statistical plus systematic uncertainties. The p 0-values and significances are given for single-bin signal regions with somewhat tighter \(M_{R}'\) cuts, along with the 95 % Confidence Level upper limit on the number events, N BSM, and cross-section, σ, for non-Standard-Model production within each signal region. In parentheses are given the expected upper limit and the upper limit under a one-σ upward (↑) or downward (↓) fluctuation in the observation
Table 6 The number of observed events and the results of the background-only fit to the control regions in the two-lepton signal regions, for an integrated luminosity of 4.7 fb−1. Nominal MC expectations (normalised to MC cross-sections) are given for comparison. The errors shown are the statistical plus systematic uncertainties. The p 0-values and significances are given for single-bin signal regions, along with the 95 % Confidence Level upper limit on the number events, N BSM, and cross-section, σ, for non-Standard-Model production within each signal region. In parentheses are given the expected upper limit and the upper limit under a one-σ upward (↑) or downward (↓) fluctuation in the observation

The distributions in all signal regions as a function of \(M_{R}'\) of background expectations, after the fit to the control region has been performed, are shown in Figs. 8 and 9. No significant deviations from the expected background are found. The most significant excess is 1.50 standard deviations from the expectation, in the one electron, b-tagged jet signal region.

Fig. 8
figure 8

The all-hadronic (top) and one-lepton (bottom) signal regions with a b-tagged jet veto (left) and requirement (right), after the fit to the control regions has been performed

Fig. 9
figure 9

The two-lepton signal regions for same-flavour (top) and opposite-flavour (bottom) leptons of the same sign (left) and opposite sign (right), after the fit to the control regions has been performed

8 Exclusion results

Using these signal regions, the CL S  [69] prescription is applied to find 95 % Confidence Level (CL) one-sided limits on the production of SUSY events in various models. The limits on visible cross-section derived in the previous section can be applied to any new physics model. However, in order to compare the exclusion power of the regions to previously published ATLAS results, model-dependent limits are produced. In each case, the exclusion limits are compared to the strongest published ATLAS result. This comparison provides valuable information about the relative strengths of comparable searches using kinematically independent regions. However, as the overlap of the signal regions in this search and the others (discussed in more detail below) is non-zero, a rigorous statistical combination with previously published results is complex and not attempted here.

A likelihood is constructed, taking into account signal shape information provided by the binning of the signal regions. All fitted nuisance parameters, with their correlations, are included in the likelihood. Because the typical \(M_{R}'\) of the signal may vary across a signal grid, the use of shape information results in an observed exclusion that is not consistently above or below the expected. The observed limits for the separate zero-, one-, and two-lepton signal regions are also constructed additionally, with all control regions included as constraints.

Figure 10 shows exclusion contours for a simplified model with gluino pair-production, where the gluinos decay to a chargino and two quarks and the chargino subsequently decays to a W boson and the LSP. Two planes are shown for this simplified model. The first fixes the chargino mass to be exactly half-way between the LSP and gluino mass and shows the exclusion in the gluino-mass–LSP-mass plane. The production cross-section falls smoothly and exponentially with m heavy, while \(M_{R}'\) and therefore the acceptance times efficiency for a signal region typically rises with the mass splitting, m heavym LSP. In the second plane, the LSP mass is fixed to 60 GeV and the exclusion is shown in the gluino mass-x plane, where x=(m charginom LSP)/(m gluinom LSP). The zero- and one-lepton signal regions with a b-tagged jet requirement do not contribute to the exclusion because these simplified models have only light quarks in the matrix element final state.

Fig. 10
figure 10

The observed and expected exclusion in a simplified model with gluino pair-production, where the gluinos decay to a chargino via the emission of two quarks and the chargino decays to the LSP and a W boson. Top, for the chargino mass exactly half-way between the gluino and LSP mass, in the gluino mass–LSP mass plane, and bottom, for the LSP mass fixed to 60 GeV, in the gluino mass-x plane, with x=(m charginom LSP)/(m gluinom LSP). The exclusion is shown for the combination, as well as for each individual channel (labelled 0-lepton, 1-lepton, and 2-lepton). The observed and expected limit of the ATLAS single leptons search [13] (ATLAS 1-lep. (obs.) and ATLAS 1-lep. (exp.), respectively) are indicated as separate contours

At high x, although the leptons have high-p T, the larger branching fraction of the W to quarks allows the zero-lepton channel to dominate. At moderate x, the leptons allow better discrimination between signal and background in the one- and two-lepton channels. At low x, the leptons have too low p T, and the zero-lepton channel again dominates. At high x, the limit set by this analysis exceeds somewhat that of the dedicated 0-lepton and 1-lepton ATLAS searches [10, 13], which have strong \(E_{\mathrm{T}}^{\mathrm{miss}} \) requirements and use M eff to define signal regions. At low x the limit is weaker. This dependence on x observed in this analysis, which is not apparent in the other ATLAS searches, is produced by differences in kinematics in these two regions of the plane. At high x, the charginos are almost at rest in the lab frame, and the event topology is dominated by a two-body decay, \(\tilde{\chi}_{1}^{\pm}\rightarrow W^{\pm}\tilde {\chi}_{1}^{0}\). At low x, on the other hand, the chargino is highly boosted, and the topology is dominated by a three-body decay, \(\tilde{g}\rightarrow q\bar{q}' \tilde {\chi}_{1}^{\pm}\). Thus, the high-x events typically have a higher R than the low-x events, and the two have approximately the same \(M_{R}'\) distribution.

Figure 11 shows exclusion contours in simplified models with gluino pair-production, where the gluinos decay to the LSP via the emission of a \(t\bar{t}\) pair. The exclusion is presented in the gluino mass–LSP mass plane, and, since all top quarks are required to be on-shell, only points with m gluino>m LSP+2×m top are considered. The zero- and one-lepton signal regions with a b-tagged jet veto do not contribute to the exclusion, because these models include four top quarks per event. At small mass splitting, the limits here are somewhat stronger than the ATLAS dedicated multi-b-jet analysis [12]. At larger mass splittings, the three b-tagged jet requirement suppresses the background substantially while preserving the signal acceptance because of the four tops in the event. The combined limit on LSP mass falls more quickly than that of the multi-b-jet analysis because \(M_{R}'\) is proportional to the mass splitting in the event, here m gluinom LSP. The zero-lepton razor analysis is limited in this case by the use of the H T trigger, which was chosen to avoid a bias in the \(M_{R}'\) distribution.

Fig. 11
figure 11

The observed and expected exclusion in a simplified model with gluino pair-production, where the gluinos decay to the LSP via the emission of a \(t\bar{t}\) pair. The exclusion is shown for the combination, as well as for each individual channel (labelled 0-lepton, 1-lepton, and 2-lepton). The observed and expected limit of the ATLAS 3 b-jets search [12] (ATLAS 3 b-jets (obs.) and ATLAS 3 b-jets (exp.), respectively) are indicated as separate contours

Finally, Fig. 12 shows exclusion contours in a plane of MSUGRA with tan(β)=10, A 0=0, and μ>0. At low m 0, where squark pair-production is dominant, the zero-lepton channel dominates the exclusion, although it is affected somewhat by the jet multiplicity requirement that is not applied in the dedicated signal region of Ref. [10], which therefore has more stringent limits. The leptonic channels enter at high m 0, particularly where longer decay chains are common. The robustness of the individual limits have also been cross checked by removing some of the control regions. For example, removing the zero- and one-lepton control regions from the calculation of the two-lepton limit, the MSUGRA limit changes by less than 20 GeV in m 1/2. In the m gluinom squark region, these limits are consistent with those of earlier ATLAS analyses [10, 11, 13], which rely on transverse information only. In this region, the single mass-splitting scale of the main strong production modes should produce a somewhat sharper peak in \(M_{R}'\), allowing an improved limit in the shape fit. At large m 0, the high \(M_{R}'\) requirement of the all-hadronic signal regions, resulting from the H T trigger use, produce a somewhat weaker limit than the ATLAS multijet analysis [11].

Fig. 12
figure 12

The observed and expected exclusion in a plane of the constrained minimal supersymmetric model. The exclusion is shown for the combination, as well as for each individual channel (labelled 0-lepton, 1-lepton, and 2-lepton). The observed and expected limit of the ATLAS 0-lepton search [10] (ATLAS 0-lepton (obs.) and ATLAS 3 0-lepton (exp.), respectively) are indicated as separate contours

The complementarity of a search using razor variables can be quantified by studying the overlap of the signal regions with the dedicated searches. Various signal models have been studied to understand this overlap, including both simplified models and full SUSY production models. The overlap between the signal regions presented here and other searches in ATLAS [10, 11, 13, 70] is typically 10–50 %, with similar overlaps in the data. The signal regions of this search access kinematic regions that are different from those of the standard searches. In simplified models in particular, the overlap between the dominant signal regions in the standard ATLAS analyses and the signal regions presented here is below 10–15 %. Thus, the regions of SUSY parameter space and kinematic phase space excluded by this search complement those excluded by earlier ATLAS searches using the same data sample.

In the control regions, the overlaps between this analysis and the others are much larger, as they all attempt to select dominant backgrounds with reasonable statistics. The fits that are performed in the various searches, however, look at different properties of the control regions to understand the agreement between data and MC simulation, and therefore the post-fit results may differ somewhat. The background treatments in this search and those previously published are similar enough to consider them correlated in control regions. However, the edges of kinematic phase space explored by the signal regions in these searches may suffer from different features or mis-modelings in MC event generators. Moreover, the treatment of systematic uncertainties and backgrounds varies somewhat between analyses, and because in a simultaneous fit the effects of these uncertainties are convolved, a combination of the various analyses discussed here is beyond the scope of this paper.

9 Summary

A search for supersymmetry including final states with zero, one, and two leptons, with and without b-tagged jets, in 4.7 fb−1 of \(\sqrt{s}=7\mbox{ TeV}\) pp collisions has been presented. Mutually exclusive signal regions exploiting these final states are combined with the use of variables that include both transverse and longitudinal event information. No significant excess of events beyond the Standard Model background expectation was observed in any signal region. Fiducial cross section upper limits on the production of new physics beyond the Standard Model are shown. Exclusion contours at 95 % CL are provided in SUSY-inspired simplified models and in the constrained minimal supersymmetric extension of the Standard Model.