1 Introduction

Supersymmetry (SUSY) [19] proposes the existence of new particles with spin differing by one half unit from that of their Standard Model (SM) partners. In the Minimal Supersymmetric Standard Model (MSSM) [1014], charginos, \(\tilde{\chi }_{1,2}^\pm \), and neutralinos, \(\tilde{\chi }_{1,2,3,4}^0\), are the mass-ordered eigenstates formed from the linear superposition of the SUSY partners of the Higgs and electroweak gauge bosons (higgsinos, winos and bino). In \(R\)-parity-conserving models, SUSY particles are pair-produced in colliders and the lightest SUSY particle (LSP) is stable. In many models the LSP is assumed to be a bino-like \(\tilde{\chi }_1^0\), which is weakly interacting. Naturalness arguments [15, 16] suggest that the lightest of the charginos and neutralinos may have masses at the electroweak scale, and may be accessible at the Large Hadron Collider (LHC) [17]. Furthermore, direct pair production of charginos and neutralinos may be the dominant production of supersymmetric particles if the superpartners of the gluon and quarks are heavier than a few TeV.

In SUSY scenarios where the masses of the pseudoscalar Higgs boson and the superpartners of the leptons are larger than those of the produced chargino and neutralino, the chargino decays to the lightest neutralino and the \(W\) boson, while the next-to-lightest neutralino decays to the lightest neutralino and the SM-like Higgs or \(Z\) boson. This paper focuses on SUSY scenarios where the decay to the Higgs boson is the dominant one. This happens when the mass splitting between the two lightest neutralinos is larger than the Higgs boson mass and the higgsinos are much heavier than the winos, causing the composition of the lightest chargino and next-to-lightest neutralino to be wino-like and nearly mass degenerate.

A simplified SUSY model [18, 19] is considered for the optimisation of the search and the interpretation of results. It describes the direct production of \(\tilde{\chi }_1^\pm \) and \(\tilde{\chi }_2^0\), where the masses and the decay modes of the relevant particles (\(\tilde{\chi }_1^\pm \), \(\tilde{\chi }_1^0\), \(\tilde{\chi }_2^0\)) are the only free parameters. It is assumed that the \(\tilde{\chi }_1^\pm \) and \(\tilde{\chi }_2^0\) are pure wino states and degenerate in mass, while the \(\tilde{\chi }_1^0\) is a pure bino state. The prompt decays \(\tilde{\chi }_1^\pm \rightarrow W^\pm \tilde{\chi }_1^0\) and \(\tilde{\chi }_2^0\rightarrow h\tilde{\chi }_1^0\) are assumed to have 100 % branching fractions. The Higgs boson mass is set to \(125{\mathrm {\ GeV}}\), which is consistent with the measured value [20], and its branching fractions are assumed to be the same as in the SM. The latter assumption is motivated by those SUSY models in which the mass of the pseudoscalar Higgs boson is much larger than the \(Z\) boson mass.

The search presented in this paper targets leptonic decays of the \(W\) boson and three Higgs boson decay modes as illustrated in Fig. 1. The Higgs boson decays into a pair of \(b\)-quarks, or a pair of photons, or a pair of \(W\) bosons where at least one of the bosons decays leptonically. The final states therefore contain missing transverse momentum from neutrinos and neutralinos, one lepton (\(\ell =e\) or \(\mu \)), and one of the following: two \(b\)-quarks (\(\ell {}bb\)), or two photons (\(\ell \gamma \gamma \)), or an additional lepton with the same electric charge (\(\ell ^{\pm }\ell ^{\pm }\)). The Higgs boson candidate can be fully reconstructed with the \(\ell {}bb\) and \(\ell \gamma \gamma \) signatures. The \(\ell ^{\pm }\ell ^{\pm }\) signature does not allow for such reconstruction and it is considered because of its small SM background. Its main signal contribution is due to \(h\rightarrow {}WW\), with smaller contributions from \(h\rightarrow {}ZZ\) and \(h\rightarrow {}\tau \tau \) when some of the visible decay products are missed during the event reconstruction.

Fig. 1
figure 1

Diagrams for the direct pair production of \(\tilde{\chi }_1^\pm \tilde{\chi }_2^0\) and the three decay modes studied in this paper. For the same-sign dilepton channel (c), only the dominant decay mode is shown. a One lepton and two \(b\)-quarks channel, b one lepton and two photons channel and c same-sign dilepton channel

The analysis is based on 20.3 \(\mathrm {fb}^{-1}\) of \(\sqrt{s}=8{\mathrm {\ TeV}}\) proton–proton collision data delivered by the LHC and recorded with the ATLAS detector. Previous searches for charginos and neutralinos at the LHC have been reported by the ATLAS [2123] and CMS [24, 25] collaborations. Similar searches were conducted at the Tevatron [26, 27] and LEP [2832].

The results of this paper are combined with those of the ATLAS search using the three-lepton and missing transverse momentum final state, performed with the same dataset [21]. The three-lepton selections may contain up to two hadronically decaying \(\tau {}\) leptons, providing sensitivity to the \(h\rightarrow \tau \tau /WW/ZZ\) Higgs boson decay modes. The statistical combination of the results is facilitated by the fact that all event selections were constructed not to overlap.

This paper is organised in the following way: the ATLAS detector is briefly described in Sect. 2, followed by a description of the Monte Carlo simulation in Sect. 3. In Sect. 4 the common aspects of the event reconstruction are illustrated; Sects. 5, 6, and 7 describe the channel-specific features; Sect. 8 discusses the systematic uncertainties; the results and conclusions are presented in Sects. 9 and 10.

2 The ATLAS detector

ATLAS is a multipurpose particle physics experiment [33]. It consists of detectors forming a forward-backward symmetric cylindrical geometry.Footnote 1 The inner detector (ID) covers \(|\eta |<2.5\) and consists of a silicon pixel detector, a semiconductor microstrip tracker, and a transition radiation tracker. The ID is surrounded by a thin superconducting solenoid providing a \(2\,\mathrm{T}\) axial magnetic field. A high-granularity lead/liquid-argon (LAr) sampling calorimeter measures the energy and the position of electromagnetic showers within \(|\eta |<3.2\). Sampling calorimeters with LAr are also used to measure hadronic showers in the endcap (1.5 \(<\) \(|\eta |\) \(<\) 3.2) and forward (3.1 \(<\) \(|\eta |\) \(<\) 4.9) regions, while a steel/scintillator tile calorimeter measures hadronic showers in the central region (\(|\eta |\) \(<\) 1.7). The muon spectrometer (MS) surrounds the calorimeters and consists of three large superconducting air-core toroid magnets, each with eight coils, precision tracking chambers (\(|\eta |\) \(<\) 2.7), and fast trigger chambers (\(|\eta |\) \(<\) 2.4). A three-level trigger system selects events to be recorded for permanent storage.

3 Monte Carlo simulation

Table 1 Simulated samples used for background estimates. “Tune” refers to the choice of parameters used for the underlying-event generation

The event generators, the accuracy of theoretical cross sections, the underlying-event parameter tunes, and the parton distribution function (PDF) sets used for simulating the SM background processes are summarised in Table 1.

The SUSY signal samples are produced with Herwig ++  [57] using the CTEQ6L1 PDF set. Signal cross sections are calculated at next-to-leading order (NLO) in the strong coupling constant using Prospino2  [58]. These agree with the NLO calculations matched to resummation at next-to-leading-logarithmic (NLL) accuracy within \(\sim \)2 % [59, 60]. For each cross section, the nominal value and its uncertainty are taken respectively from the centre and the spread of the cross-section predictions using different PDF sets and their associated uncertainties, as well as from variations of factorisation and renormalisation scales, as described in Ref. [61].

The propagation of particles through the ATLAS detector is modelled with GEANT4  [62] using the full ATLAS detector simulation [63] for all Monte Carlo (MC) simulated samples, except for \(t\bar{t}\) production and the SUSY signal samples in which the Higgs boson decays to two \(b\)-quarks, for which a fast simulation based on a parametric response of the electromagnetic and hadronic calorimeters is used [64]. The effect of multiple proton–proton collisions in the same or nearby beam bunch crossings (in-time or out-of-time pile-up) is incorporated into the simulation by overlaying additional minimum-bias events generated with Pythia6 onto hard-scatter events. Simulated events are weighted so that the distribution of the average number of interactions per bunch crossing matches that observed in data, but are otherwise reconstructed in the same manner as data.

4 Event reconstruction

The data sample considered in this analysis was collected with a combination of single-lepton, dilepton, and diphoton triggers. After applying beam, detector, and data-quality requirements, the dataset corresponds to an integrated luminosity of 20.3 \(\mathrm {fb}^{-1}\), with an uncertainty of 2.8 % derived following the methodology detailed in Ref. [65].

Vertices compatible with the proton-proton interactions are reconstructed using tracks from the ID. Events are analysed if the primary vertex has five or more tracks, each with transverse momentum \(p_\mathrm {T}>400{\mathrm {\ MeV}}\), unless stated otherwise. The primary vertex of an event is identified as the vertex with the largest \(\sum p_\mathrm {T}^2\) of the associated tracks.

Electron candidates are reconstructed from calibrated clustered energy deposits in the electromagnetic calorimeter and a matched ID track, which in turn determine the \(p_\mathrm {T}\) and \(\eta \) of the candidates respectively. Electrons must satisfy “medium” cut-based identification criteria, following Ref. [66], and are required to have \(p_\mathrm {T}>10{\mathrm {\ GeV}}\) and \(|\eta |<2.47\).

Muon candidates are reconstructed by combining tracks in the ID and tracks or segments in the MS [67] and are required to have \(p_\mathrm {T}>10{\mathrm {\ GeV}}\) and \(|\eta |<2.5\). To suppress cosmic-ray muon background, events are rejected if they contain a muon having transverse impact parameter with respect to the primary vertex \(|d_0|>0.2\) mm or longitudinal impact parameter with respect to the primary vertex \(|z_0|>1\) mm.

Photon candidates are reconstructed from clusters of energy deposits in the electromagnetic calorimeter. Clusters without matching tracks as well as those matching one or two tracks consistent with a photon conversion are considered. The shape of the cluster must match that expected for an electromagnetic shower, using criteria tuned for robustness under the pile-up conditions of 2012 [68]. The cluster energy is calibrated separately for converted and unconverted photon candidates using simulation. In addition, \(\eta \)-dependent correction factors determined from \(Z\rightarrow e^+e^-\) events are applied to the cluster energy, as described in Ref. [68]. The photon candidates must have \(p_\mathrm {T}>20{\mathrm {\ GeV}}\) and \(|\eta |<2.37\), excluding the transition region \(1.37<|\eta |<1.56\) between the central and endcap electromagnetic calorimeters. The tighter \(\eta \) requirement on photons, as compared to electrons, reflects the poorer photon resolution in the transition region and for \(2.37\le |\eta |<2.47\).

Jets are reconstructed with the anti-\(k_t\) algorithm [69] with a radius parameter of 0.4 using three-dimensional clusters of energy in the calorimeter [70] as input. The clusters are calibrated, weighting differently the energy deposits arising from the electromagnetic and hadronic components of the showers. The final jet energy calibration corrects the calorimeter response to the particle-level jet energy [71, 72]; the correction factors are obtained from simulation and then refined and validated using data. Corrections for in-time and out-of-time pile-up are also applied, as described in Ref. [73]. Events containing jets failing to meet the quality criteria described in Ref. [71] are rejected to suppress non-collision background and events with large noise in the calorimeters.

Jets with \(p_\mathrm {T}>20{\mathrm {\ GeV}}\) are considered in the central pseudorapidity (\(|\eta |<2.4\)) region, and jet \(p_\mathrm {T}>30{\mathrm {\ GeV}}\) is required in the forward (\(2.4<|\eta |<4.5\)) region. For central jets, the \(p_\mathrm {T}\) threshold is lower since it is possible to suppress pile-up using information from the ID, the “jet vertex fraction” (JVF). This is defined as the \(p_\mathrm {T}\)-weighted fraction of tracks within the jet that originate from the primary vertex of the event, and is \(-1\) if there are no tracks within the jet. Central jets can also be tagged as originating from bottom quarks (referred to as \(b\)-jets) using the MV1 multivariate \(b\)-tagging algorithm based on quantities related to impact parameters of tracks and reconstructed secondary vertices [74]. The efficiency of the \(b\)-tagging algorithm depends on the operating point chosen for each channel, and is reported in Sects. 5 and 7.

Hadronically decaying \(\tau \) leptons are reconstructed as 1- or 3-prong hadronic jets within \(|\eta |<2.47\), and are required to have \(p_\mathrm {T}>20{\mathrm {\ GeV}}\) after being calibrated to the \(\tau \) energy scale [75]. Final states with hadronically decaying \(\tau \) leptons are not considered here; however, identified \(\tau \) leptons are used in the overlap removal procedure described below, as well as to ensure that the same-sign lepton channel does not overlap with the three-lepton search [21] that is included in the combined result.

Potential ambiguities between candidate leptons, photons and jets are resolved by removing one or both objects if they are separated by \(\Delta {}R\equiv \sqrt{(\Delta \phi )^2+(\Delta \eta )^2}\) below a threshold. This process eliminates duplicate objects reconstructed from a single particle, and suppresses leptons and photons contained inside hadronic jets. The thresholds and the order in which overlapping objects are removed are summarised in Table 2. In the same-sign channel, \(e^+e^-\) and \(\mu ^+\mu ^-\) pairs with \(m_{\ell ^+\ell ^-}<12~{\mathrm {\ GeV}}{}\) are also removed. The remaining leptons and photons are referred to as “preselected” objects.

Table 2 Summary of the overlap removal procedure. Potential ambiguities are resolved by removing nearby objects in the indicated order, from top to bottom. Different \(\Delta {}R\) separation requirements are used in the three channels

Isolation criteria are applied to improve the purity of reconstructed objects. The criteria are based on the scalar sum of the transverse energies \(E_\mathrm {T}\) of the calorimeter cell clusters within a radius \(\Delta {}R\) of the object (\(E_\mathrm {T}^{\mathrm {cone}\Delta {}R}\)), and on the scalar sum of the \(p_\mathrm {T}\) of the tracks within \(\Delta {}R\) and associated with the primary vertex (\(p_\mathrm {T}^{\mathrm {cone}\Delta {}R}\)). The contribution due to the object itself is not included in either sum. The values used in the isolation criteria depend on the channel; they are specified in Sects. 5, 6 and 7.

The missing transverse momentum, \(\mathbf {p}_\mathrm {T}^\mathrm {\,miss}\) (with magnitude \(E_{\mathrm {T}}^{\mathrm {miss}}\)), is the negative vector sum of the transverse momenta of all preselected electrons, muons, and photons, as well as jets and calorimeter energy clusters with \(|\eta |\) \(<\) 4.9 not associated with these objects. Clusters that are associated with electrons, photons and jets are calibrated to the scale of the corresponding objects [76, 77].

The efficiencies for electrons, muons, and photons to satisfy the reconstruction and identification criteria are measured in control samples, and corrections are applied to the simulated samples to reproduce the efficiencies in data. Similar corrections are also applied to the trigger efficiencies, as well as to the jet \(b\)-tagging efficiency and misidentification probability.

5 One lepton and two \(b\)-jets channel

Table 3 Selection requirements for the signal, control and validation regions of the one lepton and two \(b\)-jets channel. The number of leptons, jets, and \(b\)-jets is labelled with \(n_\mathrm {lepton}\), \(n_\mathrm {jet}\), and \(n_{b-\mathrm {jet}}\)respectively

5.1 Event selection

The events considered in the one lepton and two \(b\)-jets channel are recorded with a combination of single-lepton triggers with a \(p_\mathrm {T}\) threshold of 24 GeV. To ensure that the event is triggered with a constant high efficiency, the offline event selection requires exactly one signal lepton (\(e\) or \(\mu \)) with \(p_\mathrm {T}>25{\mathrm {\ GeV}}\). The signal electrons must satisfy the “tight” identification criteria of Ref. [66], as well as \(|d_0|/\sigma _{d_0}<5\), where \(\sigma _{d_0}\) is the error on \(d_0\), and \(|z_0\sin \theta |<0.4\) mm. The signal muons must satisfy \(|\eta |<2.4\), \(|d_0|/\sigma _{d_0}<3\), and \(|z_0\sin \theta |<0.4\) mm. The signal electrons (muons) are required to satisfy the isolation criteria \(E_\mathrm {T}^{\mathrm {cone}0.3}/p_\mathrm {T}<0.18\) (0.12) and \(p_\mathrm {T}^{\mathrm {cone}0.3}/p_\mathrm {T}<0.16\) (0.12).

Events with two or three jets are selected, and the jets can be either central (\(|\eta |<2.4\)) or forward (\(2.4<|\eta |<4.9\)). Central jets have \(p_\mathrm {T}>25~{\mathrm {\ GeV}}\), and forward jets have \(p_\mathrm {T}>30~{\mathrm {\ GeV}}\). For central jets with \(p_\mathrm {T}<50{\mathrm {\ GeV}}\), the \(\mathrm {JVF}{}\) must be \(>0.5\). Events must contain exactly two \(b\)-jets and these must be the highest-\(p_\mathrm {T}\) central jets. The chosen operating point of the \(b\)-tagging algorithm identifies \(b\)-jets in simulated \(t\bar{t}\) events with an efficiency of 70 %; it misidentifies charm jets 20 % of the time and light-flavour (including gluon-induced) jets less than \(1\,\%\) of the time.

After the requirement of \(E_{\mathrm {T}}^{\mathrm {miss}}>100\) GeV, the dominant background contributions in the \(\ell {}bb\) channel are \(t\bar{t}\), \(W+\mathrm {jets}\), and single-top \(Wt\) production. Their contributions are suppressed using the kinematic selections described below, which define the two signal regions (SR) SR\(\ell bb\)-1 and SR\(\ell bb\)-2 summarised in Table 3.

The contransverse mass \(m_\mathrm {CT}\)  [78, 79] is defined as

$$\begin{aligned} m_\mathrm {CT} = \sqrt{(E_\mathrm {T}^{b_1}+E_\mathrm {T}^{b_2})^2 - |\mathbf {p}_\mathrm {T}^{b_1}-\mathbf {p}_\mathrm {T}^{b_2}|^2}, \end{aligned}$$
(1)

where \(E_\mathrm {T}^{b_i}\) and \(\mathbf {p}_\mathrm {T}^{b_i}\) are the transverse energy and momentum of the \(i\)th \(b\)-jet. The SM \(t\bar{t}\) background has an upper endpoint at \(m_\mathrm {CT} \) of approximately \(m_t\), and is efficiently suppressed by requiring \(m_\mathrm {CT} >160{\mathrm {\ GeV}}\).

The transverse mass \(m_\mathrm {T}^W\), describing \(W\) candidates in background events, is defined as

$$\begin{aligned} m_\mathrm {T}^W= \sqrt{2 E_\mathrm {T}^\ell E_{\mathrm {T}}^{\mathrm {miss}}- 2 \mathbf {p}_\mathrm {T}^\ell \cdot \mathbf {p}_\mathrm {T}^\mathrm {miss}}, \end{aligned}$$
(2)

where \(E_\mathrm {T}^\ell \) and \(\mathbf {p}_\mathrm {T}^\ell \) are the transverse energy and momentum of the lepton. Requiring \(m_\mathrm {T}^W>100{\mathrm {\ GeV}}\) efficiently suppresses the \(W\) \(+\) jets background. The two SRs are distinguished by requiring \(100<m_\mathrm {T}^W<130{\mathrm {\ GeV}}\) for SR\(\ell bb\)-1 and \(m_\mathrm {T}^W>130{\mathrm {\ GeV}}\) for SR\(\ell bb\)-2. The first signal region provides sensitivity to signal models with a mass splitting between \(\tilde{\chi }_1^0\) and \(\tilde{\chi }_2^0\) similar to the Higgs boson mass, while the second one targets larger mass splittings.

In each SR, events are classified into five bins of the invariant mass \(m_{bb}\) of the two \(b\)-jets as 45–75–105–135–165–195 GeV. In the SRs, about 70 % of the signal events due to \(h\rightarrow b\bar{b}\) populate the central bin of 105–135 GeV. The other four bins (sidebands) are used to constrain the background normalisation, as described below.

Fig. 2
figure 2

Distributions of contransverse mass \(m_\mathrm {CT}\), transverse mass of the \(W\)-candidate \(m_\mathrm {T}^W\), number of \(b\)-jets, and invariant mass of the \(b\)-jets \(m_{bb}\) for the one lepton and two \(b\)-jets channel in the indicated regions. The stacked background histograms are obtained from the background-only fit. The hashed areas represent the total uncertainties on the background estimates after the fit. The rightmost bins in ad include overflow. The distributions of a signal hypothesis are also shown without stacking on the background histograms. The vertical arrows indicate the boundaries of the signal regions. The lower panels show the ratio of the data to the SM background prediction. a \(m_\mathrm {CT}\) in CR\(\ell bb\)-T, SR\(\ell bb\)-1 and SR\(\ell bb\)-2, central \(m_{bb}\) bin, b \(m_\mathrm {CT}\) in CR\(\ell bb\)-T, SR\(\ell bb\)-1 and SR\(\ell bb\)-2, \(m_{bb}\) sidebands, c \(m_\mathrm {T}^W\) in VR\(\ell bb\)-2, SR\(\ell bb\)-1 and SR\(\ell bb\)-2, central \(m_{bb}\) bin, d \(m_\mathrm {T}^W\) in VR\(\ell bb\)-2, SR\(\ell bb\)-1 and SR\(\ell bb\)-2, \(m_{bb}\) sidebands, e number of \(b\)-jets in SR\(\ell bb\)-1 and SR\(\ell bb\)-2 without the \(b\)-jet multiplicity requirement, central \(m_{bb}\) bin, f \(m_{bb}\) in SR\(\ell bb\)-1 and SR\(\ell bb\)-2

Table 4 Event yields and SM expectation in the one lepton and two \(b\)-jets channel obtained with the background-only fit. “Other” includes \(Z\) \(+\) jets, \(WW\), \(WZ\), \(ZZ\), \(Zh\) and \(Wh\) processes. The errors shown include statistical and systematic uncertainties

5.2 Background estimation

The contributions from the \(t\bar{t}\) and \(W+\mathrm {jets}\) background sources are estimated from simulation, and normalised to data in dedicated control regions defined in the following paragraphs. The contribution from multi-jet production, where the signal lepton is a misidentified jet or comes from a heavy-flavour hadron decay or photon conversion, is estimated using the “matrix method” described in Ref. [22], and is found to be less than 3 % of the total background in all regions and is thus neglected. The remaining sources of background (single top, \(Z\) \(+\) jets, \(WW\), \(WZ\), \(ZZ\), \(Zh\) and \(Wh\) production) are estimated from simulation.

Two control regions (CR), CR\(\ell bb\)-T and CR\(\ell bb\)-W, are designed to constrain the normalisations of the \(t\bar{t}\) and \(W+\mathrm {jets}\) backgrounds respectively. The acceptance for \(t\bar{t}\) events is increased in CR\(\ell bb\)-T by modifying the requirement on \(m_\mathrm {CT}\) to \(100<m_\mathrm {CT} <160{\mathrm {\ GeV}}\). The acceptance of \(W+\mathrm {jets}\) events is increased in CR\(\ell bb\)-W by requiring \(m_\mathrm {T}^W{}>40{\mathrm {\ GeV}}\) and exactly two jets, of which only one is \(b\)-tagged. These two control regions are summarised in Table 3. The control regions are defined to be similar to the signal regions in order to reduce systematic uncertainties on the extrapolation to the signal regions; at the same time they are dominated by the targeted background processes and the expected contamination by signal is small.

As in the signal regions, the control regions are binned in \(m_{bb}\) (\(m_{bj}\) in the case of CR\(\ell bb\)-W). A “background-only” likelihood fit is performed, in which the predictions of the simulated background processes without any signal hypothesis are fit simultaneously to the data yields in eight \(m_{bb}\) sideband bins of the SRs and the ten \(m_{bb}\) bins of the CRs. This fit, as well as the limit-setting procedure, is performed using the HistFitter package described in Ref. [80]. The two free parameters of the fit, namely the normalisations of the \(t\bar{t}\) and \(W+\mathrm {jets}\) background components, are constrained by the number of events observed in the control regions and signal region sidebands, where the number of events is described by a Poisson probability density function. The remaining nuisance parameters correspond to the sources of systematic uncertainty described in Sect. 8. They are taken into account with their uncertainties, and adjusted to maximise the likelihood. The yields estimated with the background-only fit are reported in Table 4, as well as the resulting predictions in SR\(\ell bb\)-1 and SR\(\ell bb\)-2 for \(105<m_{bb} <135\) GeV. While CR\(\ell bb\)-T is dominated by \(t\bar{t}\) events, CR\(\ell bb\)-W is populated evenly by \(t\bar{t}\) and \(W+\mathrm {jets}\) events, which causes the normalisations of the \(t\bar{t}\) and \(W+\mathrm {jets}\) contributions to be negatively correlated after the fit. As a result, the uncertainties on individual background sources do not add up quadratically to the uncertainty on the total SM expectation. The normalisation factors are found to be \(1.03\pm 0.15\) for \(t\bar{t}\) and \(0.79\pm 0.07\) for \(W+\mathrm {jets}\), where the errors include statistical and systematic uncertainties.

To validate the background modelling, two validation regions (VR) are defined similarly to the SRs except for requiring \(40<m_\mathrm {T}^W<100{\mathrm {\ GeV}}\), and requiring \(100<m_\mathrm {CT} <160{\mathrm {\ GeV}}\) for VR\(\ell bb\)-1 and \(m_\mathrm {CT} >160{\mathrm {\ GeV}}\) for VR\(\ell bb\)-2 as summarised in Table 3. The yields in the VRs are shown in Table 4 after the background-only fit, which does not use the data in the VRs to constrain the background. The data event yields are found to be consistent with background expectations. Figure 2 shows the data distributions of \(m_\mathrm {CT}\), \(m_\mathrm {T}^W\), \(n_{b-\mathrm {jet}}\) and \(m_{bb}\) compared to the SM expectations in various regions. The data agree well with the SM expectations in all distributions.

6 One lepton and two photons channel

6.1 Event selection

Events recorded with diphoton or single-lepton triggers are used in the one lepton and two photons channel. For the diphoton trigger, the transverse momentum thresholds at trigger level for the highest-\(p_\mathrm {T}\) (leading) and second highest-\(p_\mathrm {T}\) (sub-leading) photons are \(35{\mathrm {\ GeV}}\) and \(25{\mathrm {\ GeV}}\) respectively. For these events, the event selection requires exactly one signal lepton (\(e\) or \(\mu \)) and exactly two signal photons, with \(p_\mathrm {T}\) thresholds of \(15{\mathrm {\ GeV}}\) for electrons, \(10{\mathrm {\ GeV}}\) for muons, and 40 (27) GeV for leading (sub-leading) photons. In addition, events recorded with single-lepton triggers, which have transverse momentum thresholds at trigger level of 24 GeV, are used. For these events, the selection requires \(p_\mathrm {T}\) thresholds of \(25{\mathrm {\ GeV}}\) for electrons and muons, and 40 (20) GeV for leading (sub-leading) photons.

Table 5 Selection requirements for the signal and validation regions of the one lepton and two photons channel. The number of leptons and photons is labelled with \(n_\mathrm {lepton}\) and \(n_{\gamma }\) respectively
Fig. 3
figure 3

Distributions of missing transverse momentum \(E_{\mathrm {T}}^{\mathrm {miss}}\), azimuth difference between the \(W\) and Higgs boson candidates \(\Delta \phi (W,h)\), transverse mass of the \(W\) and photon system \(m_{\mathrm {T}}^{{W\!\gamma _1}}\) and \(m_{\mathrm {T}}^{{W\!\gamma _2}}\) in the one lepton and two photons signal regions for the Higgs-mass window (\(120<m_{\gamma \gamma }<130{\mathrm {\ GeV}}\)). The vertical arrows indicate the boundaries of the signal regions. The filled and hashed areas represent the stacked histograms of the simulation-based background cross check and the total uncertainties. The contributions from non-Higgs backgrounds are scaled by 10 GeV/50 GeV \(=\) 0.2 from the \(m_{\gamma \gamma }\) sideband (\(100<m_{\gamma \gamma }<120{\mathrm {\ GeV}}\) and \(130<m_{\gamma \gamma }<160{\mathrm {\ GeV}}\)) into the Higgs-mass window. The rightmost bins in a, c, and d include overflow. Scaled data in the sideband are shown as squares, while events in the Higgs-mass window are shown as circles. The distributions of a signal hypothesis are also shown without stacking on the background histograms. a \(E_{\mathrm {T}}^{\mathrm {miss}}\) in SR\(\ell \gamma \gamma \)-1 and SR\(\ell \gamma \gamma \)-2 without \(E_{\mathrm {T}}^{\mathrm {miss}}\) cut, b \(\Delta \phi (W,h)\) in SR\(\ell \gamma \gamma \)-1 and SR\(\ell \gamma \gamma \)-2 without \(\Delta \phi (W,h)\) cut, c \(m_{\mathrm {T}}^{{W\!\gamma _1}}\) in SR\(\ell \gamma \gamma \)-1 and SR\(\ell \gamma \gamma \)-2 without \(m_{\mathrm {T}}^{{W\!\gamma _i}}\) cuts, d \(m_{\mathrm {T}}^{{W\!\gamma _2}}\) in SR\(\ell \gamma \gamma \)-1 and SR\(\ell \gamma \gamma \)-2 without \(m_{\mathrm {T}}^{{W\!\gamma _i}}\) cuts

Fig. 4
figure 4

Results of the background-only fit to the diphoton invariant mass, \(m_{\gamma \gamma }\), distribution in the one lepton and two photons signal and validation regions. The contributions from SM Higgs boson production are constrained to the MC prediction and associated systematic uncertainties. The band shows the systematic uncertainty on the fit. The fit is performed on events with 100 GeV \(< m_{\gamma \gamma }<\) 160 GeV, with events in SR\(\ell \gamma \gamma \)-1 or SR\(\ell \gamma \gamma \)-2 in the Higgs-mass window (120 GeV \(\le m_{\gamma \gamma }\le \) 130 GeV), indicated by the arrows, excluded from the fit. a SR\(\ell \gamma \gamma \)-1, b SR\(\ell \gamma \gamma \)-2, c VR\(\ell \gamma \gamma \)-1, d VR\(\ell \gamma \gamma \)-2

In this channel, a neural network algorithm, based on the momenta of the tracks associated with each vertex and the direction of flight of the photons, is used to select the primary vertex, similarly to the ATLAS SM \(h\rightarrow \gamma \gamma \) analysis described in Ref. [81]. Signal muons must satisfy \(|d_0|<1\) mm and \(|z_0|<10\) mm. The isolation criteria for both the electrons and muons are \(E_\mathrm {T}^{\mathrm {cone}0.4}/p_\mathrm {T}<0.2\) and \(p_\mathrm {T}^{\mathrm {cone}0.2}/p_\mathrm {T}<0.15\). Signal photons are required to satisfy \(E_\mathrm {T}^{\mathrm {cone}0.4}<6{\mathrm {\ GeV}}\) and \(p_\mathrm {T}^{\mathrm {cone}0.2}<2.6{\mathrm {\ GeV}}\).

The two largest background contributions are due to multi-jet and \(Z\gamma \) production, with leptons or jets misreconstructed as photons. These background contributions are suppressed by requiring \(E_{\mathrm {T}}^{\mathrm {miss}}>40{\mathrm {\ GeV}}\).

The \(\mathbf {p}_\mathrm {T}\) of the \(W\rightarrow \ell \nu \) system, reconstructed assuming background events with neutrino \(\mathbf {p}_\mathrm {T}=\mathbf {p}_\mathrm {T}^\mathrm {\,miss}\), is required to be back-to-back with the \(\mathbf {p}_\mathrm {T}\) of the \(h\rightarrow \gamma \gamma \) candidate (\(\Delta \phi (W,h)>2.25\)). Only events with a diphoton invariant mass, \(m_{\gamma \gamma }\), between 100 and 160 GeV are considered. Events in the sideband, outside the Higgs-mass window between 120 and 130 GeV, are included to constrain the non-Higgs background as described in Sect. 6.2.

Selected events are split into two SRs with different expected signal sensitivities based on two variables \(m_{\mathrm {T}}^{{W\!\gamma _1}}\) and \(m_{\mathrm {T}}^{{W\!\gamma _2}}\), which are defined as

$$\begin{aligned} m_\mathrm {T}^{{W\!\gamma _i}} = \sqrt{ (m_\mathrm {T}^W)^2 + 2 E_\mathrm {T}^W E_\mathrm {T}^{\gamma _i} - 2 \mathbf {p}_\mathrm {T}^W\cdot \mathbf {p}_\mathrm {T}^{\gamma _i}}, \end{aligned}$$
(3)

where \(m_\mathrm {T}^W\), \(E_\mathrm {T}^W\) and \(\mathbf {p}_\mathrm {T}^W\) are the transverse mass, energy and momentum of the \(W\) candidate, and \(E_\mathrm {T}^{\gamma _i}\) and \(\mathbf {p}_\mathrm {T}^{\gamma _i}\) are the transverse energy and momentum of the \(i\)th, \(p_\mathrm {T}\)-ordered, photon. Including a photon in the transverse mass calculation provides a means to identify leptonically decaying \(W\) bosons in the presence of a final-state radiation photon. Events with \(m_{\mathrm {T}}^{{W\!\gamma _1}}>150{\mathrm {\ GeV}}\) and \(m_{\mathrm {T}}^{{W\!\gamma _2}}>80{\mathrm {\ GeV}}\) are classified into SR\(\ell \gamma \gamma \)-1, and those with either \(m_{\mathrm {T}}^{{W\!\gamma _1}}<150{\mathrm {\ GeV}}\) or \(m_{\mathrm {T}}^{{W\!\gamma _2}}<80{\mathrm {\ GeV}}\) into SR\(\ell \gamma \gamma \)-2. Most of the sensitivity to the signal is provided by SR\(\ell \gamma \gamma \)-1, while SR\(\ell \gamma \gamma \)-2 assists in constraining systematic uncertainties.

Two overlapping validation regions are defined by inverting and modifying the \(E_{\mathrm {T}}^{\mathrm {miss}}\) and \(\Delta \phi (W,h)\) criteria relative to those of the signal regions. The first region VR\(\ell \gamma \gamma \)-1 requires \(E_{\mathrm {T}}^{\mathrm {miss}}<40{\mathrm {\ GeV}}\) and has no requirement on \(\Delta \phi (W,h)\), and the second region VR\(\ell \gamma \gamma \)-2 requires \(\Delta \phi (W,h)<2.25\) and has no requirement on \(E_{\mathrm {T}}^{\mathrm {miss}}\). The signal and validation regions are summarised in Table 5.

Table 6 Event yields and SM expectation in the Higgs-mass window of the lepton plus two photon channel (\(120<m_{\gamma \gamma }<130{\mathrm {\ GeV}}\)) after the background-only fit. The Higgs-mass window is excluded from the fit in the two signal regions. The errors shown include statistical and systematic uncertainties

Distributions in the Higgs-mass window of the four kinematic variables used to define the SRs are shown in Fig. 3. For illustration purposes, the observed yield in the sideband region is shown for each distribution, scaled into the corresponding Higgs-mass window by the relative widths of the Higgs-mass window and the sideband region, 10 GeV/50 GeV \(=\) 0.2. Also shown, for each distribution, is a simulation-based cross-check of the background estimate. To reduce statistical uncertainties originating from the limited number of simulated events, the non-Higgs contributions are obtained in the sideband and scaled into the Higgs-mass window by 0.2. The simulation-based prediction of the non-Higgs background is estimated from the W/Z(\(\gamma ,\gamma \gamma \)) \(+\) jets samples, after applying a data-driven correction for the probability of electrons or jets to be reconstructed as photons. The contribution from backgrounds with jets reconstructed as leptons is determined by using the “fake factor” method described in Ref. [82]. This simulation-based background estimate is only used as a cross-check of the sideband-data-based background estimate described above. It gives results consistent with the data estimate, but it is not used for limit setting.

6.2 Background estimation

The contribution from background sources that do not contain a \(h\rightarrow \gamma \gamma \) decay can be statistically separated by a template fit to the full \(m_{\gamma \gamma }\) distribution, from 100 to 160 GeV. The approach followed is similar to the one in Ref. [81]: the non-Higgs background is modelled as \(\exp (-\alpha m_{\gamma \gamma })\), with the constant \(\alpha \) as a free, positive parameter in the fit. Alternative functional models are used to evaluate the systematic uncertainty due to the choice of background modelling function. The \(h\rightarrow \gamma \gamma \) template, used for the Higgs background and signal, is formed by the sum of a Crystal Ball function [83] for the core of the distribution and a Gaussian function for the tails. This functional form follows the one used in the SM \(h\rightarrow \gamma \gamma \) analysis [81], with the nominal values and uncertainties on the fit parameters determined by fits to the simulation in SR\(\ell \gamma \gamma \)-1 and SR\(\ell \gamma \gamma \)-2. The results of the fit to the simulation are used as an external constraint on the template during the fit to data. The width of the Gaussian core of the Crystal Ball function quantifies the detector resolution and is determined in simulation to be 1.7 GeV in SR\(\ell \gamma \gamma \)-1 and 1.8 GeV in SR\(\ell \gamma \gamma \)-2. This is comparable to the resolution found in the SM \(h\rightarrow \gamma \gamma \) analysis [81].

Contributions from SM processes with a real Higgs boson decay are estimated by simulation and come primarily from \(Wh\) associated production, with smaller amounts from \(t\bar{t}h\) and \(Zh\). The contributions from SM Higgs boson production via gluon fusion or vector boson fusion are found to be negligible. Systematic uncertainties on the yields of these SM processes are discussed in Sect. 8. Figure 4 shows the background-only fits to the observed \(m_{\gamma \gamma }\) distributions in the signal and validation regions, with the signal region Higgs-mass window (\(120<m_{\gamma \gamma }<130{\mathrm {\ GeV}}\)) excluded from the fit. Table 6 summarises the observed event yields in the Higgs-mass window and the background estimates, from the background-only fits, in the signal and validation regions. The errors are dominated by the statistical uncertainty due to the number of events in the \(m_{\gamma \gamma }\) sidebands.

7 Same-sign dilepton channel

7.1 Event selection

Events recorded with a combination of dilepton triggers are used in the same-sign dilepton channel. The \(p_\mathrm {T}\) thresholds of the dilepton triggers depend on the flavour of the leptons. The triggers reach their maximum efficiency at \(p_\mathrm {T}\) values of about \(14\)\(25\) GeV for the leading lepton and \(8\)\(14\) GeV for the sub-leading lepton.

Table 7 Selection requirements for the signal regions of the same-sign dilepton channel

The offline event selection requires two same-sign signal leptons (\(ee\), \(e\mu \) or \(\mu \mu \)) with \(p_\mathrm {T}>30{\mathrm {\ GeV}}\) or \(20{\mathrm {\ GeV}}\) as shown in Table 7 and no additional preselected lepton. The signal electrons must satisfy the “tight” identification criteria from Ref. [66], \(|d_0|/\sigma _{d_0}<3\), and \(|z_0\sin \theta {}|<0.4\) mm. The signal muons must satisfy \(|\eta |<2.4\), \(|d_0|/\sigma _{d_0}<3\), and \(|z_0\sin \theta {}|<1\) mm. The isolation criteria for electrons (muons) are \(E_\mathrm {T}^{\mathrm {cone}0.3}/\) \(\min (p_\mathrm {T},60{\mathrm {\ GeV}})\) \(<0.13\) (0.14) and \(p_\mathrm {T}^{\mathrm {cone}0.3}/\) \(\min (p_\mathrm {T},60{\mathrm {\ GeV}})\) \(<0.07\) (0.06). Events containing a hadronically decaying preselected \(\tau \) lepton are rejected in order to avoid statistical overlap with the three-lepton final states [21].

Events are required to contain one, two, or three central (\(|\eta |<2.4\)) jets with \(p_\mathrm {T}>20{\mathrm {\ GeV}}\). If a central jet has \(p_\mathrm {T}{}<50{\mathrm {\ GeV}}\) and has tracks associated to it, at least one of the tracks must originate from the event primary vertex. To reduce background contributions with heavy-flavour decays, all the jets must fail to meet the \(b\)-tagging criterion at the 80 % efficiency operating point. There must be no forward (\(2.4<|\eta |<4.9\)) jet with \(p_\mathrm {T}>30{\mathrm {\ GeV}}\).

The dominant background contributions in the \(\ell ^{\pm }\ell ^{\pm }\) channel are due to SM diboson production (\(WZ\) and \(ZZ\)) leading to two “prompt” leptons and due to events with “non-prompt” leptons (heavy-flavour decays, photon conversions and misidentified jets). These background contributions are suppressed with the tight identification criteria described above, and with the kinematic requirements summarised in Table 7. The requirements were optimised separately for each lepton flavour combination (\(ee\), \(\mu \mu \), and \(e\mu \)), and for different numbers of reconstructed jets, leading to six signal regions.

The dilepton invariant mass \(m_{\ell \ell }\) is required to differ by at least 10 GeV from the \(Z\)-boson mass for the \(ee\) channel, in which contamination due to electron charge misidentification is significant.

The visible mass of the Higgs boson candidate is defined for the one jet signal regions as the invariant mass (\(m_{\ell j}\)) of the jet and the lepton that is closest to it in terms of \(\Delta R\), and for the two or three jet signal regions as the invariant mass (\(m_{\ell jj}\)) of the two highest-\(p_\mathrm {T}\) jets and the lepton that is closest to the dijet system. In the signal regions, \(m_{\ell j}{}<90\) GeV is required for SR\(\ell \ell \)-1 and \(m_{\ell jj}{}<120\) GeV for SR\(\ell \ell \)-2.

Fig. 5
figure 5

Distribution of effective mass \(m_\mathrm {eff}\) in the validation region of the same-sign \(e\mu \) channel. This validation region is defined by requiring one, two, or three jets, and reversing the \(m_{\ell j}\), \(m_{\ell jj}\) criteria. The hashed areas represent the total uncertainties on the background estimates that are depicted with stacked histograms. The distribution of a signal hypothesis is also shown without stacking on the background histograms. The lower panel shows the ratio of the data to the SM background prediction

Fig. 6
figure 6

Distributions of effective mass \(m_\mathrm {eff}\), largest transverse mass \(m_\mathrm {T}^\mathrm {max}\), invariant mass of lepton and jets \(m_{\ell j}\) and \(m_{\ell jj}\) for the same-sign dilepton channel in the signal regions with one jet (left) and two or three jets (right). SR\(\ell \ell \)-1 is the sum of SR\(ee\)-1, SR\(e\mu \)-1, and SR\(\mu \mu \)-1; SR\(\ell \ell \)-2 is the sum of SR\(ee\)-2, SR\(e\mu \)-2, and SR\(\mu \mu \)-2. All selection criteria are applied, except for the one on the variable being shown. The vertical arrows indicate the boundaries of the signal regions, which may not apply to all flavour channels. The hashed areas represent the total uncertainties on the background estimates that are depicted with stacked histograms. The distributions of a signal hypothesis are also shown without stacking on the background histograms. The lower panels show the ratio between data and the SM background prediction. The rightmost bins of each distribution include overflow. a \(m_\mathrm {eff}\) in SR\(\ell \ell \)-1 without \(m_\mathrm {eff}\) cut, b \(m_\mathrm {eff}\) in SR\(\ell \ell \)-2 without \(m_\mathrm {eff}\) cut, c \(m_\mathrm {T}^\mathrm {max}\) in SR\(\ell \ell \)-1 without \(m_\mathrm {T}^\mathrm {max}\) cut, d \(m_\mathrm {T}^\mathrm {max}\) in SR\(\ell \ell \)-2 without \(m_\mathrm {T}^\mathrm {max}\) cut, e \(m_{\ell j}\) in SR\(\ell \ell \)-1 without \(m_{\ell j}\) cut, f \(m_{\ell jj}\) in SR\(\ell \ell \)-2 without \(m_{\ell jj}\) cut

Table 8 Event yields and SM expectation in the same-sign dilepton channel signal regions. The \(WW\) background includes both \(W^{\pm }W^{\pm }\) and \(W^{\pm }W^{\mp }\) production, the latter due to electron charge mis-measurement. “Other” background includes \(t\bar{t}\), single top, \(Z+\mathrm {jets}\), \(Zh\) and \(Wh\) production. The errors shown include statistical and systematic uncertainties

Depending on the final state, additional kinematic variables are used to further reduce the background. Requiring the pseudorapidity difference between the two leptons \(\Delta \eta _{\ell \ell }<1.5\) decreases the \(WZ\) and \(ZZ\) background. Requirements on \(E_{\mathrm {T}}^{\mathrm {miss,rel}}\), defined as

$$\begin{aligned} E_{\mathrm {T}}^{\mathrm {miss,rel}}= \left\{ \begin{array}{ll} E_{\mathrm {T}}^{\mathrm {miss}}&{}\quad \mathrm {if}\,\,\ \Delta \phi >\pi /2, \\ E_{\mathrm {T}}^{\mathrm {miss}}\sin (\Delta \phi ) &{} \quad \mathrm {if}\,\,\ \Delta \phi <\pi /2, \\ \end{array} \right. \end{aligned}$$
(4)

where \(\Delta \phi \) is the azimuthal angle difference between \(\mathbf {p}_\mathrm {T}^\mathrm {miss}\) and the nearest lepton or jet, reduce the \(Z\) \(+\) jets and non-prompt lepton background in the \(ee\) channel. The \(E_{\mathrm {T}}^{\mathrm {miss,rel}}\) is defined so as to reduce the impact on \(E_{\mathrm {T}}^{\mathrm {miss}}\) of any potential mismeasurement, either from jets or from leptons. The scalar sum \(m_\mathrm {eff}\) of the transverse momenta of the leptons, jets and the missing transverse momentum is used to suppress the diboson background. Requiring \(m_\mathrm {T}^\mathrm {max}>110{\mathrm {\ GeV}}\), where \(m_\mathrm {T}^\mathrm {max}\) is the larger of the two \(m_\mathrm {T}^W\) values computed with one of the leptons and the missing transverse momentum, suppresses background events with one leptonically decaying \(W\) boson, whose transverse mass distribution has an endpoint at \(m_W\).

To test the non-prompt lepton and charge mismeasurement backgrounds, validation regions are defined by applying only the number of jets \(n_\mathrm {jet}\) and lepton \(p_\mathrm {T}\) requirements from Table 7 and requiring \(m_{\ell j}>90{\mathrm {\ GeV}}\) or \(m_{\ell jj}>120{\mathrm {\ GeV}}\).

7.2 Background estimation

The irreducible background in the same-sign dilepton channel is dominated by \(WZ\) and \(ZZ\) diboson production, in which both vector bosons decay leptonically and one or two leptons do not satisfy the selection requirements, mostly the kinematic ones. These contributions are estimated from the simulation.

Background contributions due to non-prompt leptons are estimated with the matrix method described in Ref. [22]. It takes advantage of the difference between the efficiencies for prompt and non-prompt leptons, defined as the fractions of prompt and non-prompt preselected leptons respectively, that pass the signal-lepton requirements. The number of events containing non-prompt leptons is obtained from these efficiencies and the observed number of events using four categories of selection with preselected or signal leptons. The efficiencies for prompt and non-prompt leptons are derived, as a function of \(p_\mathrm {T}\) and \(\eta \), for each process leading to either prompt or non-prompt leptons using the generator-level information from simulated events. They are then corrected for potential differences between simulation and data with correction factors measured in control regions, as described in Ref. [22]. The contributions from each process leading to either prompt or non-prompt leptons are then used to compute a weighted-average efficiency, where the weight for each process is determined as its relative contribution to the number of preselected leptons in the region of interest.

Same-sign background events where the lepton charge is mismeasured are usually due to a hard bremsstrahlung photon with subsequent asymmetric pair production. The charge mismeasurement probability, which is negligible for muons, is measured in data as a function of electron \(p_\mathrm {T}\) and \(|\eta |\) using \(Z\rightarrow e^+e^-\) events where the two electrons are reconstructed with the same charge. The probability, which is below \(1\,\%\) for most of the \(p_\mathrm {T}\) and \(\eta \) values, is then applied to the simulated opposite-sign \(ee\) and \(e\mu \) pairs to estimate this background [84]. Although any process with the \(e^{\pm }e^{\mp }\) or \(e^{\pm }\mu ^{\mp }\) final state can mimic the same-sign signature with charge mismeasurement, most of this background contribution is due to the production of \(Z+\mathrm {jets}\) events, amounting to less than \(10\,\%\) of the background yield in each of the \(\ell ^{\pm }\ell ^{\pm }\) signal regions.

Estimates of non-prompt lepton and charge mismeasurement background are tested in the validation regions; the number of observed events agrees with the expected background in all validation regions. Figure 5 shows the distribution of \(m_\mathrm {eff}\) in the validation region of the same-sign \(e\mu \) channel.

The number of observed and expected events in each signal region is reported in Table 8. Figure 6 shows the data distributions of \(m_\mathrm {eff}\), \(m_\mathrm {T}^\mathrm {max}\), \(m_{\ell j}\), and \(m_{\ell jj}\) compared to the SM expectations in the same-sign dilepton signal regions. No significant excess is observed over the SM background expectations in any channel.

8 Systematic uncertainties

Table 9 Summary of the statistical and main systematic uncertainties on the background estimates, expressed in per cent of the total background yields in each signal region. Uncertainties that are not considered for a particular channel are indicated by a “–”. The individual uncertainties can be correlated, and do not necessarily add in quadrature to the total background uncertainty

Table 9 summarises the dominant systematic uncertainties on the total expected background yields in the six signal regions.

For the one lepton and two \(b\)-jets channel, theoretical uncertainties on the \(t\bar{t}\) and single-top background estimates are the most important. They are evaluated by comparing different generators (Powheg, MC@NLO  [85, 86] and AcerMC) and parton shower algorithms (Pythia6 and Herwig  [87, 88]), varying the QCD factorisation and renormalisation scales up and down by a factor of two, and taking the envelope of the background variations when using different PDF sets. Statistical uncertainties from the data in the CRs result in uncertainties on the normalisations of the \(t\bar{t}\) and \(W+\mathrm {jets}\) backgrounds, while the limited number of simulated events yields uncertainty on the shape of the background \(m_{bb}\) distributions. The largest experimental systematic uncertainties are those on the jet energy scale [72] and resolution [89], derived from a combination of test-beam data and in-situ measurements, followed by the uncertainty on the \(b\)-jet identification efficiency [90]. The uncertainty on the \(W\) boson background modelling is dominated by the uncertainty on the cross section for the production of the \(W\) boson in association with heavy-flavour jets, and is reported within the “Other sources”. The \(W\) boson background component is small in \(\ell {}bb\) SRs, and its uncertainty is constrained by the CRs with a similar composition.

For the one lepton and two photons channel, the background uncertainties are dominated by the data statistics in the \(m_{\gamma \gamma }\) sidebands. The only source of systematic uncertainty on the non-Higgs background estimate is the choice of \(m_{\gamma \gamma }\) model. The systematic uncertainties on the Higgs background estimates are dominated by the theoretical uncertainties on the \(Wh\), \(Zh\), and \(t\bar{t}h\) production cross sections and the photon reconstruction. The main theoretical uncertainties are those on the QCD scales and the parton distribution functions [55]. The effect of scale uncertainties on the modelling of Higgs boson production is evaluated by reweighting the simulated Higgs boson \(p_\mathrm {T}\) distribution to account for doubling and halving the scales. The experimental systematic uncertainty from photon reconstruction is determined with the tag-and-probe method using radiative \(Z\) decays [91].

For the same-sign dilepton channel, the two main sources of systematic uncertainty are related to the non-prompt lepton estimate, and to the modelling of the \(WZ\) background. The uncertainty on the non-prompt estimate originates mainly from the limited accuracy of the efficiency correction factors, and on the production rate of non-prompt leptons, in particular their \(\eta \) dependence. The uncertainty on the \(WZ\) background modelling is determined using a same-sign, \(WZ\)-enriched sample used to validate the Sherpa prediction. This validation sample is selected by requiring three leptons, two of which must have same flavour, opposite sign, \(|m_{\ell \ell }{}-m_{Z}|<10~{\mathrm {\ GeV}}{}\), and then considering only the highest-\(p_\mathrm {T}\) same-sign pair. None of the other requirements from Table 7 are applied, except for the lepton \(p_\mathrm {T}\) and \(n_\mathrm {jet}\) selections.

9 Results and interpretations

The event yields observed in data are consistent with the Standard Model expectations within uncertainties in all signal regions. The results are used to set exclusion limits with the frequentist hypothesis test based on the profile log-likelihood-ratio test statistic and approximated with asymptotic formulae [92].

Table 10 From left to right, observed 95 % CL upper limits (\(\langle \sigma _\mathrm{vis}\rangle _{\mathrm {obs}}^{95}\)) on the visible cross sections, the observed (\(S_\mathrm {obs}^{95}\)) and expected (\(S_\mathrm {exp}^{95}\)) 95 % CL upper limits on the number of signal events with \(\pm 1\sigma \) excursions of the expectation, the observed confidence level of the background-only hypothesis (\(CL_B\)), and the discovery \(p\)-value (\(p_0\)), truncated at \(0.5\)
Fig. 7
figure 7

Observed (solid line) and expected (dashed line) 95 % CL upper limits on the cross section normalised by the simplified model prediction as a function of the common mass \(m_{\tilde{\chi }_1^\pm \tilde{\chi }_2^0}\) for \(m_{{\tilde{\chi }_1^0}}=0\). The combination in d is obtained using the result from the ATLAS three-lepton search [21] in addition to the three channels reported in this paper. The dash-dotted lines around the observed limit represent the results obtained when changing the nominal signal cross section up or down by the \(\pm 1\sigma _{\text {theory}}^{\text {SUSY}}\) theoretical uncertainty. The solid band around the expected limit represents the \(\pm 1\sigma _{\text {exp}}\) uncertainty band where all uncertainties, except those on the signal cross sections, are considered. a One lepton and two \(b\)-jets channel, b one lepton and two photons channel, c same-sign dilepton channel, d combination

Fig. 8
figure 8

Observed (solid line) and expected (dashed line) 95 % CL exclusion regions in the mass plane of \(m_{\tilde{\chi }_1^0}\) vs.  \(m_{\tilde{\chi }_2^0,\tilde{\chi }_1^\pm }\) in the simplified model. The combination in d is obtained using the result from the ATLAS three-lepton search [21] in addition to the three channels reported in this paper. The dotted lines around the observed limit represent the results obtained when changing the nominal signal cross section up or down by the \(\pm 1 \sigma _{\text {theory}}^{\text {SUSY}}\) theoretical uncertainty. The solid band around the expected limit shows the \(\pm 1\sigma _{\text {exp}}\) uncertainty band where all uncertainties, except those on the signal cross sections, are considered. a One lepton and two \(b\)-jets channel, b one lepton and two photons channel, c same-sign dilepton channel and d combination

Exclusion upper limits at the 95 % confidence level (CL) on the number of beyond-the-SM (BSM) signal events, \(S\), for each SR are derived using the CL\(_\mathrm {s}\) prescription [93], assuming no signal yield in other signal and control regions. Normalising the upper limits on the number of signal events by the integrated luminosity of the data sample provides upper limits on the visible BSM cross section, \(\sigma _\mathrm{vis} = \sigma \times A \times \epsilon \), where \(\sigma \) is the production cross section for the BSM signal, \(A\) is the acceptance defined as the fraction of events passing the geometric and kinematic selections at particle level, and \(\epsilon \) is the detector reconstruction, identification and trigger efficiency.

Table 10 summarises, for each SR, the observed 95 % CL upper limits (\(\langle \sigma _\mathrm{vis}\rangle _{\mathrm {obs}}^{95}\)) on the visible cross section, the observed (\(S_\mathrm {obs}^{95}\)) and expected (\(S_\mathrm {exp}^{95}\)) 95 % CL upper limits on the number of signal events with \(\pm 1\sigma \) excursions of the expectation, the observed confidence level (\(CL_B\)) of the background-only hypothesis, and the discovery \(p\)-value (\(p_0\)), truncated at \(0.5\).

The results are also used to set exclusion limits on the common mass of the \(\tilde{\chi }_1^\pm \) and \(\tilde{\chi }_2^0\) for various values of the \(\tilde{\chi }_1^0\) mass in the simplified model of \(pp\rightarrow \tilde{\chi }_1^\pm \tilde{\chi }_2^0\) followed by \(\tilde{\chi }_1^\pm \rightarrow W^\pm \tilde{\chi }_1^0\) and \(\tilde{\chi }_2^0\rightarrow h\tilde{\chi }_1^0\). In this hypothesis test, all the CRs and SRs, including the data in the Higgs-mass windows of the \(\ell {}bb\) and \(\ell \gamma \gamma \) channels, are fitted simultaneously, taking into account correlated experimental and theoretical systematic uncertainties as common nuisance parameters. The signal contamination in the CRs is accounted for in the fit, where a single non-negative normalisation parameter is used to describe the signal model in all channels.

Systematic uncertainties on the signal expectations stemming from detector effects are included in the fit in the same way as for the backgrounds. Theoretical systematic uncertainties on the signal cross section described in Sect. 3 are not included directly in the fit. In all resulting exclusions the dashed (black) and solid (red) lines show the 95 % CL expected and observed limits respectively, including all uncertainties except for the theoretical signal cross-section uncertainty. The (yellow) bands around the expected limit show the \(\pm 1\sigma _{\text {exp}}\) expectations. The dotted \(\pm 1\sigma _{\text {theory}}^{\text {SUSY}}\) (red) lines around the observed limit represent the results obtained when changing the nominal signal cross section up or down by its theoretical uncertainty, and reported limits correspond to the \(-1\sigma {}\) variation.

Figure 7 shows the 95 % CL upper limits on the signal cross section normalised by the simplified-model prediction as a function of \(m_{\tilde{\chi }_2^0,\tilde{\chi }_1^\pm }\) for \(m_{{\tilde{\chi }_1^0}}=0\). The sensitivity of the individual one lepton and two \(b\)-jets, one lepton and two photons, and same-sign dilepton channels is illustrated in Fig. 7a–c respectively. The corresponding limit combining all channels and the ATLAS three-lepton search is shown in Fig. 7d. For \(m_{\tilde{\chi }_2^0,\tilde{\chi }_1^\pm }>250\) GeV the same-sign dilepton channel is not considered. In Fig. 7a, the expected exclusion region below \(m_{\tilde{\chi }_2^0,\tilde{\chi }_1^\pm }=140\) GeV is largely due to SR\(\ell bb\)-1, which targets models with small mass splitting between the neutralinos, while the expected exclusion region around \(m_{\tilde{\chi }_2^0,\tilde{\chi }_1^\pm }=240\) GeV is driven by SR\(\ell bb\)-2 designed for larger mass splittings. The upper limit shows slow variation with increasing \(m_{\tilde{\chi }_2^0,\tilde{\chi }_1^\pm }\) as the acceptance of SR\(\ell bb\)-2 increases and compensates for the decrease of the production cross section. Figure 7d shows that in the \(m_{\tilde{\chi }_2^0,\tilde{\chi }_1^\pm }<170\) GeV range all channels show similar sensitivity, while for \(m_{\tilde{\chi }_2^0,\tilde{\chi }_1^\pm }>170\) GeV the one lepton and two \(b\)-jets channel is the dominant one. Nevertheless, the contribution from the other channels to the combination is important to extend the excluded range significantly compared to Fig. 7a.

Figure 8a–c show the 95 % CL exclusion regions in the \((m_{\tilde{\chi }_2^0,\tilde{\chi }_1^\pm }, m_{\tilde{\chi }_1^0})\) mass plane of the simplified model obtained from the individual one lepton and two \(b\)-jets, one lepton and two photons, and same-sign dilepton signal regions, respectively. Figure 8d shows the corresponding exclusion region obtained by combining the three channels described in this paper with the ATLAS three-lepton search, which by itself excludes \(m_{\tilde{\chi }_2^0{}, \tilde{\chi }_1^\pm {}}\) up to 160 GeV for \(m_{\tilde{\chi }_1^0}=0\) as seen in Fig. 8d. The combination of these four independent searches improves the sensitivity significantly, and the 95 % CL exclusion region for \(m_{{\tilde{\chi }_1^0}}=0\) is extended to 250 GeV. The wide uncertainty bands of the expected limits in Fig. 8 are due to the slow variation of the sensitivity with increasing \(m_{\tilde{\chi }_2^0,\tilde{\chi }_1^\pm }\) and \(m_{\tilde{\chi }_1^0}\), as can also be seen in Fig. 7. In a similar search by the CMS Collaboration [25], the observed limit on \(m_{\tilde{\chi }_2^0{}, \tilde{\chi }_1^\pm {}}\) is 210 GeV for \(m_{{\tilde{\chi }_1^0}}=0\).

10 Conclusions

A search for the direct pair production of a chargino and a neutralino \(pp\rightarrow \tilde{\chi }_1^\pm \tilde{\chi }_2^0\) followed by \(\tilde{\chi }^\pm \rightarrow \tilde{\chi }_1^0(W^{\pm }\rightarrow \ell ^{\pm }\nu )\) and \(\tilde{\chi }_2^0\rightarrow \tilde{\chi }_1^0(h\rightarrow bb/\gamma \gamma /\ell ^{\pm }\nu qq)\) has been performed using 20.3 \(\mathrm {fb}^{-1}\) of \(\sqrt{s}=8{\mathrm {\ TeV}}\) proton–proton collision data delivered by the Large Hadron Collider and recorded with the ATLAS detector. Three final-state signatures are considered: one lepton and two \(b\)-jets, one lepton and two photons, and two same-sign leptons, each associated with missing transverse momentum. Observations are consistent with the Standard Model expectations. Limits are set in a simplified model, combining these results with the three-lepton search presented in Ref. [21]. For the simplified model, common masses of \(\tilde{\chi }_1^\pm \) and \(\tilde{\chi }_2^0\) are excluded up to 250 GeV for a massless \(\tilde{\chi }_1^0\).