1 Introduction

Many theories beyond the Standard Model (SM) predict new phenomena which give rise to dilepton final states, such as new resonances. These have been searched for using the ATLAS detector at the Large Hadron Collider (LHC) and are reported elsewhere [1]. In this paper, a complementary search is performed for new phenomena that appear as broad deviations from the SM in the dilepton invariant mass distribution or in the angular distribution of the leptons (where the leptons considered in this analysis are electrons or muons). The phenomena under investigation are contact interactions (CI) and large extra dimensions (LED).

2 Theoretical motivation

The presence of a new interaction can be detected at an energy much lower than that required to produce direct evidence of the existence of a new gauge boson. The charged weak interaction responsible for nuclear \(\beta \) decay provides such an example. A non-renormalizable description of this process was successfully formulated by Fermi in the form of a four-fermion contact interaction [2]. A contact interaction can also accommodate deviations from the SM in proton–proton scattering due to quark and lepton compositeness, where a characteristic energy scale \(\Lambda \) corresponds to the binding energy between fermion constituents. A new interaction or compositeness in the process \(q\overline{q} \rightarrow \ell ^+\ell ^-\) can be described by the following four-fermion contact interaction Lagrangian [3, 4]

$$\begin{aligned} \fancyscript{L}&= \frac{g^2}{\Lambda ^2}\;[\eta _\mathrm{LL} (\overline{q}_\mathrm{L}\gamma _{\mu } q_\mathrm{L})\,(\overline{\ell }_\mathrm{L}\gamma ^{\mu }\ell _\mathrm{L}) \\&+\eta _\mathrm{RR} (\overline{q}_\mathrm{R}\gamma _{\mu } q_\mathrm{R}) \,(\overline{\ell }_\mathrm{R}\gamma ^{\mu }\ell _\mathrm{R}) \\&+\eta _\mathrm{LR} (\overline{q}_\mathrm{L}\gamma _{\mu } q_\mathrm{L}) \,(\overline{\ell }_\mathrm{R}\gamma ^{\mu }\ell _\mathrm{R}) \\&+\eta _\mathrm{RL} (\overline{q}_\mathrm{R}\gamma _{\mu } q_\mathrm{R}) \,(\overline{\ell }_\mathrm{L}\gamma ^{\mu }\ell _\mathrm{L}) ], \end{aligned}$$

where \(g\) is a coupling constant chosen by convention to satisfy \(g^2/4\pi = 1\), \(\Lambda \) is the contact interaction scale, and \(q_\mathrm{L,R}\) and \(\ell _\mathrm{L,R}\) are left-handed and right-handed quark and lepton fields, respectively. The parameters \(\eta _{ij}\), where \(i\) and \(j\) are L or R (left or right), define the chiral structure of the new interaction. Different chiral structures are investigated here, with the left-right model obtained by setting \(\eta _\mathrm{LR} = \eta _\mathrm{RL} = \pm 1\) and \(\eta _\mathrm{LL} = \eta _\mathrm{RR} = 0\). Likewise, the left-left and right-right models are obtained by setting the corresponding parameters to \(\pm 1\), and the others to zero. The sign of \(\eta _{ij}\) determines whether the interference is constructive (\(\eta _{ij} = -1\)) or destructive (\(\eta _{ij} = +1\)). The cross-section for the process \(q \overline{q} \rightarrow \ell ^+\ell ^-\) in the presence of these contact interaction models can be written as:

$$\begin{aligned} \sigma _\mathrm{tot} = \sigma _\mathrm{DY} - \eta _{ij} \frac{F_\mathrm{I}}{\Lambda ^2} + \frac{F_\mathrm{C}}{\Lambda ^4}, \end{aligned}$$
(1)

where the first term accounts for the \(q \overline{q} \rightarrow Z/\gamma ^* \rightarrow \ell ^+\ell ^-\) Drell–Yan (DY) process, the second term corresponds to the interference between the DY and CI processes, and the third term describes the pure CI process. These two latter terms include \(F_\mathrm{I}\) and \(F_\mathrm{C}\), respectively, which are functions of the cross-section, and do not depend on \(\Lambda \). The relative impact of the interference and pure CI terms depends on both the dilepton mass and \(\Lambda \). For example, the magnitude of the interference term for dilepton masses above 600 GeV is about twice as large as that of the pure CI term at \(\Lambda = 14\) TeV; the interference becomes increasingly dominant for higher values of \(\Lambda \).

There are other models which predict deviations from the SM in the dilepton mass spectrum, and seek to address the vast hierarchy between the electroweak (EW) and Planck scales, such as the solution proposed by Arkani-Hamed, Dimopoulos and Dvali (ADD) [5]. In this model, gravity is allowed to propagate in large flat extra spatial dimensions, thereby diluting its apparent effect in 3+1 spacetime dimensions. The flat \(n\) extra dimensions are of common size \(R\) (\(\sim \)1 µm–1 mm for \(n\) = 2) and are compactified on an \(n\)-dimensional torus. The fundamental Planck scale in (4+\(n\))-dimensions, \(M_\mathrm{D}\), is related to the Planck scale, \(M_\mathrm{Pl}\), by Gauss’s law \(M^{2}_\mathrm{Pl}\) \(\sim \) \(M_\mathrm{D}^{n+2} R^n\). It is thus possible for \(M_\mathrm{D}\) to be in the TeV range for sufficiently large volumes (\(\propto R^n\)). In this model, the SM particles and their interactions are confined to a four-dimensional submanifold, whereas gravitons may also propagate into extra dimensions of size \(R\). This gives rise to a tower of Kaluza–Klein (KK) graviton modes with a mass spacing inversely proportional to \(R\). Values for \(M_\mathrm{D}\) at the TeV scale imply very small mass differences between KK modes and thus an essentially continuous mass spectrum.

The production of dileptons via virtual KK graviton exchange involves a sum over many KK modes that needs to be cut off at some value. In this paper, the ultraviolet cutoff is chosen to be the string scale, \(M_\mathrm{S}\) [6], which sets the context in which this search and its results should be interpreted, and is chosen for consistency with previous searches. This scale is related to \(M_\mathrm{D}\) via the Gamma function, \(\Gamma \), by [7]

$$\begin{aligned} M_\mathrm{S} = 2 \sqrt{\pi } \left[ \Gamma \left( \frac{n}{2} \right) \right] ^{1/(n+2)} M_\mathrm{D}. \end{aligned}$$

The cross-section for \(q \overline{q}/gg \rightarrow \ell ^+\ell ^-\) in the presence of large extra dimensions can be expressed as

$$\begin{aligned} \sigma _\mathrm{tot} = \sigma _\mathrm{DY} + \fancyscript{F}\frac{F_\mathrm{int}}{M_\mathrm{S}^4} + \fancyscript{F}^{2}\frac{F_\mathrm{G}}{M_\mathrm{S}^8}, \end{aligned}$$
(2)

where \(\sigma _\mathrm{DY}\) is the DY cross-section, and \(F_\mathrm{int}\) and \(F_\mathrm{G}\) are functions of the cross-sections (they do not depend on \(M_\mathrm{S}\)) involving the interference and pure KK graviton effects, respectively. The strength of the interaction is characterized by \(\fancyscript{F}\)/\(M^{4}_\mathrm{S}\), where the dimensionless parameter \(\fancyscript{F}\) varies in the different calculations provided by Giudice–Rattazzi–Wells (GRW) [8], Hewett [9] and Han–Lykken–Zhang (HLZ) [10]. The different values are:

$$\begin{aligned} \fancyscript{F}&= 1, \mathrm{~~(GRW)} \\ \fancyscript{F}&= \frac{2 \lambda }{\pi } = \frac{\pm 2}{\pi },\quad \mathrm{(Hewett)} \\ \fancyscript{F}&= \log \left( \frac{M_\mathrm{S}^2}{s} \right) \quad \mathrm{for~} n = 2, \mathrm{~~(HLZ)} \\ \fancyscript{F}&= \frac{2}{n-2}\quad \mathrm{for~} n > 2. \mathrm{~(HLZ)}. \end{aligned}$$

In the Hewett formalism, \(\lambda = \pm 1\) is introduced to allow for constructive or destructive interference with the DY process. Unlike the situation with contact interactions described above, interference effects between the DY and virtual KK graviton processes are small due to dilepton production by virtual KK gravitons being predominantly gluon-induced rather than quark-induced.

Previous searches for CI have been carried out in neutrino–nucleus and electron–electron scattering [11], as well as electron–positron [12, 13], electron–proton [14], and proton–antiproton colliders [15, 16]. Searches for CI have also been performed by the ATLAS and CMS Collaborations [17, 18]. The strongest exclusion limits for \(\ell \ell q q\) CI in which all quark flavours contribute come from the previous ATLAS non-resonant dilepton analysis conducted using 5 fb\(^{-1}\) of proton–proton (\(pp\)) collision data at \(\sqrt{s}=7\) TeV [17]. That combined analysis of the dielectron and dimuon channels set lower limits at 95 % credibility level (C.L.) on the left-left model of \(\Lambda \) \(>\) 13.9 TeV and \(\Lambda \) \(>\) 10.2 TeV, for constructive and destructive interference, respectively, given a uniform positive 1/\(\Lambda ^2\) prior.

Previous searches for evidence of ADD-model extra dimensions via virtual KK graviton exchange have been performed at electron–positron [19], electron–proton [20], and proton–antiproton colliders [16]. Searches have also been performed at the LHC by the ATLAS and CMS Collaborations [17, 21]. The most stringent results come from the ATLAS search in the dilepton channel and subsequent combination with the diphoton channel result using 5 fb\(^{-1}\) of \(pp\) collision data at \(\sqrt{s}\) = 7 TeV [17]. That analysis set lower limits on \(M_\mathrm{S}\) at 95 % C.L. in the GRW formalism of 3.5 TeV and 3.4 TeV for 1/\(M_\mathrm{S}^4\) and 1/\(M_\mathrm{S}^{8}\) priors, respectively.

3 The ATLAS detector

The ATLAS detector [22] consists of an Inner Detector (ID) surrounded by a solenoid magnet for tracking charged particles, and a calorimeter for capturing particles that interact electromagnetically or hadronically, to measure their energy. A Muon Spectrometer (MS) and toroidal magnet system provide tracking for muons, which typically escape the calorimeter.

The ID is immersed in a 2.0 T axial magnetic field and provides charged-particle tracking up to \(|\eta |\) of 2.5.Footnote 1 It is composed of a pixel detector, a silicon-strip tracker, and a transition radiation tracker.

The calorimeter system surrounds the solenoid and extends up to \(|\eta |=4.9\). One of its main components is a lead and liquid-argon electromagnetic sampling calorimeter, covering \(|\eta | <\) 3.2 with a fine segmentation varying by layer. This provides precise energy and position measurements for electrons and photons. Another electromagnetic calorimeter, in the forward direction up to \(|\eta |=4.9,\) uses liquid-argon active elements and copper as an absorber. Further from the interaction point lies an iron and scintillator tile calorimeter up to \(|\eta |=1.7\) and a copper and liquid-argon calorimeter up to \(|\eta |=3.2\) for hadronic energy measurements. A hadronic calorimeter in the forward region, up to \(|\eta |=4.9,\) uses liquid-argon active elements combined with tungsten as an absorber.

The outermost detector is the MS, which consists of layers of precision tracking chambers and trigger chambers to enable reconstruction of muons with \(|\eta | <\) 2.7. Precision tracking is provided by monitored drift tube chambers, complemented by a layer of cathode strip chambers in the innermost layer in the forward region. Triggering is handled by resistive plate chambers in the barrel (\(|\eta | <\) 1.05) and thin-gap chambers in the endcap (1.05 \(< |\eta | <\) 2.4). One barrel and two endcap toroidal magnet systems provide the bending force to measure muon momentum.

The triggering of events to be recorded by the ATLAS detector is handled by a three-level system [22] which consists of a level-1 hardware trigger, and the high-level trigger (HLT). The HLT is made up of the level-2 trigger, which uses regions of interest, and the event filter, which is based on standard ATLAS event reconstruction and analysis algorithms.

4 Data and Monte Carlo samples

This search uses the LHC 2012 dataset from \(pp\) collisions at \(\sqrt{s}=8\) TeV, corresponding to an integrated luminosity of approximately 20 fb\(^{-1}\). The peak luminosity during this period was \(7.7 \times 10^{33}~\)cm\(^{-2}\) s\(^{-1}\), with an average number of \(pp\) interactions per bunch crossing (pile-up) of \(\langle \)µ\(\rangle =20.7.\)

The main background comes from the irreducible DY process. The photon-induced (PI) process is also an irreducible contribution which produces two leptons, arising from a \(\gamma \gamma \) initial state via \(\hat{t}\)– and \(\hat{u}\)– channel processes. The PI process is not a major background, although it is important in the description of the lepton angular distribution. The reducible, but non-negligible, backgrounds are \(t\bar{t}\) and single top-quark production, multi-jet, \(W\)+jets, and diboson (\(WW\), \(WZ\), and \(ZZ\)) processes. Monte Carlo (MC) simulation is used to estimate all of these backgrounds, with the exception of the multi-jet and \(W\)+jets backgrounds, which are estimated with a data-driven fake-factor method, as described in Sect. 5. The multi-jet and \(W\)+jets backgrounds are found to be negligible in the dimuon channel [1].

All MC samples were passed through a simulation of the ATLAS detector using Geant4 [2325]. The DY background is generated with Powheg [26] for the next-to-leading-order (NLO) matrix elements using the CT10 [27] parton distribution functions (PDF) and Pythia 8.165 [28] for parton showering and hadronization. To correct the DY cross-section from NLO to next-to-next-to-leading-order (NNLO), a dilepton mass-dependent QCD+EW \(K\)-factor is calculated with FEWZ 3.1 [29] using the MSTW2008NNLO [30, 31] PDF (with CT10 as the base NLO PDF) to take into account higher-order QCD and EW corrections. The photon-induced background is generated with Pythia 8.165 at LO using the MRST2004QED [32] PDF. The top-quark production processes are simulated using MC@NLO 4.06 [33] with the CT10 PDF to generate the matrix elements, Jimmy 4.31 [34] to describe multiple parton interactions, and Herwig 6.520 [35] to describe the remaining underlying event and parton showers. Higher-order corrections are calculated with Top\(++\) 2.0 [36] to derive a \(K\)-factor which scales this background description from NLO to NNLO in QCD, including resummation of next-to-next-to-leading-logarithmic (NNLL) soft gluon terms. The diboson processes are generated with Herwig 6.520 at leading-order (LO) using the CTEQ6L1 PDF [37], and these cross-sections are extrapolated to NLO using dilepton mass-independent \(K\)-factors.

The CI signal processes are generated using Pythia 8.165 at LO with the MSTW2008LO PDF. The CI cross-section is scaled from LO to NNLO, again using FEWZ with the MSTW2008NNLO PDF to calculate a dilepton mass-dependent QCD+EW \(K\)-factor. The ADD LED signal process is simulated with the multi-leg LO generator Sherpa 1.3.1 [7] using the CTEQ6L1 PDF. No higher-order correction is applied to the ADD LED cross-section.

To ensure adequate modelling of the data by the MC simulation, data-derived corrections are applied to the simulation. These include electron energy scale corrections [38], muon momentum corrections [39], and pile-up corrections. They also include trigger, lepton identification, and reconstruction scale factors [38, 39], which are all found to be very close to unity. A summary of the generator, parton shower, and PDF information used for all signal and background MC samples used in this search can be found in Table 1.

Table 1 Summary of MC sample information for signal and background processes used in this search. The columns from left to right give the process of interest, generator, matrix-element order, parton shower program, and PDF utilized

5 Event selection and background estimation

Events in the \(ee\) channel are required to have passed a two-object trigger with transverse momentum (\(p_\mathrm{T}\)) thresholds of 35 GeV and 25 GeV. Events in the \(\mu \mu \) channel are required to have passed at least one of two single-object triggers with \(p_\mathrm{T}\) thresholds of 36 GeV and 24 GeV. The higher threshold trigger is used to recover small efficiency losses due to the online muon isolation requirement imposed by the lower threshold trigger. The overall efficiency for dilepton events to fire either of these triggers is found to be \(>\) 99 %. In both channels, events are required to have at least one primary vertex with more than two tracks.

In the dielectron channel, events are retained if at least two electrons fulfil the following criteria: the electrons satisfy \(|\eta |\) \(<\) 2.47 (excluding the transition region between the barrel and endcap, 1.37 \(<\) \(|\eta |\) \(<\) 1.52), the leading and sub-leading electrons have \(p_\mathrm{T}\) \(>\) 40 GeV and 30 GeV, respectively, and the electrons satisfy a set of electron identification criteria which are designed to reject jets misidentified as electrons [38]. For the leading and sub-leading electrons, the calorimeter isolation must be less than \((0.007 \times E_\mathrm{T}) + 5.0\) GeV, and \((0.022 \times E_\mathrm{T}) + 6.0\) GeV, respectively (where \(E_\mathrm{T}\) is the transverse energy in units of GeV). The electron calorimeter isolation is calculated as the \(\sum {E_\mathrm{T}}\) in a cone of \(\Delta R = \sqrt{(\Delta \eta )^2 + (\Delta \phi )^2}\) = 0.2, excluding the electron \(E_\mathrm{T}\). This measure of isolation is corrected for \(E_\mathrm{T}\)-dependent leakage, and pile-up effects which are parameterised as a function of the number of primary vertices in the event. If more than one electron pair exists in the event, the one with the largest scalar sum of \(E_\mathrm{T}\) is chosen. The two electrons in the selected pair are then required to have opposite charge and have dilepton mass greater than 80 GeV.

In the dimuon channel, events are retained if at least two muons fulfil the following criteria: the muons have \(p_\mathrm{T} >\) 25 GeV, pass track quality requirements, and meet longitudinal (\(|z_0|\) \(<\) 1 mm) and transverse (\(|d_0|\) \(<\) 0.2 mm) track impact parameter requirements with respect to the primary vertex. Muons are also required to be isolated: the \(\sum {p_\mathrm{T}}\) of all additional tracks within \(\Delta R = 0.3\) of the muon must be less than 5 % of the muon \(p_\mathrm{T}\). Muons are reconstructed by combining tracks from both the ID and MS systems. The MS hit requirements are particularly stringent to improve the momentum resolution, minimize tails in the dimuon mass distribution, and improve modelling by the simulation. Muon tracks are required to include at least three hits in each of three precision MS chambers and have at least one hit in the non-bending plane (\(\phi \)) of two separate chambers to determine the \(\phi \) coordinate and thus a good estimate of the non-uniform toroidal magnetic field. If those tracks include hits in precision chambers that have either no alignment or poor alignment, the tracks are rejected. Finally, the independent ID and MS track \(q/p_\mathrm{T}^\mathrm{{track}}\) must agree within five standard deviations of the standalone measurement uncertainties added in quadrature. The muon acceptance is highest in the pseudorapidity region up to approximately 2.5. If more than two muons satisfy these criteria, the pair of oppositely charged muons with the highest scalar sum of \(p_\mathrm{T}\) is selected. The final requirement is that the dimuon mass must be greater than 80 GeV.

The event selection detailed above is applied to the data and all MC background samples. The acceptance times efficiency for DY events with dilepton mass of 1 TeV (2 TeV) is found to be 67 % (67 %) in the dielectron channel and 47 % (45 %) in the dimuon channel. The selection efficiency is lower for the dimuon channel mainly because of the strict MS hit requirements.

The dominant DY background, as well as the PI background, \(t\bar{t}\) and single top-quark production processes, and diboson processes, are all modelled with MC as described in Sect. 4. The combined multi-jet & \(W\)+jets background which only affects the electron channel is estimated using a data-driven method designed to describe events which contain a maximum of one real lepton, and one or more jets or photons which are misidentified as a lepton. The details of this method are provided in Ref. [1]. For the top and combined multi-jet & \(W\)+jets backgrounds, fits are used to describe the shape of the background dilepton mass distribution with a phenomenologically motivated three-parameter (\(p_1\), \(p_2\), \(p_3\)) function (\(y(x) = p_1\,x^{p_2+p_3 \log x}\), where \(x =\) m\(_{\ell \ell }\)) at high masses, where the statistical uncertainty becomes large. The fit to the top-quarks background is performed in a similar manner to Ref. [1] using the mass range 200–700 GeV to match the fit to the MC distribution, and the resultant fit as the extrapolated top background estimate above 500 GeV. The choice of extrapolation point is found to have a negligible effect on the fit; however, the range of the fit and the uncertainty on the fit parameters are included in the systematic uncertainty. For the fit describing the combined multi-jet & \(W\)+jets background at high mass, the lower edge of the fit range is varied from 425 GeV to 600 GeV and the upper edge from 700 GeV to 1200 GeV, with the extrapolation point fixed to 500 GeV. The uncertainty on this fit is negligible compared to the systematic uncertainty assigned to the data-driven method, as described in Sect. 7.

In this analysis, the normalization, control, and search regions are defined based on the dilepton mass. In the normalization region with mass between 80 GeV and 120 GeV, the total background estimate is scaled to data. This protects the analysis against mass-independent systematic uncertainties. The control region, defined by the mass range from 120 GeV to 400 GeV, is used to check the quality of the background modelling since the signal contribution is negligible in this region. After the normalization procedure, good agreement is found in the control region, as displayed in Fig. 1. The small deviation observed in the first bin of the dielectron mass distribution (Fig. 1) corresponds to an effect that is less than 0.2 % of the total number of events in the normalization region. Thus it has a negligible effect. The search is then conducted in the mass region 400–4500 GeV.

Fig. 1
figure 1

Reconstructed dielectron (top) and dimuon (bottom) mass distributions for data and the SM background estimate. Also shown are the predictions for a benchmark \(\Lambda \) value in the LL contact interaction model and benchmark \(M_\mathrm{S}\) value in the GRW ADD model. The distribution bin width is constant in \(\log (m_{\ell \ell })\). The ratio is presented with the total systematic uncertainty overlaid as a band

6 Event yields

In the CI search, six broad dilepton mass bins are used in the search region from 400 GeV to 4500 GeV. For the ADD search region, a single dilepton mass bin is employed in the range 1900–4500 GeV, where the lower mass boundary is optimized based on the strongest expected exclusion limit.

The dielectron (dimuon) channel event yields are presented in Table 2 (Table 3) for the CI search and both channels are presented in Table 4 for the ADD LED search. Dilepton mass distributions for data and the predicted background are shown in Fig. 1 for both channels, along with a few benchmark CI and ADD signals overlaid.

Table 2 Expected and observed event yields in the dielectron channel. The predicted yields are shown for SM background as well as for SM+CI for several CI signal scenarios. The quoted errors consist of both the statistical and systematic uncertainties added in quadrature
Table 3 Expected and observed event yields in the dimuon channel. The predicted yields are shown for SM background as well as for SM+CI for several CI signal scenarios. The quoted errors consist of both the statistical and systematic uncertainties added in quadrature
Table 4 Expected and observed event yields in the dielectron (second column) and dimuon (third column) channels in the ADD search for large extra dimensions. The expected yields for the SM plus two GRW ADD parameter points are also shown. The quoted errors consist of both the statistical and systematic uncertainties added in quadrature

The dilepton invariant mass is commonly used as the discriminating variable for a CI search. However, the lepton decay angle also has high discriminating power from DY events in certain cases such as the left-right model. Therefore, the dilepton decay angle, \(\theta ^*\), is also used as a discriminating variable in the CI search. The angle \(\theta ^*\) is defined in the Collins–Soper (CS) frame [40], which is constructed with the \(z\)-axis bisecting the angle between the two incoming parton momenta, and the \(x\)-axis perpendicular to the incoming parton momentum plane. As the incoming parton information from \(pp\) collisions is unknown, the direction of the dilepton system is taken to be the direction of the incoming quark (as opposed to anti-quark). This introduces a dilution of any asymmetry in the \(\cos \theta ^*\) distribution (leading to derived angular variables being described as “uncorrected”). The angle \(\theta ^*\) is then taken as the angle between this \(z\)-axis and the outgoing negatively charged lepton, using the formula

$$\begin{aligned} \small \cos \theta ^* = \frac{p_z(\ell ^+\ell ^-)}{|p_z(\ell ^+\ell ^-)|} \frac{2(p_1^+p_2^- - p_1^-p_2^+)}{m(\ell ^+\ell ^-) \sqrt{m(\ell ^+\ell ^-)^2 + p_T(\ell ^+\ell ^-)^2}}~, \end{aligned}$$

where \(p_n^\pm \) denotes \(\frac{1}{\sqrt{2}} (E\pm p_z)\) and \(n\) = \(1\) or \(2\) corresponds to the negatively charged or positively charged leptons, respectively. From this angle, a forward-backward asymmetry, which is sensitive to the chiral structure of the interaction, is defined as follows:

$$\begin{aligned} A_\mathrm{FB} = \frac{N_\mathrm{F} - N_\mathrm{B}}{N_\mathrm{F} + N_\mathrm{B}}, \end{aligned}$$

where \(N_\mathrm{F}\) (\(N_\mathrm{B}\)) is the number of events with \(\cos \theta ^*\) greater (smaller) than zero. The discrimination between CI+SM and the SM-only background is due to the couplings of the CI model, which predicts a larger \(A_\mathrm{{FB}}\) than the SM background for the CI signal in the left-left and right-right model, and an equally large but opposite-sign \(A_\mathrm{{FB}}\) for the left-right model. If a CI signal were present in nature this would therefore lead to a modest increase in the total measured \(A_\mathrm{{FB}}\) as a function of dilepton mass for the left-left and right-right model, and a substantial decrease in the measured \(A_\mathrm{{FB}}\) for the left-right model. Therefore in the CI search, each dilepton mass bin is further divided into forward and backward events for the statistical interpretation of the results. Figures 2 and 3 present the data and background for \(\cos \theta ^*\) and \(A_\mathrm{{FB}}\) as a function of dilepton mass, respectively, in both channels. These distributions also display CI signal predictions.

Fig. 2
figure 2

Reconstructed \(\cos \theta ^*\) distributions for data and the SM background estimate in the dielectron (top) and dimuon (bottom) channels. Results are shown for the contact interaction signal region for dilepton masses between 400 GeV and 4500 GeV. Also shown are the predictions for a benchmark \(\Lambda \) value in the LR contact interaction model. The ratio is presented with the total systematic uncertainty overlaid as a band

Fig. 3
figure 3

Reconstructed \(A_\mathrm{FB}\) distributions for data and the SM background estimate as a function of dielectron (top) and dimuon (bottom) mass. Also shown are the predictions of different benchmark \(\Lambda \) values for the LL and LR contact interaction model (the RR model is very similar to the LL case). The ratio displays the background-subtracted data (\(\Delta \)) divided by the total uncertainty (\(\sigma \)) in each bin

Good agreement is observed between the data and the background model in both the dilepton mass and \(A_\mathrm{FB}\) distributions.

7 Systematic uncertainties

The total background estimate is normalized by scaling to data in the dilepton mass region 80–120 GeV. This protects the analysis against mass-independent systematic uncertainties (such as the luminosity uncertainty), as any constant scale factor cancels. However, mass-dependent systematic uncertainties affect the shape of the discriminating variables and are therefore considered as nuisance parameters in the statistical interpretation.

Experimental uncertainties originate from the following sources: lepton trigger and reconstruction efficiencies, lepton energy and momentum scale and resolution, lepton charge misidentification, multi-jet & \(W\)+jets background estimate (in the \(ee\) channel), beam energy scale, and MC statistics.

It is important to control the lepton momentum uncertainty, as mismodelling of the resolution could result in a broad signal-like excess (or deficit) in the dilepton mass distribution. The muon momentum resolution depends critically on the quality of the MS chamber alignment. Resolution uncertainties are determined from dedicated data-taking periods with no magnetic field in the MS and from studies of muon tracks passing through the overlap region between chambers in the small and large sectors of the MS where the independent track momenta reconstructed from these adjacent sectors can be compared directly. The electron momentum uncertainty is negligible.

Another important experimental uncertainty is the charge misidentification which can arise from two main sources: track curvature and “trident” events. The latter occurs when a hard bremsstrahlung is emitted by a high-momentum lepton, and a subsequent photon conversion gives rise to a high-momentum track with a charge opposite to that of the initial lepton, but which is selected erroneously. To study the trident source of charge misidentification, dedicated MC samples were generated with the amount of detector material varied by up to 20 % of a radiation length. To study the track curvature source, various investigations were carried out wherein additional charge misidentification is injected into the simulation to ascertain its effect, and the ID track resolution in \(q/p\) is varied to assess the probability of a change of charge sign. As these systematic uncertainty studies found a negligible change in charge misidentification, a conservative uncertainty of 20 % with respect to the measured charge misidentification rate in Drell–Yan MC simulation was applied. For the dielectron channel, the charge misidentification systematic uncertainty can be as large as 3 %. For the dimuon channel, this is covered by the resolution uncertainty and is negligible.

The uncertainty on the data-driven estimate of the combined multi-jet & \(W\)+jets background is assessed by comparing complementary estimation methods (giving a maximum deviation of 18 % from the nominal method) and variations of the real-electron contamination suppression requirements in the nominal method (resulting in deviations of up to 5 %). The addition of these effects in quadrature gives a total systematic uncertainty on the data-driven estimate of 20 %. A detailed description of this procedure is given in Ref. [1].

A systematic uncertainty on the LHC beam energy of 0.65 % [41] is assessed for both the signal and background processes.

The statistical uncertainty of the MC samples is included as a systematic uncertainty for both the signal and the background. This includes the fit uncertainty due to the high-mass extrapolation of the top-quarks background, which is described in Sect. 5.

The theoretical uncertainties are the variations among the PDF eigenvector sets, the effect of PDF choice, the PDF \(\alpha _S\) scale, the EW higher-order corrections, the photon-induced contributions, and the DY cross-section uncertainty. The effect of these uncertainties on the background yield are taken into account with a standard procedure where event weights are used to create systematically shifted distributions, which are then used as nuisance parameters in the statistical interpretation. However, for the signal yields one does not want to introduce a bias via the specific theoretical uncertainty choices, and therefore these are only taken into account by the effect that they have on the signal acceptance times efficiency. This effect was found to be negligible in all cases except for the PDF variation in the ADD search, where an additional uncertainty of 6 % (3 %) is included in the dielectron (dimuon) channel as a nuisance parameter in the statistical interpretation. For the CI search, systematic uncertainties are taken into account as a function of dilepton mass for forward and backward events separately, to account for any variation in the uncertainty which might affect the expected asymmetry. For example, the largest systematic uncertainty in this search is the background PDF variation, which has an effect in the dielectron (dimuon) channel of 11 % (12 %) at a mass of 2 TeV. When separated into forward and backward regions at the same dielectron (dimuon) mass, this uncertainty is 10 % (8.5 %) and 16 % (15 %), respectively. Likewise the PI uncertainty in the dielectron (dimuon) channel of 12 % (9.5 %) at a mass of 2 TeV, becomes 10 % (7.5 %) and 16 % (13 %), when separated into forward and backward regions, respectively. The other sources of systematic uncertainty were found to not have a strong dependence between forward and backward events. The different sources of PDF uncertainty are assessed by utilizing the MSTW2008NNLO PDF error set (90 % C.L.) and by following the procedure detailed in Ref. [1, 42]. The uncertainty due to the choice of PDF is investigated by comparing the central values of various PDFs, namely MSTW2008NNLO, CT10NNLO [43], NNPDF2.3 [44], ABM11 [45], and HERAPDF1.5 [46]. All except for ABM11 are found to be within the MSTW2008NNLO 90 % C.L. uncertainty, and so the variation from ABM11 with respect to the MSTW2008NNLO central value, outside of the MSTW2008NNLO 90 % C.L. uncertainty, is taken as a separate systematic uncertainty due to PDF choice. VRAP [47] is used to assess the \(\alpha _S\) systematic uncertainty, along with scale uncertainties which are estimated by varying the nominal renormalization and factorization scales simultaneously by a factor of two. A study to ascertain the photon-induced background estimate uncertainty was performed in Ref. [1], and found that the nominal MRST2004QED PDF gives an upper estimate of the PI contribution. Varying the assumed quark masses showed that the lower bound of this estimate gives fairly small PI contributions. Therefore the PI background estimate is assigned a conservative uncertainty of 100 %. A uniform uncertainty of 4 % due to the uncertainty on the Z/\(\gamma ^*\) NNLO cross-section (using MSTW2008NNLO 90 % C.L.) in the normalization region was determined in Ref. [1] and is applied to signal event yields due to the normalization procedure. The variation due to the cross-section uncertainty in the other background MC samples was found to be negligible. All systematic uncertainties are treated as uncorrelated, and a summary of the systematic uncertainties at dilepton masses of 1 and 2 TeV is presented in Table 5.

Table 5 Summary of the systematic uncertainties taken into account for the total expected number of events. Values are provided at \(m_{\ell \ell }\) = 1 TeV (2 TeV) to give representative estimates relevant to this search. The PDF variation values shown for signal are based on CI. For the ADD signal they are uniform at 6 % and 3 % in the dielectron and dimuon channels, respectively. Signal systematic uncertainties are assessed as a function of the corresponding parameter of interest but are not found to vary greatly. N/A indicates that the uncertainty is not applicable

8 Statistical interpretation

A Bayesian approach is used for the statistical interpretation of the results, using a uniform positive prior as a function of the parameter of interest to quantify any observed excess. In the absence of a signal, 95 % C.L. lower exclusion limits are set on that parameter. The total number of expected events \(\mu \) in each search region can be expressed as

$$\begin{aligned} \mu = n_\mathrm{s}(\Theta ,\overline{\Omega }) + n_\mathrm{b}(\overline{\Omega }), \end{aligned}$$

where \(n_{s}(\Theta ,\overline{\Omega })\) is the number of events predicted by the CI or ADD signal for a particular choice of model parameter \(\Theta \). The quantity \(n_{b}(\overline{\Omega })\) is the total number of background events, and in both cases \(\overline{\Omega }\) represents the set of Gaussian nuisance parameters that account for systematic uncertainties on the number of respective signal and background events. The parameter \(\Theta \) corresponds to a choice of contact interaction scale \(\Lambda \) and interference parameter \(\eta _{ij}\) in the case of the CI interpretation, and a choice of string scale \(M_\mathrm{S}\) and specific formalism (GRW, Hewett, or HLZ) in the case of the ADD interpretation.

The likelihood of observing \(n\) events given the new physics parameter \(\Theta \) and nuisance parameters \(\overline{\Omega }\) is then the product of Poisson probabilities for each mass–\(\cos \theta ^*\) bin \(k\):

$$\begin{aligned} \mathcal{{L}} (n \mid \Theta ,\overline{\Omega }) = \displaystyle \prod _{l=1}^{N_\mathrm{{channel}}}\displaystyle \prod _{k=1}^{N_\mathrm{{bin}}}\frac{\mu _{lk}^{n_{lk}}e^{-\mu _{lk}}}{n_{lk}!}, \end{aligned}$$

where \(n_{lk}\) is the number of events observed in data, and \(\mu _{lk}\) is the total number of expected events (signal plus background), both in mass–\(\cos \theta ^*\) bin \(k\) and channel \(l\) (where the channel can be dielectron or dimuon). According to Bayes’ theorem, the posterior probability for the parameter \(\Theta \), given \(n\) observed events, is then

$$\begin{aligned} \mathcal{P} (\Theta \mid n) = \frac{1}{\mathcal{Z}} \mathcal{L}_\mathcal{M} (n \mid \Theta ) P(\Theta ), \end{aligned}$$

where \(\mathcal{Z}\) is a normalization constant and the marginalized likelihood \(\mathcal{L}_\mathcal{M}\) corresponds to the likelihood after all nuisance parameters are integrated out. This integration is performed assuming that the nuisance parameters are correlated across all dilepton mass–\(\cos \theta ^*\) bins. The nuisance parameters that are treated as correlated between both channels are: PDF uncertainties, EW corrections, photon-induced, beam energy, and normalization. All other sources are treated as uncorrelated. Table 5 shows which nuisance parameters are taken into account for the signal and background expectations. The prior probability \(P(\Theta )\) is chosen to be uniform and positive in either \(1/\Lambda ^2\) or \(1/\Lambda ^4\) for the CI analysis, and either \(1/M_\mathrm{S}^4\) or \(1/M_\mathrm{S}^8\) for the ADD analysis. These choices are motivated by the form of Eqs. (1) and (2), to give the reader a sense of how the interplay in these forms can affect the result. The 95 % C.L. limit is then obtained by finding the value \(\Theta _\mathrm{lim}\) satisfying \(\int _0^{\Theta _\mathrm{lim}} \mathcal{P} (\Theta \mid n) \, d\Theta = 0.95\), where \(\Theta \) is chosen to be \(1/\Lambda ^2\), \(1/\Lambda ^4\), \(1/M_\mathrm{S}^4\) or \(1/M_\mathrm{S}^8\).

The calculations are performed with the Bayesian Analysis Toolkit [48], which uses a Markov Chain Monte Carlo technique to integrate over the nuisance parameters. For each physics model, 1000 pseudo-experiments (PEs) are run to obtain an adequate SM-only expected distribution; the PE with the median parameter of interest value provides the expected limit, with \(\pm 1\,\sigma \) and \(\pm 2\,\sigma \) intervals also obtained from this set of 1000 PEs correspondingly. In order to quantify the consistency between the data and the background expectation, the likelihood ratio is computed for the signal-plus-background and background-only hypotheses, where the signal-plus-background likelihood (given the prior) is evaluated at the \(\Theta \) value that maximizes the likelihood. The distribution of negative log-likelihood-ratio (LLR) values is then used to compute the \(p\)-value by calculating the fraction of PEs that have a more signal-like LLR value than the observed LLR value in data. The \(p\)-value is the probability of observing an excess, at least as signal-like as the one observed in data, given that only background exists.

9 Results

Good agreement is observed between the data and expected background yields. The most significant deviation from the expected background is seen in the dimuon channel for the CI search, with a \(p\)-value of 8 % in the LL model with destructive interference given the 1/\(\Lambda ^2\) prior. In the ADD search, the most significant excess is also observed in the dimuon channel, with a \(p\)-value of 6 % in the GRW formalism for the 1/\(M_\mathrm{S}^{4}\) prior. In neither case is the deviation significant. The expected and observed 95 % C.L. lower exclusion limits are set on the parameter of interest in each search, with the resulting limits for the CI and ADD search presented in Tables 6 and 7 respectively, including conversions to other formalisms. These results are also displayed graphically in Fig. 4 for the CI search given the 1/\(\Lambda ^2\) prior and Fig. 5 for the ADD search given the 1/\(M_\mathrm{S}^8\) prior. In the case of the ADD interpretation, the limits obtained with a prior uniform and positive in signal cross-section are found to be consistent with those obtained with the uniform positive 1/\(M_\mathrm{S}^8\) prior.

Fig. 4
figure 4

Summary of 95 % C.L. lower exclusion limits on \(\Lambda \) for the combined dilepton contact interaction search, using a uniform positive prior in 1/\(\Lambda ^2\). Previous ATLAS search results [17, 49] are also presented for comparison. Exclusion limits were previously only set on the LL model

Fig. 5
figure 5

Summary of 95 % C.L. lower exclusion limits on \(M_\mathrm{S}\) for the combined dilepton ADD large extra dimensions search, using a uniform positive prior in 1/\(M_\mathrm{S}^8\). Previous ATLAS search results [17] are also presented for comparison. Exclusion limits were not previously set on the HLZ \(n=2\) ADD model

Table 6 Expected and observed 95 % C.L. lower exclusion limits on \(\Lambda \) for the LL, LR, and RR contact interaction search using a uniform positive prior in 1/\(\Lambda ^2\) or 1/\(\Lambda ^4\). The dielectron, dimuon, and combined dilepton channel limits are shown for both the constructive and destructive interference cases
Table 7 Expected and observed 95 % C.L. lower exclusion limits on \(M_\mathrm{S}\), using a uniform positive prior in \(1/M_\mathrm{S}^4\) or \(1/M_\mathrm{S}^8\). The dielectron, dimuon, and combined dilepton channel limits are shown for ADD signal in the GRW, Hewett and HLZ formalisms

For the ADD search results, the similar expected and observed exclusion limits within the separate channels are due to the small number of expected SM background events, which arise from the high mass threshold chosen for that search. This leads a large fraction of the PEs to return a result of zero expected events, and the median value of the ensemble (taken as the expected limit) to therefore also be zero expected events. For the combined dilepton channel, the total number of expected SM background events is large enough that a wider range of limits is obtained in the ensemble of PEs and the slight data deficit translates into stronger observed limits than expected.

10 Conclusions

A search for non-resonant new phenomena in the dilepton channel has been carried out using the 2012 LHC proton–proton collision dataset of 20 fb\(^{-1}\) at \(\sqrt{s} = 8\) TeV. This study builds upon previous ATLAS searches, using both dilepton invariant mass and the lepton \(\cos \theta ^*\) distribution (and by proxy \(A_\mathrm{FB}\)) as search variables. No significant deviations from the Standard Model predictions are observed and lower limits are placed on the scale of contact interactions and large extra dimensions. The most restrictive 95 % C.L. limits are obtained by combining the dielectron and dimuon channels, yielding \(\Lambda > 26.3\) TeV for the left-right contact interaction model with constructive interference and a prior flat in 1/\(\Lambda ^2\), and \(M_\mathrm{S} > 5.0\) TeV for the HLZ \(n=3\) ADD model with a prior flat in 1/\(M_\mathrm{S}^4\).