1 Introduction

The large hadron collider (LHC) has opened up a new kinematic regime to test perturbative QCD (pQCD) using measurements of jet production. Next-to-leading-order QCD predictions for inclusive jet and dijet cross section s have been found to describe the data at the highest measured energies [17]. However, purely fixed-order calculations are expected to describe the data poorly wherever higher-order corrections to a given observable are important. In such cases, higher orders in perturbation theory must be resummed; this resummation is typically performed in terms of \({\ln (1/ x)}\), where \({x}\) is Bjorken-\(x\), the Balitsky–Fadin–Kuraev–Lipatov (BFKL) approach [811], or in terms of \({\ln (Q^2)}\), where \({Q^2}\) is the virtuality of the interaction, the Dokshitzer–Gribov–Lipatov–Altarelli–Parisi (DGLAP) approach [1214]. These resummations provide approximations that are most valid in phase-space regions for which the resummed terms provide a dominant contribution to the observable. Such a situation exists in dijet topologies when the two jets have a large rapidity separation or when a veto is applied to additional jet activity in the rapidity interval bounded by the dijet system [15]. In these regions of phase-space, higher order corrections proportional to the rapidity separation and the logarithm of the scalar average of the transverse momenta of the dijets become increasing important: these must be summed to all orders to obtain accurate theoretical predictions.

When studying these phase-space regions, a particularly interesting observable is the gap fraction, \({f\left( {Q_{0}} \right) }\), defined as \({f\left( {Q_{0}} \right) = \sigma _{\mathrm {jj}}\left( {Q_{0}} \right) /\sigma _{\mathrm {jj}}}\) where \({\sigma _{\mathrm {jj}}}\) is the inclusive dijet cross section and \({\sigma _{\mathrm {jj}}\left( {Q_{0}} \right) }\) is the cross section for dijet production in the absence of jets with transverse momentum greater than \({Q_{0}}\) in the rapidity interval bounded by the dijet system. The variable \({Q_{0}}\) is referred to as the veto scale. In the limit of large rapidity separation, \({\Delta y}\), between the jet centroids, the gap fraction is expected to be sensitive to BFKL dynamics [16, 17]. Alternatively, when the scalar average of the transverse momenta of the dijet s, \({\overline{p_{\mathrm{T}}}}\), is much larger than the veto scale, the effects of wide-angle soft gluon radiation may become important [1820]. Finally, dijet production via \({t}\)-channel colour-singlet exchange [21] is expected to provide an increasingly important contribution to the total dijet cross section when both of these limits are approached simultaneously. The mean number of jets above the veto scale in the rapidity interval between the dijet s is presented as an alternative measurement of hard jet emissions in the rapidity interval.

A complementary probe of higher-order QCD effects can be made by studying the azimuthal angle between the jets in the dijet system, \({\Delta \phi }\). A purely \({2\rightarrow 2}\) partonic scatter produces final-state partons back-to-back in azimuthal angle. Any additional quark or gluon emission alters the balance between the partons and produces an azimuthal decorrelation, the predicted magnitude of which is different for fixed-order calculations, BFKL-inspired resummations and DGLAP-inspired resummations [22]. In particular, the azimuthal decorrelation is expected to increase with increasing rapidity separation if BFKL effects are present [23, 24]. To discriminate between DGLAP-like and BFKL-like behaviour, the azimuthal angular moments \({\langle \cos \left( n \left( \pi - \Delta \phi \right) \right) \rangle }\) where \({n}\) is an integer and the angled brackets indicate the profiled mean over all events, have been proposed [2325]. In addition, taking the ratio of different angular moments is predicted to enhance BFKL effects [24, 26, 27].

Previous measurements have been made of dijet production for which a strict veto, of order of \({\Lambda _\text {QCD}}\), was imposed on the emission of additional radiation in the inter-jet region, so-called “rapidity gap” events, at HERA [2830] and at the Tevatron [3135]. At the LHC, measurements of forward rapidity gaps and dijet production have been made by ATLAS [36, 37], while ratios of exclusive-to-inclusive dijet cross section s have been measured at CMS [38]. Azimuthal decorrelations for central dijet s have also been measured at the LHC by ATLAS [39] and CMS [40] and before that by D0 at the Tevatron [41]. Finally, a previous study of azimuthal angular decorrelations for widely separated dijet s was made by D0 at the Tevatron [42].

This paper presents measurements of the gap fraction and the mean number of jets in the rapidity interval as functions of both the dijet rapidity separation and the scalar average of the transverse momenta of the dijet s. Measurements of the first azimuthal angular moment, the ratio of the first two moments and the double-differential cross section s as functions of \({\Delta \phi }\) and \({\Delta y}\) are also presented, both for an inclusive dijet sample and for events where a jet veto is imposed. Previous results are extended out to a dijet rapidity separation of \({{\Delta y} = 8}\) as well as to dijet transverse momenta up to \({{\overline{p_{\mathrm{T}}}} = 1.5\,\mathrm{TeV}}\), the effective kinematic limits of the ATLAS detector for \({pp}\) collisions at a centre-of-mass energy, \({\sqrt{s}=7\,\mathrm{TeV}}\). The measurements are obtained using the full \({pp}\) collision datasets recorded during 2010 and 2011, corresponding to integrated luminosities of \({36.1\pm 1.3}\,\mathrm{pb}^{-1}\) and \({4.5\pm 0.1}\,\mathrm{fb}^{-1}\), respectively [43]. The two datasets are used in complementary areas of phase-space: the data collected during 2010 are used in the full rapidity range covered by the detector, probing large rapidity separations, with a veto scale of \(20\,\mathrm{GeV}\), while the data collected during 2011 are used in a restricted rapidity range with a veto scale of \(30\,\mathrm{GeV}\) but can access higher values of \({\overline{p_{\mathrm{T}}}}\).

The content of the paper is as follows. Section 2 describes the ATLAS detector followed by Sect. 3, which details the Monte Carlo simulation samples used. Jet reconstruction and event selection are presented in Sect. 4 and Sect. 5 respectively. The correction for detector effects is shown in Sect. 6 and discussion of systematic uncertainties on the measurement is in Sect. 7. Section 8 discusses the theoretical predictions before the results are presented in Sect. 9. Finally, the conclusions are given in Sect. 10.

2 The ATLAS detector

ATLAS [44] is a general-purpose detector surrounding one of the interaction points of the LHC. The main detector components relevant to this analysis are the inner tracking detector and the calorimeters; in addition, the minimum bias trigger scintillators (MBTS) are used for selecting events during early data taking. The inner tracking detector covers the pseudorapidity range \({{|{\eta } |} <2.5}\) Footnote 1 and has full coverage in azimuthal angle. There are three main components to the inner tracker. In order, moving outwards from the beam-pipe, these are the silicon pixel detector, the silicon microstrip detector and the straw-tube transition–radiation tracker. These components are arranged in concentric layers and immersed in a \(2\,\mathrm{T}\) magnetic field provided by the superconducting solenoid magnet.

The calorimeter is also divided into sub-detectors, providing overall coverage for \({{|{\eta } |} <4.9}\). The electromagnetic calorimeter, covering the region \({{|{\eta } |} <3.2}\), is a high-granularity sampling detector in which the active medium is liquid argon (LAr) interspaced with layers of lead absorber. The hadronic calorimeters are divided into three sections: a tile scintillator/steel calorimeter is used in both the barrel (\({{|{\eta } |} <1.0}\)) and extended barrel cylinders (\({0.8<{|{\eta } |} <1.7}\)) while the hadronic endcap (\({1.5<{|{\eta } |} <3.2}\)) consists of LAr/copper calorimeter modules. The forward calorimeter measures both electromagnetic and hadronic energy in the range \({3.2<{|{\eta } |} <4.9}\) using LAr/copper and LAr/tungsten modules.

The MBTS system consists of 32 scintillator counters, organized into two disks with one on each side of the detector. They are located in front of the end-cap calorimeter cryostats and cover the region \({2.1<{|{\eta } |} <3.8}\).

The online trigger selection used in this analysis employs the minimum bias and calorimeter jet triggers [45]. The minimum bias triggers are only available at the hardware level, while the calorimeter triggers have both hardware and software levels. The Level-1 (L1) hardware-based trigger provides a fast, but low-granularity, reconstruction of energy deposited in towers in the calorimeter; the Level-2 (L2) software implements a simple jet reconstruction algorithm in a window around the region triggered at L1; and finally the Event Filter (EF) performs a more detailed jet reconstruction procedure taking information from the entirety of the detector. The efficiency of jet triggers is determined using a bootstrap method, starting from the fully efficient MBTS trigger [45]. Between March and August of 2010, only L1 information was used to select events; both the L1 and L2 stages were used for the remainder of the 2010 data-taking period and all three levels were required for data taken during 2011.

3 Monte Carlo event simulation

Simulated proton–proton collisions at \({\sqrt{s} = 7\,\mathrm{TeV}}\) were generated using the pythia 6.4 [46] program. These were used only to derive systematic uncertainties and to correct for detector effects; for this purpose they are compared against uncorrected data. Additional samples used to compare theoretical predictions to the data are described in Sect. 8.

The pythia program implements leading-order (LO) QCD matrix elements for \({2\rightarrow 2}\) processes followed by \({p_{\mathrm{T}}}\)-ordered parton showers and the Lund string hadronisation model. The underlying event in pythia is modelled by multiple-parton interactions interleaved with the initial-state parton shower.

The events were generated using the MRST LO* parton distribution functions (PDFs) [47, 48]. Samples which simulated the data-taking conditions during 2010 (2011) used version 6.423 (6.425) of the generator, together with the ATLAS AMBT1 [49] (Perugia 2011 [50]) underlying event tune. For the samples simulating the data-taking conditions from 2011, additional \(pp\) collisions were overlaid onto the hard scatter in the correct proportions to replicate this effect in the data. The final-state particles were passed through a detailed geant4  [51] simulation of the ATLAS detector [52] before being reconstructed using the same software used to process data.

4 Jet reconstruction

The collision events selected by the ATLAS trigger system were fully reconstructed offline. Energy deposits in the calorimeter left by electromagnetic and hadronic showers were calibrated to the electromagnetic (EM) scaleFootnote 2. Three-dimensional topological clusters (“topoclusters”) [53] were constructed from seed calorimeter cells according to an iterative procedure designed to suppress electronic noise [54]. Each of these was then treated as a massless particle with direction given by its energy-weighted barycentre. The topoclusters were then passed as input to the FastJet  [55] implementation of the \({\mathrm{anti}\hbox {-}{k_t}}\) jet algorithm [56] with distance parameter \({R=0.6}\) and full four-momentum recombination.

The jets built by the \({\mathrm{anti}\hbox {-}{k_t}}\) algorithm were then calibrated in a multi-step procedure. Additional energy arising from “in-time pileup” (simultaneous \(pp\) collisions within a single bunch crossing) was subtracted using a correction derived from data. Each event was required to have at least one primary vertex, reconstructed using two or more tracks, each with \({{p_{\mathrm{T}}} > 400\,\mathrm{MeV}}\) and the primary vertex with the highest \({\sum {p_{\mathrm{T}}^2}}\) of tracks associated with it was identified as the origin of the hard scatter. The jet position was recalibrated to point to this identified hard scatter primary vertex, rather than the geometric centre of the detector. A series of \({p_{\mathrm{T}}}\)- and \({\eta }\)-dependent energy correction factors derived from simulated events were used to correct for the response of the detector to jets. For the data collected during 2011, additional calibration steps were applied. Energy contributions, which were usually negative, coming from “out-of-time pileup” (residual electronic effects from previous \(pp\) collisions) were corrected for using an offset correction derived using simulation.

A final in situ calibration, using \({Z}\)+jet balance, \({\gamma }\)+jet balance and multi-jet balance, was then applied to correct for residual differences in jet response between the simulation and data. The calibration procedure is described in more detail elsewhere [57, 58].

5 Event selection

The measurements were performed using only the data from specific runs and run periods in which the detector, trigger and reconstructed physics objects satisfied data-quality selection criteria. Beam background was rejected by requiring at least one primary vertex in each event while selection requirements were applied to the hard-scatter vertex to minimise contamination from pileup. For data collected during 2010, the event was required to have only one primary vertex with five or more associated tracks; the proportion of such events was 93 % in the early low-luminosity runs, falling to 21 % in the high-luminosity runs at the end of the year. For data collected during 2011, the hard-scatter vertex was required to have at least three associated tracks. Jets arising from pileup were rejected using the jet vertex fraction (JVF). The JVF takes all tracks matched to the jet of interest and measures the ratio of \({\sum {p_{\mathrm{T}}}}\) from tracks which originated in the hard-scatter vertex to the \({\sum {p_{\mathrm{T}}}}\) of all tracks matched to the jet. For this analysis only jets with \({\text {JVF}>0.75}\) were used; all other jets were considered to have arisen from pileup and were therefore ignored. Due to the limited coverage of the tracking detectors, the JVF is only available for jets satisfying \({{|y|} < 2.4}\), which limits the acceptance in rapidity for 2011 data.

Due to the high instantaneous luminosity reached by the LHC, only high-threshold jet triggers remained unprescaled throughout the data-taking period in question. Jets with transverse momentum below the lowest-threshold unprescaled trigger were therefore only recorded using prescaled triggers. For data collected during 2011, \({\overline{p_{\mathrm{T}}}}\) was used to determine the most appropriate trigger to use for each event. Among all of the triggers determined to be fully efficient at the particular \({\overline{p_{\mathrm{T}}}}\) in question, the one which had been least prescaled was selected: only events passing this trigger were considered.

For the data collected during 2010, it was necessary to combine triggers from the central region (\({{|{\eta } |} \le 3.2}\)) and the forward region (\({3.1 < {|{\eta } |} \le 4.9}\)) of the detector, due to the large \({\Delta y}\) span under consideration. In each of these regions, efficiency curves as a function of jet transverse momentum were calculated for each trigger on a per-jet basis, rather than the per-event basis described above. The triggers were ordered according to their prescales and a lookup table was created, showing the point at which each trigger reached a plateau of 99 % efficiency.

An appropriate trigger was chosen for each of the two leading (highest transverse momentum) jets in each event; this was the lowest prescale trigger to have reached its efficiency plateau at the relevant \({p_{\mathrm{T}}}\) and \({|y|}\). The dijet event was then accepted if the event satisfied the trigger appropriate to the leading jet, the subleading jet or both. This procedure maximised event acceptance, since the random factor inherent in the trigger prescale meant that some events could be accepted based on the properties of the subleading jet even when the appropriate trigger for the leading jet had not fired. In order to combine overlapping triggers with different prescales, the procedure detailed in Ref. [59] was followed. In some less well-instrumented or malfunctioning regions of the detector, the per-jet trigger efficiency plateau occurred at less than the usual 99 % point. This introduced a measurable trigger inefficiency, which was corrected for by weighting events containing jets in these regions by the inverse of the efficiency.

Jets were required to have transverse momentum \({p_{\mathrm{T}}} > 20 (30)\,\mathrm{GeV}\) for data collected during 2010 (2011), thus ensuring that they remained in a region for which the jet energy scale had been evaluated (see Sect. 4). Jets were restricted in rapidity to \({{|y|} < 4.4}\) for data collected in 2010, with a stricter requirement of \({{|y|} < 2.4}\) applied in 2011 to ensure that the JVF could be determined for all jets. The two leading jets satisfying these criteria were then identified as the dijet system of interest. The event was rejected if the transverse momentum of the leading jet was below \(60\,\mathrm{GeV}\) or if that of the subleading jet was below \(50\,\mathrm{GeV}\). For the data collected during 2011, a minimum rapidity separation, \({{\Delta y} \ge 1}\), was required to enhance the physics of interest. The veto scale, \({Q_{0}}\), was set to \({{p_{\mathrm{T}}} > 20 (30)\,\mathrm{GeV}}\) for data collected in 2010 (2011).

Jet cleaning criteria [60, 61] were developed in order to reject fake jets, those which come from cosmic rays, beam halo or detector noise. These criteria also removed jets which were badly measured due to falling into poorly instrumented regions. Events collected in 2010 (2011) were rejected if they contained any jet with transverse momentum \({{p_{\mathrm{T}}} > 20 (30)\,\mathrm{GeV}}\) that failed these cleaning cuts. This requirement was also applied to the simulated samples where appropriate.

Additionally, a problem developed in the LAr calorimeter during 2011 running, resulting in a region in which energies were not properly recorded. As a result, a veto was applied to events that had at least one jet with \({{p_{\mathrm{T}}} > 30\,\mathrm{GeV}}\) falling in the vicinity of this region during the affected data-taking periods. This effect was replicated in the relevant simulation samples, which were reweighted to the data to ensure that an identical proportion of such events were included.

In total, 1188583 events were accepted from the data collected in 2010, with 852030 of these being gap events: those with no additional jets above the veto scale in the rapidity interval between the dijet s. For data collected during 2011, 1411676 events were accepted, with 938086 of these being gap events. Data from 2010 and from 2011 were compared after being analysed separately and were found to agree to within the experimental uncertainties in a region kinematically accessible with both datasets. Figure 1 shows the comparison between detector level data and pythia 6.4 simulation in dijet events. The normalised number of events is presented as a function of \({\Delta y}\) in Fig. 1a and of \({\overline{p_{\mathrm{T}}}}\) in Fig. 1b. In both cases, and in all similar distributions, the pythia 6.4 event generator and geant4 detector simulation give a fair description of the uncorrected data.

Fig. 1
figure 1

Comparison between uncorrected data (black points) and detector level pythia 6.4 Monte Carlo events (solid line). Statistical errors on the data are shown. The normalised distribution of events is presented as a function of a \({\Delta y}\) and b \({\overline{p_{\mathrm{T}}}}\). The ratio of the pythia 6.4 prediction to the data is shown in the bottom panel

6 Correction for detector effects

Before comparing to theoretical predictions, the data are corrected for all experimental effects so that they correspond to the particle-level final state. This comprises all stable particles, defined as those with a proper lifetime longer than \(10\,\mathrm{ps}\), including muons and neutrinos from decaying hadrons [62]. The correction for detector resolutions and inefficiencies is made by unfolding the measured distributions using the Bayesian procedure [63] implemented in the RooUnfold framework [64].

Bayesian unfolding entails using simulated events to calculate a transfer matrix which encodes bin-to-bin migrations between particle-level distributions and the equivalent reconstructed distributions at detector level. A series of bin transition probabilities are obtained from the matrix, and Bayes’ theorem is used to calculate the corresponding inverse probabilities; the process is then repeated iteratively. In this paper, the unfolding is performed using the pythia 6.4 samples described in Sect. 3 and the number of iterations is set to two throughout, as this was found to be sufficient to achieve convergence.

As the results shown here are constructed from multi-dimensional distributions, this must also be taken into account when unfolding. Each distribution is unfolded in two or three dimensions, with these dimensions being relevant combinations of \({\Delta \phi }\), \({\Delta y}\), \({\overline{p_{\mathrm{T}}}}\), \({\cos \left( \pi - \Delta \phi \right) }\), \({\cos \left( 2 \Delta \phi \right) }\) and the classification of the event as gap or non-gap. This means that the transfer matrices are four-dimensional or six-dimensional, rather than the usual two-dimensional case. This allows the effect of all possible bin migrations to be evaluated.

The statistical uncertainties are estimated by performing pseudo-experiments [65]. Each event in data is assigned a weight drawn from a Poisson distribution with unit mean for each pseudo-experiment and these weighted events are used to build a series of one thousand replicas for each distribution. Each of these replicas is unfolded, and the root-mean-squared spread around the nominal value is used to measure the statistical error on the unfolded result.

Possible bias arising from mismodelling of the distributions considered here is evaluated by performing a self-consistency check using pythia 6.4 events. The pythia 6.4 simulation is reweighted on an event-by-event basis using a three-dimensional function which is chosen in such a way as to ensure that the output of this reweighting step will approximate the uncorrected detector level data. For the pythia sample simulated to replicate 2010 data-taking conditions, the reweighting is carried out as a function of \({\Delta y}\), \({\Delta \phi }\) and the highest \({p_{\mathrm{T}}}\) among jets in the rapidity interval; for the sample simulated to replicate 2011 data-taking conditions, \({\overline{p_{\mathrm{T}}}}\) is used instead of \({\Delta y}\). This reweighted detector level pythia 6.4 sample is then unfolded using the original transfer matrix and the result is compared with the particle-level spectrum, which was itself implicitly modified through the event-by-event reweighting. Any remaining difference between these distributions is then taken as a systematic uncertainty associated with the unfolding procedure.

7 Systematic uncertainties

For the most part, the dominant systematic uncertainty on these measurements is the one coming from the jet energy scale (JES) calibration. This uncertainty was determined using a combination of in situ calibration techniques, as detailed earlier, test-beam data and Monte Carlo modelling [57, 58]. It comprises 13 independent components for data taken in 2010 and 64 components for the 2011 data. The uncertainty components fully account for the differences in jet calibration discussed in Sect. 4, thus ensuring that data collected in the two years are fully compatible within their uncertainties.

For each component, all jet energies and transverse momenta are shifted up or down by one standard deviation of the uncertainty and the shifted jets are then passed through the full analysis chain. The measured distributions are unfolded and compared to the nominal distribution; the difference between these is taken as the uncertainty for the component in question. These fractional differences are then combined in quadrature, since the components are uncorrelated, to compute the jet energy scale uncertainty.

The uncertainty on the energy resolution of jets (JER) is derived in situ, using dijet balance techniques and the bisector method; it is then cross-checked through comparison with simulation [66]. Jet angular resolutions are estimated using simulated events and cross-checked using in situ techniques, where good agreement is observed with the simulation. These resolution uncertainties are propagated through the unfolding procedure by smearing the energy or angle of each reconstructed jet in each simulated event by a Gaussian function, with its width given by the quadratic difference between the nominal resolution and the resolution after shifting by the resolution uncertainty. This procedure is repeated one thousand times for each jet, to remove the effects of statistical fluctuations. The resulting smeared events are used to calculate a modified transfer matrix incorporating the resolution uncertainty; this matrix is then used to unfold the data. The ratio of this distribution to the distribution unfolded using the nominal transfer matrix is taken as a systematic uncertainty.

The trigger efficiency correction which is applied to events with jets falling into poorly measured detector regions also has an associated systematic uncertainty. This is determined by increasing or decreasing the measured inefficiencies by an absolute shift of 10 %, with a maximum efficiency of 100 %. The full correction procedure is then carried out using these new correction factors and the difference between this and the nominal distribution is taken as a systematic uncertainty on the measurement.

The effect of statistical fluctuations in the samples used to derive these uncertainties is also estimated by performing pseudo-experiments. Each event in the sample is assigned a weight drawn from a Poisson distribution with unit mean for each pseudo-experiment and these weighted events are used to build a series of one thousand replicas of the transfer matrices. These replicas are then used to unfold the nominal data sample; the root-mean-squared spread around the nominal value provides the systematic error due to the limited statistical precision of the Monte Carlo samples used.

For each of the systematic variations considered, the pseudo-experiment approach applied to the data is used to determine statistical uncertainties on each distribution and correlations between bins. Each fractional uncertainty was smoothed to remove these statistical fluctuations before the uncertainties were combined. To do this, each systematic component was rebinned until each bin showed a statistically significant deviation from the nominal value. This rebinned distribution was then smoothed using a Gaussian kernel and the smoothed function was evaluated at each of the original set of bin centres. The overall fractional uncertainty was then obtained by summing the individual sources in quadrature.

The uncertainty on the unfolding procedure, estimated as described in Sect. 6 is also important in some distributions. There is also an additional uncertainty on the luminosity calibration which is not included here. For the cross section s this is \(3.5\,\%\) while it cancels for all other distributions.

Other sources of uncertainty were examined, found to be negligible and therefore ignored. Specifically, residual pileup contributions from soft-scatter vertices were studied by dividing the data into two subsamples coming from high-luminosity and low-luminosity runs. The disagreement between these subsamples was found to be negligible and hence no separate systematic uncertainty was assigned. The effect of varying the cut applied on the JVF was also studied and was found to produce differences in the detector-level distributions. However, such differences were well-reproduced by the pythia 6.4 sample and the resulting deviations after unfolding were much smaller than other uncertainties. Accordingly, no systematic uncertainty was assigned here either.

Figure 2 shows the summary of systematic uncertainties for two sample distributions: Fig. 2a for the gap fraction as a function of \({\Delta y}\) and Fig. 2b for the \({\langle \cos \left( \pi - \Delta \phi \right) \rangle }\) distribution as a function of \({\overline{p_{\mathrm{T}}}}\).

Fig. 2
figure 2

Summary of systematic uncertainties on a the gap fraction as a function of \({\Delta y}\) and b the \({\langle \cos \left( \pi - \Delta \phi \right) \rangle }\) distribution as a function of \({\overline{p_{\mathrm{T}}}}\). Here the systematic uncertainties from the jet energy scale (dark dashes), Monte Carlo statistical precision (dark dots), jet energy resolution (dark dashed-dots), unfolding (light dashes), jet \({\phi }\) resolution (light dots) and from residual trigger inefficiencies (light dashed-dots) are shown. The total systematic uncertainty (light cross-hatched area) is also shown

8 Theoretical predictions

Two state-of-the-art theoretical predictions, namely High Energy Jets (HEJ) [16, 67] and the powheg  Box [6870], are considered in this paper.

hej provides a leading-logarithmic (LL) calculation of the perturbative terms that dominate the production of multi-jet events when the jets span a large range in rapidity [16, 67, 71]. This formalism resums logarithms relevant in the Mueller–Navelet [72] limit, and incorporates a contribution from all final states with at least two hard jets. The purely partonic multi-jet output from hej can also be interfaced to the ariadne parton shower framework [73] to evolve the prediction to the hadron-level final state [74]. The ariadne program is based on the colour-dipole cascade model [75] in which gluon emissions are modelled as radiation from colour-connected partons, and provides soft and collinear radiation down to the hadronic scale, using pythia 6.4 for hadronisation. This accounts for radiation in the rapidity region outside the multi-jet system modelled by hej and represents a contribution from small-\(x\), BFKL-like, logarithmic terms.

The powheg Box (version r2169) provides a full next-to-leading-order dijet calculation and is interfaced to either pythia 8  [76] (AU2 tune with \(\alpha _\text {s}\) matching for the ISR [77]) or herwig  [78] (AUET1 tune [79]) to provide all-order resummation of soft and collinear emissions using the parton shower approximation. The advantage over the simple \({2\rightarrow 2}\) matrix elements is that the emission of an additional third hard parton is calculated exactly in pQCD, allowing observables that depend on the third jet to be calculated with good accuracy [80].

The prediction provided by powheg uses the DGLAP formalism, while that provided by hej is based on BFKL. For all theoretical predictions, events were generated using the CT10 PDF set [81]. The orthogonal error sets provided as part of the CT10 PDF set were used in order to evaluate the uncertainty inherent in the PDF, at the 68 % confidence level, following the CTEQ prescription [82]. The default choice for the renormalisation and factorisation scales was the transverse momentum of the leading parton in each event. The uncertainty due to higher-order corrections was estimated for the powheg prediction by increasing and decreasing the scale by a factor of two and taking the envelope of these variations. For the hej predictions, an envelope of nineteen scale variations was considered. These were constructed by varying each scale upwards and downwards by factors of 2 and \(\sqrt{2}\), but excluding those cases where the ratio between the two scale factors was greater than two.

The scale uncertainty and PDF uncertainties were combined in quadrature to construct an overall uncertainty for each prediction. The PDF uncertainties are small across all of the phase-space regions considered in this paper and hence the predominant contribution to the uncertainty comes from the scale uncertainty. Theoretical uncertainties on the hej+ariadne prediction are not currently calculable and are not shown here. The range covered by the central values of the powheg+pythia 8 and powheg+herwig predictions gives an estimate of the uncertainty inherent in the parton shower matching procedure. This range, together with the uncertainty band on the powheg+pythia 8 prediction, can be considered together as a total theoretical uncertainty on the NLO+DGLAP prediction that can be compared to the predictions given by hej and hej+ariadne.

Finally, in order to allow comparisons against the fully corrected data distributions presented in this paper, the partons from hej or the final-state particles from hej+ariadne, powheg+pythia 8 and powheg+herwig were clustered together using the same jet algorithm and parameters as for the data.

9 Results and discussion

The fully corrected data are compared to next-to-leading-order theoretical predictions from powheg and hej, as explained in Sect. 8. The powheg prediction is presented after parton showering, hadronisation and underlying event simulation with either pythia 8 or herwig. As the uncertainties on these two powheg predictions are highly correlated, uncertainties are only shown on the powheg+pythia 8 prediction, with only the central value of the powheg+herwig prediction presented. Two hej curves are presented: one a pure parton-level prediction and the second after interfacing with the ariadne parton shower. The central value of the hej+ariadne prediction is shown, together with the statistical uncertainty on this prediction.

9.1 Gap fraction and mean jet multiplicity

Figures 3 and 4 show the gap fraction and the number of jets in the rapidity gap, respectively, as functions of \({\Delta y}\) and \({\overline{p_{\mathrm{T}}}}\). Naïvely, it is expected from pQCD that the number of events passing the jet veto should be exponentially suppressed as a function of \({\Delta y}\) and \({\ln {\left( \overline{p_{\mathrm{T}}}/Q_{0}\right) }}\) due to the exchange of colour in the \({t}\)-channel [19]. However, non-exponential behaviour may become apparent in the tails of these distributions as the steeply falling parton distribution functions can reduce the probability of additional quark and gluon radiation from the dijet system and increase the gap fraction [15]. This can be understood by considering the behaviour at extreme values of \({\Delta y}\) or \({\overline{p_{\mathrm{T}}}}\), when all of the collision energy is used to create the dijet pair and little is available for additional radiation. As the gap fraction is expected to be smooth it must therefore begin increasing at some point, so as to reach unity when this kinematic limit is obtained. Furthermore, since the cross section for QCD colour-singlet exchange increases with jet separation [21], any contribution from such processes would also lead to an increase in the gap fraction at large \({\Delta y}\).

Fig. 3
figure 3

The measured gap fraction (black dots) as a function of a \({\Delta y}\) and b \({\overline{p_{\mathrm{T}}}}\). The inner error bars represent statistical uncertainty while the outer error bars represent the quadrature sum of the systematic and statistical uncertainties. For comparison, the predictions from parton-level hej (light-shaded cross-hatched band), hej+ariadne (mid-shaded dotted band), powheg+pythia 8 (dark-shaded hatched band) and powheg+herwig (dotted line) are also included. The ratio of the theory predictions to the data is shown in the bottom panel

Fig. 4
figure 4

The mean number of jets above the veto threshold in the rapidity interval bounded by the dijet system measured in data as a function of a \({\Delta y}\) and b \({\overline{p_{\mathrm{T}}}}\). For comparison, the hej, hej+ariadne, powheg+pythia 8 and powheg+herwig predictions are presented in the same way as Fig. 3

The data do indeed show exponential behaviour in Fig. 3 at low values of \({\Delta y}\) and \({\overline{p_{\mathrm{T}}}}\), but deviate from purely exponential behaviour at the highest values of \({\Delta y}\) and \({\overline{p_{\mathrm{T}}}}\), with the gap fraction reaching a plateau in both distributions. For the \({\overline{p_{\mathrm{T}}}}\) distribution, this plateau is qualitatively reproduced by all the predictions considered here, even those which do not provide good overall agreement with the data. The plateau observed in data for the \({\Delta y}\) distribution is not, however, as prominent in any of the theoretical predictions, which all continue to fall as \({\Delta y}\) increases. A similar excess was observed in previous experiments [2835] and was attributed to colour-singlet exchange effects. However, here the spread of theoretical predictions is too large to allow definite conclusions to be drawn and improved calculations are needed before a quantitative statement can be made.

In the high-\({\Delta y}\) region, both powheg predictions slightly underestimate the gap fraction and hence overestimate the mean jet multiplicity in the rapidity interval. Partonic hej slightly overestimates the gap fraction for intermediate values of \({\Delta y}\). Interfacing hej to ariadne improves the description of the data across the \({\Delta y}\) spectrum.

powheg+pythia 8, which resums soft and collinear emissions through the parton shower approximation, provides a good description of the gap fraction and the mean jet multiplicity distributions as a function of \({\overline{p_{\mathrm{T}}}}\). On the other hand, the powheg+herwig model, which also provides a similar resummation, consistently predicts too much jet activity across the \({\overline{p_{\mathrm{T}}}}\) range. Conversely, hej, which does not attempt to resum these soft and collinear terms, provides a poor description of the data in the large \({\ln {\left( \overline{p_{\mathrm{T}}}/Q_{0}\right) }}\) limit. Significantly improved agreement with the data is seen when interfacing hej to the ariadne parton shower model, which performs a resummation of these terms. In fact, the prediction from hej+ariadne is similar to that from powheg+pythia 8 for most values of \({\Delta y}\) and \({\overline{p_{\mathrm{T}}}}\).

9.2 Azimuthal decorrelations

Figure 5 shows the \({\langle \cos \left( \pi - \Delta \phi \right) \rangle }\) and \({\langle \cos \left( 2 \Delta \phi \right) \rangle }/ {\langle \cos \left( \pi - \Delta \phi \right) \rangle } \rangle \) distributions, as functions of \({\Delta y}\) and \({\overline{p_{\mathrm{T}}}}\), for inclusive dijet events. For the azimuthal moments, \({\langle \cos \left( n \left( \pi - \Delta \phi \right) \right) \rangle }\), a decrease (increase) in azimuthal correlation manifests as a decrease (increase) in the azimuthal moment. As the dijet s deviate from a back-to-back topology, the second azimuthal moment falls more rapidly than the first (in the region \({{\Delta \phi } > \pi /2}\) where the majority of events lie). The ratio \({{\langle \cos \left( 2 \Delta \phi \right) \rangle }/ {\langle \cos \left( \pi - \Delta \phi \right) \rangle } \rangle }\) is, therefore, expected to show a similar, but more pronounced, dependence on azimuthal correlation to that seen in the moments.

Fig. 5
figure 5

The measured a, b \({\langle \cos \left( \pi - \Delta \phi \right) \rangle }\) and c, d \({{\langle \cos \left( 2 \Delta \phi \right) \rangle }/ {\langle \cos \left( \pi - \Delta \phi \right) \rangle } \rangle }\) distributions as a function of a, c \({\Delta y}\) and b, d \({\overline{p_{\mathrm{T}}}}\). For comparison, the hej, hej+ariadne, powheg+pythia 8 and powheg+herwig predictions are presented in the same way as Fig. 3

The data show the expected qualitative behaviour of a decrease in azimuthal correlation with increasing \({\Delta y}\) and increase in azimuthal correlation with increasing \({\overline{p_{\mathrm{T}}}}\). Both powheg predictions underestimate the degree of azimuthal correlation except in the high-\({\overline{p_{\mathrm{T}}}}\) region, while hej predicts too much azimuthal correlation. In both cases, the changing degree of correlation with \({\Delta y}\) and \({\overline{p_{\mathrm{T}}}}\) is, for the most part, well described by the predictions. The largest differences between the predictions and the data are seen at high \({\Delta y}\) and low \({\overline{p_{\mathrm{T}}}}\).

Additionally, it can be seen that the separation between theoretical predictions for the ratio \({\langle \cos \left( 2 \Delta \phi \right) \rangle }/ {\langle \cos \left( \pi - \Delta \phi \right) \rangle } \rangle \) is significantly greater than for the \({\langle \cos \left( \pi - \Delta \phi \right) \rangle }\) distribution alone, considering the uncertainties of these predictions. This means that the ratio gives enhanced discrimination between the DGLAP-like powheg and BFKL-like hej predictions, as predicted by theoretical calculations [83]. Here, neither hej nor powheg provide good agreement with the data. However, the hej+ariadne prediction gives a good description of the data for both low-\({\overline{p_{\mathrm{T}}}}\) and for \({\Delta y}\).

Figure 6 shows the corresponding \({\langle \cos \left( \pi - \Delta \phi \right) \rangle }\) and \({{\langle \cos \left( 2 \Delta \phi \right) \rangle }/ {\langle \cos \left( \pi - \Delta \phi \right) \rangle } \rangle }\) distributions for events that pass the veto requirement on additional jet activity in the rapidity interval bounded by the dijet system. In this case, with the jet veto suppressing additional quark and gluon radiation, the spectra show the opposite behaviour, namely a slight increase in correlation with \({\Delta y}\), which now agrees with the rise seen in the \({\overline{p_{\mathrm{T}}}}\) distribution. This can be explained by considering that as \({\Delta y}\) or \({\overline{p_{\mathrm{T}}}}\) increase, the veto requirement imposes an increasingly back-to-back topology on the dijet system. The spread of theoretical predictions is again large in each distribution, with the powheg predictions having too much decorrelation and hej predicting too little decorrelation.

Fig. 6
figure 6

The measured a, b \({\langle \cos \left( \pi - \Delta \phi \right) \rangle }\) and c, d \({\langle \cos \left( 2 \Delta \phi \right) \rangle }/ {\langle \cos \left( \pi - \Delta \phi \right) \rangle } \rangle \) distributions, for gap events as a function of a, c \({\langle \cos \left( \pi - \Delta \phi \right) \rangle }\) \({\Delta y}\) and b, d \({\langle \cos \left( \pi - \Delta \phi \right) \rangle }\) \({\overline{p_{\mathrm{T}}}}\). The veto scale is \({Q_{0}}\) = 20 \((30)\,\mathrm{GeV}\) for data collected during 2010 (2011). For comparison, the hej, hej+ariadne, powheg+pythia 8 and powheg+herwig predictions are presented in the same way as Fig. 3

The use of the ariadne parton shower again brings hej into better agreement with the data, although not as well as in the inclusive case. The best agreement is given by powheg+pythia 8, especially in the highest \({\overline{p_{\mathrm{T}}}}\) bins. hej agrees less well with the data than in the inclusive case, showing that this region of widely separated hard jets without additional radiation in the event is not well reproduced by the hej calculation. A quantitative statement about the degree of agreement seen here between hej+ariadne and the data cannot, however, be made in the absence of theoretical uncertainties on this calculation.

Figures 7 and 8 show the double-differential dijet cross section s as functions of \({\Delta \phi }\) and \({\Delta y}\) for inclusive and gap events respectively. The predictions from powheg+pythia 8 and powheg+herwig provide a good overall description of the measured cross section s, within the experimental and theoretical uncertainties, with the only notable deviations occurring at high \({\Delta \phi }\) in the lowest \({\Delta y}\) bins. This is in agreement with the observations in previous ATLAS studies [1]. hej underestimates the cross section seen in data throughout the \({\Delta y}\) range, although it provides a good description of the overall shape. This underestimate is noticeably enhanced when only gap events are considered, which is a regime far from the wide-angle, hard-emission limit for which the underlying resummation procedure is valid.

Fig. 7
figure 7

The measured double-differential cross section s (black points) as a function of \({\Delta \phi }\) for eight slices in \({\Delta y}\). For comparison, the hej, hej+ariadne, powheg+pythia 8 and powheg+herwig predictions are presented in the same way as Fig. 3. In a the absolute comparison is shown, while in b the ratios of the predictions to the data are shown

Fig. 8
figure 8

The measured double-differential cross section s as a function of \({\Delta \phi }\) for eight slices in \({\Delta y}\), for gap events. The veto scale is \({{Q_{0}} = 20\,\mathrm{GeV}}\). For comparison, the hej, hej+ariadne, powheg+pythia 8 and powheg+herwig predictions are presented in the same way as Fig. 3. In a the absolute comparison is shown, while in b the ratios of the predictions to the data are shown

10 Summary

Theoretical predictions based on perturbative QCD are tested by studying dijet events in extreme regions of phase space. Measurements of the gap fraction, as a function of both the rapidity separation and the average dijet transverse momentum, together with the azimuthal decorrelation are presented as functions of \({\Delta y}\) and \({\overline{p_{\mathrm{T}}}}\), extending previous studies up to eight rapidity units in \({\Delta y}\) and \(1.5\,\mathrm{TeV}\) in \({\overline{p_{\mathrm{T}}}}\). The measurements are used to investigate the predicted breakdown of DGLAP evolution and the appearance of BFKL effects by comparing the data to the all-order resummed leading-logarithmic calculations of hej and the full next-to-leading-order calculations of powheg. The full data sample collected with the ATLAS detector in \(7\,\mathrm{TeV}\) \(pp\) collisions at the LHC is used, corresponding to integrated luminosities of \({36.1}\,\mathrm{pb}^{-1}\) and \({4.5}\,\mathrm{fb}^{-1}\) for data collected during 2010 and 2011 respectively.

The data show the expected behaviour of a reduction of gap events, or equivalently, an increase in jet activity, for large values of \({\overline{p_{\mathrm{T}}}}\) and \({\Delta y}\), together with an associated rise in the number of jets in the rapidity interval. The azimuthal moments show an increase in correlation with increasing \({\overline{p_{\mathrm{T}}}}\) and an increase in (de)correlation with increasing \({\Delta y}\) for gap (all) events. The expected increase in cross section with \({\Delta \phi }\) is also seen.

The powheg+pythia 8 prediction provides a reasonable description of the data in most distributions, but shows disagreement in some areas of phase-space, particularly for the inclusive azimuthal distributions in the limit of large \({\Delta y}\) or small \({\overline{p_{\mathrm{T}}}/Q_{0}}\). When the herwig parton shower is used instead of pythia 8, the agreement with data worsens as herwig predicts too many jets above the veto scale. The partonic hej prediction provides a poor description of the data in most of the distributions presented, with the exception of the gap fraction and jet multiplicity distributions as a function of \({\Delta y}\). The addition of the ariadne parton shower, which accounts for some of the soft and collinear terms ignored in the hej approximation, brings the prediction closer to powheg+pythia 8.

No single theoretical prediction is able to simultaneously describe the data over the full phase-space region considered here; in general, however, the best agreement is given by powheg+pythia 8 and hej+ariadne. The variable best able to discrimate between the DGLAP-like prediction from powheg+pythia 8 and the BFKL-like prediction from hej is \({{\langle \cos \left( 2 \Delta \phi \right) \rangle }/ {\langle \cos \left( \pi - \Delta \phi \right) \rangle } \rangle }\). Here, it can clearly be seen that neither of these predictions describe the data accurately as a function of either \({\Delta y}\) or \({\overline{p_{\mathrm{T}}}}\). It should be noted, however, that when the inclusive event sample is considered, the hej+ariadne model, a combination of BFKL-like parton dynamics with the colour-dipole cascade model, provides a good description of \({{\langle \cos \left( 2 \Delta \phi \right) \rangle }/ {\langle \cos \left( \pi - \Delta \phi \right) \rangle } \rangle }\) in the large \({\Delta y}\) and small \({\overline{p_{\mathrm{T}}}}\) regions, where the powheg models show some divergence from the data.

In most of the phase-space regions presented, the experimental uncertainty is smaller than the spread of theoretical predictions. These disparities between predictions represent a genuine difference in the modelling of the underlying physics and the data can, therefore, provide a crucial input for constraining parton-shower models in the future—particularly in the case of QCD radiation between widely separated or high transverse momentum dijet s. Improved theoretical predictions are essential before any conclusions can be drawn about the presence or otherwise of BFKL effects or colour-singlet exchange in these data.