1 Introduction

In high-energy proton-proton collisions, an interaction with large momentum transfer between two partons may lead to the production of a pair of jets with large transverse momenta \(p_{\mathrm {T}}\). Dijet production at the LHC [1,2,3,4,5,6,7,8,9,10,11,12] is generally well described by perturbative quantum chromodynamics (pQCD) calculations based on the Dokshitzer–Gribov–Lipatov–Altarelli–Parisi (DGLAP) evolution equations [13,14,15]. The DGLAP equations govern the emission of additional softer partons, ordered in transverse momentum \(k_\mathrm {T}\) with respect to the jets axes. However, when the two jets are separated by a large interval in pseudorapidity (\(\eta \)), an alternative pQCD evolution based on the Balitsky–Fadin–Kuraev–Lipatov (BFKL) equations [16,17,18] is expected to describe the data better [19]. In the BFKL approach, the emission of additional partons is ordered in \(\eta \sim \ln (1/x)\), where x is the fractional momentum carried by the radiated parton.

The events considered in this study are pp collisions where two jets are produced with a large rapidity gap between them. The absence of particles between the jets is reminiscent of a diffractive process [20], in which a color-singlet exchange (CSE) takes place between the interacting partons. In diffractive processes, such an exchange is described in terms of the pomeron, a combination of gluons in a color-singlet state. However, the absolute value of the four–momentum squared exchanged in standard diffractive events (less than a few \(\,\text {GeV} ^2\)) is much smaller than that in the events considered here. Such events can be understood in a BFKL-inspired approach in terms of the exchange of a color-singlet gluon ladder (Fig. 1), as first discussed by Mueller and Tang in Ref. [21] and further developed in Refs. [22,23,24]. Jet-gap-jet events in proton–proton collisions may be affected by additional scatterings among the spectator partons, which can destroy the original rapidity gap. Such a contribution is typically described by a non-perturbative quantity, the so-called gap survival probability, which quantifies the fraction of events where the rapidity gap is not destroyed by interactions between spectator partons [19].

Fig. 1
figure 1

Schematic diagram of a dijet event with a rapidity gap between the jets (jet-gap-jet event). The gap is defined as the absence of charged particle tracks above a certain \(p_{\mathrm {T}}\) threshold

Jet-gap-jet events were first observed in \(\mathrm {p}\overline{\text {p}} \) collisions at the Tevatron by D0 [25,26,27] and CDF [28,29,30], and in \(\text {e}^\pm \mathrm {p}\) collisions at HERA [31, 32]. At the Tevatron, the fraction of dijet events produced through CSE was found to be \(\sim \)1% at \(\sqrt{s}=1.8\) \(\,\text {TeV}\), a factor of 2–3 less than at \(\sqrt{s}=0.63\) \(\,\text {TeV}\). This paper presents the first observation of jet-gap-jet events at the LHC, and the measurement of the CSE fraction at \(\sqrt{s}=7\) \(\,\text {TeV}\), using events with two leading jets of \(p_{\mathrm {T}} ^{\text {jet}}>40\) \(\,\text {GeV}\) and \(1.5<|\eta ^{\text {jet}} |<4.7\), reconstructed in opposite ends of the CMS detector. The CSE signal is extracted from the distribution of the charged-particle multiplicity in the central region \(|\eta |<1\) between the jets, for particles with \(p_{\mathrm {T}} >0.2\) \(\,\text {GeV}\). The CSE fraction is studied as a function of the pseudorapidity separation \(\varDelta \eta _\mathrm {jj}\) between the jets, and of the \(p_{\mathrm {T}}\) of the second-leading jet, as done by the D0 experiment [27].

The data used for this measurement correspond to an integrated luminosity of 8\(\,\text {pb}^{-1}\) and were recorded with the CMS detector in the year 2010, when the LHC operated at \(\sqrt{s}=7\) \(\,\text {TeV}\) with low probability of overlapping \(\mathrm {p}\mathrm {p}\) interactions (pileup).

2 The CMS detector and event reconstruction

The central feature of the CMS apparatus is a superconducting solenoid of 6\(\text {\,m}\) internal diameter. Within the field volume are the silicon pixel and strip tracker, the crystal electromagnetic calorimeter (ECAL), and the brass and scintillator hadronic calorimeter (HCAL). Muons are measured in gas-ionization detectors embedded in the steel flux-return yoke outside the solenoid.

The silicon tracker measures charged particles within the pseudorapidity range \(|\eta | < 2.5\). It consists of 1440 silicon pixel and 15,148 silicon strip detector modules. For nonisolated particles of \(1< p_{\mathrm {T}} < 10\,\text {GeV} \) and \(|\eta | < 1.4\), the track resolutions are typically 1.5% in \(p_{\mathrm {T}}\) and 25–90 (45–150)\(\,\mu \text {m}\) in transverse (longitudinal) impact parameter. The silicon tracker provides the primary vertex position with \(\sim \)15\(\,\mu \text {m}\) resolution for jet events of the type considered in this analysis [33].

In the region \(| \eta |< 1.74\), the HCAL cells have widths of 0.087 in both \(\eta \) and azimuth (\(\varphi \), in radians). In the \(\eta \)-\(\varphi \) plane, and for \(|\eta |< 1.48\), the HCAL cells map onto 5\({}\times {}\)5 ECAL crystal arrays to form calorimeter towers projecting radially outwards from the nominal interaction point. At larger values of \(| \eta |\), the size of the towers increases and the matching ECAL arrays contain fewer crystals. In addition to the barrel and endcap detectors, CMS has extensive forward calorimetry. The forward component of the hadron calorimeter \((2.9<|\eta |<5.2)\) consists of steel absorbers with embedded radiation-hard quartz fibers, providing fast collection of Cherenkov light.

A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [34].

The first level of the CMS trigger system [35], composed of custom hardware processors, uses information from the calorimeters and muon detectors to select the most interesting events in a fixed time interval of less than 3.2\(\,\mu \text {s}\). The high-level trigger processor farm further decreases the event rate from around 100\(\text {\,kHz}\) to around 400\(\text {\,Hz}\), before data storage.

Tracks are reconstructed with the standard iterative algorithm of CMS, which is based on a combinatorial track finder that uses information from the silicon tracker. To reduce the misidentification rate, tracks are required to pass standard CMS quality criteria, usually referred to as ’high-purity’ criteria [33]. These place requirements on the number of hits, the \(\chi ^2\) of the track fit, and the degree of compatibility with the hypothesis that the track originates from a vertex reconstructed with the pixel detector. The requirements are functions of the track \(p_{\mathrm {T}}\) and \(\eta \), as well as the number of layers with a hit. A more detailed discussion on the combinatorial track finder algorithm and the high-purity track definition can be found in Ref. [33].

The jets are reconstructed using the infrared- and collinear-safe anti-\(k_{\mathrm {T}}\) algorithm [36, 37], with a distance parameter \(R=0.5\), starting from the particles identified with the particle-flow method [38]. The key feature of the anti-\(k_{\mathrm {T}}\) algorithm is the resilience of the jet boundary with respect to soft radiation. This leads to cone-shaped hard jets. Soft jets tend to have more complicated shapes. The jet momentum is determined as the vector sum of all particle momenta in the jet, and is found in the simulation to be within 5 to 10% of the true hadron-level momentum over the whole \(p_{\mathrm {T}} ^{\text {jet}}\) spectrum and detector acceptance. When combining information from the entire detector, the jet energy resolution for jets with \(p_{\mathrm {T}} ^{\text {jet}} =40\) \(\,\text {GeV}\)  (200\(\,\text {GeV}\)) is about 12% (7%) for \(|\eta ^{\text {jet}} |<0.5\) and about 10% for \(4< |\eta ^{\text {jet}} | < 4.5\) [39]. Jet energy corrections are derived from the simulation, and are confirmed with in situ measurements of the energy balance in dijet and photon+jet events [40]. No jet energy corrections related to the removal of pileup contributions [41] are required for the jets studied in this analysis.

3 Monte Carlo simulation

The simulation of inclusive dijet events is performed using the pythia 6.422 Monte Carlo (MC) event generator [42]. pythia 6 is based on the leading order (LO) DGLAP evolution equations combined with a leading-logarithmic (LL) resummation of soft gluon emission in the parton shower, and uses the Lund string fragmentation model [43] for hadronization. The underlying event in pythia 6 includes particles produced in the fragmentation of minijets from multiple parton interactions (MPI), initial- and final-state radiation, as well as proton remnants. The events were simulated using the Z2* tune [44], which was developed to reproduce the CMS underlying event data at center-of-mass energies up to 7\(\,\text {TeV}\). pythia 6 models the production of diffractive dijets (leading to a final state with a gap-jet-jet topology) and of central diffractive and exclusive dijets (leading to a gap-jet-jet-gap final-state). However, it does not directly generate the jet-gap-jet topology considered here unless a fluctuation in the radiation and hadronization of the parton showers in inclusive dijet production randomly leads to suppressed hadronic activity between the jets.

Jet-gap-jet events are simulated with the default tune of the herwig 6.520 generator [45] (switching on CSE production, and switching off all other processes). The herwig 6 generator simulates events with hard color-singlet exchange between two partons according to the model by Mueller and Tang [21], which is based on simplified (LL) BFKL calculations. The hadronization process in herwig is based on cluster fragmentation: at the end of the perturbative parton evolution, clusters are built and then decayed into the final-state hadrons. The herwig 6 generator does not include any modeling of MPI; they are instead simulated with the jimmy package [46]. For simplicity, unless stated otherwise, by herwig 6 we herafter refer to the combination of this MC generator with jimmy. The herwig 6 generator predicts a decrease of the CSE fraction with increasing \(p_{\mathrm {T}}\) of the jets, but the Tevatron data show instead the opposite trend [25, 28]. In the present analysis, the events generated with herwig 6 are reweighted with an exponential function, \(\exp (\mathrm {b} \, p_{\mathrm {T}} ^{\text {jet2}}\)) with \(\mathrm {b}=0.01\,\text {GeV} {}^{-1}\), to ensure that the CMS data are reproduced. In the following, this sample of reweighted herwig events will be referred to as the herwig 6 sample.

Both pythia 6 and herwig 6 use the CTEQ6L1 parametrization of the proton parton distribution functions [47]. The simulated events are processed and reconstructed in the same manner as the collision data. A detailed MC simulation of the CMS detector response is performed with the Geant4 toolkit [48].

4 Data samples and dijet event selection

Three non-overlapping samples of dijet events are used, corresponding to the following three \(p_{\mathrm {T}} ^{\text {jet}}\) ranges, defined in terms of the \(p_{\mathrm {T}}\) of the second leading jet in the dijet system, \(p_{\mathrm {T}} ^{\text {jet2}}\): 40–60, 60–100, and 100–200\(\,\text {GeV}\). The first two samples were selected online with dijet triggers with 15 and 30\(\,\text {GeV}\) thresholds on the uncorrected jet \(p_{\mathrm {T}} \), respectively, while the third sample was collected with a single jet trigger with uncorrected jet \(p_{\mathrm {T}}\) threshold of 70\(\,\text {GeV}\). This selection maximizes the amount of dijet events for the analysis and ensures high dijet reconstruction efficiency. The triggers for the first two samples were heavily prescaled. The three samples correspond to integrated luminosities of 48, 410, and 8320\(\,\text {nb}^{-1}\), respectively. The mean number of inelastic \(\mathrm {p}\mathrm {p}\) interactions per bunch crossing (pileup) in each of the three samples is 1.16, 1.17, and 1.60, respectively.

The following conditions are imposed offline on all samples:

  • events are required to contain at least two jets that pass the standard CMS quality criteria [49];

  • the number of primary vertices with more than zero degrees of freedom in the event, as defined in [33], is required to be 0 or 1;

  • a primary vertex, if present, is required to be within a longitudinal distance \(|z | < 24\) cm from the nominal interaction point;

  • events with long horizontal sections of the pixel tracker traversed by charged particles parallel to the beam (beam-scraping events) are rejected using a dedicated algorithm [50].

In order to allow for a sufficiently wide rapidity gap between the jets, the following conditions are further imposed on the jets:

  • the two leading jets are required to be in the range \(1.5<|\eta ^\text {jet} |< 4.7\);

  • the two leading jets are required to be in opposite hemispheres: \(\eta ^\mathrm {jet1} \, \eta ^\text {jet2} < 0\).

Fig. 2
figure 2

Schematic picture of a jet-gap-jet event in the \(\varphi \) vs. \(\eta \) plane. The circles indicate the two jets reconstructed on each side of the detector, while the dots represent the remaining hadronic activity in the event. The shaded area corresponds to the region of the potential rapidity gap, in which the charged-particle multiplicity is measured (the so-called gap region)

The single- or zero-vertex requirement rejects most of the events with pileup interactions, which can hide an existing rapidity gap. At the same time, it may reject dijet events in which one true primary vertex is wrongly reconstructed as two or more; however, the probability of such badly reconstructed vertices has been checked with the pythia 6 Z2* and herwig 6 simulations and found to be negligible. Selecting events with no reconstructed vertices increases the acceptance for signal events in which the two jets are produced outside the tracker coverage. Such events are estimated from the data to contribute about 10% of all CSE events. According to the simulations the residual fraction of pileup events in the sample is negligible.

There are 6196, 8197, and 9591 events that satisfy the above selection criteria in the \(p_{\mathrm {T}} ^\text {jet2}= 40\)–60, 60–100, and 100–200\(\,\text {GeV}\) jet samples, respectively.

5 Jet-gap-jet events

The charged-particle multiplicity (\(N_\text {tracks}\)) in the gap region between the two leading jets (the shaded area in Fig. 2) is used to discriminate between CSE and non-CSE events. The \(N_\text {tracks}\) variable is defined as the number of reconstructed particles with \(p_{\mathrm {T}} >0.2\) \(\,\text {GeV}\) in the interval \(|\eta | < 1\). Tracks are required to have a measured \(p_{\mathrm {T}}\) with relative uncertainty smaller than 10% (\(\sigma _{p_{\mathrm {T}}}/p_{\mathrm {T}} < 10\%\)), which reduces the contribution of tracks from secondary interactions. The chosen \(\eta \) range ensures a high track reconstruction efficiency and, at the same time, is wide enough to suppress most of the background events with smaller gaps produced via non-CSE fluctuations.

The separation between the jet axes corresponds to at least three units of \(\eta \) (for jets with \(|\eta ^\text {jet} |> 1.5\) and \(\eta ^\mathrm {jet1} \, \eta ^\text {jet2} < 0\)), the minimum gap width typically used in studies of diffractive interactions. For the majority of the events the gap region is far from the edges of jets, which reduces the contamination of soft radiation from the jet shower evolution.

Figure 3 shows the measured \(N_\text {tracks}\) distribution in different \(p^\text {jet2}_\mathrm {T}\) bins. In each \(p_{\mathrm {T}} ^\text {jet2}\) bin, the pythia 6 distribution is normalized to the integral of the number of events measured for \(N_\text {tracks}>3\), and the herwig 6 predictions are normalized to the number of events with \(N_\text {tracks}=0\) measured in the data. The data are satisfactorily described by the pythia 6 simulation, with the exception of the lowest multiplicity bins, in which a large excess of events is observed, consistent with a contribution from CSE events. This excess is well described by the reweighted herwig 6 generator, as seen in the data/MC ratio plots.

The leading and the second-leading jet \(p_{\mathrm {T}}\) spectra for events with no tracks reconstructed in the gap region \(|\eta |<1\) are presented in Fig. 4. The data, plotted in bins of \(p^\text {jet2}_\mathrm {T}\), are reproduced by the normalized herwig 6 CSE events. A very small contribution from pythia 6 events can be explained by fluctuations in the hadronization of (non-CSE) inclusive dijet events, with no particles or only neutral particles produced inside the gap region. Figure 5 shows the distributions of the azimuthal angle \(\varDelta \varphi ^\mathrm {jet1,2}\) between the jets (left), and of the ratio of the second-leading jet \(p_{\mathrm {T}}\) to the leading jet \(p_{\mathrm {T}} \), \(p_{\mathrm {T}} ^\text {jet2}/p_{\mathrm {T}} ^\mathrm {jet1}\) (right). The data, shown separately for events with no tracks and with more than three tracks reconstructed in the \(|\eta |<1\) region, are well described by the normalized simulations, which are dominated by CSE (herwig 6) and non-CSE (pythia 6) events, respectively. The peaks in the distributions at \(\varDelta \varphi ^\mathrm {jet1,2}=\pi \) and \(p_{\mathrm {T}} ^\text {jet2}/p_{\mathrm {T}} ^\mathrm {jet1}=1\) are narrower for events with no tracks, reflecting the fact that the CSE dijets are more balanced in azimuthal angle and momentum than the non-CSE ones, because of the extra radiation in the latter.

Fig. 3
figure 3

Distribution, uncorrected for detector effects, of the number of central tracks between the two leading jets in events with \(p_{\mathrm {T}} ^\text {jet2}\) = 40–60 (top left), 60–100 (top right), and 100–200 (bottom) \(\,\text {GeV}\), compared to the predictions of pythia 6 (inclusive dijets) and herwig 6 (CSE jet-gap-jet events). The pythia 6 and herwig 6 samples are normalized to the number of events measured for \(N_\text {tracks}>3\) and \(N_\text {tracks}=0\), respectively. Beneath each plot the ratio of the data yield to the sum of the normalized herwig 6 and pythia 6 predictions is shown. The vertical error bars indicate the statistical uncertainties

In order to quantify the contribution from CSE events, we measure the CSE fraction, \(f_\mathrm {CSE}\), defined as

$$\begin{aligned} f_\mathrm {CSE}=\frac{N_\text {events}^\mathrm {F}-N_\text {non-CSE}^\mathrm {F}}{N_\text {events}}, \end{aligned}$$
(1)

where \(N_\text {events}^\mathrm {F}\) is the number of events in the first bins of the multiplicity distribution (\(N_\text {tracks}<2\) or 3, as explained later in this section), \(N_\text {non-CSE}^\mathrm {F}\) is the estimated number of events in these bins originating from non-CSE events, and \(N_\text {events}\) is the total number of events considered. The \(f_\mathrm {CSE}\) fraction defined in this way is not sensitive to the trigger efficiencies and jet reconstruction uncertainties as they cancel in the ratio. While the extraction of \(N_\text {events}^\mathrm {F}\) and \(N_\text {events}\) is straightforward (event counting), the estimation of \(N_\text {non-CSE}^\mathrm {F}\) requires modeling of the non-CSE contributions, for which two data-driven approaches are considered.

Fig. 4
figure 4

Transverse momentum distributions, uncorrected for detector effects, of the leading jet (left) and the second-leading jet (right) in three dijet samples with \(p_{\mathrm {T}} ^\text {jet2} =\) 40–60, 60–100, and 100–200 \(\,\text {GeV}\) (from top to bottom) after all selections, for events with no tracks reconstructed in the gap region \(|\eta |<1\), compared to predictions of pythia 6 (inclusive dijets) and herwig 6 (CSE jet-gap-jet events), normalized as in Fig. 3. The error bars indicate the statistical uncertainties

Fig. 5
figure 5

Distributions, uncorrected for detector effects, of the azimuthal angle \(\varDelta \varphi ^\mathrm {jet1,2}\) between the two leading jets (left) and the ratio \(p_{\mathrm {T}} ^\text {jet2}/p_{\mathrm {T}} ^\mathrm {jet1}\) of the second-leading jet \(p_{\mathrm {T}}\) to the leading jet \(p_{\mathrm {T}}\) (right) for events after all selections, with no tracks (\(N_\text {tracks} = 0\), full circles) or more than three tracks (\(N_\text {tracks}> 3\), open circles) reconstructed in the \(|\eta |<1\) region, compared with the MC predictions. The distributions are summed over the three \(p^\text {jet2}_\mathrm {T}\) bins used in the analysis and normalized to unity for shape comparison

In the first approach, the shape of the \(N_\text {tracks}\) distribution for background events is obtained from a sample in which the two leading jets are produced on the same side of the CMS detector (same side, or SS, sample, with jets satisfying the selection \(|\eta ^\text {jet} |> 1.5\) and \(\eta ^\mathrm {jet1} \, \eta ^\text {jet2} > 0\)). For the nominal sample defined in Sect. 4 (opposite side, or OS, sample, with two jets produced on opposite sides of the CMS detector), the gap region \(|\eta | < 1\) mainly contains particles originating from the hard scattering, while for the SS sample it is dominated by particles originating from the underlying event. This difference is reflected in the \(N_\text {tracks}\) distributions: whereas the shapes of the distributions are similar for the SS and OS samples, the mean \(N_\text {tracks}\) value in the SS sample is slightly lower. In order to minimize the difference between the average \(N_\text {tracks}\) values of the two samples, the gap region for the SS sample is enlarged to \(|\eta | < 1.2\), in agreement with the range reported by the CDF Collaboration [30]. The adjusted multiplicity distribution in the SS sample (Fig. 6 left) is normalized to the one in the OS sample for \(N_\text {tracks} > 3\), and the number of events in the first bins is taken as an estimate of the background.

Fig. 6
figure 6

Distribution, uncorrected for detector effects, of the number of central tracks in opposite-side (OS) dijet events (black circles) with \(p_{\mathrm {T}} ^\text {jet2}\) = 40–60 (top), 60–100 (middle), and 100–200 \(\,\text {GeV}\) (bottom), plotted (left) together with the \(N_\text {tracks}\) distribution of same-side (SS) dijet events (blue circles), and fitted to a NBD function (right)

The second method is based on the fit of the \(N_\text {tracks}\) distribution with a negative binomial distribution (NBD), which was first used to describe charged-particle multiplicity distributions by the UA5 Collaboration [51] at energies up to \(\sqrt{s} = 546\) \(\,\text {GeV}\). Later, it was observed that the NBD fit reproduces less well the tails of the particle multiplicity at higher center-of-mass energies (deviations were reported at \(\sqrt{s} = 900\) \(\,\text {GeV}\) by UA5, and later at Tevatron and LHC energies [26, 52, 53]). This issue is largely avoided when one restricts the NBD fit to the region around the mean of the distribution. The fit used in this analysis starts at \(N_\text {tracks} = 3\), where the CSE signal to background ratio is expected to be negligible, and ends at \(N_\text {tracks} = 35\), slightly above the maximum of the distribution. The extrapolation of the fit to the first multiplicity bins provides an estimate of the non-CSE background. The results of the NBD fits are shown in Fig. 6 (right). To check the performance of the method, the fit is repeated on the SS sample in the range \(3 \le N_\text {tracks} \le 35\). The extrapolation of the fit to the \(N_\text {tracks} < 3\) region agrees with the number of events observed in the SS sample data, which confirms the validity of this approach.

The numbers of background events obtained with the two methods described above agree within statistical uncertainties, with the results of the NBD fit being slightly lower. Since the SS method cannot be used to estimate the background in bins of \(\varDelta \eta _\mathrm {jj}\) between the jets (because of the smaller \(\varDelta \eta _\mathrm {jj}\) values than in the OS sample), the NBD fit is chosen as the main background determination method in this analysis. The method involving the SS sample is used as a systematic check, as discussed in the next section. The non-CSE background contributes about 10–15% of the events in the \(0\mathrm {th}\) bin of the multiplicity distribution, about 25–35% in the first two multiplicity bins, and about 40–60% when the signal is integrated over the first three multiplicity bins.

Fig. 7
figure 7

Background-subtracted central track multiplicity distributions, uncorrected for detector effects, in the three bins of \(p_{\mathrm {T}} ^\text {jet2}\), compared to the herwig 6 predictions without underlying event simulation (“no MPI”), normalized as in Fig. 3. The background is estimated from the NBD fit to the data in the \(3 \le N_\text {tracks} \le 35\) range, extrapolated to the lowest multiplicity bins

Figure 7 shows the track multiplicity distribution in the three bins of \(p_{\mathrm {T}} ^\text {jet2}\) after subtracting the non-CSE background. A clear excess in the lowest bins is observed over a flat continuum, in agreement with the normalized predictions from a herwig 6 subsample with jet-gap-jet events only (no additional MPI); the jet-gap-jet events with additional MPI producing tracks in the rapidity gap are part of the background subtracted from the track multiplicity distributions, and are not included in the figure. In the region of the excess (CSE signal region), most events are in the \(0\text {th}\) bin, with smaller contributions from events with one or two tracks reconstructed in the gap region. These tracks originate from the jets but are reconstructed outside of the jet cone, and their contribution is larger in the highest \(p_{\mathrm {T}} ^\text {jet2}\) bin, for which jets tend to have a higher multiplicity and to be produced more centrally (closer to the gap). We use the \(N_\text {tracks}<2\) region to extract the CSE signal in the lowest and medium \(p_{\mathrm {T}} ^\text {jet2}\) bins, and the \(N_\text {tracks}<3\) region to extract the CSE signal in the highest \(p_{\mathrm {T}} ^\text {jet2}\) bin.

The CSE fractions are obtained from the data using Eq. (1), with the different terms in this formula uncorrected for detector effects. No unfolding of the data is necessary since the effects of resolution and migration of the dijet variables cancel in the \(f_\mathrm {CSE}\) ratio. In addition, the number of jet–gap–jet events extracted in the numerator of Eq. (1) does not depend on the track reconstruction efficiency; the latter only influences the non-CSE background count, which is subtracted from the data. Studies with simulated events show that the results do not change, within uncertainties, if the hadron-level variables are used. For the latter, stable particles (with lifetime \(\tau \) such that \(c\tau > 10\) \(\text {\,mm}\)) are used both for the jet reconstruction and for the extraction of the \(N_\text {tracks}\) variable.

6 Systematic uncertainties

The systematic uncertainties in the \(f_\mathrm {CSE}\) extraction are estimated by modifying the selection criteria and the analysis procedure. The following sources of systematic uncertainty are taken into account:

  • Jet energy scale (JES)  The \(p_{\mathrm {T}}\) of each jet in an event is varied up and down according to the formula \(p_{\mathrm {T}} ^\text {jet, new}= p_{\mathrm {T}} ^{\text {jet}} \pm \mathrm {u}(p_{\mathrm {T}} ^{\text {jet}}, \eta ^{\text {jet}})\), where \(\mathrm {u}(p_{\mathrm {T}} ^{\text {jet}}, \eta ^{\text {jet}})\) is the JES uncertainty, which increases at lower (higher) values of \(p_{\mathrm {T}} ^{\text {jet}}\) (\(\eta ^{\text {jet}}\)) [49]. After changing the \(p_{\mathrm {T}}\) of the jets, they are reordered in \(p_{\mathrm {T}} ^\text {jet, new}\), and the analysis is repeated using the two highest \(p_{\mathrm {T}} ^\text {jet, new}\) jets. The average difference of the results obtained for the positive and negative variations relative to the nominal result is taken as an estimate of the uncertainty associated with the JES.

  • Track quality  The track multiplicity distributions are redetermined after relaxing the track quality criteria [33], in order to study the effect of variations in the track finding algorithm. The symmetrized difference between the results obtained with the relaxed and nominal conditions is taken as an estimate of the uncertainty.

  • Background subtraction  The number of background events in the first bins of the \(N_\text {tracks}\) distribution is estimated from data, based on the SS sample introduced in Sect. 5. The symmetrized difference of the results with respect to those found with the nominal method, based on the NBD fit, is taken as an estimate of the corresponding uncertainty. For the measurement of \(f_\mathrm {CSE}\) as a function of \(\varDelta \eta _\mathrm {jj}\) in bins of \(p_{\mathrm {T}} ^\text {jet2}\), the average uncertainty in the \(p_{\mathrm {T}} ^\text {jet2}\) bin is used in each \(\varDelta \eta _\mathrm {jj}\) bin.

Table 1 Percent systematic (individual, and total) and statistical uncertainties of the CSE fraction in the three bins of \(p_{\mathrm {T}} ^\text {jet2}\)

The total systematic uncertainty is calculated as the quadratic sum of the individual contributions. The effect of each systematic source and the total systematic uncertainty are also given in Table 1, for each of the \(p_{\mathrm {T}} ^\text {jet2}\) bins. In this analysis, the systematic uncertainties are smaller than the statistical ones.

As a check of the sensitivity of the results to the definition of the hadronic activity in the gap region, the track multiplicity distributions are redetermined after increasing the lower limit of the track \(p_{\mathrm {T}}\) from 0.2 to 0.25\(\,\text {GeV}\). The results agree within a few percent with the nominal ones, implying no strong dependence on the hadronic activity definition. This observation is in accordance with the results of the D0 experiment [27] using calorimeter towers, in which consistent values of the \(f_\mathrm {CSE}\) fraction were obtained for tower transverse energy thresholds of 0.15, 0.2 and 0.25\(\,\text {GeV}\). Likewise, in the CDF analysis [29] consistent results were obtained based on track multiplicities (\(p_{\mathrm {T}} >0.3\) \(\,\text {GeV}\)) and calorimeter tower multiplicities (\(E_{\mathrm {T}} >0.2\) \(\,\text {GeV}\)). In the present analysis, neutral particles are not included in the multiplicity calculation because of the relatively high transverse energy thresholds required above calorimeter noise, about 0.5\(\,\text {GeV}\) for photons and 2\(\,\text {GeV}\) for neutral hadrons, compared to the much lower 0.2\(\,\text {GeV}\) value for charged tracks.

7 Results

Table 2 Measured values of \(f_\mathrm {CSE}\) as a function of \(p_{\mathrm {T}} ^\text {jet2}\). The first and second uncertainties correspond to the statistical and systematic components, respectively. The mean values of \(p_{\mathrm {T}} ^\text {jet2}\) in the bin are also given
Fig. 8
figure 8

Fraction of dijet events with a central gap (\(f_\mathrm {CSE}\)) as a function of \(p_{\mathrm {T}} ^\text {jet2}\) at \(\sqrt{s}=7\) \(\,\text {TeV}\), compared to the D0 [27] and CDF [29, 30] results at \(\sqrt{s}=0.63\) and 1.8\(\,\text {TeV}\). The details of the jet selections are given in the legend. The results are plotted at the mean value of \(p_{\mathrm {T}} ^\text {jet2}\) in the bin. The inner and outer error bars represent the statistical, and the statistical and systematic uncertainties added in quadrature, respectively

The values of the \(f_\mathrm {CSE}\) fraction, measured as explained in Sect. 5 in three bins of \(p_{\mathrm {T}} ^\text {jet2}\), are given in Table 2. Figure 8 presents the extracted \(f_\mathrm {CSE}\) values as a function of \(p_{\mathrm {T}} ^\text {jet2}\), compared to the results of the D0 [27] and CDF [29, 30] experiments obtained in similar \(\mathrm {p}\overline{\text {p}} \) analyses at \(\sqrt{s}=0.63\) and 1.8 \(\,\text {TeV}\). All the measurements are based on the same pseudorapidity range for the gap region, but differ in the selection of jets. D0 and CDF use the cone jet reconstruction algorithm with size parameter \(R = 0.7\), and select jets in the regions \(1.9<|\eta ^{\text {jet}} |<4.1\), and \(1.8<|\eta ^{\text {jet}} |<3.5\), respectively. The latter difference only minimally affects the comparison with the CMS results, as the measured \(f_\mathrm {CSE}\) fractions at 0.63 and 1.8\(\,\text {TeV}\) depend only weakly on the gap size. At all the three collision energies \(f_\mathrm {CSE}\) increases with \(p_{\mathrm {T}} ^\text {jet2}\). This reflects the fact that the cross section for dijet events with a gap decreases with \(p_{\mathrm {T}} ^\text {jet2}\) less rapidly than the inclusive dijet cross section does. In addition, a decrease of the gap fraction with increasing \(\sqrt{s}\) is observed. The value of \(f_\mathrm {CSE}\) measured for \(40<p_{\mathrm {T}} ^\text {jet2}<60\) \(\,\text {GeV}\) at \(\sqrt{s}=7\) \(\,\text {TeV}\) is about a factor of two lower than those measured for the same \(p_{\mathrm {T}} ^\text {jet2}\) at \(\sqrt{s}=1.8\) \(\,\text {TeV}\). This behavior is in agreement with observations by D0 and CDF, which reported that the jet-gap-jet fraction decreases by a factor of \(2.5 \pm 0.9\) [27] and \(3.4 \pm 1.2\) [30], respectively, when \(\sqrt{s}\) increases from 0.63 to 1.8\(\,\text {TeV}\). The decrease of \(f_\mathrm {CSE}\) with increasing energy can be ascribed to a stronger contribution from rescattering processes, in which the interactions between spectator partons destroy the rapidity gap [19, 54]. As a consequence, the gap survival probability factor \(|S |^2\) is expected to decrease with collision energy. Although no explicit predictions for \(|S |^2\) currently exist for jet-gap-jet production at \(\sqrt{s}=7\) \(\,\text {TeV}\), a suppression factor of about 2, for \(\sqrt{s}\) increasing from 1.8 to 7\(\,\text {TeV}\), is predicted for central exclusive production [55, 56].

Fig. 9
figure 9

Fraction of dijet events with a central gap (\(f_\mathrm {CSE}\)) as a function of \(p_{\mathrm {T}} ^\text {jet2}\) at \(\sqrt{s}=7\) \(\,\text {TeV}\), compared to the predictions of the Mueller and Tang (MT) model [21], and of the Ekstedt, Enberg, and Ingelman (EEI) model [22, 23] with three different treatments of the gap survival probability factor \(|S |^2\), as described in the text. The results are plotted at the mean value of \(p_{\mathrm {T}} ^\text {jet2}\) in the bin. The inner and outer error bars represent the statistical, and the statistical and systematic uncertainties added in quadrature, respectively

Figure 9 shows the comparison of the present results with the BFKL-based theoretical calculations of the Mueller and Tang (MT), and Ekstedt, Enberg and Ingelman (EEI) models. The gap fractions are plotted relative to the standard LO QCD dijet production rates, calculated with pythia 6 (using tune Z2* for MT, and the default settings with color reconnection features turned off for EEI). The MT model [21] prediction is based on the LL BFKL evolution in the asymptotic limit of large rapidity separations between the jets, and is obtained with herwig 6 (as described in Sect. 3, without reweighting of the \(p_{\mathrm {T}} ^\text {jet2}\) dependence) for pure jet-gap-jet events (no simulation of MPI). The MT prediction does not reproduce the increase of \(f_\mathrm {CSE}\) with \(p_{\mathrm {T}} ^\text {jet2}\), as already observed for the 1.8\(\,\text {TeV}\) data [22]; it also underestimates the \(f_\mathrm {CSE}\) fractions measured at 7\(\,\text {TeV}\). The EEI predictions [23] are based on the model of Ref. [22] extended to the present energy. The model includes the dominant next-to-LL corrections to the BFKL evolution of the parton-level cross section, as well as the effect of rescattering processes. For the latter, three approaches are considered, in which gap survival probability is either assumed to be a constant factor, or is partially or fully simulated using Monte Carlo models, to take into account its dependence on the variables \(p_{\mathrm {T}} ^\text {jet2}\) and \(\varDelta \eta _\mathrm {jj}\). In the first approach, the BFKL cross section is scaled by a constant factor corresponding to a gap survival probability value of \(|S |^2=0.7\%\) (magenta long-dashed curve in Fig. 9), in order to match the data. Alternatively, the activity originating from perturbative gluons is modeled in terms of initial- and final-state parton showers, MPI and hadronization processes, as implemented in pythia 6. The remaining nonperturbative interactions are simulated either by an additional gap survival probability factor of \(|S |^2=1.5\%\) (green dotted line in Fig. 9), or by soft color interactions (SCI, red dashed line in Fig. 9) where a color exchange with negligible momentum transfer occurs between parton clusters [23].

As can be seen in Fig. 9, the EEI model with \(|S |^2=0.7\%\), and that with MPI and \(|S |^2=1.5\%\) reproduce the \(p_{\mathrm {T}} ^\text {jet2}\) dependence of the \(f_\mathrm {CSE}\) fraction in the data. The EEI model with MPI and SCI correctly predicts the amount of jet-gap-jet events in the first two \(p_{\mathrm {T}} ^\text {jet2}\) bins, but tends to be lower than the data at higher \(p_{\mathrm {T}} ^\text {jet2}\). The dip in the prediction around \(p_{\mathrm {T}} ^\text {jet2}=80\) \(\,\text {GeV}\) is a result of using the SCI model in conjunction with final state showering, and is a feature of the model rather than a statistical fluctuation.

Table 3 Measured values of the fraction of dijet events with a central gap (\(f_\mathrm {CSE}\)) as a function of the pseudorapidity separation between the jets (\(\varDelta \eta _\mathrm {jj}\)) in bins of \(p_{\mathrm {T}} ^\text {jet2}\). The columns in the table correspond to \(p_{\mathrm {T}} ^\text {jet2}\) bins and the rows to \(\varDelta \eta _\mathrm {jj}\) bins. The first and second errors correspond to the statistical and systematic uncertainties, respectively. The mean values of \(\varDelta \eta _\mathrm {jj}\) in the bin are also given

The dependence of the \(f_\mathrm {CSE}\) fraction on the size of \(\varDelta \eta _\mathrm {jj}\) is studied for each \(p_{\mathrm {T}} ^\text {jet2}\) sample in three bins of \(\varDelta \eta _\mathrm {jj}\) = 3–4, 4–5, and 5–7. The measured values of the \(f_\mathrm {CSE}\) fractions are listed in Table 3, and plotted in Fig. 10. The fraction of jet-gap-jet events increases with \(\varDelta \eta _\mathrm {jj}\), and varies from 0.3 to 1.2%, and from 0.8 to 2%, in the lowest and the highest \(p_{\mathrm {T}} ^\text {jet2}\) bins, respectively. Figure 10 also shows the comparison of the data with the predictions of the MT and EEI models. The MT model predicts a flat dependence of \(f_\mathrm {CSE}\) with \(\varDelta \eta _\mathrm {jj}\), and underestimates the measured jet-gap-jet fractions except for the lowest (\(p_{\mathrm {T}} ^\text {jet2}\), \(\varDelta \eta _\mathrm {jj}\)) bin for which the agreement is good. The EEI model with the \(|S |^2=0.7\%\) factor, as well as that with MPI plus \(|S |^2=1.5\%\) predict a decrease of \(f_\mathrm {CSE}\) with \(\varDelta \eta _\mathrm {jj}\), and are at variance with the data. Conversely, the EEI model with MPI plus soft color interactions satisfactorily reproduces the rise of \(f_\mathrm {CSE}\) with \(\varDelta \eta _\mathrm {jj}\) in all \(p_{\mathrm {T}} ^\text {jet2}\) bins.

Fig. 10
figure 10

Fraction of dijet events with a central gap (\(f_\mathrm {CSE}\)) as a function of \(\varDelta \eta _\mathrm {jj}\) at \(\sqrt{s}=7\) \(\,\text {TeV}\) in three different \(p_{\mathrm {T}} ^\text {jet2}\) ranges, compared to the predictions of the Mueller and Tang (MT) model [21], and of the Ekstedt, Enberg, and Ingelman (EEI) model [22, 23] with three different treatments of the gap survival probability factor \(|S |^2\), as described in the text. The results are plotted at the mean value of \(\varDelta \eta _\mathrm {jj}\) in the bin. Inner and outer error bars correspond to the statistical, and the statistical and systematic uncertainties added in quadrature, respectively

8 Summary

Events with a large rapidity gap between the two leading jets have been measured for the first time at the LHC, for jets with transverse momentum \(p_{\mathrm {T}} ^{\text {jet}}>40\) \(\,\text {GeV}\) and pseudorapidity \(1.5<|\eta ^{\text {jet}} |<4.7\), reconstructed in opposite ends of the detector. The number of dijet events with no particles with \(p_{\mathrm {T}} >0.2\) \(\,\text {GeV}\) in the region \(|\eta |<1\) is severely underestimated by pythia 6 (tune Z2*). herwig 6 predictions, which include a contribution from color singlet exchange (CSE), based on the leading logarithmic Balitsky–Fadin–Kuraev–Lipatov (BFKL) evolution equations, are needed to reproduce the type of dijet topologies selected in our analysis. The fraction of selected dijet events with such a rapidity gap has been measured as a function of the second-leading jet transverse momentum (\(p_{\mathrm {T}} ^\text {jet2}\)) and as a function of the size of the pseudorapidity interval between the jets, \(\varDelta \eta _\mathrm {jj}\). The \(f_\mathrm {CSE}\) fraction rises with \(p_{\mathrm {T}} ^\text {jet2}\) (from 0.6 to 1%) and with \(\varDelta \eta _\mathrm {jj}\) (from 0.3 to 1.2% for \(40<p_{\mathrm {T}} ^\text {jet2}<60\) \(\,\text {GeV}\), from 0.5 to 0.9% for \(60<p_{\mathrm {T}} ^\text {jet2}<100\) \(\,\text {GeV}\), and from 0.8 to 2% for \(100<p_{\mathrm {T}} ^\text {jet2}<200\) \(\,\text {GeV}\)).

The measured CSE fractions have been compared to the results of the D0 and CDF experiments at a center-of-mass energies of 0.63 and 1.8\(\,\text {TeV}\). A factor of two decrease of the CSE fraction measured at \(\sqrt{s} = 7\) \(\,\text {TeV}\) with respect to that at \(\sqrt{s} = 1.8\) \(\,\text {TeV}\) is observed. Such a behavior is consistent with the decrease seen in the Tevatron data when \(\sqrt{s}\) rises from 0.63 to 1.8\(\,\text {TeV}\), and with theoretical expectations for the \(\sqrt{s}\) dependence of the rapidity gap survival probability.

The data are also compared to theoretical perturbative quantum chromodynamics calculations based on the BFKL evolution equations complemented with different estimates of the non-perturbative gap survival probability. The next-to-leading-logarithmic BFKL calculations of Ekstedt, Enberg and Ingelman, with three different implementations of the soft rescattering processes, describe many features of the data, but none of the implementations is able to simultaneously describe all the features of the measurement.