1 Introduction

Jets are a key element in many analyses of the data collected by the experiments at the Large Hadron Collider (LHC) [1]. The jet calibration procedure should correctly determine the jet energy scale and additionally the best possible energy and angular resolution should be achieved. Good jet reconstruction and calibration facilitates the identification of known resonances that decay to hadronic jets, as well as the search for new particles. A complication, at the high luminosities encountered by the ATLAS detector [2], is that multiple interactions can contribute to the detector signals associated with a single bunch-crossing (pile-up). These interactions, which are mostly soft, have to be separated from the hard interaction that is of interest.

Pile-up contributes to the detector signals from the collision environment, and is especially important for higher-intensity operations of the LHC. One contribution arises from particle emissions produced by the additional proton–proton (\(pp\)) collisions occurring in the same bunch crossing as the hard-scatter interaction (in-time pile-up). Further pile-up influences on the signal are from signal remnants in the ATLAS calorimeters from the energy deposits in other bunch crossings (out-of-time pile-up).

In Run 1 of the LHC, the ATLAS experiment used either solely the calorimeter or solely the tracker to reconstruct hadronic jets and soft particle activity. The vast majority of analyses utilised jets that were built from topological clusters of calorimeter cells (topo-clusters) [3]. These jets were then calibrated to the particle level using a jet energy scale (JES) correction factor [4,5,6,7]. For the final Run 1 jet calibration, this correction factor also took into account the tracks associated with the jet, as this was found to greatly improve the jet resolution [4]. ‘Particle flow’ introduces an alternative approach, in which measurements from both the tracker and the calorimeter are combined to form the signals, which ideally represent individual particles. The energy deposited in the calorimeter by all the charged particles is removed. Jet reconstruction is then performed on an ensemble of ‘particle flow objects’ consisting of the remaining calorimeter energy and tracks which are matched to the hard interaction.

The chief advantages of integrating tracking and calorimetric information into one hadronic reconstruction step are as follows:

  • The design of the ATLAS detector [8] specifies a calorimeter energy resolution for single charged pions in the centre of the detector of

    $$\begin{aligned} \frac{\sigma (E)}{E}=\frac{50{\%}}{\sqrt{E}}\oplus 3.4{\%} \oplus \frac{1{\%}}{E}, \end{aligned}$$
    (1)

    while the design inverse transverse momentum resolution for the tracker is

    $$\begin{aligned} \sigma \left( \frac{1}{p_{\mathrm{T}}}\right) \cdot p_{\mathrm{T}} =0.036{\%} \cdot p_{\mathrm{T}} \oplus 1.3{\%}, \end{aligned}$$
    (2)

    where energies and transverse momenta are measured in GeV. Thus for low-energy charged particles, the momentum resolution of the tracker is significantly better than the energy resolution of the calorimeter. Furthermore, the acceptance of the detector is extended to softer particles, as tracks are reconstructed for charged particles with a minimum transverse momentum \(\,p_{\mathrm{T}} >400\,\mathrm{MeV}\), whose energy deposits often do not pass the noise thresholds required to seed topo-clusters [9].

  • The angular resolution of a single charged particle, reconstructed using the tracker is much better than that of the calorimeter.

  • Low-\(p_{\mathrm{T}}\) charged particles originating within a hadronic jet are swept out of the jet cone by the magnetic field by the time they reach the calorimeter. By using the track’s azimuthal coordinateFootnote 1 at the perigee, these particles are clustered into the jet.

  • When a track is reconstructed, one can ascertain whether it is associated with a vertex, and if so the vertex from which it originates. Therefore, in the presence of multiple in-time pile-up interactions, the effect of additional particles on the hard-scatter interaction signal can be mitigated by rejecting signals originating from pile-up vertices.Footnote 2

The capabilities of the tracker in reconstructing charged particles are complemented by the calorimeter’s ability to reconstruct both the charged and neutral particles. At high energies, the calorimeter’s energy resolution is superior to the tracker’s momentum resolution. Thus a combination of the two subsystems is preferred for optimal event reconstruction. Outside the geometrical acceptance of the tracker, only the calorimeter information is available. Hence, in the forward region the topo-clusters alone are used as inputs to the particle flow jet reconstruction.

However, particle flow introduces a complication. For any particle whose track measurement ought to be used, it is necessary to correctly identify its signal in the calorimeter, to avoid double-counting its energy in the reconstruction. In the particle flow algorithm described herein, a Boolean decision is made as to whether to use the tracker or calorimeter measurement. If a particle’s track measurement is to be used, the corresponding energy must be subtracted from the calorimeter measurement. The ability to accurately subtract all of a single particle’s energy, without removing any energy deposited by any other particle, forms the key performance criterion upon which the algorithm is optimised.

Particle flow algorithms were pioneered in the ALEPH experiment at LEP [10]. They have also been used in the H1 [11], ZEUS [12, 13] and DELPHI [14] experiments. Subsequently, they were used for the reconstruction of hadronic \(\tau \)-lepton decays in the CDF [15], D0 [16] and ATLAS [17] experiments. In the CMS experiment at the LHC, large gains in the performance of the reconstruction of hadronic jets and \(\tau \) leptons have been seen from the use of particle flow algorithms [18,19,20]. Particle flow is a key ingredient in the design of detectors for the planned International Linear Collider [21] and the proposed calorimeters are being optimised for its use [22]. While the ATLAS calorimeter already measures jet energies precisely [6], it is desirable to explore the extent to which particle flow is able to further improve the ATLAS hadronic-jet reconstruction, in particular in the presence of pile-up interactions.

This paper is organised as follows. A description of the detector is given in Sect. 2, the Monte Carlo (MC) simulated event samples and the dataset used are described in Sects. 3 and 4, while Sect. 5 outlines the relevant properties of topo-clusters. The particle flow algorithm is described in Sect. 6. Section 7 details the algorithm’s performance in energy subtraction at the level of individual particles in a variety of cases, starting from a single pion through to dijet events. The building and calibration of reconstructed jets is covered in Sect. 8. The improvement in jet energy and angular resolution is shown in Sect. 9 and the sensitivity to pile-up is detailed in Sect. 10. A comparison between data and MC simulation is shown in Sect. 11 before the conclusions are presented in Sect. 12.

2 ATLAS detector

The ATLAS experiment features a multi-purpose detector designed to precisely measure jets, leptons and photons produced in the \(pp\) collisions at the LHC. From the inside out, the detector consists of a tracking system called the inner detector (ID), surrounded by electromagnetic (EM) sampling calorimeters. These are in turn surrounded by hadronic sampling calorimeters and an air-core toroid muon spectrometer (MS). A detailed description of the ATLAS detector can be found in Ref. [2].

The high-granularity silicon pixel detector covers the vertex region and typically provides three measurements per track. It is followed by the silicon microstrip tracker which usually provides eight hits, corresponding to four two-dimensional measurement points, per track. These silicon detectors are complemented by the transition radiation tracker, which enables radially extended track reconstruction up to \(\,|{\eta }| = 2.0\). The ID is immersed in a 2 T axial magnetic field and can reconstruct tracks within the pseudorapidity range \(\,|{\eta }|<2.5\). For tracks with transverse momentum \(\,p_{\mathrm{T}} <100\,\mathrm{GeV}\), the fractional inverse momentum resolution \(\sigma ({1/p_{\mathrm{T}}}) \cdot p_{\mathrm{T}} \) measured using 2012 data, ranges from approximately 2–12% depending on pseudorapidity and \(p_{\mathrm{T}}\)  [23].

The calorimeters provide hermetic azimuthal coverage in the range \(\,|{\eta }|<4.9\). The detailed structure of the calorimeters within the tracker acceptance strongly influences the development of the shower subtraction algorithm described in this paper. In the central barrel region of the detector, a high-granularity liquid-argon (LAr) electromagnetic calorimeter with lead absorbers is surrounded by a hadronic sampling calorimeter (Tile) with steel absorbers and active scintillator tiles. The same LAr technology is used in the calorimeter endcaps, with fine granularity and lead absorbers for the EM endcap (EMEC), while the hadronic endcap (HEC) utilises copper absorbers with reduced granularity. The solid angle coverage is completed with forward copper/LAr and tungsten/LAr calorimeter modules (FCal) optimised for electromagnetic and hadronic measurements respectively. Figure 1 shows the physical location of the different calorimeters. To achieve a high spatial resolution, the calorimeter cells are arranged in a projective geometry with fine segmentation in \(\phi \) and \(\eta \). Additionally, each of the calorimeters is longitudinally segmented into multiple layers, capturing the shower development in depth. In the region \(\,|{\eta }| < 1.8\), a presampler detector is used to correct for the energy lost by electrons and photons upstream of the calorimeter. The presampler consists of an active LAr layer of thickness 1.1 cm (0.5 cm) in the barrel (endcap) region. The granularity of all the calorimeter layers within the tracker acceptance is given in Table 1.

The EM calorimeter is over 22 radiation lengths in depth, ensuring that there is little leakage of EM showers into the hadronic calorimeter. The total depth of the complete calorimeter is over 9 interaction lengths in the barrel and over 10 interaction lengths in the endcap, such that good containment of hadronic showers is obtained. Signals in the MS are used to correct the jet energy if the hadronic shower is not completely contained. In both the EM and Tile calorimeters, most of the absorber material is in the second layer. In the hadronic endcap, the material is more evenly spread between the layers.

Fig. 1
figure 1

Cut-away view of the ATLAS calorimeter system

Table 1 The granularity in \(\Delta \eta \times \Delta \phi \) of all the different ATLAS calorimeter layers relevant to the tracking coverage of the inner detector

The muon spectrometer surrounds the calorimeters and is based on three large air-core toroid superconducting magnets with eight coils each. The field integral of the toroids ranges from 2.0 to 6.0 Tm across most of the detector. It includes a system of precision tracking chambers and fast detectors for triggering.

3 Simulated event samples

A variety of MC samples are used in the optimisation and performance evaluation of the particle flow algorithm. The simplest samples consist of a single charged pion generated with a uniform spectrum in the logarithm of the generated pion energy and in the generated \(\eta \). Dijet samples generated with Pythia 8 (v8.160) [24, 25], with parameter values set to the ATLAS AU2 tune [26] and the CT10 parton distribution functions (PDF) set [27], form the main samples used to derive the jet energy scale and determine the jet energy resolution in simulation. The dijet samples are generated with a series of jet \(p_{\mathrm{T}}\) thresholds applied to the leading jet, reconstructed from all stable final-state particles excluding muons and neutrinos, using the anti-\(k_t\) algorithm [28] with radius parameter 0.6 using FastJet (v3.0.3) [29, 30].

For comparison with collision data, \(Z\rightarrow {\mu {\mu }}\) events are generated with Powheg-Box (r1556) [31] using the CT10 PDF and are showered with Pythia 8, with the ATLAS AU2 tune. Additionally, top quark pair production is simulated with MC@NLO  (v4.03) [32, 33] using the CT10 PDF set, interfaced with Herwig  (v6.520) [34] for parton showering, and the underlying event is modelled by Jimmy  (v4.31) [35]. The top quark samples are normalised using the cross-section calculated at next-to-next-to-leading order (NNLO) in QCD including resummation of next-to-next-to-leading logarithmic soft gluon terms with top++2.0 [36,37,38,39,40,41,42,43], assuming a top quark mass of 172.5 GeV. Single-top-quark production processes contributing to the distributions shown are also simulated, but their contributions are negligible.

3.1 Detector simulation and pile-up modelling

All samples are simulated using Geant4  [44] within the ATLAS simulation framework [45] and are reconstructed using the noise threshold criteria used in 2012 data-taking [3]. Single-pion samples are simulated without pile-up, while dijet samples are simulated under three conditions: with no pile-up; with pile-up conditions similar to those in the 2012 data; and with a mean number of interactions per bunch crossing \(\langle \mu \rangle = 40\), where \(\mu \) follows a Poisson distribution. In 2012, the mean value of \(\mu \) was 20.7 and the actual number of interactions per bunch crossing ranged from around 10 to 35 depending on the luminosity. The bunch spacing was 50 ns. When compared to data, the MC samples are reweighted to have the same distribution of \(\mu \) as present in the data. In all the samples simulated including pile-up, effects from both the same bunch crossing and previous/subsequent crossings are simulated by overlaying additional generated minimum-bias events on the hard-scatter event prior to reconstruction. The minimum-bias samples are generated using Pythia 8 with the ATLAS AM2 tune [46] and the MSTW2009 PDF set [47], and are simulated using the same software as the hard-scatter event.

3.2 Truth calorimeter energy and tracking information

For some samples the full Geant4 hit information [44] is retained for each calorimeter cell such that the true amount of hadronic and electromagnetic energy deposited by each generated particle is known. Only the measurable hadronic and electromagnetic energy deposits are counted, while the energy lost due to nuclear capture and particles escaping from the detector is not included. For a given charged pion the sum of these hits in a given cluster i originating from this particle is denoted by \(E^{\mathrm{clus}\,i}_{\mathrm{true},\,\pi }\).

Reconstructed topo-cluster energy is assigned to a given truth particle according to the proportion of Geant4 hits supplied to that topo-cluster by that particle. Using the Geant4 hit information in the inner detector a track is matched to a generated particle based on the fraction of hits on the track which originate from that particle [48].

4 Data sample

Data acquired during the period from March to December 2012 with the LHC operating at a pp centre-of-mass energy of 8 TeV are used to evaluate the level of agreement between data and Monte Carlo simulation of different outputs of the algorithm. Two samples with a looser preselection of events are reconstructed using the particle flow algorithm. A tighter selection is then used to evaluate its performance.

First, a \(Z \rightarrow {\mu }{\mu }\) enhanced sample is extracted from the 2012 dataset by selecting events containing two reconstructed muons [49], each with \(\,p_{\mathrm{T}} > 25\,\mathrm{GeV}\) and \(\,|{\eta }|<2.4\), where the invariant mass of the dimuon pair is greater than 55 GeV , and the \(p_{\mathrm{T}}\) of the dimuon pair is greater than 30 GeV.

Similarly, a sample enhanced in \({t{\bar{t}}}\rightarrow {b{\bar{b}}}{q{\bar{q}}}{\mu }{\nu }\) events is obtained from events with an isolated muon and at least one hadronic jet which is required to be identified as a jet containing b-hadrons (b-jet). Events are selected that pass single-muon triggers and include one reconstructed muon satisfying \(\,p_{\mathrm{T}} > 25\,\mathrm{GeV}\), \(\,\,|{\eta }|<2.4\), for which the sum of additional track momenta in a cone of size \(\Delta R = 0.2\) around the muon track is less than 1.8 GeV. Additionally, a reconstructed calorimeter jet is required to be present with \(\,p_{\mathrm{T}} > 30\,\mathrm{GeV}\), \(\,|{\eta }|<2.5\), and pass the 70% working point of the MV1 b-tagging algorithm [50].

For both datasets, all ATLAS subdetectors are required to be operational with good data quality. Each dataset corresponds to an integrated luminosity of 20.2 fb\(^{-1}\). To remove events suffering from significant electronic noise issues, cosmic rays or beam background, the analysis excludes events that contain calorimeter jets with \(\,p_{\mathrm{T}} > 20\,\mathrm{GeV}\) which fail to satisfy the ‘looser’ ATLAS jet quality criteria [51, 52].

5 Topological clusters

The lateral and longitudinal segmentation of the calorimeters permits three-dimensional reconstruction of particle showers, implemented in the topological clustering algorithm [3]. Topo-clusters of calorimeter cells are seeded by cells whose absolute energy measurements |E| exceed the expected noise by four times its standard deviation. The expected noise includes both electronic noise and the average contribution from pile-up, which depends on the run conditions. The topo-clusters are then expanded both laterally and longitudinally in two steps, first by iteratively adding all adjacent cells with absolute energies two standard deviations above noise, and finally adding all cells neighbouring the previous set. A splitting step follows, separating at most two local energy maxima into separate topo-clusters. Together with the ID tracks, these topo-clusters form the basic inputs to the particle flow algorithm.

The topological clustering algorithm employed in ATLAS is not designed to separate energy deposits from different particles, but rather to separate continuous energy showers of different nature, i.e. electromagnetic and hadronic, and also to suppress noise. The cluster-seeding threshold in the topo-clustering algorithm results in a large fraction of low-energy particles being unable to seed their own clusters. For example, in the central barrel \(\sim \)25% of 1 GeV charged pions do not seed their own cluster [9].

While the granularity, noise thresholds and employed technologies vary across the different ATLAS calorimeters, they are initially calibrated to the electromagnetic scale (EM scale) to give the same response for electromagnetic showers from electrons or photons. Hadronic interactions produce responses that are lower than the EM scale, by amounts depending on where the showers develop. To account for this, the mean ratio of the energy deposited by a particle to the momentum of the particle is determined based on the position of the particle’s shower in the detector, as described in Sect. 6.4.

A local cluster (LC) weighting scheme is used to calibrate hadronic clusters to the correct scale [3]. Further development is needed to combine this with particle flow; therefore, in this work the topo-clusters used in the particle flow algorithm are calibrated at the EM scale.

6 Particle flow algorithm

A cell-based energy subtraction algorithm is employed to remove overlaps between the momentum and energy measurements made in the inner detector and calorimeters, respectively. Tracking and calorimetric information is combined for the reconstruction of hadronic jets and soft activity (additional hadronic recoil below the threshold used in jet reconstruction) in the event. The reconstruction of the soft activity is important for the calculation of the missing transverse momentum in the event [53], whose magnitude is denoted by \(E_{\mathrm{T}}^{\mathrm{miss}}\).

The particle flow algorithm provides a list of tracks and a list of topo-clusters containing both the unmodified topo-clusters and a set of new topo-clusters resulting from the energy subtraction procedure. This algorithm is sketched in Fig. 2. First, well-measured tracks are selected following the criteria discussed in Sect. 6.2. The algorithm then attempts to match each track to a single topo-cluster in the calorimeter (Sect. 6.3). The expected energy in the calorimeter, deposited by the particle that also created the track, is computed based on the topo-cluster position and the track momentum (Sect. 6.4). It is relatively common for a single particle to deposit energy in multiple topo-clusters. For each track/topo-cluster system, the algorithm evaluates the probability that the particle energy was deposited in more than one topo-cluster. On this basis it decides if it is necessary to add more topo-clusters to the track/topo-cluster system to recover the full shower energy (Sect. 6.5). The expected energy deposited in the calorimeter by the particle that produced the track is subtracted cell by cell from the set of matched topo-clusters (Sect. 6.6). Finally, if the remaining energy in the system is consistent with the expected shower fluctuations of a single particle’s signal, the topo-cluster remnants are removed (Sect. 6.7).

This procedure is applied to tracks sorted in descending \(p_{\mathrm{T}}\)-order, firstly to the cases where only a single topo-cluster is matched to the track, and then to the other selected tracks. This methodology is illustrated in Fig. 3.

Fig. 2
figure 2

A flow chart of how the particle flow algorithm proceeds, starting with track selection and continuing until the energy associated with the selected tracks has been removed from the calorimeter. At the end, charged particles, topo-clusters which have not been modified by the algorithm, and remnants of topo-clusters which have had part of their energy removed remain

Fig. 3
figure 3

Idealised examples of how the algorithm is designed to deal with several different cases. The red cells are those which have energy from the \(\pi ^+\), the green cells energy from the photons from the \(\pi ^0\) decay, the dotted lines represent the original topo-cluster boundaries with those outlined in blue having been matched by the algorithm to the \(\pi ^+\), while those in black are yet to be selected. The different layers in the electromagnetic calorimeter (Presampler, EMB1, EMB2, EMB3) are indicated. In this sketch only the first two layers of the Tile calorimeter are shown (TileBar0 and TileBar1)

Details about each step of the procedure are given in the rest of this section. After some general discussion of the properties of topo-clusters in the calorimeter, the energy subtraction procedure for each track is described. The procedure is accompanied by illustrations of performance metrics used to validate the configuration of the algorithm. The samples used for the validation are single-pion and dijet MC samples without pile-up, as described in the previous section. Charged pions dominate the charged component of the jet, which on average makes up two-thirds of the visible jet energy [54, 55]. Another quarter of the jet energy is contributed by photons from neutral hadron decays, and the remainder is carried by neutral hadrons that reach the calorimeter. Because the majority of tracks are generated by charged pions [56], particularly at low \(p_{\mathrm{T}}\), the pion mass hypothesis is assumed for all tracks used by the particle flow algorithm to reconstruct jets. Likewise the energy subtraction is based on the calorimeter’s response to charged pions.

In the following sections, the values for the parameter set and the performance obtained for the 2012 dataset are discussed. These parameter values are not necessarily the product of a full optimisation, but it has been checked that the performance is not easily improved by variations of these choices. Details of the optimisation are beyond the scope of the paper.

6.1 Containment of showers within a single topo-cluster

The performance of the particle flow algorithm, especially the shower subtraction procedure, strongly relies on the topological clustering algorithm. Hence, it is important to quantify the extent to which the clustering algorithm distinguishes individual particles’ showers and how often it splits a single particle’s shower into more than one topo-cluster. The different configurations of topo-clusters containing energy from a given single pion are classified using two variables.

For a given topo-cluster i, the fraction of the particle’s true energy contained in the topo-cluster (see Sect. 3.2), with respect to the total true energy deposited by the particle in all clustered cells, is defined as

$$\begin{aligned} \varepsilon ^{\mathrm{clus}}_i = \frac{E^{\mathrm{clus}\,i}_{\mathrm{true},\,\pi }}{E^\mathrm{all topo-clusters}_{\mathrm{true},\,\pi }}, \end{aligned}$$
(3)

where \({E^{\mathrm{clus}\,i}_{\mathrm{true},\,\pi }}\) is the true energy deposited in topo-cluster i by the generated particle under consideration and \({E^\mathrm{all topo-clusters}_{\mathrm{true},\,\pi }}\) is the true energy deposited in all topo-clusters by that truth particle. For each particle, the topo-cluster with the highest value of \(\varepsilon ^{\mathrm{clus}}_i\) is designated the leading topo-cluster, for which \(\varepsilon ^{\mathrm{clus}}_{\mathrm{lead}} = \varepsilon ^{\mathrm{clus}}_i \). The minimum number of topo-clusters needed to capture at least 90% of the particle’s true energy, i.e. such that \(\sum _{i=0}^{n}\varepsilon ^{\mathrm{clus}}_i > 90{\%}\), is denoted by \(n_\mathrm{clus}^{90}\).

Topo-clusters can contain contributions from multiple particles, affecting the ability of the subtraction algorithm to separate the energy deposits of different particles. The purity \(\rho ^{\mathrm{clus}}_i\) for a topo-cluster i is defined as the fraction of true energy within the topo-cluster which originates from the particle of interest:

$$\begin{aligned} \rho ^{\mathrm{clus}}_i = \frac{E^{\mathrm{clus}\,i}_{\mathrm{true},\,\pi } }{E^{\mathrm{clus}\,i}_\mathrm{true, all particles}}. \end{aligned}$$
(4)

For the leading topo-cluster, defined by having the highest \(\varepsilon ^{\mathrm{clus}}_i\), the purity value is denoted by \(\rho ^{\mathrm{clus}}_{\mathrm{lead}}\).

Only charged particles depositing significant energy (at least 20% of their true energy) in clustered cells are considered in the following plots, as in these cases there is significant energy in the calorimeter to remove. This also avoids the case where insufficient energy is present in any cell to form a cluster, which happens frequently for very low-energy particles [3].

Figure 3 illustrates how the subtraction procedure is designed to deal with cases of different complexity. Four different scenarios are shown covering cases where the charged pion deposits its energy in one cluster, in two clusters, and where there is a nearby neutral pion which either deposits its energy in a separate cluster or the same cluster as the charged pion.

Several distributions are plotted for the dijet sample in which the energy of the leading jet, measured at truth level, is in the range \(20< p_{\mathrm{T}}^{\mathrm{lead}} < 500\,\mathrm{GeV}\). The distribution of \(\varepsilon ^{\mathrm{clus}}_{\mathrm{lead}}\) is shown in Fig. 4 for different \(\,p_{\mathrm{T}}^{\mathrm{true}} \) and \(\eta ^{\mathrm{true}} \) bins. It can be seen that \(\varepsilon ^{\mathrm{clus}}_{\mathrm{lead}}\) decreases as the \(p_{\mathrm{T}}\) of the particle increases and very little dependence on \(\eta \) is observed. Figure  5 shows the distribution of \(n_\mathrm{clus}^{90}\). As expected, \(n_\mathrm{clus}^{90}\) increases with particle \(p_{\mathrm{T}}\). It is particularly interesting to know the fraction of particles for which at least 90% of the true energy is contained in a single topo-cluster (\(n_\mathrm{clus}^{90}=1\)) and this is shown in Fig. 6. Lastly, Fig. 7 shows the distribution of \(\rho ^{\mathrm{clus}}_{\mathrm{lead}}\). This decreases as \(p_{\mathrm{T}}^{\mathrm{true}}\) increases and has little dependence on \(|\eta ^\mathrm {true}|\).

Fig. 4
figure 4

Distribution of the fraction of the total true energy in the leading topo-cluster, \(\varepsilon ^{\mathrm{clus}}_{\mathrm{lead}}\), for charged pions which deposit significant energy (20% of the particle’s energy) in the clustered cells for three different \(\,p_{\mathrm{T}}^{\mathrm{true}} \) bins in three \(|\eta ^{\mathrm{true}} |\) regions. The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

Fig. 5
figure 5

Distributions of the number of topo-clusters required to contain \(> 90{\%}\) of the true deposited energy of a single charged pion which deposits significant energy (20% of the particle’s energy) in the clustered cells. The distributions are shown for three \(\,p_{\mathrm{T}}^{\mathrm{true}} \) bins in three \(|\eta ^{\mathrm{true}} |\) regions. The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

Fig. 6
figure 6

The probability that a single topo-cluster contains \(> 90{\%}\) of the true deposited energy of a single charged pion, which deposits significant energy (20% of the particle’s energy) in the clustered cells. The distributions are shown as a function of \(\,p_{\mathrm{T}}^{\mathrm{true}} \) in three \(|\eta ^{\mathrm{true}} |\) regions. The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

Fig. 7
figure 7

The purity \(\rho ^{\mathrm{clus}}_{\mathrm{lead}}\), defined for a selected charged pion as the fractional contribution of the chosen particle to the total true energy in the leading topo-cluster, shown for pions with \(\varepsilon ^{\mathrm{clus}}_{\mathrm{lead}}\) >50%. Distributions are shown for several \(\,p_{\mathrm{T}}^{\mathrm{true}} \) bins and in three \(|\eta ^{\mathrm{true}} |\) regions. The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

For more than \({60}{\%}\) of particles with \(1< p_{\mathrm{T}}^{\mathrm{true}} < 2\,\mathrm{GeV}\), the shower is entirely contained within a single topo-cluster (\(\varepsilon ^{\mathrm{clus}}_{\mathrm{lead}} \sim 1\)). This fraction falls rapidly with particle \(p_{\mathrm{T}}\), reaching \(\sim {25}{\%}\) for particles in the range \(5<p_{\mathrm{T}}^{\mathrm{true}} <10\,\mathrm{GeV}\). For particles with \(\,p_{\mathrm{T}}^{\mathrm{true}} <2\,\mathrm{GeV}\), \(90{\%}\) of the particle energy can be captured within two topo-clusters in \(\sim {95}{\%}\) of cases. The topo-cluster purity also falls as the pion \(p_{\mathrm{T}}\) increases, with the target particle only contributing between 38 and 45% of the topo-cluster energy when \(5<p_{\mathrm{T}}^{\mathrm{true}} <10\,\mathrm{GeV}\). This is in part due to the tendency for high-\(p_{\mathrm{T}}\) particles to be produced in dense jets, while softer particles from the underlying event tend to be isolated from nearby activity.

In general, the subtraction of the hadronic shower is easier for cases with topo-clusters with high \(\rho ^{\mathrm{clus}}_i\), and high \(\varepsilon ^{\mathrm{clus}}_i\), since in this configuration the topo-cluster ing algorithm has separated out the contributions from different particles.

6.2 Track selection

Tracks are selected which pass stringent quality criteria: at least nine hits in the silicon detectors are required, and tracks must have no missing Pixel hits when such hits would be expected [57]. This selection is designed such that the number of badly measured tracks is minimised and is referred to as ‘tight selection’. No selection cuts are made on the association to the hard scatter vertex at this stage Additionally, tracks are required to be within \(\,|{\eta }|<2.5\) and have \(\,p_{\mathrm{T}} > 0.5\,\mathrm{GeV}\). These criteria remain efficient for tracks from particles which are expected to deposit energy below the threshold needed to seed a topo-cluster or particles that do not reach the calorimeter. Including additional tracks by reducing the \(p_{\mathrm{T}}\) requirement to \(0.4\,\mathrm{GeV}\) leads to a substantial increase in computing time without any corresponding improvement in jet resolution. This is due to their small contribution to the total jet \(p_{\mathrm{T}}\).

Tracks with \(\,p_{\mathrm{T}} \) > \(40\,\mathrm{GeV}\) are excluded from the algorithm, as such energetic particles are often poorly isolated from nearby activity, compromising the accurate removal of the calorimeter energy associated with the track. In such cases, with the current subtraction scheme, there is no advantage in using the tracker measurement. This requirement was tuned both by monitoring the effectiveness of the energy subtraction using the true energy deposited in dijet MC events, and by measuring the jet resolution in MC simulation. The majority of tracks in jets with \(p_{\mathrm{T}}\) between 40 and 60 GeV have \(p_{\mathrm{T}}\) below 40 GeV, as shown later in Sect. 11.

In addition, any tracks matched to candidate electrons [58] or muons [49], without any isolation requirements, identified with medium quality criteria, are not selected and therefore are not considered for subtraction, as the algorithm is optimised for the subtraction of hadronic showers. The energy deposited in the calorimeter by electrons and muons is hence taken into account in the particle flow algorithm and any resulting topo-clusters are generally left unsubtracted.

Figure 8 shows the charged-pion track reconstruction efficiency, for the tracks selected with the criteria described above, as a function of \(\eta ^{\mathrm{true}} \) and \(\,p_{\mathrm{T}}^{\mathrm{true}} \) in the dijet MC sample, with leading jets in the range \(20<p_{\mathrm{T}}^{\mathrm{lead}} <1000\,\mathrm{GeV}\) and with similar pile-up to that in the 2012 data. The Monte Carlo generator information is used to match the reconstructed tracks to the generated particles [48]. The application of the tight quality criteria substantially reduces the rate of poorly measured tracks, as shown in Fig. 9. Additionally, using the above selection, the fraction of combinatorial fake tracks arising from combining ID hits from different particles is negligible [48].

Fig. 8
figure 8

The track reconstruction efficiency for charged pions after applying the tight quality selection criteria to the tracks. Subfigure (a) shows the efficiency for 1–2 GeV, 2–5 GeV and 5–10 GeV particles as a function of \(\eta \), while (b) shows the track reconstruction efficiency as a function of \(p_{\mathrm{T}}\) in three \(\,|{\eta }|\) bins. A simulated dijet sample is used, with similar pile-up to that in the 2012 data, and for which \(20< p_{\mathrm{T}}^{\mathrm{lead}} < 1000\,\mathrm{GeV}\). The statistical uncertainties in the number of MC simulated events are shown in a darker shading

Fig. 9
figure 9

The difference between the reconstructed \(p_{\mathrm{T}}\) of the track from a charged pion and the particle’s true \(p_{\mathrm{T}}\) for two bins in truth particle \(p_{\mathrm{T}}\) and \(\,|{\eta }|\), determined in dijet MC simulation with similar pile-up to that in the 2012 data. The shaded bands represent the statistical uncertainty. The tails in the residuals are substantially diminished upon the application of the more stringent silicon detector hit requirements. A simulated dijet sample with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < 1000\,\mathrm{GeV}\) is used, and the statistical uncertainties in the number of MC simulated events are shown as a hatched band

6.3 Matching tracks to topo-clusters

To remove the calorimeter energy where a particle has formed a single topo-cluster, the algorithm first attempts to match each selected track to one topo-cluster. The distances \(\Delta \phi \) and \(\Delta \eta \) between the barycentre of the topo-cluster and the track, extrapolated to the second layer of the EM calorimeter, are computed for each topo-cluster. The topo-clusters are ranked based on the distance metric

$$\begin{aligned} \Delta R' = \sqrt{ \left( \frac{\Delta \phi }{\sigma _\phi } \right) ^ 2+ \left( \frac{\Delta \eta }{\sigma _\eta } \right) ^2 }, \end{aligned}$$
(5)

where \(\sigma _\eta \) and \(\sigma _\phi \) represent the angular topo-cluster widths, computed as the standard deviation of the displacements of the topo-cluster ’s constituent cells in \(\eta \) and \(\phi \) with respect to the topo-cluster barycentre. This accounts for the spatial extent of the topo-clusters, which may contain energy deposits from multiple particles.

The distributions of \(\sigma _\eta \) and \(\sigma _\phi \) for single-particle samples are shown in Fig. 10. The structure seen in these distributions is related to the calorimeter geometry. Each calorimeter layer has a different cell granularity in both dimensions, and this sets the minimum topo-cluster size. In particular, the granularity is significantly finer in the electromagnetic calorimeter, thus particles that primarily deposit their energy in either the electromagnetic and hadronic calorimeters form distinct populations. High-energy showers typically spread over more cells, broadening the corresponding topo-clusters. If the computed value of \(\sigma _{\eta }\) or \(\sigma _{\phi }\) is smaller than 0.05, it is set to 0.05.

Fig. 10
figure 10

The distribution of \(\sigma _\eta \) and \(\sigma _\phi \), for charged pions, in three different regions of the detector for three particle \(p_{\mathrm{T}}\) ranges. The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

A preliminary selection of topo-clusters to be matched to the tracks is performed by requiring that \(E^{\mathrm{clus}}/p^{\mathrm{trk}} >0.1\), where \(E^{\mathrm{clus}} \) is the energy of the topo-cluster and \(p^{\mathrm{trk}}\) is the track momentum. The distribution of \(E^{\mathrm{clus}}/p^{\mathrm{trk}} \) for the topo-cluster with at least 90% of the true energy from the particle matched to the track – the “correct” one to match to – and for the closest other topo-cluster in \(\Delta R'\) is shown in Fig. 11. For very soft particles, it is common that the closest other topo-cluster carries \(E^{\mathrm{clus}}/p^{\mathrm{trk}}\) comparable to (although smaller than) the correct topo-cluster. About 10% of incorrect topo-clusters are rejected by the \(E^{\mathrm{clus}}/p^{\mathrm{trk}} \) cut for particles with \(1<p_{\mathrm{T}} <2\,\mathrm{GeV}\). The difference in \(E^{\mathrm{clus}}/p^{\mathrm{trk}} \) becomes much more pronounced for particles with \(\,p_{\mathrm{T}} >5\,\mathrm{GeV}\), for which there is a very clear separation between the correct and incorrect topo-cluster matches, resulting in a 30–40% rejection rate for the incorrect topo-clusters. This is because at lower \(\,p_{\mathrm{T}} \) clusters come from both signal and electronic or pile-up noise. Furthermore, the particle \(\,p_{\mathrm{T}} \) spectrum is peaked towards lower values, and thus higher-\(\,p_{\mathrm{T}} \) topo-clusters are rarer. The \(E^{\mathrm{clus}}/p^{\mathrm{trk}} >0.1\) requirement rejects the correct cluster for far less than 1% of particles.

Fig. 11
figure 11

The distributions of \(E^{\mathrm{clus}}/p^{\mathrm{trk}} \) for the topo-cluster with \(> 90{\%}\) of the true energy of the particle and the closest other topo-cluster in \(\Delta R'\). The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band. A track is only used for energy subtraction if a topo-cluster is found inside a cone of \(\Delta R' = 1.64\) for which \(E^{\mathrm{clus}}/p^{\mathrm{trk}} > 0.1\), as indicated by the vertical dashed line

Next, an attempt is made to match the track to one of the preselected topo-clusters using the distance metric \(\Delta R'\) defined in Eq. 5. The distribution of \(\Delta R'\) between the track and the topo-cluster with \(> 90{\%}\) of the truth particle energy and to the closest other preselected topo-cluster is shown in Fig. 12 for the dijet MC sample. From this figure, it is seen that the correct topo-cluster almost always lies at a small \(\Delta R'\) relative to other clusters. Hence, the closest preselected topo-cluster in \(\Delta R' \) is taken to be the matched topo-cluster. This criterion selects the correct topo-cluster with a high probability, succeeding for virtually all particles with \(\,p_{\mathrm{T}} >5\,\mathrm{GeV}\). If no preselected topo-cluster is found in a cone of size \(\Delta R' =1.64\), it is assumed that this particle did not form a topo-cluster in the calorimeter. In such cases the track is retained in the list of tracks and no subtraction is performed. The numerical value corresponds to a one-sided Gaussian confidence interval of 95%, and has not been optimised. However, as seen in Fig. 12, this cone size almost always includes the correct topo-cluster, while rejecting the bulk of incorrect clusters.

Fig. 12
figure 12

The distributions of \(\Delta R' \) for the topo-cluster with \(>90{\%}\) of the true energy of the particle and the closest other topo-cluster, both satisfying \(E^{\mathrm{clus}}/p^{\mathrm{trk}} > 0.1\). The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band. A track is only used for energy subtraction if a topo-cluster is found with \(E^{\mathrm{clus}}/p^{\mathrm{trk}} > 0.1\) inside a cone of \(\Delta R' < 1.64\), as indicated by the vertical dashed line

6.4 Evaluation of the expected deposited particle energy through \(\langle E^{\mathrm{clus}}_{\mathrm{ref}}/p^{\mathrm{trk}}_{\mathrm{ref}} \rangle \) determination

It is necessary to know how much energy a particle with measured momentum \(p^{\mathrm{trk}}\) deposits on average, given by \(\langle E_{\mathrm{dep}} \rangle = p^{\mathrm{trk}} \,\langle E^{\mathrm{clus}}_{\mathrm{ref}}/p^{\mathrm{trk}}_{\mathrm{ref}} \rangle \), in order to correctly subtract the energy from the calorimeter for a particle whose track has been reconstructed. The expectation value \(\langle E^{\mathrm{clus}}_{\mathrm{ref}}/p^{\mathrm{trk}}_{\mathrm{ref}} \rangle \) (which is also a measure of the mean response) is determined using single-particle samples without pile-up by summing the energies of topo-clusters in a \(\Delta R\) cone of size 0.4 around the track position, extrapolated to the second layer of the EM calorimeter. This cone size is large enough to entirely capture the energy of the majority of particle showers. This is also sufficient in dijet events, as demonstrated in Fig. 13, where one might expect the clusters to be broader due to the presence of other particles. The subscript ‘ref’ is used here and in the following to indicate \(E^{\mathrm{clus}}/p^{\mathrm{trk}}\) values determined from single-pion samples.

Variations in \(\langle E^{\mathrm{clus}}_{\mathrm{ref}}/p^{\mathrm{trk}}_{\mathrm{ref}} \rangle \) due to detector geometry and shower development are captured by binning the measurement in the \(\,p_{\mathrm{T}} \) and \(\eta \) of the track as well as the layer of highest energy density (LHED), defined in the next section. The LHED is also used to determine the order in which cells are subtracted in subsequent stages of the algorithm.

The spread of the expected energy deposition, denoted by \(\sigma (E_{\mathrm{dep}}) \), is determined from the standard deviation of the \(E^{\mathrm{clus}}_{\mathrm{ref}}/p^{\mathrm{trk}}_{\mathrm{ref}}\) distribution in single-pion samples. It is used in order to quantify the consistency of the measured \(E^{\mathrm{clus}}/p^{\mathrm{trk}} \) with the expectation from \(\langle E^{\mathrm{clus}}_{\mathrm{ref}}/p^{\mathrm{trk}}_{\mathrm{ref}} \rangle \) in both the split-shower recovery (Sect. 6.5) and remnant removal (Sect. 6.7).

Fig. 13
figure 13

The cone size \(\Delta R\) around the extrapolated track required to encompass both the leading and sub-leading topo-clusters, for \(\pi ^{\pm }\) when \(< {70}{\%}\) of their true deposited energy in topo-clusters is contained in the leading topo-cluster, but \(> 90{\%}\) of the energy is contained in the two leading topo-clusters. The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

6.4.1 Layer of highest energy density

The dense electromagnetic shower core has a well-defined ellipsoidal shape in \(\eta \)\(\phi \). It is therefore desirable to locate this core, such that the energy subtraction may be performed first in this region before progressing to the less regular shower periphery. The LHED is taken to be the layer which shows the largest rate of increase in energy density, as a function of the number of interaction lengths from the front face of the calorimeter. This is determined as follows:

  • The energy density is calculated for the jth cell in the ith layer of the calorimeter as

    $$\begin{aligned} \rho _{ij} = \frac{E_{ij}}{V_{ij}} \left( \mathrm{GeV}/X_{0}^{3}\right) , \end{aligned}$$
    (6)

    with \(E_{ij}\) being the energy in and \(V_{ij}\) the volume of the cell expressed in radiation lengths. The energy measured in the Presampler is added to that of the first layer in the EM calorimeter. In addition, the Tile and HEC calorimeters are treated as single layers. Thus, the procedure takes into account four layers – three in the EM calorimeter and one in the hadronic calorimeter. Only cells in the topo-clusters matched to the track under consideration are used.

  • Cells are then weighted based on their proximity to the extrapolated track position in the layer, favouring cells that are closer to the track and hence more likely to contain energy from the selected particle. The weight for each cell, \(w_{ij}\), is computed from the integral over the cell area in \(\eta \)\(\phi \) of a Gaussian distribution centred on the extrapolated track position with a width in \(\Delta R\) of 0.035, similar to the Molière radius of the LAr calorimeter.

  • A weighted average energy density for each layer is calculated as

    $$\begin{aligned} \langle \rho ' \rangle _{i} = \sum _{j} w_{ij} \rho _{ij}. \end{aligned}$$
    (7)
  • Finally, the rate of increase in \(\langle \rho '\rangle _i\) in each layer is determined. Taking \(d_i\) to be the depth of layer i in interaction lengths, the rate of increase is defined as

    $$\begin{aligned} \Delta \rho '_{i} = \frac{ \langle \rho '\rangle _i - \langle \rho '\rangle _{i-1} }{d_i - d_{i-1}}, \end{aligned}$$
    (8)

    where the values \(\langle \rho '\rangle _0=0\) and \(d_0=0\) are assigned, and the first calorimeter layer has the index \(i=1\).

The layer for which \(\Delta \rho '\) is maximal is identified as the LHED.

6.5 Recovering split showers

Particles do not always deposit all their energy in a single topo-cluster, as seen in Fig. 5. Clearly, handling the multiple topo-cluster case is crucial, particularly the two topo-cluster case, which is very common. The next stages of the algorithm are therefore firstly to determine if the shower is split across several clusters, and then to add further clusters for consideration when this is the case.

The discriminant used to distinguish the single and multiple topo-cluster cases is the significance of the difference between the expected energy and that of the matched topo-cluster (defined using the algorithm in Sect. 6.3),

$$\begin{aligned} S(E^{\mathrm{clus}}) = \frac{E^{\mathrm{clus}}- \langle E_{\mathrm{dep}} \rangle }{\sigma (E_{\mathrm{dep}})}. \end{aligned}$$
(9)

The distribution of \(S(E^{\mathrm{clus}})\) is shown in Fig. 14 for two categories of matched topo-clusters: those with \(\varepsilon ^{\mathrm{clus}}_i > 90{\%}\) and those with \(\varepsilon ^{\mathrm{clus}}_i < {70}{\%}\). A clear difference is observed between the \(S(E^{\mathrm{clus}})\) distributions for the two categories, demonstrating the separation between showers that are and are not contained in a single cluster. More than 90% of clusters with \(\varepsilon ^{\mathrm{clus}}_i > 90{\%}\) have \(S(E^{\mathrm{clus}})>-1\). Based on this observation a split shower recovery procedure is run if \(S(E^{\mathrm{clus}})<-1\): topo-clusters within a cone of \(\Delta R =0.2\) around the track position extrapolated to the second EM calorimeter layer are considered to be matched to the track. As can be seen in the figure, the split shower recovery procedure is typically run 50% of the time when \(\varepsilon ^{\mathrm{clus}}_{\mathrm{matched}} < {70}{\%}\). The full set of matched clusters is then considered when the energy is subtracted from the calorimeter.

Fig. 14
figure 14

The significance of the difference between the energy of the matched topo-cluster and the expected deposited energy \(\langle E_{\mathrm{dep}} \rangle \) and that of the matched topo-cluster, for \(\pi ^{\pm }\) when \(< {70}{\%}\) and \(> 90{\%}\) of the true deposited energy in topo-clusters is contained in the matched topo-cluster for different \(p_{\mathrm{T}}^{\mathrm{true}}\) and \(|\eta ^{\mathrm{true}} |\) ranges. The vertical line indicates the value below which additional topo-clusters are matched to the track for cell subtraction. Subfigures af indicate that a single cluster is considered \((93,95,95,94,95,91)\,\%\) of the time when \(\varepsilon ^{\mathrm{clus}}_{\mathrm{matched}} > 90{\%}\); while additional topo-clusters are considered \((49,39,46,56,52,60)\,\%\) of the time when \(\varepsilon ^{\mathrm{clus}}_{\mathrm{matched}} < {70}{\%}\). The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

6.6 Cell-by-cell subtraction

Once a set of topo-clusters corresponding to the track has been selected, the subtraction step is executed. If \(\langle E_{\mathrm{dep}} \rangle \) exceeds the total energy of the set of matched topo-clusters, then the topo-clusters are simply removed. Otherwise, subtraction is performed cell by cell.

Starting from the extrapolated track position in the LHED, a parameterised shower shape is used to map out the most likely energy density profile in each layer. This profile is determined from a single \(\pi ^{\pm }\) MC sample and is dependent on the track momentum and pseudorapidity, as well as on the LHED for the set of considered topo-clusters. Rings are formed in \(\eta-\phi\) space around the extrapolated track. The rings are just wide enough to always contain at least one calorimeter cell, independently of the extrapolated position, and are confined to a single calorimeter layer. Rings within a single layer are equally spaced in radius. The average energy density in each ring is then computed, and the rings are ranked in descending order of energy density, irrespective of which layer each ring is in. Subtraction starts from the ring with the highest energy density (the innermost ring of the LHED) and proceeds successively to the lower-density rings. If the energy in the cells in the current ring is less than the remaining energy required to reach \(\langle E_{\mathrm{dep}} \rangle \), these cells are simply removed and the energy still to be subtracted is reduced by the total energy of the ring. If instead the ring has more energy than is still to be removed, each cell in the ring is scaled down in energy by the fraction needed to reach the expected energy from the particle, then the process halts. Figure 15 shows a cartoon of how this subtraction works, removing cells in different rings from different layers until the expected energy deposit is reached.

Fig. 15
figure 15

An idealised example of how the cell-by-cell subtraction works. Cells in two adjacent calorimeter layers (EMB2 and EMB3) are shown in grey if they are not in clusters, red if they belong to a \(\pi ^+\) cluster and in green if contributed by a \(\pi ^0\) meson. Rings are placed around the extrapolated track (represented by a star) and then the cells in these are removed ring by ring starting with the centre of the shower (a), where the expected energy density is highest and moving outwards, and between layers. This sequence of ring subtraction is shown in subfigures (a) through (g). The final ring contains more energy than the expected energy, hence this is only partially subtracted (g), indicated by a lighter shading

6.7 Remnant removal

If the energy remaining in the set of cells and/or topo-clusters that survive the energy subtraction is consistent with the width of the \(E^{\mathrm{clus}}_{\mathrm{ref}}/p^{\mathrm{trk}}_{\mathrm{ref}} \) distribution, specifically if this energy is less than \(1.5 \sigma (E_{\mathrm{dep}}) \), it is assumed that the topo-cluster system was produced by a single particle. The remnant energy therefore originates purely from shower fluctuations and so the energy in the remaining cells is removed. Conversely, if the remaining energy is above this threshold, the remnant topo-cluster (s) are retained – it being likely that multiple particles deposited energy in the vicinity. Figure 16 shows how this criterion is able to separate cases where the matched topo-cluster has true deposited energy only from a single particle from those where there are multiple contributing particles.

After this final step, the set of selected tracks and the remaining topo-clusters in the calorimeter together should ideally represent the reconstructed event with no double counting of energy between the subdetectors.

Fig. 16
figure 16

The significance of the difference between the energy of the matched topo-cluster and the expected deposited energy \(\langle E_{\mathrm{dep}} \rangle \) for \(\pi ^{\pm }\) with either \(< {70}{\%}\) or \(> 90{\%}\) of the total true energy in the matched topo-cluster originating from the \(\pi ^{\pm }\) for different \(p_{\mathrm{T}}^{\mathrm{true}}\) and \(|\eta ^{\mathrm{true}} |\) ranges. The vertical line indicates the value below which the remnant topo-cluster is removed, as it is assumed that in this case no other particles contribute to the topo-cluster. Subfigures a–(f indicate that when \(\rho ^{\mathrm{clus}}_{\mathrm{matched}} > 90{\%}\) the remnant is successfully removed \((91,89,94,89,91,88)\,\%\) of the time; while when \(\rho ^{\mathrm{clus}}_{\mathrm{matched}} < {70}{\%}\) the remnant is retained \((81,80,76,84,83,91)\,\%\) of the time. The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

7 Performance of the subtraction algorithm at truth level

The performance of each step of the particle flow algorithm is evaluated exploiting the detailed energy information at truth level available in Monte Carlo generated events. For these studies a dijet sample with leading truth jet \(p_{\mathrm{T}}\) between 20 and 500 GeV without pile-up is used.

7.1 Track–cluster matching performance

Initially, the algorithm attempts to match the track to a single topo-cluster containing the full particle energy. Figure 17 shows the fraction of tracks whose matched cluster has \(\varepsilon ^{\mathrm{clus}}_{\mathrm{lead}} >90{\%}\) or \(\varepsilon ^{\mathrm{clus}}_{\mathrm{lead}} >50{\%}\). When almost all of the deposited energy is contained within a single topo-cluster, the probability to match a track to this topo-cluster (matching probability) is above 90% in all \(\eta \) regions, for particles with \(\,p_{\mathrm{T}} > 2\,\mathrm{GeV}\). The matching probability falls to between 70 and \(90{\%}\) when up to half the particle’s energy is permitted to fall in other topo-clusters. Due to changes in the calorimeter geometry, the splitting rate and hence the matching probability vary significantly for particles in different pseudorapidity regions. In particular, the larger cell size at higher \(\,|{\eta }|\) enhances the likelihood of capturing soft particle showers in a single topo-cluster, as seen in Figs. 4 and 5, which results in the matching efficiency increasing at low \(p_{\mathrm{T}}\) for \(\,|{\eta }|>2\).

Fig. 17
figure 17

The probability to match the track to the leading topo-cluster (a) when \(\varepsilon ^{\mathrm{clus}}_{\mathrm{lead}} > 90{\%}\) and (b) when \(\varepsilon ^{\mathrm{clus}}_{\mathrm{lead}} > 50{\%}\). The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

7.2 Split-shower recovery performance

Frequently, a particle’s energy is not completely contained within the single best-match topo-cluster, in which case the split shower recovery procedure is applied. The effectiveness of the recovery can be judged based on whether the procedure is correctly triggered, and on the extent to which the energy subtraction is improved by its execution.

Figure 18 shows the fraction \(\varepsilon ^\mathrm{clus}_\mathrm{matched}\) of the true deposited energy contained within the matched topo-cluster, separately for cases where the split shower recovery procedure is and is not triggered, as determined by the criteria described in Sect. 6.5. In the cases where the split shower recovery procedure is not run, \(\varepsilon ^\mathrm{clus}_\mathrm{matched}\) is found to be high, confirming that the comparison of topo-cluster energy and \(\langle E^{\mathrm{clus}}_{\mathrm{ref}}/p^{\mathrm{trk}}_{\mathrm{ref}} \rangle \) is successfully identifying good topo-cluster matches. Conversely, the split shower recovery procedure is activated when \(\varepsilon ^\mathrm{clus}_\mathrm{matched}\) is low, particularly for higher-\(p_{\mathrm{T}}\) particles, which are expected to split their energy between multiple topo-clusters more often. Furthermore, as the particle \(p_{\mathrm{T}}\) rises, the width of the calorimeter response distribution decreases, making it easier to distinguish the different cases.

Fig. 18
figure 18

The fraction of the true energy of a given particle contained within the initially matched topo-cluster for particles where the split shower recovery procedure is run (SSR run) and where it is not (No SSR). For cases where most of the energy is contained in the initially matched topo-cluster the procedure is less likely to be run. The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

Figure 19 shows the fraction \(f^\mathrm{clus}_\mathrm{sub}\) of the true deposited energy of the pions considered for subtraction, in the set of clusters matched to the track, as a function of true \(p_{\mathrm{T}}\). For particles with \(\,p_{\mathrm{T}} >20\,\mathrm{GeV}\), with split shower recovery active, \(f^\mathrm{clus}_\mathrm{sub}\) is greater than \(90{\%}\) on average. The subtraction algorithm misses more energy for softer showers, which are harder to capture completely. While \(f^\mathrm{clus}_\mathrm{sub}\) could be increased by simply attempting recovery more frequently, expanding the topo-cluster matching procedure in this fashion increases the risk of incorrectly subtracting neutral energy; hence the split shower recovery procedure cannot be applied indiscriminately. The settings used in the studies presented in this paper are a reasonable compromise between these two cases.

Fig. 19
figure 19

The fraction of the true energy of a given particle considered in the subtraction procedure \(f^\mathrm{clus}_\mathrm{sub}\) after the inclusion of the split shower recovery algorithm. The data are taken from a dijet sample without pile-up with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < {500}\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown as a hatched band

7.3 Accuracy of cell subtraction

The cell subtraction procedure removes the expected calorimeter energy contribution based on the track properties. It is instructive to identify the energy that is incorrectly subtracted from the calorimeter, to properly understand and optimise the performance of the algorithm.

Truth particles are assigned reconstructed energy in topo-clusters as described in Sect. 3.2, and then classified depending on whether or not a track was reconstructed for the particle. The reconstructed energy assigned to each particle is computed both before subtraction and after the subtraction has been performed, using the remaining cells. In the ideal case, the subtraction should remove all the energy in the calorimeter assigned to stable truth particles which have reconstructed tracks, and should not remove any energy assigned to other particles. The total transverse momentum of clusters associated with particles in a truth jet where a track was reconstructed before (after) subtraction is defined as \(p^{\pm }_{\mathrm{T,pre-sub}} (p^{\pm }_{\mathrm{T,post-sub}})\). Similarly, the transverse momentum of clusters associated with the other particles in a truth jet, neutral particles and those that did not create selected, reconstructed tracks, before (after) subtraction as \(p^{0}_{\mathrm{T,pre-sub}} (p^{0}_{\mathrm{T,post-sub}})\). The corresponding transverse momentum fractions are defined as \(f^{\pm } = p^{\pm }_{\mathrm{T,pre-sub}}/p_{\mathrm{T}} ^{\mathrm{jet, true}}\) (\(f^0 = p^{0}_{\mathrm{T,pre-sub}}/p_{\mathrm{T}} ^{\mathrm{jet, true}}\)).

Three measures are established, to quantify the degree to which the energy is incorrectly subtracted. The incorrectly subtracted fractions for the two classes of particles are:

$$\begin{aligned} R^{\pm } = \frac{p^{\pm }_{\mathrm{T,post-sub}}}{p_{\mathrm{T}} ^\mathrm{jet, true}} \end{aligned}$$
(10)

and

$$\begin{aligned} R^0 = \frac{p^0_\mathrm{T,pre-sub} - p^0_\mathrm{T,post-sub}}{p_{\mathrm{T}} ^\mathrm{jet, true}}, \end{aligned}$$
(11)

such that \(R^{\pm }\) corresponds to the fraction of surviving momentum associated with particles where the track measurement is used, which should have been removed, while \(R^0\) gives the fraction of momentum removed that should have been retained as it is associated with particles where the calorimeter measurement is being used. These two variables are combined into the confusion term

$$\begin{aligned} C = R^{\pm }-R^0, \end{aligned}$$
(12)

which is equivalent to the net effect of both mistakes on the final jet transverse momentum, as there is a potential cancellation between the two effects. An ideal subtraction algorithm would give zero for all three quantities.

Figure 20 shows the fractions associated with the different classes of particle, before and after the subtraction algorithm has been executed for jets with a true energy in the range 40–60 GeV. The confusion term is also shown, multiplied by the jet energy scale factor that would be applied to these reconstructed jets, such that its magnitude (\(C \times \mathrm{JES}\)) is directly comparable to the reconstructed jet resolution.

Clearly, the subtraction does not perform perfectly, but most of the correct energy is removed – the mean value of the confusion is \({-1}{\%}\), with an RMS of 7.6%. The slight bias towards negative values suggests that the subtraction algorithm is more likely to remove additional neutral energy rather than to miss charged energy and the RMS gives an indication of the contribution from this confusion to the overall jet resolution.

Figure 21 shows \(C \times \mathrm{JES}\) as a function of \(p_{\mathrm{T}}\). The mean value of the JES weighted confusion remains close to zero and always within \(\pm {1.5}{\%}\), showing that on average the algorithm removes the correct amount of energy from the calorimeter. The RMS decreases with increasing \(p_{\mathrm{T}}\). This is due to a combination of the particle \(p_{\mathrm{T}}\) spectrum becoming harder, such that the efficiency of matching to the correct cluster increases; the increasing difficulty of subtracting the hadronic showers in the denser environments of high-\(p_{\mathrm{T}}\) jets; and the fact that no subtraction is performed for tracks above 40 GeV, resulting in the fraction of the jet considered for subtraction decreasing with increasing jet \(p_{\mathrm{T}}\).

Fig. 20
figure 20

The fractions of the jet calorimeter energy that have been incorrectly subtracted by the cell subtraction algorithm, for jets with \(40< p_{\mathrm{T}}^{\mathrm{true}} < 60\,\mathrm{GeV}\) and \(\,|{\eta }|<1.0\) in dijet MC simulation without pile-up. The statistical uncertainty is indicated by the hatched bands. Subfigure (a) shows the fraction of jet transverse momentum carried by reconstructed tracks before subtraction \(f^{\pm }\) (hashed) and the corresponding fraction after subtraction \(R^{\pm }\) (solid); b shows the fraction of jet transverse momentum carried by particles without reconstructed tracks before subtraction \(f^0\) (hashed) and the corresponding fraction after subtraction \(R^0\) (solid); and c shows the confusion \(C = R^{\pm }-R^0\), scaled up by the jet energy scale, derived as discussed in Sect. 8

Fig. 21
figure 21

As a function of the jet \(p_{\mathrm{T}}\), subfigure a shows the mean of the confusion term \(C = R^{\pm }-R^0\), scaled up by the jet energy scale, derived as discussed in Sect. 8, and (b) shows the RMS of this distribution. The error bars denote the statistical uncertainty. The MC samples used do not include pile-up

7.4 Visualising the subtraction

For a concrete demonstration of successes and failures of the subtraction algorithm, it is instructive to look at a specific event in the calorimeter. Figure 22 illustrates the action of the algorithm in the second layer of the EM calorimeter, where the majority of low-energy showers are contained. The focus is on a region where a 30 GeV truth jet is present. In general, the subtraction works well in the absence of pile-up, as the two topo-clusters inside the jet radius with energy mainly associated with charged particles at truth level are entirely removed. Nevertheless, examples can be seen where small mistakes are made. For example, the algorithm additionally removes some cells containing neutral-particle energy from the topo-cluster just above the track at \((\eta ,\phi )=(0.0,1.8)\).

The figure also shows the same event, overlaid with pile-up corresponding to \(\mu =40\). Pile-up contributions are identified by subtracting the energy reconstructed without pile-up and are illustrated in blue. The pile-up supplies many more energy deposits and tracks within the region under scrutiny. However, the subtraction continues to function effectively, removing energy in the vicinity of pile-up tracks and hence the post-subtraction cell distribution more closely resembles that without pile-up, especially inside the jet radius. Because tracks classified as originating from pile-up are ignored in jet reconstruction (see Sect. 8), the jet energy after subtraction is mainly contaminated by neutral pile-up contributions.

Fig. 22
figure 22

A graphical display of the second layer of the EM calorimeter focusing on a 30 GeV truth jet, outlined by the ellipse. Asterisks indicate the positions of tracks extrapolated to the calorimeter, while blue framed rectangles mark the cells clustered into topo-clusters. The colour purple (dark) is used to indicate those tracks that are selected for particle flow jet reconstruction, i.e. those matched to the nominal hard-scatter primary vertex (see Sect. 8) and clustered into the jet based on their momenta expressed at the perigee. Other tracks are shown in orange (light). Red and green boxes indicate reconstructed cell energies associated with truth particles where tracks have and have not been reconstructed. Subfigures (a) and (b) show the event without pile-up. Subfigures (c) and (d) show the same event with pile-up overlaid. Pile-up energy in the calorimeter is indicated by blue boxes

8 Jet reconstruction and calibration

Improved jet performance is the primary goal of using particle flow reconstruction. Particle flow jets are reconstructed using the anti-\(k_t\) algorithm with radius parameter 0.4. The inputs to jet reconstruction are the ensemble of positive energy topo-clusters surviving the energy subtraction step and the selected tracks that are matched to the hard-scatter primary vertex. These tracks are selected by requiring \(|z_0\sin \theta | < 2\,\mathrm{mm}\), where \(z_0\) is the distance of closest approach of the track to the hard-scatter primary vertex along the z-axis. This criterion retains the tracks from the hard scatter, while removing a large fraction of the tracks (and their associated calorimeter energy) from pile-up interactions [59]. Prior to jet-finding, the topo-cluster \(\eta \) and \(\phi \) are recomputed with respect to the hard-scatter primary vertex position, rather than the detector origin.

Calorimeter jets are similarly reconstructed using the anti-\(k_t\) algorithm with radius parameter 0.4, but take as input topo-clusters calibrated at the LC-scale, uncorrected for the primary vertex position. For the purposes of jet calibration, truth jets are formed from stable final-state particles excluding muons and neutrinos, using the anti-\(k_t\) algorithm with radius parameter 0.4.Footnote 3

8.1 Overview of particle flow jet calibration

Calibration of these jets closely follows the scheme used for standard calorimeter jets described in Refs. [4,5,6,7] and is carried out over the range \(20< p_{\mathrm{T}} < 1500\,\mathrm{GeV}\). The reconstructed jets are first corrected for pile-up contamination using the jet ghost-area subtraction method [60, 61]. This is described in Sect. 8.2. A numerical inversion [6] based on Monte Carlo events (see Sect. 8.3) restores the jet response to match the average response at particle level. Additional fluctuations in jet response are corrected for using a global sequential correction process [4], which is detailed in Sect. 8.4. No in situ correction to data is applied in the context of these studies; however, the degree of agreement between data and MC simulation is checked using the \(p_{\mathrm{T}}\) balance of jets against a Z boson decaying to two muons.

The features of particle flow jet calibration that differ from the calibration of calorimeter jets are discussed below, and results from the different stages of the jet calibration are shown.

8.2 Area-based pile-up correction

The calorimeter jet pile-up correction uses a transverse energy density \(\rho \) calculated from topo-clusters using \(k_\mathrm {T}\) jets [62, 63], for a correction of the form of \(\rho \) multiplied by the area of the jet [61]. For particle flow jets, the transverse energy density therefore needs to be computed using charged and neutral particle flow objects to correctly account for the differences in the jet constituents. As discussed above, the tracks associated to pile-up vertices are omitted, eliminating a large fraction of the energy deposits from charged particles from pile-up interactions. The jet-area subtraction therefore corrects for the impact of charged underlying-event hadrons, charged particles from out-of-time interactions, and more importantly, neutral particles from pile-up interactions. This correction is evaluated prior to calibration of the jet energy scale. Figure 23 shows the distribution of the median transverse energy density \(\rho \) in dijet MC events for particle flow objects and for topo-clusters. The topo-cluster \(\rho \) is calculated with the ensemble of clusters, calibrated either at the EM scale or LC scale, and the particle flow jets use topo-clusters calibrated at the EM scale.

The LC-scale energy density is larger than the EM-scale energy density due to the application of the cell weights to calibrate cells to the hadronic scale. Compared to the EM- and LC-scale energy densities, \(\rho \) has a lower per-event value for particle flow jets in 2012 conditions, due to the reduced pile-up contribution. The removal of the charged particle flow objects that are not associated with the hard-scatter primary vertex more than compensates for the higher energy scale for charged hadrons from the underlying event.

Fig. 23
figure 23

The distribution of the median transverse energy density \(\rho \) in dijet MC simulated events with similar pile-up to that measured in the 2012 data for different jet constituents

8.3 Monte Carlo numerical inversion

Figure 24 shows the energy response \(R=E_\mathrm{reco}/E_\mathrm{truth}\) prior to the MC-based jet energy scale correction. The same numerical procedure as described in Ref. [6] is applied and successfully corrects for the hadron response, at a similar level to that observed in Ref. [6].

Fig. 24
figure 24

The response \(E_\mathrm{reco}/E_\mathrm{true}\) of anti-\(k_t\) particle flow jets with radius parameter 0.4 from MC dijet samples with no pile-up, as a function of the jet \(\eta \), measured prior to calibration, and binned in the energy of the matched truth jet

8.4 Global sequential correction

The numerical inversion calibration restores the average reconstructed jet energy to the mean value of the truth jet energy, accounting for variations in the jet response due to the jet energy and pseudorapidity. However, other jet characteristics such as the flavour of the originating parton and the composition of the hadrons created in jet fragmentation may cause further differences in the response. A global sequential correction [4] that makes use of additional observables adapts the jet energy calibration to account for such variations, thereby improving the jet resolution without changing the scale. The variables used for particle flow jets are not the same as those used for calorimeter jets, as tracks have already been used in the calculation of the energy of the jet constituents.

As the name implies, the corrections corresponding to each variable are applied consecutively. Three variables are used as inputs to the correction:

  1. 1.

    the fraction of the jet energy measured from constituent tracks (charged fraction), i.e. those tracks associated to the jet;

  2. 2.

    the fraction of jet energy measured in the third EM calorimeter layer;

  3. 3.

    the fraction of jet energy measured in the first Tile calorimeter layer.

The first of these variables allows the degree of under-calibrated signal, due to the lower energy deposit of hadrons in the hadronic calorimeter, to be determined. The calorimeter-layer energy fractions allow corrections to be made for the energy lost in dead material between the LAr and Tile calorimeters.

8.5 In situ validation of JES

A full in situ calibration and evaluation of the uncertainties on the JES [64] is not performed for these studies. However, to confirm that the ATLAS MC simulation describes the particle flow jet characteristics well enough to reproduce the jet energy scale in data, similar methods are used to validate the jet calibration. A sample of \(Z \rightarrow {\mu {\mu }}\) events with a jet balancing the Z boson is used for the validation. A preselection is made using the criteria described in Sect. 4. The particle flow algorithm is run on these events and further requirements, discussed below, are applied. The jet with the highest \(p_{\mathrm{T}}\) (\(\mathrm{j1}\)) and the reconstructed Z boson are required to be well separated in azimuthal angle, \(\Delta \phi >\pi -0.3\). Events with any additional jet within \(\,|{\eta }|<4.5\), with \(\,p_{\mathrm{T}} ^{\,\mathrm{j2}} > 20\,\mathrm{GeV}\) or \(\,p_{\mathrm{T}} ^{\,\mathrm{j2}}>0.1 p_{\mathrm{T}} ^{\,\mathrm{j1}}\), are vetoed, where \(\mathrm{j2}\) denotes the jet with the second highest \(p_{\mathrm{T}}\). In Fig. 25, the mean value of the ratio of the transverse momentum of the jet to that of the Z boson is shown for data and MC simulation for jets with \(\,|{\eta }|<1\). The mean value is determined using a Gaussian fit to the distribution in bins of the Z-boson \(p_{\mathrm{T}}\). The double-ratio of data to MC simulation is also shown. The simulation reproduces the data to within 2%, and in general is consistent with the data points within statistical uncertainties. At high \(p_{\mathrm{T}}\) the data/MC ratio is expected to tend towards that of the calorimeter jets [6, 7], as a large fraction of the jet’s energy is carried by particles above the cut made on the track momentum. For \(p_{\mathrm{T}}\) > 200 GeV it is observed that the jet energy scale of calorimeter jets in data is typically 0.5% below that in simulation.

Fig. 25
figure 25

The mean value of the ratio of the transverse momentum of a jet to that of the reconstructed Z boson decaying to \({\mu {\mu }}\). The uncertainties shown are statistical

9 Resolution of jets in Monte Carlo simulation

The largest expected benefit from using the particle flow reconstruction as input to jet-finding is an improvement of the jet energy and angular resolution for low-\(p_{\mathrm{T}}\) jets. In this section, the jet resolution achieved with particle flow methods is compared with that attained using standard calorimeter jet reconstruction.

9.1 Transverse momentum resolution

In Fig. 26, the relative jet transverse momentum resolution for particle flow and calorimeter jets is shown as a function of jet transverse momentum for jets in the pseudorapidity range \(\,|{\eta }|<1.0\), and as a function of \(\,|{\eta }|\) for jets with \(40<p_{\mathrm{T}} <60\,\mathrm{GeV}\). Particle flow jets are calibrated using the procedures described in Sect. 8, while calorimeter jets use the more detailed procedure described in Refs. [4,5,6,7]. The particle flow jets perform better than calorimeter jets at transverse momenta of up to \(90\,\mathrm{GeV}\) in the central region, benefiting from the improved scale for low-\(p_{\mathrm{T}}\) hadrons and intrinsic pile-up suppression (elaborated on in Sect. 10). However, at high transverse momenta, the particle flow reconstruction performs slightly worse than the standard reconstruction. This is due to two effects. The dense core of a jet poses a challenge to tracking algorithms, causing the tracking efficiency and accuracy to degrade in high-\(p_{\mathrm{T}}\) jets. Furthermore, the close proximity of the showers within the jet increases the degree of confusion during the cell subtraction stage. To counteract this, the track \(p_{\mathrm{T}}\) used for particle flow reconstruction is required to be \(< 40\,\mathrm{GeV}\) for the 2012 data. Alternative solutions, such as smoothly disabling the algorithm for individual tracks as the particle environment becomes more dense, are expected to restore the particle flow jet performance to match that of the calorimeter jets at high energies. The benefits of particle flow also diminish toward the more forward regions as the cell granularity decreases, as shown in Fig. 26b

In Fig. 27, the underlying distributions of the ratio of reconstructed to true \(p_{\mathrm{T}}\) are shown for two different jet \(p_{\mathrm{T}}\) bins. This demonstrates that the particle flow algorithm does not introduce significant tails in the response at either low or high \(p_{\mathrm{T}}\). The low-side tail visible in Fig. 27b is present in both calorimeter and particle flow jets and is caused by dead material and inactive detector regions.

Fig. 26
figure 26

The jet transverse momentum resolution as determined in dijet MC events for calorimeter jets and particle flow jets. Subfigure (a) shows the resolution as a function of \(p_{\mathrm{T}}\) for jets with \(\,|{\eta }|<1.0\,\) and (b) shows the resolution as a function of \(\,|{\eta }|\,\) for jets with \(40<p_{\mathrm{T}} <60\,\mathrm{GeV}\). Simulated pile-up conditions are similar to the data-taking in 2012. To quantify the difference in resolution between particle flow and calorimeter jets, the lower figure shows the square root of the difference of the squares of the resolution for the two classes of jets

Fig. 27
figure 27

The jet transverse momentum response distribution as determined in dijet MC events for calorimeter jets and particle flow jets. Two different \(p_{\mathrm{T}}\) bins are shown; a \(40<p_{\mathrm{T}} <50\,\mathrm{GeV}\) and b \(120<p_{\mathrm{T}} <130\,\mathrm{GeV}\). Simulated pile-up conditions are similar to the data-taking in 2012

9.2 Angular resolution of jets

Besides improving the \(p_{\mathrm{T}}\) resolution of jets, the particle flow algorithm is expected to improve the angular (\(\eta , \phi \)) resolution of jets. This is due to three different effects. Firstly, usage of tracks to measure charged particles results in a much better angular resolution for individual particles than that obtained using topo-clusters, because the tracker’s angular resolution is far superior to that of the calorimeter. Secondly, the track four-momentum can be determined at the perigee, before the charged particles have been spread out by the magnetic field, thereby improving the \(\phi \) resolution for the jet. Thirdly, the suppression of charged pile-up particles should also reduce mismeasurements of the jet direction.

Figure 28 shows the angular resolution in \(\eta \) and \(\phi \) as a function of the reconstructed jet transverse momentum for particle flow and calorimeter jets. It is determined from the standard deviation of a Gaussian fit over \(\pm 1.5\sigma \) to the difference between the \(\eta \) and \(\phi \) values for the reconstructed and matched truth (\(\Delta R <0.3\)) jets in the central region. At low \(p_{\mathrm{T}}\), where the three effects described above are expected to be more important, significant improvements are seen in both the \(\eta \) and \(\phi \) resolutions. It is interesting to note that for particle flow jets the \(\eta \) and \(\phi \) resolutions are similar, while for calorimeter jets the \(\phi \) resolution is worse due to the aforementioned effect of the magnetic field on charged particles.

Fig. 28
figure 28

The angular resolution, a in \(\eta \) and b in \(\phi \), as a function of the jet \(p_{\mathrm{T}}\), determined in dijet MC simulation by fitting Gaussian functions to the difference between the reconstructed and truth quantities. Conditions are similar to the data-taking in 2012

10 Effect of pile-up on the jet resolution and rejection of pile-up jets

At the design luminosity of the LHC, and even in 2012 data-taking conditions, in- and out-of-time pile-up contribute significantly to the signals measured in the ATLAS detector, increasing the fluctuations in jet energy measurements. The pile-up suppression inherent in the particle flow reconstruction and the calibration of charged particles through the use of tracks significantly mitigates the degradation of jet resolution from pile-up and eliminates jets reconstructed from pile-up deposits, making the particle flow method a powerful tool, especially as the LHC luminosity increases.

10.1 Pile-up jet rate

In the presence of pile-up, jets can arise from particles not produced in the hard-scatter interaction. These jets are here referred to as ‘fake jets’. Figure 29a shows the fake-jet rate as a function of the jet \(\eta \) for particle flow jets compared to calorimeter jets with and without track-based pile-up suppression  [65]. These rates are evaluated using a dijet MC sample overlaid with simulated minimum-bias events approximating the data-taking conditions in 2012. The jet vertex fraction (JVF) is defined as the ratio of two scalar sums of track momenta: the numerator is the scalar sum of the \(p_{\mathrm{T}}\) of tracks that originate from the hard-scatter primary vertex and are associated with the jet; the denominator is the scalar sum of the transverse momenta of all tracks associated with that jet.Footnote 4 Within the tracker coverage of \(\,|{\eta }|<2.5\), the fake rate for particle flow jets drops by an order of magnitude compared to the standard calorimeter jets. The small increase in the rate of particle flow fake jets around \(1.0< \,|{\eta }| < 1.2\) is related to the worse performance of the particle flow algorithm in the transition region between the barrel and extended barrel of the Tile calorimeter, which is significantly affected by pile-up contributions [3].

For \(\,|{\eta }| > 2.5\), the jets are virtually identical, and hence the fake rate shows no differences. This rejection rate is comparable to that achieved using the JVF discriminant, which can likewise only be applied within the tracker coverage. Here, the comparison is made to a \(|\mathrm{JVF}|\) threshold of 0.25 for calorimeter jets, which is not as powerful as the particle flow fake-jet rate reduction. The inefficiency of the particle flow jet-finding is negligible, as can be seen from Fig. 29b. In contrast, the inefficiency generated by requiring \(|\mathrm{JVF}|>0.25\) is clearly visible (it should be noted that in 2012 JVF cuts were only applied to calorimeter jets up to a \(p_{\mathrm{T}}\) of 50 GeV). Below 30 GeV, the jet resolution causes some reconstructed jets to fall below the jet reconstruction energy threshold so these values are not shown.

Fig. 29
figure 29

In the presence of pile-up, ‘fake jets’ can arise from particles not produced in the hard-scatter interaction. Subfigure a shows the number of fake jets (jets not matched to truth jets with \(\,p_{\mathrm{T}} > 4\,\mathrm{GeV}\) within \(\Delta R <0.4\)) and b the efficiency of reconstructing a hard-scatter jet (reconstructed jet found within \(\Delta R <0.4\) with \(\,p_{\mathrm{T}} > 15\,\mathrm{GeV}\)) in dijet MC events. Simulated pile-up conditions are similar to the data-taking in 2012

A more detailed study of the pile-up jet rates is carried out in a \(Z\rightarrow {\mu {\mu }}\) sample, both in data and MC simulation, by isolating several phase-space regions that are enriched in hard-scatter or pile-up jets. A preselection is made using the criteria described in Sect. 4. The particle flow algorithm is run on these events and further requirements are applied: events are selected with two isolated muons, each with \(\,p_{\mathrm{T}} > 25\,\mathrm{GeV}\), with invariant mass \(80< m_{{\mu {\mu }}} < 100\,\mathrm{GeV}\) and \(\,p_{\mathrm{T}} ^{{\mu {\mu }}} > 32\,\mathrm{GeV}\), ensuring that the boson recoils against hadronic activity. Figure 30 displays two regions of phase space: one opposite the recoiling boson, where large amounts of hard-scatter jet activity are expected, and one off-axis, which is more sensitive to pile-up jet activity.

Fig. 30
figure 30

A diagram displaying the regions of r\(\phi \) phase space which are expected to be dominated by hard-scatter jets (opposite in the r\(\phi \) plane to the \(Z \rightarrow {\mu {\mu }}\) decay) and where there is greater sensitivity to pile-up jet activity (perpendicular to the \(Z \rightarrow {\mu {\mu }}\) decay)

Figure 31 shows the average number of jets with \(\,p_{\mathrm{T}} > 20\,\mathrm{GeV}\) in the hard-scatter-enriched region for different \(\,|{\eta }|\) ranges as a function of the number of primary vertices. The distributions are stable for particle flow jets and for calorimeter jets with \(|\mathrm{JVF}|>0.25\) as a function the number of primary vertices in all \(\,|{\eta }|\) regions. The only exception is in the \(2.0<\,|{\eta }|<2.5\) region, where in Fig. 29 a slight increase in the jet fake rate is visible for jet pseudorapidities very close to the tracker boundary. This is due to the jet area collecting charged-particle pile-up contributions that are outside the ID acceptance. If the JVF cut is not applied to the calorimeter jets, the jet multiplicity grows with increasing pile-up. Figure 32 shows that in the pile-up-enriched selection, the particle flow and calorimeter jets with a JVF selection still show no dependence on the number of reconstructed vertices in all \(\,|{\eta }|\) regions. The observed difference between data and MC simulation for both jet collections is due to a poor modelling of this region of phase space. These distributions establish the high stability of particle flow jets in varying pile-up conditions.

Fig. 31
figure 31

The average number of jets per event, for jets with \(\,p_{\mathrm{T}} > 20\,\mathrm{GeV}\), as a function of the number of primary vertices in the \(Z\rightarrow {\mu {\mu }}\) samples. The distributions are shown in three different \(\,|{\eta }|\) regions for particle flow jets, calorimeter jets and calorimeter jets with an additional cut on JVF. The jets are selected in a region of \(\phi \) opposite the Z boson’s direction, \(\Delta \phi (Z,\mathrm{jet}) > 3\pi /4\), which is enriched in hard-scatter jets. The statistical uncertainties in the number of events are shown as a hatched band

Fig. 32
figure 32

The average number of jets per event, for jets with \(\,p_{\mathrm{T}} > 20\,\mathrm{GeV}\), as a function of the number of primary vertices in the \(Z\rightarrow {\mu {\mu }}\) samples. The distributions are shown in three different \(\,|{\eta }|\) regions for particle flow jets, calorimeter jets and calorimeter jets with an additional cut on JVF. The jets are selected in a region of \(\phi \) perpendicular to the Z boson’s direction, \(\pi /4< \Delta \phi (Z,\mathrm{jet}) < 3\pi /4\), which is enriched in pile-up jets. The statistical uncertainties in the number of events are shown as a hatched band

10.2 Pile-up effects on jet energy resolution

In addition to simply suppressing jets from pile-up, the particle flow procedure reduces the fluctuations in the jet energy measurements due to pile-up contributions. This is demonstrated by Fig. 33, which compares the jet energy resolution for particle flow and calorimeter jets with and without pile-up. Even in the absence of pile-up, the particle flow jets have a better resolution at \(p_{\mathrm{T}}\) values below 50 GeV. With pile-up conditions similar to those in the 2012 data, the cross-over point is at \(\,p_{\mathrm{T}} =90\,\mathrm{GeV}\), indicating that particle flow reconstruction alleviates a significant contribution from pile-up even for fairly energetic jets. The direct effect of pile-up can be seen in the lower panel, where the difference in quadrature between the resolutions with and without pile-up is shown. The origin of the increase in the resolution with pile-up is discussed in detail in Ref. [6]. It is shown that additional energy deposits are the primary cause of the degradation of the calorimeter jet resolution. This effect is mitigated by the particle flow algorithm in two ways. Firstly, the subtraction of topo-clusters formed by charged particles from pile-up vertices prior to jet-finding eliminates a major source of fluctuations. Secondly, the increase in the constituent scale of hard-scatter jets from the use of calibrated tracks, rather than energy clusters in the calorimeter, amplifies the signal, effectively suppressing the contribution from pile-up. This second mechanism is found to be the main contributing factor.

Fig. 33
figure 33

The resolutions of calorimeter and particle flow jets determined as a function of \(\,p_{\mathrm{T}} \) in MC dijet simulation, compared with no pile-up and conditions similar to those in the 2012 data. The quadratic difference in the resolution with and without pile-up is shown in the lower panel for LC + JES (blue) and particle flow (black) jets. The data are taken from dijet samples, with and without pileup, with \(20< p_{\mathrm{T}}^{\mathrm{lead}} < 500\,\mathrm{GeV}\) and the statistical uncertainties on the number of MC simulated events are shown

For 40 GeV jets, the total jet resolution without pile-up is 10%. Referring back to Fig. 20c, confusion contributes \(\sim {8}{\%}\) to the jet resolution in the absence of pile-up. Since the terms are combined in quadrature, confusion contributes significantly to the overall jet resolution, although it does not totally dominate. While additional confusion can be caused by the presence of pile-up particles, the net effect is that pile-up affects the resolution of particle flow jets less than that of calorimeter jets.

11 Comparison of data and Monte Carlo simulation

It is crucial that the quantities used by the particle flow reconstruction are accurately described by the ATLAS detector simulation. In this section, particle flow jet properties are compared for \(Z \rightarrow {\mu {\mu }}\) and \(t{\bar{t}}\) events in data and MC simulation. Various observables are validated, from low-level jet characteristics to derived observables relevant to physics analyses.

11.1 Individual jet properties

A sample of jets is selected in \(Z \rightarrow {\mu {\mu }}\) events, as in Sect. 8, and used for a comparison between data and MC simulation. As the subtraction takes place at the cell level, the energy subtracted from each layer of the calorimeter demonstrates how well the subtraction procedure is modelled. To determine the energy before subtraction the particle flow jets are matched to jets formed solely from topo-clusters at the electromagnetic scale. A similar selection to that used to evaluate the jet energy scale is used. The leading jet is required to be opposite a reconstructed Z boson decaying to two muons with \(\Delta \phi >\pi -0.4\). The \(p_{\mathrm{T}}\) of the reconstructed boson is required to be above 32 GeV and the reconstructed jets must have \(40<p_{\mathrm{T}} <60\,\mathrm{GeV}\). Figures 34 and 35 show the properties of central jets. The MC simulation describes the data reasonably well for the jet track multiplicity, fraction of the jet \(p_{\mathrm{T}}\) carried by tracks as well as the amount of subtracted or surviving energy in each layer of the EM barrel. Similar levels of agreement are observed for jets in the endcap regions of the detector.

Fig. 34
figure 34

Comparison of jet track properties, for a selection of jets with \(40< p_{\mathrm{T}} < 60\,\mathrm{GeV}\) and \(\,|{\eta }|<0.6\), selected in \(Z\rightarrow {\mu {\mu }}\) events from collision data and MC simulation. The simulated samples are normalised to the number of events in data. The following distributions are shown: a the charged fraction, i.e. the fractional jet \(p_{\mathrm{T}}\) carried by reconstructed tracks; b the number of tracks in the jet that originate from the nominal hard-scatter primary vertex; c the transverse momentum of the leading track in the jet; d the transverse momenta of all tracks in the jet weighted by the track \(p_{\mathrm{T}}\) and normalised to the number of jets, illustrating the transverse momentum flow from particles of different \(p_{\mathrm{T}}\). The distribution is shown both for tracks satisfying the hard-scatter primary vertex association criteria and forming the jet as well as the additional tracks within \(\Delta R =0.4\) of the jet failing to satisfy the hard-scatter primary vertex association criteria. The darker shaded bands represent the statistical uncertainties

Fig. 35
figure 35

Comparison of the fractions of jet energy removed from a single layer of the electromagnetic calorimeter relative to the total energy of the constituents of the matched calorimeter jet \(E_{\text {Calo}}^{\text {constit.}}\) (left) and retained relative to the total energy of the constituents of the particle flow jet \(E_{\text {Pflow}}^{\text {constit.}}\) (right) by the cell subtraction algorithm in different layers of the EM barrel, for a selection of jets with \(40< p_{\mathrm{T}} < 60\,\mathrm{GeV}\) and \(\,|{\eta }|<0.6\), selected in \(Z\rightarrow {\mu {\mu }}\) events from collision data and MC simulation. The simulated samples are normalised to the number of events in data. The darker shaded bands represent the statistical uncertainties

11.2 Event-level observables

Finally, the particle flow performance is examined in a sample of selected \(t{\bar{t}}\) events; a sample triggered by a single-muon trigger with a single offline reconstructed muon is used. At least four jets with \(\,p_{\mathrm{T}} > 25\,\mathrm{GeV}\) and \(\,|{\eta }|<2.0\) are required and two of these are required to have been b-tagged using the MV1 algorithm and have \(\,p_{\mathrm{T}} > 35\) and 30 GeV.Footnote 5 This selects a 95% pure sample of \(t{\bar{t}}\) events. The event \(E_{\mathrm{T}}^{\mathrm{miss}}\) is reconstructed from the vector sum of the calibrated jets with \(\,p_{\mathrm{T}} > 20\,\mathrm{GeV}\), the muon and all remaining tracks associated with the hard-scatter primary vertex but not associated with these objects. This is then used to form the transverse mass variable defined by \(m_{\mathrm{T}} = \sqrt{2 p_{\mathrm{T}} ^{\mu } E_{\mathrm{T}}^{\mathrm{miss}} (1-\cos (\Delta \phi (\mu ,E_{\mathrm{T}}^{\mathrm{miss}})))}\). The invariant mass of the two leading non-b-tagged jets, \(m_{\mathrm{jj}}\), forms a hadronic W candidate, while the invariant masses of each of the two b-tagged jets and these two non-b-tagged jets form two hadronic top quark candidates, \(m_{\mathrm{jj}b}\).

Figure 36 compares the data with MC simulation for these three variables; \(m_{\mathrm{T}}, m_{\mathrm{jj}}\) and \(m_{\mathrm{jj}b}\). The MC simulation describes the data very well in all three distributions. Figure 37 shows the \(m_{\mathrm{jj}}\) distribution for particle flow jets compared to the distribution obtained from the same selection applied to calorimeter jets (with \(|\mathrm{JVF}|>0.25\)). For the calorimeter jet selection, the \(E_{\mathrm{T}}^{\mathrm{miss}}\) is reconstructed from the muon, jets, photons and remaining unassociated clusters [53]. The two selections are applied separately; hence the exact numbers of events in the plots differ. The particle flow reconstruction provides a good measure and narrower width of the peak for both low and high \(\,p_{\mathrm{T}} ^{\mathrm{jj}}\). Gaussian fits to the data in the range \(65< m_{\mathrm{jj}} < 95\,\mathrm{GeV}\) give widths of \((13.8 \pm 0.4) \mathrm{GeV}\) and \((16.2 \pm 0.6) \mathrm{GeV}\) for particle flow reconstruction and that based on calorimeter jets, respectively, for \(\,p_{\mathrm{T}} ^{\mathrm{jj}} < 80\,\mathrm{GeV}\). For \(\,p_{\mathrm{T}} ^{\mathrm{jj}} > 80\,\mathrm{GeV}\), the widths were found to be \((11.2 \pm 0.2)\,\mathrm{GeV}\) and \((11.9 \pm 0.3)\,\mathrm{GeV}\), respectively. At very high values of \(\,p_{\mathrm{T}} ^{W}\), the gains would further diminish (see Fig. 26).

Fig. 36
figure 36

Comparison of the distributions of mass variables computed with particle flow jets between collision data and the MC simulation for a \(t{\bar{t}}\) event selection. The darker shaded bands and the errors on the collision data show the statistical uncertainties

Fig. 37
figure 37

Comparison between the \(m_\mathrm {jj}\) distributions measured using particle flow jets and calorimeter jets with a JVF selection in data. The sample is split into those events where the reconstructed W candidate has \(\,p_{\mathrm{T}} ^{\mathrm{jj}} < 80\,\mathrm{GeV}\) and \(\,p_{\mathrm{T}} ^{\mathrm{jj}} > 80\,\mathrm{GeV}\). The errors shown are purely statistical

12 Conclusions

The particle flow algorithm used by the ATLAS Collaboration for 20.2 fb\(^{-1}\) of pp collisions at 8 TeV at the LHC is presented. This algorithm aims to accurately subtract energy deposited by tracks in the calorimeter, exploiting the good calorimeter granularity and longitudinal segmentation. Use of particle flow leads to improved energy and angular resolution of jets compared to techniques that only use the calorimeter in the central region of the detector.

In 2012 data-taking conditions, the transverse momentum resolution of particle flow jets calibrated with a global sequential correction is superior up to \(\,p_{\mathrm{T}} \sim 90\,\mathrm{GeV}\) for \(\,|{\eta }|<1.0\). For a representative jet \(\,p_{\mathrm{T}}^{\mathrm{true}} \) of 30 GeV, the resolution is improved from the 17.5% resolution of calorimeter jets with local cluster weighting calibration to 14%. Jet angular resolutions are improved across the entire \(p_{\mathrm{T}}\) spectrum, with \(\sigma (\eta )\) and \(\sigma (\phi )\) decreasing from 0.03 to 0.02 and 0.05 to 0.02, respectively, for a jet \(p_{\mathrm{T}}\) of 30 GeV.

Rejection of charged particles from pile-up interactions in jet reconstruction leads to substantially better jet resolution and to the suppression of jets due to pile-up interactions by an order of magnitude within the tracker acceptance, with negligible inefficiency for jets from the hard-scatter interaction. This outperforms a purely track-based jet pile-up discriminant typically used in ATLAS analyses, which achieves similar pile-up suppression at the cost of about one percent in hard-scatter jet efficiency.

The algorithm therefore achieves a better performance for hadronic observables such as reconstructed resonant particle masses.

Studies which compare data with MC simulation demonstrate that jet properties used for energy measurement and calibration are modelled well by the ATLAS simulation, both before and after application of the particle flow algorithm. This translates to good agreement between data and simulation for derived physics observables, such as invariant masses of combinations of jets.

The algorithm has been integrated into the ATLAS software framework for Run 2 of the LHC. As demonstrated, it is robust against pile-up and should therefore perform well under the conditions encountered in Run 2.