1 Introduction

Hadronic jets are observed in large momentum-transfer interactions. They are theoretically interpreted to arise when partons—quarks (q) and gluons (g)—are emitted in collision events of subatomic particles. Partons then evolve into hadronic jets in a two-step process. The first can be described by perturbation theory and gives rise to a parton shower, the second is non-perturbative and is responsible for the hadronisation. The internal structure of a jet is expected to depend primarily on the type of parton it originated from, with some residual dependence on the quark production and fragmentation process. For instance, due to the different colour factors in ggg and qqg vertices, gluons lead to more parton radiation and therefore gluon-initiated jets are expected to be broader than quark-initiated jets.

For jets defined using cone or k t algorithms [1, 2], jet shapes, i.e. the normalised transverse momentum flow as a function of the distance to the jet axis [3], have been traditionally used as a means of understanding the evolution of partons into hadrons in e + e , ep and hadron colliders [411]. It is experimentally observed that jets in e + e and ep are narrower than those observed in \(p\bar{p}\) and pp collisions and this is interpreted as a result of the different admixtures of quark and gluon jets present in these different types of interactions [12]. Furthermore, at high momentum transfer, where fragmentation effects are less relevant, jet shapes have been found to be in qualitative agreement with next-to-leading-order (NLO) QCD predictions and in quantitative agreement with those including leading logarithm corrections [13]. Jet shapes have also been proposed as a tool for studies of substructure or in searches for new phenomena in final states with highly boosted particles [1417].

Due to the mass of the b-quark, jets originating from a b-quark (hereafter called b-jets) are expected to be broader than light-quark jets, including charm jets, hereafter called light jets. This expectation is supported by observations by the CDF collaboration in Ref. [18], where a comparison is presented between jet shapes in a b-jet enriched sample with a purity of roughly 25 % and an inclusive sample where no distinction is made between the flavours.

This paper presents the first measurement of b-jet shapes in top pair events. The \(t\bar{t}\) final states are a source of b-jets, as the top quark decays almost exclusively via tWb. While the dilepton channel, where both W bosons decay to leptons, is a very pure source of b-jets, the single-lepton channel contains b-jets and light jets, the latter originating from the dominant \(W^{+} \rightarrow u\bar{d}, c\bar{s}\) decays and their charge conjugates. A comparison of the light- and b-jet shapes measured in the \(t\bar {t}\) decays improves the CDF measurement discussed above, as the jet purity achieved using \(t\bar{t}\) events is much higher. In addition, these measurements could be used to improve the modelling of jets in \(t\bar{t}\) production Monte Carlo (MC) models in a new kinematic regime.

This paper is organised as follows. In Sect. 2 the ATLAS detector is described, while Sect. 3 is dedicated to the MC samples used in the analysis. In Sects. 4 and 5, the physics object and event selection for both the dilepton and single-lepton \(t\bar{t}\) samples is presented. Section 6 is devoted to the description of both the b-jet and light-jet samples obtained in the single-lepton final state. The differential and the integrated shape distributions of these jets are derived in Sect. 7. In Sect. 8 the results on the average values of the jet shape variables at the detector level are presented, including those for the b-jets in the dilepton channel. Results corrected for detector effects are presented in Sect. 9. In Sect. 10 the systematic uncertainties are discussed, and Sect. 11 contains a discussion of the results. Finally, Sect. 12 includes the summary and conclusions.

2 The ATLAS detector

The ATLAS detector [19] is a multi-purpose particle physics detector with a forward-backward symmetric cylindrical geometryFootnote 1 and a solid angle coverage of almost 4π.

The inner tracking system covers the pseudorapidity range |η|<2.5, and consists of a silicon pixel detector, a silicon microstrip detector, and, for |η|<2.0, a transition radiation tracker. The inner detector (ID) is surrounded by a thin superconducting solenoid providing a 2 T magnetic field along the beam direction. A high-granularity liquid-argon sampling electromagnetic calorimeter covers the region |η|<3.2. An iron/scintillator tile hadronic calorimeter provides coverage in the range |η|<1.7. The endcap and forward regions, spanning 1.5<|η|<4.9, are instrumented with liquid-argon calorimeters for electromagnetic and hadronic measurements. The muon spectrometer surrounds the calorimeters. It consists of three large air-core superconducting toroid systems and separate trigger and high-precision tracking chambers providing accurate muon tracking for |η|<2.7.

The trigger system [20] has three consecutive levels: level 1 (L1), level 2 (L2) and the event filter (EF). The L1 triggers are hardware-based and use coarse detector information to identify regions of interest, whereas the L2 triggers are based on fast software-based online data reconstruction algorithms. Finally, the EF triggers use offline data reconstruction algorithms. For this analysis, the relevant triggers select events with at least one electron or muon.

3 Monte Carlo samples

Monte Carlo generators are used in which \(t\bar{t}\) production is implemented with matrix elements calculated up to NLO accuracy. The generated events are then passed through a detailed Geant4 simulation [21, 22] of the ATLAS detector. The baseline MC samples used here are produced with the MC@NLO [23] or Powheg [24] generators for the matrix element calculation; the parton shower and hadronisation processes are implemented with Herwig [25] using the cluster hadronisation model [26] and CTEQ6.6 [27] parton distribution functions (PDFs). Multi-parton interactions are simulated using Jimmy [28] with the AUET1 tune [29]. This MC generator package has been used for the description of the \(t\bar{t}\) final states for ATLAS measurements of the cross section [30, 31] and studies of the kinematics [32].

Additional MC samples are used to check the hadronisation model dependence of the jet shapes. They are based on Powheg+Pythia [24, 33], with the MRST2007LO* PDFs [34]. The AcerMC generator [35] interfaced to Pythia with the Perugia 2010 tune [36] for parton showering and hadronisation is also used for comparison. Here the parton showers are ordered by transverse momentum and the hadronisation proceeds through the Lund string fragmentation scheme [37]. The underlying event and other soft effects are simulated by Pythia with the AMBT1 tune [38]. Comparisons of different event generators show that jet shapes in top-quark decays show little sensitivity to initial-state radiation effects, different PDF choices or underlying-event effects. They are more sensitive to details of the parton shower and the fragmentation scheme.

Samples of events including W and Z bosons produced in association with light- and heavy-flavour jets are generated using the Alpgen [39] generator with the CTEQ6L PDFs [40], and interfaced with Herwig and Jimmy. The same generator is used for the diboson backgrounds, WW, WZ and ZZ, while MC@NLO is used for the simulation of the single-top backgrounds, including the t- and s-channels as well as the Wt-channel.

The MC-simulated samples are normalised to the corresponding cross sections. The \(t\bar{t}\) signal is normalised to the cross section calculated at approximate next-to-next-to-leading order (NNLO) using the Hathor package [41], while for the single-top production cross section, the calculations in Refs. [4244] are used. The W+jets and Z+jets cross sections are taken from Alpgen [39] with additional NNLO K-factors as given in Ref. [45].

The simulated events are weighted such that the distribution of the number of interactions per bunch crossing in the simulated samples matches that of the data. Finally, additional correction factors are applied to take into account the different object efficiencies in data and simulation. The scale factors used for these corrections typically differ from unity by 1 % for electrons and muons, and by a few percent for b-tagging.

4 Physics object selection

Electron candidates are reconstructed from energy deposits in the calorimeter that are associated with tracks reconstructed in the ID. The candidates must pass a tight selection [46], which uses calorimeter and tracking variables as well as transition radiation for |η|<2.0, and are required to have transverse momentum p T>25 GeV and |η|<2.47. Electrons in the transition region between the barrel and endcap calorimeters, 1.37<|η|<1.52, are not considered.

Muon candidates are reconstructed by searching for track segments in different layers of the muon spectrometer. These segments are combined and matched with tracks found in the ID. The candidates are refitted using the complete track information from both detector systems and are required to have a good fit and to satisfy p T>20 GeV and |η|<2.5.

Electron and muon candidates are required to be isolated to reduce backgrounds arising from jets and to suppress the selection of leptons from heavy-flavour semileptonic decays. For electron candidates, the transverse energy deposited in the calorimeter and which is not associated with the electron itself (\(E^{\mathrm{iso}}_{\mathrm{T}}\)) is summed in a cone in ηϕ space of radiusFootnote 2 ΔR=0.2 around the electron. The \(E^{\mathrm{iso}}_{\mathrm{T}}\) value is required to be less than 3.5 GeV. For muon candidates, both the corresponding calorimeter isolation \(E^{\mathrm{iso}}_{\mathrm{T}}\) and the analogous track isolation transverse momentum (\(p^{\mathrm{iso}}_{\mathrm{T}}\)) must be less than 4 GeV in a cone of ΔR=0.3. The track isolation is calculated from the scalar sum of the transverse momenta of tracks with p T>1 GeV, excluding the muon.

Muon candidates arising from cosmic rays are rejected by removing candidate pairs that are back-to-back in the transverse plane and that have transverse impact parameter relative to the beam axis |d 0|>0.5 mm.

Jets are reconstructed with the anti-k t algorithm [47, 48] with radius parameter R=0.4. This choice for the radius has been used in measurements of the top-quark mass [49] and also in multi-jet cross-section measurements [50]. The inputs to the jet algorithm are topological clusters of calorimeter cells. These clusters are seeded by calorimeter cells with energy |E cell|>4σ, where σ is the cell-by-cell RMS of the noise (electronics plus pileup). Neighbouring cells are added if |E cell|>2σ and clusters are formed through an iterative procedure [51]. In a final step, all remaining neighbouring cells are added to the cluster.

The baseline calibration for these clusters calculates their energy using the electromagnetic energy scale [54]. This is established using test-beam measurements for electrons and muons in the electromagnetic and hadronic calorimeters [5153]. Effects due to the differing response to electromagnetic and hadronic showers, energy losses in the dead material, shower leakage, as well as inefficiencies in energy clustering and jet reconstruction are also taken into account. This is done by matching calorimeter jets with MC particle jets in bins of η and E, and supplemented by in situ calibration methods such as jet momentum imbalance in Z/γ +1 jet events. This is called the Jet Energy Scale (JES) calibration, thoroughly discussed in Ref. [54]. The JES uncertainty contains an extra term for b-quark jets, as the jet response is different for b-jets and light jets because they have different particle composition. References [50] and [55] contain more details on the JES and a discussion of its uncertainties.

Jets that overlap with a selected electron are removed if they are closer than ΔR=0.2, while if a jet is closer than ΔR=0.4 to a muon, the muon is removed.

The primary vertex is defined as the pp interaction vertex with the largest \(\sum_{i} p_{\mathrm{T}i}^{2}\), where the sum runs over the tracks with p T>150 MeV associated with the vertex.

Jets are identified as candidates for having originated from a b-quark (b-tagged) by an algorithm based on a neural-network approach, as discussed in Sect. 6.

The reconstruction of the direction and magnitude (\(E_{\mathrm {T}}^{\mathrm{miss}}\)) of the missing transverse momentum is described in Ref. [56] and begins with the vector sum of the transverse momenta of all jets with p T>20 GeV and |η|<4.5. The transverse momenta of electron candidates are added. The contributions from all muon candidates and from all calorimeter clusters not belonging to a reconstructed object are also included.

5 Event selection

Two samples of events are selected: a dilepton sample, where both W bosons decay to leptons (e, μ, including leptonic τ decays), and a single-lepton sample, where one W boson decays to leptons and the other to a \(q\bar{q}'\) pair, giving rise to two more jets (see Fig. 1). The selection criteria follow those in Ref. [30] for the single-lepton sample and Ref. [31] for the dilepton sample. Events are triggered by inclusive high-p T electron or muon EF triggers. The trigger thresholds are 18 GeV for muons and 20 GeV for electrons. The dataset used for the analysis corresponds to the first half of the data collected in 2011, with a centre-of-mass energy \(\sqrt{s} = 7 \ \mbox{TeV}\) and an integrated luminosity of 1.8 fb−1. This data-taking period is characterised by an instantaneous luminosity smaller than 1.5×1033 cm−2 s−1, for which the mean number of interactions per bunch crossing is less than six. To reject the non-collision background, the primary vertex is required to have at least four tracks, each with p T>150 MeV, associated with it. Pile-up effects are therefore small and have been taken into account as a systematic uncertainty.

Fig. 1
figure 1

Example LO Feynman diagrams for \(gg\rightarrow t\bar{t}\) in the dilepton (left) and single-lepton (right) decay modes

5.1 Dilepton sample

In the dilepton sample, events are required to have two charged leptons and \(E_{\mathrm{T}}^{\mathrm{miss}}\) from the leptonic W-boson decays to a neutrino and an electron or muon. The offline lepton selection requires two isolated leptons (e or μ) with opposite charge and with transverse momenta p T(e)>25 GeV, where p T(e)=E clustersin(θ track), E cluster being the cluster energy and θ track the track polar angle, and p T(μ)>20 GeV. At least one of the selected leptons has to match the corresponding trigger object.

Events are further filtered by requiring at least two jets with p T>25 GeV and |η|<2.5 in the event. In addition, at least one of the selected jets has to be tagged as a b-jet, as discussed in the next section. The whole event is rejected if a jet is identified as an out-of-time signal or as noise in the calorimeter.

The missing transverse momentum requirement is \(E_{\mathrm {T}}^{\mathrm{miss}} > 60 \ \mbox{GeV}\) for the ee and μμ channels. For the channel, H T is required to be greater than 130 GeV, where H T is the scalar sum of the p T of all muons, electrons and jets. To reject the Drell–Yan lepton pair background in the ee and μμ channels, the lepton pair is required to have an invariant mass m ℓℓ greater than 15 GeV and to lie outside of a Z-boson mass window, rejecting all events where the two-lepton invariant mass satisfies |m ℓℓ m Z |<10 GeV.

The selected sample consists of 95 % \(t\bar{t}\) events, but also backgrounds from the final states W+jets and Z+jets, where the gauge bosons decay to leptons. All backgrounds, with the exception of multi-jet production, have been estimated using MC samples. The multi-jet background has been estimated using the jet–electron method [60]. This method relies on the identification of jets which, due to their high electromagnetic energy fraction, can fake electron candidates. The jet–electron method is applied with some modifications to the muon channel as well. The normalisation is estimated using a binned likelihood fit to the \(E_{\mathrm{T}}^{\mathrm{miss}} \) distribution. The results are summarised in Table 1.

Table 1 The expected composition of the dilepton sample. Fractions are relative to the total number of expected events. ‘Other EW’ corresponds to the W+jets and diboson (WW, WZ and ZZ) contributions

5.2 Single-lepton sample

In this case, the event is required to have exactly one isolated lepton with p T>25 GeV for electrons and p T>20 GeV for muons. To account for the neutrino in the leptonic W decay, \(E_{\mathrm {T}}^{\mathrm{miss}}\) is required to be greater than 35 GeV in the electron channel and greater than 20 GeV in the muon channel. The \(E_{\mathrm {T}}^{\mathrm{miss}}\) resolution is below 10 GeV [56]. Furthermore, the transverse massFootnote 3 (m T) is required to be greater than 25 GeV in the e-channel and to satisfy the condition \(E_{\mathrm{T}}^{\mathrm{miss}}+m_{\mathrm{T}} > 60 \ \mbox{GeV}\) in the μ-channel.

The jet selection requires at least four jets (p T>25 GeV and |η|<2.5) in the final state, and at least one of them has to be tagged as a b-jet. The fraction of \(t\bar{t}\) events in the sample is 77 %; the main background contributions for the single-lepton channel have been studied as in the previous case, and are summarised in Table 2. As in the dileptonic case, the multi-jet background has been estimated using the jet–electron method.

Table 2 The expected composition of the single-lepton sample. Fractions are relative to the total number of expected events. In this case ‘Other EW’ includes Z+jets and diboson processes

6 Jet sample definition

Jets reconstructed in the single-lepton and dilepton samples are now subdivided into b-jet and light-jet samples. In order to avoid contributions from non-primary collisions, it is required that the jet vertex fraction (JVF) be greater than 0.75. After summing the scalar p T of all tracks in a jet, the JVF is defined as the fraction of the total scalar p T that belongs to tracks originating from the primary vertex. This makes the average jet multiplicity independent of the number of pp interaction vertices. This selection is not applied to jets with no associated tracks. Also, to reduce the impact of pileup on the jets, the p T threshold has been raised to 30 GeV.

Jets whose axes are closer than ΔR=0.8, which is twice the jet radius, to some other jet in the event are not considered. This is done to avoid possible overlaps between the jet cones, which would bias the shape measurement. These configurations are typical in boosted W bosons, leading to light jets which are not well separated. The resulting ΔR distributions for any pair of b-jets or light jets are approximately constant between 0.8 and π and exhibit an exponential fall-off between π and the endpoint of the distribution.

6.1 b-jet samples

To select b-jets, a neural-network algorithm, which relies on the reconstruction of secondary vertices and impact parameter information in the three spatial dimensions, is used. The reconstruction of the secondary decay vertices makes use of an iterative Kalman-filter algorithm [61] which relies on the hypothesis that the bcX decay chains lie in a straight line originally taken to be parallel to the jet axis. The working point of the algorithm is chosen to maximise the purity of the sample. It corresponds to a b-tagging efficiency of 57 % for jets originating from b-quarks in simulated \(t\bar{t}\) events, and a u,d,s-quark jet rejection factor of about 400, as well as a c-jet rejection factor of about 10 [62, 63]. The resulting number of b-jets selected in the dilepton (single-lepton) sample is 2279 (16735). A second working point with a b-tagging efficiency of 70 % is also used in order to evaluate the dependence of the measured jet shapes on b-tagging.

Figure 2 shows the b-tagged jet transverse momentum distributions for the single-lepton and dilepton channels. The p T distributions for the b-jets in both the dilepton and single-lepton samples show a similar behaviour, since they come mainly from top-quark decays. This is well described by the MC expectations from the MC@NLO generator coupled to Herwig. In the dilepton sample the signal-to-background ratio is found to be greater than in the single-lepton sample, as it is quantitatively shown in Tables 1 and 2.

Fig. 2
figure 2

The p T distributions for b-tagged jets in the single-lepton (top) and dilepton (bottom) samples along with the sample composition expectations

6.2 Light-quark jet sample

The hadronic decays \(W\rightarrow q\bar{q}'\) are a clean source of light-quark jets, as gluons and b-jets are highly suppressed; the former because gluons would originate in radiative corrections of order \(\mathcal{O}(\alpha_{s})\), and the latter because of the smallness of the CKM matrix elements |V ub | and |V cb |. To select the light-jet sample, the jet pair in the event which has the invariant mass closest to the W-boson mass is selected. Both jets are also required to be non-tagged by the b-tagging algorithm. The number of jets satisfying these criteria is 7158. Figure 3 shows the transverse momentum distribution of these jets together with the invariant mass of the dijet system. As expected, the p T distribution of the light jets from W-boson decays exhibits a stronger fall-off than that for the b-jets. This dependence is again well described by the MC simulations in the jet p T region used in this analysis. Agreement between the invariant mass distributions for observed and simulated events is good, in particular in the region close to the W-boson mass.

Fig. 3
figure 3

The distribution of light-jet p T (top) and of the invariant mass of light-jet pairs (bottom) along with the sample composition expectations. The latter shows a peak at the W mass, whose width is determined by the dijet mass resolution

6.3 Jet purities

To estimate the actual number of b-jets and light jets in each of the samples, the MC simulation is used by analysing the information at generator level. For b-jets, a matching to a b-hadron is performed within a radius ΔR=0.3. For light jets, the jet is required not to have a b-hadron within ΔR=0.3 of the jet axis. Additionally, to distinguish light quarks and c-quarks from gluons, the MC parton with highest p T within the cone of the reconstructed jet is required to be a (u,d,c or s)-quark. The purity p is then defined as

$$\begin{aligned} p = \sum_{k}\alpha_{k}p_{k}; \quad p_{k} = 1-\frac{N_{\mathrm {f}}^{(k)}}{N_{\mathrm{T}}^{(k)}} \end{aligned}$$
(1)

where α k is the fraction of events in the k-th MC sample (signal or background), given in Tables 1 and 2 and \(N_{\mathrm{f}}^{(k)}\), \(N_{\mathrm{T}}^{(k)}\) are the number of fakes (jets not assigned to the correct flavour, e.g. charm jets in the b-jet sample), and the total number of jets in a given sample, respectively. The purity in the multi-jet background is determined using Pythia MC samples.

In the single-lepton channel, the resulting purity of the b-jet sample is \(p^{(\mathrm{s})}_{b}=(88.5 \pm5.7)~\%\), while the purity of the light-jet sample is found to be \(p^{(\mathrm{s})}_{\mathrm{l}}= (66.2 \pm4.1)~\%\), as shown in Table 3. The uncertainty on the purity arises from the uncertainties on the signal and background fractions in each sample. The charm content in the light-jet sample is found to be 16 %, with the remaining 50 % ascribed to u,d and s.

Table 3 Purity estimation for b-jets and light jets in the single-lepton channel. The uncertainty on the purity arises from the uncertainties in the signal and background fractions

MC studies indicate that the contamination of the b-jet sample is dominated by charm-jet fakes and that the gluon contamination is about 0.7 %. For the light-jet sample, the fraction of gluon fakes amounts to 19 %, while the b-jet fakes correspond to 15 %.

In the dilepton channel, a similar calculation yields the purity of the b-jet sample to be \(p^{(\mathrm{d})}_{b} = (99.3^{+0.7}_{-6.5})~\%\) as shown in Table 4. Thus, the b-jet sample purity achieved using \(t\bar{t}\) final states is much higher than that obtained in inclusive b-jet measurements at the Tevatron [18] or the LHC [55].

Table 4 Purity estimation for b-jets in the dilepton channel. The uncertainty on the purity arises from the uncertainties in the signal and background fractions

7 Jet shapes in the single-lepton channel

For the jet shape calculation, locally calibrated topological clusters are used [54, 57, 58]. In this procedure, effects due to calorimeter response, leakage, and losses in the dead material upstream of the calorimeter are taken into account separately for electromagnetic and hadronic clusters [59].

The differential jet shape ρ(r) in an annulus of inner radius rΔr/2 and outer radius r+Δr/2 from the axis of a given jet is defined as

$$\begin{aligned} \rho(r) = \frac{1}{\varDelta r}\frac{p_{\mathrm{T}}(r-\varDelta r/2,r+\varDelta r/2)}{p_{\mathrm{T}}(0,R)} \end{aligned}$$
(2)

Here, Δr=0.04 is the width of the annulus; r, such that Δr/2≤rRΔr/2, is the distance to the jet axis in the η-ϕ plane, and p T(r 1,r 2) is the scalar sum of the p T of the jet constituents with radii between r 1 and r 2.

Some distributions of ρ(r) are shown in Fig. 4 for the b-jet sample selected in the single-lepton channel. There is a marked peak at zero energy deposit, which indicates that energy is concentrated around relatively few particles. As r increases, the distributions of ρ(r) are concentrated at smaller values because of the relatively low energy density at the periphery of the jets. Both effects are well reproduced by the MC generators.

Fig. 4
figure 4

Distribution of R=0.4 b-jets in the single-lepton channel as a function of the differential jet shapes ρ(r) for different values of r

The analogous ρ(r) distributions for light jets are shown in Fig. 5. The gross features are similar to those previously discussed for b-jets, but for small values of r, the ρ(r) distributions for light jets are somewhat flatter than those for b-jets.

Fig. 5
figure 5

Distribution of R=0.4 light jets in the single-lepton channel as a function of the differential jet shapes ρ(r) for different values of r

The integrated jet shape in a cone of radius rR around the jet axis is defined as the cumulative distribution for ρ(r), i.e.

$$\begin{aligned} \varPsi(r) = \frac{p_{\mathrm{T}}(0,r)}{p_{\mathrm{T}}(0,R)}; \quad r \leq R \end{aligned}$$
(3)

which satisfies Ψ(r=R)=1. Figure 6 (Fig. 7) shows distributions of the integrated jet shapes for b-jets (light jets) in the single-lepton sample. These figures show the inclusive (i.e. not binned in either η or p T) ρ(r) and Ψ(r) distributions for fixed values of r. Jet shapes are only mildly dependent on pseudorapidity, while they strongly depend on the transverse momentum. This behaviour has been verified in previous analyses [511]. This is illustrated in Figs. 8 and 9, which show the energy fraction in the outer half of the cone as a function of p T and |η|. For this reason, all the data presented in the following are binned in five p T regions with p T<150 GeV, where the statistical uncertainty is small enough. In the following, only the average values of these distributions are presented:

$$\begin{aligned} &\bigl\langle \rho(r)\bigr\rangle = \frac{1}{\varDelta r}\frac {1}{N_{\mathrm {jets}}}\sum _{\mathrm{jets}}\frac{p_{\mathrm{T}}(r-\varDelta r/2,r+\varDelta r/2)}{p_{\mathrm{T}}(0,R)} \end{aligned}$$
(4)
$$\begin{aligned} &\bigl\langle \varPsi(r)\bigr\rangle = \frac{1}{N_{\mathrm {jets}}}\sum _{\mathrm {jets}}\frac{p_{\mathrm{T}}(0,r)}{p_{\mathrm{T}}(0,R)} \end{aligned}$$
(5)

where the sum is performed over all jets of a given sample, light jets (l) or b-jets (b) and N jets is the number of jets in the sample.

Fig. 6
figure 6

Distribution of R=0.4 b-jets in the single-lepton channel as a function of the integrated jet shapes Ψ(r) for different values of r

Fig. 7
figure 7

Distribution of R=0.4 light jets in the single-lepton channel as a function of the integrated jet shapes Ψ(r) for different values of r

Fig. 8
figure 8

Dependence of the b-jet (top) and light-jet (bottom) shapes on the jet transverse momentum. This dependence is quantified by plotting the mean value 〈1−Ψ(r=0.2)〉 (the fraction of energy in the outer half of the jet cone) as a function of p T for jets in the single-lepton sample

Fig. 9
figure 9

Dependence of the b-jet (top) and light-jet (bottom) shape on the jet pseudorapidity. This dependence is quantified by plotting the mean value 〈1−Ψ(r=0.2)〉 (the fraction of energy in the outer half of the jet cone) as a function of |η| for jets in the single-lepton sample

8 Results at the detector level

In the following, the detector-level results for the average values 〈ρ(r)〉 and 〈Ψ(r)〉 as a function of the jet internal radius r, are presented. A comparison has been made between b-jet shapes obtained in both the dilepton and single-lepton samples, and it is found that they are consistent with each other within the uncertainties. Thus the samples are merged. In Fig. 10, the distributions for the average values of the differential jet shapes are shown for each p T bin, along with a comparison with the expectations from the simulated samples described in Sect. 3. There is a small but clear difference between light- and b-jet differential shapes, the former lying above (below) the latter for smaller (larger) values of r. These differences are more visible at low transverse momentum. In Fig. 11, the average integrated jet shapes 〈Ψ(r)〉 are shown for both the light jets and b-jets, and compared to the MC expectations discussed earlier. Similar comments apply here: The values of 〈Ψ(r)〉 are consistently larger for light jets than for b-jets for small values of r, while they tend to merge as rR since, by definition, Ψ(R)=1.

Fig. 10
figure 10

Average values of the differential jet shapes 〈ρ(r)〉 for light jets (triangles) and b-jets (squares), with Δr=0.04, as a function of r at the detector level, compared to MC@NLO+Herwig and Powheg+Pythia event generators. The uncertainties shown for data are only statistical

Fig. 11
figure 11

Average values of the integrated jet shapes 〈Ψ(r)〉 for light jets (triangles) and b-jets (squares), with Δr=0.04, as a function of r at the detector level, compared to MC@NLO+Herwig and Powheg+Pythia event generators. The uncertainties shown for data are only statistical

9 Unfolding to particle level

In order to correct the data for acceptance and detector effects, thus enabling comparisons with different models and other experiments, an unfolding procedure is followed. The method used to correct the measurements based on topological clusters to the particle level relies on a bin-by-bin correction. Correction factors F(r) are calculated separately for differential, 〈ρ(r)〉, and integrated, 〈Ψ(r)〉, jet shapes in both the light- and b-jet samples. For differential (ρ) and integrated jet shapes (Ψ), they are defined as the ratio of the particle-level quantity to the detector-level quantity as described by the MC simulations discussed in Sect. 3, i.e.

$$\begin{aligned} &F^{\rho}_{\mathrm{l},b}(r) = \frac{\langle\rho(r)_{\mathrm {l},b}\rangle _{\mathrm{MC,part}}}{\langle\rho(r)_{\mathrm{l},b}\rangle _{\mathrm {MC,det}}} \end{aligned}$$
(6)
$$\begin{aligned} &F^{\varPsi}_{\mathrm{l},b}(r) = \frac{\langle\varPsi(r)_{\mathrm {l},b}\rangle _{\mathrm{MC,part}}}{\langle\varPsi(r)_{\mathrm{l},b}\rangle _{\mathrm{MC,det}}} \end{aligned}$$
(7)

While the detector-level MC includes the background sources described before, the particle-level jets are built using all particles in the signal sample with an average lifetime above 10−11 s, excluding muons and neutrinos. The results have only a small sensitivity to the inclusion or not of muons and neutrinos, as well as to the background estimation. For particle-level b-jets, a b-hadron with p T>5 GeV is required to be closer than ΔR=0.3 from the jet axis, while for light jets, a selection equivalent to that for the detector-level jets is applied, selecting the non-b-jet pair with invariant mass closest to m W . The same kinematic selection criteria are applied to these particle-level jets as for the reconstructed jets, namely p T>25 GeV, |η|<2.5 and ΔR>0.8 to avoid jet–jet overlaps.

A Bayesian iterative unfolding approach [64] is used as a cross-check. The RooUnfold software [65] is used by providing the jet-by-jet information on the jet shapes, in the p T intervals defined above. This method takes into account bin-by-bin migrations in the ρ(r) and Ψ(r) distributions for fixed values of r. The results of the bin-by-bin and the Bayesian unfolding procedures agree at the 2 % level.

As an additional check of the stability of the unfolding procedure, the directly unfolded integrated jet shapes are compared with those obtained from integrating the unfolded differential distributions. The results agree to better than 1 %. These results are reassuring since the differential and integrated jet shapes are subject to migration and resolution effects in different ways. Both quantities are also subject to bin-to-bin correlations. For the differential measurement, the correlations arise from the common normalisation. They increase with the jet transverse momentum, varying from 25 % to 50 % at their maximum, which is reached for neighbouring bins at low r. The correlations for the integrated measurement are greater and their maximum varies from 60 % to 75 % as the jet p T increases.

10 Systematic uncertainties

The main sources of systematic uncertainty are described below.

  • The energy of individual clusters inside the jet is varied according to studies using isolated tracks [67], parameterising the uncertainty on the calorimeter energy measurements as a function of the cluster p T. The impact on the differential jet shape increases from 2 % to 10 % as the edge of the jet cone is approached.

  • The coordinates η, ϕ of the clusters are smeared using a Gaussian distribution with an RMS width of 5 mrad accounting for small differences in the cluster position between data and Monte Carlo [66]. This smearing has an effect on the jet shape which is smaller than 2 %.

  • An uncertainty arising from the amount of passive material in the detector is derived using the algorithm described in Ref. [66] as a result of the studies carried out in Ref. [67]. Low-energy clusters (E<2.5 GeV) are removed from the reconstruction according to a probability function given by \(\mathcal{P}(E=0)\times\mathrm{e}^{-2E}\), where \(\mathcal{P}(E = 0)\) is the measured probability (28 %) of a charged particle track to be associated with a zero energy deposit in the calorimeter and E is the cluster energy in GeV. As a result, approximately 6 % of the total number of clusters are discarded. The impact of this cluster-removing algorithm on the measured jet shapes is smaller than 2 %.

  • As a further cross-check an unfolding of the track-based jet shapes to the particle level has also been performed. The differences from those obtained using calorimetric measurements are of a similar scale to the ones discussed for the cluster energy, angular smearing and dead material.

  • An uncertainty arising from the jet energy calibration (JES) is taken into account by varying the jet energy scale in the range 2 % to 8 % of the measured value, depending on the jet p T and η. This variation is different for light jets and b-jets since they have a different particle composition.

  • The jet energy resolution is also taken into account by smearing the jet p T using a Gaussian centred at unity and with standard deviation σ r [68]. The impact on the measured jet shapes is about 5 %.

  • The uncertainty due to the JVF requirement is estimated by comparing the jet shapes with and without this requirement. The uncertainty is smaller than 1 %.

  • An uncertainty is also assigned to take pile-up effects into account. This is done by calculating the differences between samples where the number of pp interaction vertices is smaller (larger) than five and the total sample. The impact on the differential jet shapes varies from 2 % to 10 % as r increases.

  • An additional uncertainty due to the unfolding method is determined by comparing the correction factors obtained with three different MC samples, Powheg+Pythia, Powheg+Jimmy and AcerMC [35] with the Perugia 2010 tune [36], to the nominal correction factors from the MC@NLO sample. The uncertainty is defined as the maximum deviation of these three unfolding results, and it varies from 1 % to 8 %.

Additional systematic uncertainties associated with details of the analysis such as the working point of the b-tagging algorithm and the ΔR>0.8 cut between jets, as well as those related to physics object reconstruction efficiencies and variations in the background normalisation are found to be negligible. All sources of systematic uncertainty are propagated through the unfolding procedure. The resulting systematic uncertainties on each differential or integrated shape are added in quadrature. In the case of differential jet shapes, the uncertainty varies from 1 % to 20 % in each p T bin as r increases, while the uncertainty for the integrated shapes decreases from 10 % to 0 % as one approaches the edge of the jet cone, where r=R.

11 Discussion of the results

The results at the particle level are presented, together with the total uncertainties arising from statistical and systematic effects. The averaged differential jet shapes 〈ρ(r)〉 are shown in the even-numbered Figs. 1220 as a function of r and in bins of p T, while numerical results are presented in the odd-numbered Tables 513. The observation made at the detector level in Sect. 8 that b-jets are broader than light jets is strengthened after unfolding because it also corrects the light-jet sample for purity effects. Similarly, the odd-numbered Figs. 1321 show the integrated shapes 〈Ψ(r)〉 as a function of r and in bins of p T for light jets and b-jets. Numerical results are presented in the even-numbered Tables 614. As before, the observation is made that b-jets have a wider energy distribution inside the jet cone than light jets, as it can be seen that 〈Ψ b 〉<〈Ψ l〉 for low p T and small r.

Fig. 12
figure 12

Differential jet shapes 〈ρ(r)〉 as a function of the radius r for light jets (triangles) and b-jets (squares). The data are compared to MC@NLO+Herwig and Powheg+Pythia event generators for 30 GeV<p T<40 GeV. The uncertainties shown include statistical and systematic sources, added in quadrature

Fig. 13
figure 13

Integrated jet shapes 〈Ψ(r)〉 as a function of the radius r for light jets (triangles) and b-jets (squares). The data are compared to MC@NLO+Herwig and Powheg+Pythia event generators for 30 GeV<p T<40 GeV. The uncertainties shown include statistical and systematic sources, added in quadrature

Fig. 14
figure 14

Differential jet shapes 〈ρ(r)〉 as a function of the radius r for light jets (triangles) and b-jets (squares). The data are compared to MC@NLO+Herwig and Powheg+Pythia event generators for 40 GeV<p T<50 GeV. The uncertainties shown include statistical and systematic sources, added in quadrature

Fig. 15
figure 15

Integrated jet shapes 〈Ψ(r)〉 as a function of the radius r for light jets (triangles) and b-jets (squares). The data are compared to MC@NLO+Herwig and Powheg+Pythia event generators for 40 GeV<p T<50 GeV. The uncertainties shown include statistical and systematic sources, added in quadrature

Fig. 16
figure 16

Differential jet shapes 〈ρ(r)〉 as a function of the radius r for light jets (triangles) and b-jets (squares). The data are compared to MC@NLO+Herwig and Powheg+Pythia event generators for 50 GeV<p T<70 GeV. The uncertainties shown include statistical and systematic sources, added in quadrature

Fig. 17
figure 17

Integrated jet shapes 〈Ψ(r)〉 as a function of the radius r for light jets (triangles) and b-jets (squares). The data are compared to MC@NLO+Herwig and Powheg+Pythia event generators for 50 GeV<p T<70 GeV. The uncertainties shown include statistical and systematic sources, added in quadrature

Fig. 18
figure 18

Differential jet shapes 〈ρ(r)〉 as a function of the radius r for light jets (triangles) and b-jets (squares). The data are compared to MC@NLO+Herwig and Powheg+Pythia event generators for 70 GeV<p T<100 GeV. The uncertainties shown include statistical and systematic sources, added in quadrature

Fig. 19
figure 19

Integrated jet shapes 〈Ψ(r)〉 as a function of the radius r for light jets (triangles) and b-jets (squares). The data are compared to MC@NLO+Herwig and Powheg+Pythia event generators for 70 GeV<p T<100 GeV. The uncertainties shown include statistical and systematic sources, added in quadrature

Fig. 20
figure 20

Differential jet shapes 〈ρ(r)〉 as a function of the radius r for light jets (triangles) and b-jets (squares). The data are compared to MC@NLO+Herwig and Powheg+Pythia event generators for 100 GeV<p T<150 GeV. The uncertainties shown include statistical and systematic sources, added in quadrature

Fig. 21
figure 21

Integrated jet shapes 〈Ψ(r)〉 as a function of the radius r for light jets (triangles) and b-jets (squares). The data are compared to MC@NLO+Herwig and Powheg+Pythia event generators for 100 GeV<p T<150 GeV. The uncertainties shown include statistical and systematic sources, added in quadrature

Table 5 Unfolded values for 〈ρ(r)〉, together with statistical and systematic uncertainties for 30 GeV<p T<40 GeV
Table 6 Unfolded values for 〈Ψ(r)〉, together with statistical and systematic uncertainties for 30 GeV<p T<40 GeV
Table 7 Unfolded values for 〈ρ(r)〉, together with statistical and systematic uncertainties for 40 GeV<p T<50 GeV
Table 8 Unfolded values for 〈Ψ(r)〉, together with statistical and systematic uncertainties for 40 GeV<p T<50 GeV
Table 9 Unfolded values for 〈ρ(r)〉, together with statistical and systematic uncertainties for 50 GeV<p T<70 GeV
Table 10 Unfolded values for 〈Ψ(r)〉, together with statistical and systematic uncertainties for 50 GeV<p T<70 GeV
Table 11 Unfolded values for 〈ρ(r)〉, together with statistical and systematic uncertainties for 70 GeV<p T<100 GeV
Table 12 Unfolded values for 〈Ψ(r)〉, together with statistical and systematic uncertainties for 70 GeV<p T<100 GeV
Table 13 Unfolded values for 〈ρ(r)〉, together with statistical and systematic uncertainties for 100 GeV<p T<150 GeV
Table 14 Unfolded values for 〈Ψ(r)〉, together with statistical and systematic uncertainties for 100 GeV<p T<150 GeV

These observations are in agreement with the MC calculations, where top-quark pair-production cross sections are implemented using matrix elements calculated to NLO accuracy, which are then supplemented by angular- or transverse momentum-ordered parton showers. Within this context, both MC@NLO and Powheg+Pythia give a good description of the data, as illustrated in Figs. 1221.

Comparisons with other MC approaches have been made (see Fig. 22). The Perugia 2011 tune, coupled to Alpgen+Pythia, Powheg+Pythia and AcerMC+Pythia, has been compared to the data, and found to be slightly disfavoured. The AcerMC generator [35] coupled to Pythia for the parton shower and with the Perugia 2010 tune [36] gives a somewhat better description of the data, as does the Alpgen [39] generator coupled to Herwig.

Fig. 22
figure 22

Comparison of the \(t\bar{t}\) differential jet shape data for 50 GeV<p T<70 GeV with several MC event generators. As stated in the text, AcerMC [35] coupled to Pythia [33] with the A Pro and A CR Pro tunes [69, 70] give the best description of the data, while the Perugia 2011 [36] tunes are found to be slightly disfavoured. Alpgen+Jimmy [28, 39] provides an intermediate description.

AcerMC coupled to Tune A Pro [69, 70] is found to give the best description of the data within the tunes investigated. Colour reconnection effects, as implemented in Tune A CR Pro [69, 70] have a small impact on this observable, compared to the systematic uncertainties.

Since jet shapes are dependent on the method chosen to match parton showers to the matrix-element calculations and, to a lesser extent, on the fragmentation and underlying-event modelling, the measurements presented here provide valuable inputs to constrain present and future MC models of colour radiation in \(t\bar{t}\) final states.

MC generators predict jet shapes to depend on the hard scattering process. MC studies were carried out and it was found that inclusive b-jet shapes, obtained from the underlying hard processes \(gg\to b\bar {b}\) and gbgb with gluon splitting \(g\to b\bar{b}\) included in the subsequent parton shower, are wider than those obtained in the \(t\bar{t}\) final states. The differences are interpreted as due to the different colour flows in the two different final states i.e. \(t\bar{t}\) and inclusive multi-jet production. Similar differences are also found for light-jet shapes, with jets generated in inclusive multi-jet samples being wider than those from W-boson decays in top-quark pair-production.

12 Summary

The structure of jets in \(t\bar{t}\) final states has been studied in both the dilepton and single-lepton modes using the ATLAS detector at the LHC. The first sample proves to be a very clean and copious source of b-jets, as the top-quark decays predominantly via tWb. The second is also a clean source of light jets produced in the hadronic decays of one of the W bosons in the final state. The differences between the b-quark and light-quark jets obtained in this environment have been studied in terms of the differential jet shapes ρ(r) and integrated jet shapes Ψ(r). These variables exhibit a marked (mild) dependence on the jet transverse momentum (pseudorapidity).

The results show that the mean value 〈Ψ(r)〉 is smaller for b-jets than for light jets in the region where it is possible to distinguish them, i.e. for low values of the jet internal radius r. This means that b-jets are broader than light-quark jets, and therefore the cores of light jets have a larger energy density than those of b-jets. The jet shapes are well reproduced by current MC generators for both light and b-jets.