1 Introduction

The top quark is the heaviest known fundamental particle and is unique in many respects. In the Standard Model, its large mass derives from a Yukawa coupling to the Higgs boson [1, 2] close to unity. Thus it plays a critical role in the quantum corrections to the electroweak Higgs potential and possible vacuum instability at high energies (see Ref. [3] for a review). Because of its large mass, the top quark has a lifetime shorter than the typical time scale of hadronisation of coloured quarks to hadrons. Hence, the properties of the top quark can be investigated unaffected from non-perturbative effects occuring in hadronic bound states. However, the hadronisation of the quarks and gluons constituting the jets from the decay products of the top quark introduces an unavoidable sensitivity of the measured top-quark mass on non-perturbative effects. The top-quark mass \(m_{t}\), is also an essential parameter in high-precision fits to electroweak observables [4].

The top-quark mass can be determined from decay channels involving hadronic and leptonic decays of the intermediate \(W\) boson. For the recent world-average top-quark mass value [5], the highest precision [615] comes from measurements using the lepton plus jets final state in the decay of top–antitop pairs (\(t\bar{t}\)). This channel has a substantial branching fraction and allows a relatively unambiguous assignment of jets to partons from the \(t\bar{t}\) decay. Such events are selected using the lepton and neutrino from the decay of a \(W\) boson from one member of the top–antitop pair.

Events in which the top–antitop quark pair decays into a fully hadronic final state constitute both the largest branching fraction and a complementary final state for the determination of the top-quark mass. The fully hadronic decay mode has been used in Refs. [10, 11] to measure the top-quark mass from \(t\bar{t}\) pairs. This decay mode is used in this analysis to measure the top-quark mass from \(t\bar{t}\) pairs produced in proton–proton collisions provided by the LHC, and observed by the ATLAS detector. The major background to this final state, with orders of magnitude larger cross section, is multijet production from proton–proton collisions other than \(t\bar{t}\) pairs. Particular experimental attention is required to precisely estimate and control this large background. This analysis employs a data-driven method to form a multijet background prediction. Selected data events are divided into several disjoint regions using two uncorrelated observables, such that \(t\bar{t}\) events accumulate only in one of these regions. The background is derived from the other regions, determining both the shape and normalisation of the background distribution in the signal region.

As the top-quark mass is calculated from the measured energy and momentum of reconstructed jets, an accurate understanding of energy and momentum measurements is essential. The dependence of the measured top-quark mass on the jet energy measurement uncertainty is reduced by exploiting the fact that two of the three jets originate from the \(W\) boson produced in the top-quark decay and that the \(W\)-boson mass is known very precisely. The analysis presented in this paper uses the observable \(R_{3/2}= m_{jjj}/m_{jj}\) to achieve a cancellation of systematic effects common to the masses of the reconstructed top quark (\(m_{jjj}\)) and associated \(W\) boson (\(m_{jj}\)).

2 The ATLAS detector

The ATLAS detector [16] at the LHC covers nearly the entire solid angle around the collision point. The inner detector (ID), which is located closest to the interaction point, provides charged-particle tracking in the range of \(|\eta | < 2.5\) where \(\eta \) is the pseudorapidity.Footnote 1 The ID comprises a high-granularity silicon pixel detector, a silicon microstrip tracker and a transition radiation tracker, and is surrounded by a thin superconducting solenoid providing a magnetic field of \(2\) T. The electromagnetic and hadronic calorimeters are located outside the solenoid and cover the pseudorapidity range \(|\eta | < 4.9\). Within the region \(|\eta |<3.2\), electromagnetic calorimetry is provided by barrel and endcap lead/liquid-argon (LAr) sampling calorimeters. Hadronic energy measurements are provided by a steel/scintillator tile calorimeter in the central region and copper/LAr calorimeters in the endcaps. The forward regions are instrumented with copper/LAr and tungsten/LAr calorimeters, optimised for electromagnetic and hadronic energy measurements, respectively. The calorimeter system is surrounded by a muon spectrometer, comprising separate trigger and high-precision tracking chambers. They measure the deflection of muons in a magnetic field with a field integral up to \(8\) Tm, generated by one barrel and two endcap superconducting air-core toroids.

A three-level trigger system is used. The first-level trigger is implemented in hardware and uses a subset of detector information to reduce the event rate to a design value of at most \(75\) kHz. This is followed by two software-based trigger levels, which together reduce the event rate to a few hundred Hz.

The energy scale and resolution of the electromagnetic and hadronic calorimeter systems [17] as well as the performance of the tracking detector for tagging jets from bottom quarks through the displaced decay vertices of \(b\)-flavoured hadrons [1820] are of major importance for the precision of this measurement. Jet energies measured by the electromagnetic and hadronic calorimeters are adjusted using correction factors, obtained from an in situ calibration [17], which depend on pseudorapidity (\(\eta \)) and transverse momentum (\(p_{\mathrm {T}}\)).

3 Data, simulation, event selection and reconstruction

3.1 Data and simulation

This measurement uses data recorded by the ATLAS detector during 2011 from \(7\mathrm{\,TeV}\) proton–proton collisions corresponding to an integrated luminosity of \(4.6\,\text{ fb }^{-1}\) [21]. Events were generated using Monte Carlo (MC) programs in order to investigate systematic uncertainties, to correct for systematic effects, and to generate template distributions used for fitting the top-quark mass. A fast simulation of the ATLAS detector response, which is based on full simulation of the tracking detectors and on parameterisations for the electromagnetic and hadronic calorimeter showers [22], was applied to the generated events. For systematic studies a smaller sample of events was processed by a full Geant4 [23] simulation of the ATLAS detector [24]. The agreement between parameterised and full simulation was verified in detail, as described in Ref. [22]. The remaining differences are small and accounted for by a systematic uncertainty. All simulated events were subject to the same selection criteria and reconstructed using the same algorithms applied to data. To generate \(t\bar{t}\) events, the MC program Powheg-box [25, 26] was employed, which incorporates a theoretical calculation in next-to-leading-order (NLO) accuracy in the strong coupling \(\alpha _S\), with NLO parton distribution functions (PDFs) CT10 [27]. The generated partons are showered and hadronised by Pythia [28]. Adjustable parameters of Pythia are fixed to the values obtained in the Perugia 2011C (P2011C) tune [29]. Signal events were generated assuming seven different top-quark mass values from \(165.0\) to \(180.0\mathrm{\,GeV}\) in steps of \(2.5\mathrm{\,GeV}\), with the largest sample at \(172.5\mathrm{\,GeV}\). In addition to the hard collisions leading to the \(t\bar{t}\) signal, soft scattering processes between the remnants of the protons can take place. Such processes underlying the signal events are also modelled by Pythia using the tuned parameters from Perugia 2011C. Multiple soft proton–proton collisions can take place between different protons in the same bunch crossing (in-time pile-up) or arise from collisions in preceding or subsequent bunch crossings (out-of-time pile-up) due to the time sensitivity of the detector being longer than the time between bunch crossings. Such multiple inelastic interactions were also generated by Pythia, and are reweighted in the simulation to match the distribution of the number of interactions per bunch crossing measured in the data. This number of interactions ranges from 3 to 17, with an average of \(8.7\).

For studies of systematic uncertainties an additional, large sample of signal events was generated at \(172.5\mathrm{\,GeV}\), using Powheg-box and Pythia with the Perugia 2012 tune.

3.2 Event selection

Table 1 Summary of event selection requirements for signal events

A jet-based trigger is used in which the jets are reconstructed in the online trigger system [30]. This jet reconstruction executes the anti-\(k_t\) jet algorithm [31] with a radius parameter of \(0.4\) using clusters of energy deposition in adjacent calorimeter cells (topological clusters) [32, 33]. At least five jets with a nominal \(p_{\mathrm {T}}\) threshold of \(30\mathrm{\,GeV}\) are required to trigger and record an event.

Events are selected according to the requirements listed in Table 1 and detailed in the following. Only events with a well-reconstructed primary vertex formed by at least five tracks with \(p_{\mathrm {T}}>400\mathrm{\,MeV}\) per track are considered for the analysis, where the primary vertex is the reconstructed vertex with the highest summed \(p_{\mathrm {T}}^2\) of associated tracks. Similar to the online trigger system, jets are reconstructed offline by the anti-\(k_t\) jet algorithm with a radius parameter of \(0.4\) using topological clusters. The jet energies are calibrated following Refs. [3436]. For the parameterised simulation a dedicated jet energy calibration is used which is obtained in the same manner as for the full simulation. To ensure that events selected by the trigger are on the plateau of the efficiency curve, only events which have at least five jets, each with \(p_{\mathrm {T}}>55\mathrm{\,GeV}\), and \(\Delta R > 0.6\) Footnote 2 between every pair of jets with \(p_{\mathrm {T}}>30\mathrm{\,GeV}\) are considered. The measured trigger efficiency of \(90\,\%\) agrees with the expectation from simulation to within 5 %. This remaining difference is considered as a source of systematic uncertainty in Sect. 6.

A signal event is required to have at least six jets. Only jets in the central part of the calorimeter (\(|\eta |<2.5\)) and with \(p_{\mathrm {T}}> 30\mathrm{\,GeV}\) are considered for the \(t\bar{t}\) mass analysis, but for the background determination the sixth leading jet has a looser requirement of \(p_{\mathrm {T}}> 25\mathrm{\,GeV}\). For a jet to be considered, at least 75 % of its summed track \(p_{\mathrm {T}}\) must be due to tracks coming from the primary vertex (jet vertex fraction JVF \(\,>0.75\)). Jets in an event are rejected if an identified electron is closer than \(\Delta R=0.2\).

Events with identified isolated electrons with \(E_{\mathrm {T}}>25\mathrm{\,GeV}\) or muons with \(p_{\mathrm {T}}>20\mathrm{\,GeV}\) are rejected. Details of the lepton identification are given in Refs. [37, 38]. Events are kept for further analysis when at most two of the four leading transverse momentum jets are identified as \(b\)-tagged jets by a neural network trained on decay vertex properties. The neural network provides an identification efficiency of 70 % for jets from \(b\)-quarks, a rejection factor of about 130 for jets arising from light partons, and a factor of about 5 for jets arising from \(c\)-quarks [39]. In the signal region, exactly two of the four leading transverse momentum jets are required to be \(b\)-tagged by the neural network. Events with mismeasured jet energies or with potential leptonic decays that include neutrinos are removed by requiring a missing transverse momentum significance \(E_{\mathrm {T}}^{\mathrm {miss}}[\mathrm{GeV}]/\sqrt{H_{\mathrm {T}}[\mathrm{GeV}]}\) of less than 3. Here \(H_{\mathrm {T}}\) is the scalar sum of the transverse momenta of all selected jets in the event. The \(E_{\mathrm {T}}^{\mathrm {miss}}\) is obtained as in Ref. [15] as the magnitude of the negative vectorial sum of calorimeter energy deposits projected onto the transverse plane, plus the transverse momenta of identified muons measured by the tracking detector and muon spectrometer. Measured energy deposits in the calorimeters are corrected according to the identified object (high-\(p_{\mathrm {T}}\) jet, photon, electron, muon); otherwise energy deposits are calibrated with the local hadronic calibration scheme detailed in Ref. [40]. The contribution from multijet background events is reduced by using the centrality \(\mathcal{C}\) of the signal events, which is different from the value in multijet events due to the large top-quark mass. Events are required to have \(\mathcal{C}> 0.6\), with

$$\begin{aligned} \mathcal{C} = \frac{\sum _j^{\mathrm {jets}}{E_{{\mathrm {T}},j}}}{\sqrt{\left( \sum _j^{\mathrm {jets}}p_j\right) ^2}}\ \ , \end{aligned}$$
(1)

where \(E_{{\mathrm {T}},j}\) is the scalar transverse energy and \(p_j=(E_j,\mathbf {p}_j)\) the four-momentum of the \(j^{\mathrm {th}}\) selected jet, and the sum is over all selected jets.

3.3 Reconstruction

Fig. 1
figure 1

Comparison of the distribution of the unnormalised logarithmic likelihood for the reconstruction of fully hadronic \(t\bar{t}\) events in the data with expectations for a top-quark mass value of \(172.5\mathrm{\,GeV}\). The graph in the lower inset shows the ratio of data to the sum of \(t\bar{t}\) MC signal and the modelled multijet background (see Sect. 4). The error bars indicate the statistical uncertainty of the data. The shaded bands show the statistical and systematic (see Sect. 6) uncertainty on the expected signal and background distributions

In each selected event, a fully hadronic \(t\bar{t}\) final state is reconstructed using the six or more jets. In order to achieve this, the jets in data are assigned to the decay partons expected from the decay of the top quark and the related intermediate \(W\) boson, assuming a leading-order decay. Exploiting the knowledge of the precisely known mass of the \(W\) boson and the Breit–Wigner lineshapes of the top quark and the \(W\) boson decay, a kinematic fit [41] based on a likelihood function similar to the one described in Ref. [15] assists in establishing the assignment of reconstructed jets to partons. The fit is performed maximising the logarithmic likelihood, defined as the product of Breit–Wigner distributions for the two top-quark and \(W\) boson masses, and MC derived transfer functions for each of the six jets. The Breit–Wigner lineshape functions use the world-average values of the \(W\) boson mass (\(80.4\mathrm{\,GeV}\)) and decay width (\(2.1\mathrm{\,GeV}\)) from Ref. [42]. The masses of the top quark and antiquark are assumed to be equal for the Breit–Wigner lineshape and free to float in the fit. The top decay width is kept fixed at \(1.3\mathrm{\,GeV}\), corresponding to a top-quark mass of \(172.5\mathrm{\,GeV}\). The energies of the partons are transferred to the measured jet energies by transfer functions derived from simulation and parameterised by superpositions of two Gaussian functions. It is required, furthermore, that the fit assigns the \(b\) and \(\overline{b}\) quarks from the \(t\bar{t}\) decay to any two of the four leading jets. Maximising the logarithmic likelihood establishes the best assignment of reconstructed jets to partons from the \(t\bar{t}\) decay. Figure 1 shows the distribution of the unnormalised logarithmic likelihood value obtained per event and compared with the Monte Carlo prediction of the \(t\bar{t}\) signal added to the modelled multijet background (see Sect. 4). The prediction is in good agreement with the shape of the distribution. Requiring the logarithmic likelihood value to be greater than \(-45\) removes events which yield a low probability under a \(t\bar{t}\) decay hypothesis. The cut rejects about 47 % of the multijet background events, while 79 % of the fully hadronically decaying \(t\bar{t}\) events pass the cut.

After applying the above selection requirements and performing the \(t\bar{t}\) reconstruction \(15 551\) events remain in the signal region for the measurement of the top-quark mass (see Table 2). The expected fraction of \(t\bar{t}\) events in this region without any restriction on \(R_{3/2}\) is about \(17\,\%\), corresponding to a selection efficiency of \(\approx \)0.5 %.

4 Modelling of multijet background

The multijet background contribution is large and cannot be removed completely from any distribution used to measure the top-quark mass in the fully hadronic final state. Currently only leading-order theory calculations for final states with up to six parton are available in MC generator programs. Therefore, the multijet background is determined from the data.

For this approach, selected events are divided into six regions (\(A\)\(F\)) by using two observables with minimal correlation: the number of \(b\)-tagged jets and the transverse momentum of the sixth leading jet, \(p_{\mathrm {T}}^{\mathrm {6th\ jet}}\). The correlation in \(t\bar{t}\) events is estimated in simulation to be \(\rho =0.009\). The six regions, defined by three bins of the number of \(b\)-tagged jets and two ranges in \(p_{\mathrm {T}}^{\mathrm {6th\ jet}}\), are detailed in Table 2. Region \(F\), which is the signal region, i.e. two \(b\)-tagged jets with \(p_{\mathrm {T}}^{\mathrm {6th\ jet}}>30\mathrm{\,GeV}\), contains the largest fraction of \(t\bar{t}\) events in addition to multijet background events.

Table 2 Event yields for the six regions, defined by the number of \(b\)-tagged jets and the transverse momentum of the sixth leading jet \(p_{\mathrm {T}}^{\mathrm {6th\ jet}}\), are listed for data and \(t\bar{t}\) simulation assuming \(m_{t}=172.5\mathrm{\,GeV}\) with statistical uncertainty. The \(t\bar{t}\) fractions are derived from the observed numbers of events and their statistical uncertainties
Fig. 2
figure 2

Distributions of (left) dijet mass \(m_{jj}\), (middle) three-jet mass \(m_{jjj}\), and (right) ratio of three-jet mass to dijet mass \(R_{3/2}\), measured in data and compared to expectations after applying all analysis event selection criteria (i.e. for region \(F\)). The shape and normalisation of the multijet background distributions (green shaded histograms) are calculated using Eq. (4). The distributions for the \(t\bar{t}\) events (white histograms) are taken from the MC simulation using a top-quark mass value of \(172.5\mathrm{\,GeV}\). The insets under the distributions show the ratio of data to the summed contributions of \(t\bar{t}\) MC signal and modelled multijet background (see Sect. 4). The error bars represent the statistical uncertainties on the data. The shaded bands show the statistical and systematic (see Sect. 6) uncertainty on the expected signal and background distributions

Regions \(A\) through \(E\) are depleted in \(t\bar{t}\) events, but enhanced in multijet background events. The data yields in these regions (\(N_R^{\mathrm {obs}}\), \(R=A, \ldots , E\)) and the expected number of \(t\bar{t}\) events from MC simulation, \(N_R^{\mathrm {sig}}\), using \(m_t=172.5\mathrm{\,GeV}\) are listed in Table 2. The table also quotes the derived fraction \(N_R^{\mathrm {sig}}/N_R^{\mathrm {obs}}\) of \(t\bar{t}\) events in the respective region. The \(t\bar{t}\) event fraction in each region other than \(F\) is accounted for by subtracting from data, \(N_R^{\mathrm {obs}}\), the number of \(t\bar{t}\) events predicted by the MC simulation, \(N_R^{\mathrm {sig}}\), for a top-quark mass value of \(175\mathrm{\,GeV}\):

$$\begin{aligned} N_R^{\mathrm {bkg}} = N_R^{\mathrm {obs}} - N_R^{\mathrm {sig}} \end{aligned}$$
(2)

for region \(R=A, \ldots , E\). Due to the small \(t\bar{t}\) fractions in region \(A\) to \(E\), the top-quark mass value chosen in the simulation used for this subtraction procedure marginally affects the value of \(m_{t}\) measured in this analysis. Therefore, the value of \(m_{t}\) closest to the measured value (see Sect. 5) is used in the simulation for subtraction. The small dependence on the \(t\bar{t}\) MC simulation introduced by this subtraction is accounted for by a systematic uncertainty (see Sect. 6.2).

Given the tiny correlation of \(0.9\,\%\) predicted by MC simulation studies for the two observables used to define the regions, the total number of multijet background events, \(N_F^{\mathrm {bkg}}\), in region \(F\) can be estimated by cross-multiplication, for example, from the ratio of the number of events in region \(B\) to region \(A\) scaled by the number of events in region \(E\). To obtain the distribution of multijet background events, \(N_F^{\mathrm {bkg}}(x)\), for any given observable \(x\) (e.g. \(R_{3/2}\)) to the distribution in region \(F\) either of the following formulae can be used:

$$\begin{aligned} N_F^{\mathrm {bkg}}(x) = N_E^{\mathrm {bkg}}\cdot \frac{N_B^{\mathrm {bkg}}(x)}{N_A^{\mathrm {bkg}}}\qquad {\mathrm {or}} \nonumber \\ N_F^{\mathrm {bkg}}(x) = N_E^{\mathrm {bkg}}\cdot \frac{N_D^{\mathrm {bkg}}(x)}{N_C^{\mathrm {bkg}}} ,{} \end{aligned}$$
(3)

hence

$$\begin{aligned} N_F^{\mathrm {bkg}}(x)=\frac{N_E^{\mathrm {bkg}}}{2} \cdot \left( \frac{N_B^{\mathrm {bkg}}(x)}{N_A^{\mathrm {bkg}}}+\frac{N_D^{\mathrm {bkg}}(x)}{N_C^{\mathrm {bkg}}}\right) . \end{aligned}$$
(4)

Here, \(N_B^{\mathrm {bkg}}(x)\) and \(N_D^{\mathrm {bkg}}(x)\) define the shape of the distributions for an observable \(x\), while the appropriate normalisation is achieved by scaling with the total number of events (\(N_A^{\mathrm {bkg}}\), \(N_C^{\mathrm {bkg}}\), \(N_E^{\mathrm {bkg}}\)) in the respective region. Equation (4) is used to determine the multijet background while Eqs. (3) are used to estimate the systematic uncertainties on the modelled background (see Sect. 6.2).

Figure 2 shows the distributions of the dijet mass, the three-jet mass, and their ratio, \(R_{3/2}=m_{jjj}/m_{jj}\), after applying the event selection and jet assignments detailed in Sect. 3. In calculating \(R_{3/2}\) values for an event, \(m_{jjj}\) of both top-quark candidates and \(m_{jj}\) of the related \(W\) boson candidate are considered. Superimposed in Fig. 2 is the sum of the distributions for the \(t\bar{t}\) events obtained from MC simulation using \(m_t=172.5\mathrm{\,GeV}\) plus the multijet background estimated using Eq. (4). The distributions of the ratios of data to the sum of the signal MC events plus background model seen in Fig. 2 show that the data-driven approach yields a reliable model of the multijet background.

5 Top-quark mass measurement

The top-quark mass is obtained from a binned likelihood fit to the \(R_{3/2}\) distribution shown in Fig. 2. As noted above, two values of \(R_{3/2}\) are contributed by each event, reconstructed separately from the top and antitop-quark candidates. Because equal masses are assumed for the Breit–Wigner lineshapes for the top quark and antiquark in the kinematic fit for the jet assignments, the two values are correlated at the level of approximately 60 % according to MC simulation. This is corrected for in the statistical treatment described below. Templates are created for both the simulated top-quark contribution to the \(R_{3/2}\) distribution and the modelled background distribution. The top-quark contribution is parameterised by the sum of a Gaussian function and a Landau function which account, respectively, for the correctly reconstructed top-quark events and for the combinatorial background due to mis-assignment of jets to partons (see Sect. 3). This description involves six parameters.

Fig. 3
figure 3

Templates for the \(R_{3/2}\) distribution for \(t\bar{t}\) MC simulation using top-quark mass values of \(170.0,\ 175.0\) and \(180.0\mathrm{\,GeV}\), respectively. For each top-quark mass, the \(R_{3/2}\) distribution is fitted by the sum (black solid) of a Gaussian (red dashed) and Landau (blue dotted) function. Superimposed (orange cross-hatched) are the templates obtained from a combined fit of all \(R_{3/2}\) distributions using a linear dependence of parameters of the Gaussian and Landau functions on the top-quark mass value. The insets under the distributions show the difference Fit\(-\)MC between the combined fit and the simulated \(R_{3/2}\) histogram normalised to the statistical uncertainty \(\sigma \) of the corresponding \(R_{3/2}\) bin

A two-step approach is used to obtain an \(m_{t}\)-dependent representation of the templates. Firstly, the \(R_{3/2}\) distribution from each of the seven simulation samples of different \(m_{t}\) is fitted separately to determine the six parameters for each template mass. This yields a good description of the \(R_{3/2}\) distributions per chosen \(m_t\) (see Fig. 3). MC simulation has shown that each of the six parameters of the Gaussian and Landau functions depend linearly on the input top-quark mass. Secondly, from the parameter values obtained by these separate fits, initial values for offsets and slopes of the linear \(m_t\) dependencies are derived and then used as inputs to a combined, simultaneous fit to all seven \(R_{3/2}\) distributions. In total 12 parameters are determined by the combined fit, which yields a \(\chi ^2\) per number of degrees of freedom (ndf) of \(\chi ^2/\mathrm {ndf}=298/282=1.06\). Both the individual and the combined fit results are shown for three of the seven \(m_t\) values in Fig. 3.

The modelled multijet background, obtained using Eq. (4), is parameterised by a Gaussian function plus a linear function, thus involving five parameters. The resulting fit to data is shown in Fig. 4 and yields \(\chi ^2/\)ndf\(=40/36=1.08\). The shape of the fitted parameterisation is assumed to be independent of the top-quark mass while the normalisation is obtained from fitting to the data distribution. Any residual dependence of this parameterisation on the top-quark mass is accounted for by a systematic uncertainty (see Sect. 6).

The \(R_{3/2}\) distribution is fitted for the top-quark mass using the templates for both the top-quark signal and the modelled multijet background distribution described above. Defining the likelihood function as a product of Poisson probabilities

$$\begin{aligned} \mathcal {L}(R_{3/2}|m_{t})= \prod _{j}^{\mathrm {bins}} \left( \frac{\lambda _{j}^{N_{F,j}^{\mathrm {obs}}}}{N_{F,j}^{\mathrm {obs}}!} \right) \exp ({-\lambda _{j}}), \end{aligned}$$
(5)

a binned likelihood fit is applied. For the \(R_{3/2, j}\), i.e. the \(j^{\mathrm {th}}\) bin of the \(R_{3/2}\) distribution, \(N_{F,j}^{\mathrm {obs}}\equiv N_F^{\mathrm {obs}}(R_{3/2, j})\) and \(\lambda _{j}\) are the observed and expected number of events in that bin. Here, the expected number of events in a bin is given by the sum of \(t\bar{t}\) events \(N_{F,j}^{\mathrm {sig}}(m_{t})\), as derived from the signal templates, and multijet background events \(N_{F,j}^{\mathrm {bkg}}\equiv N_F^{\mathrm {bkg}}(R_{3/2, j})\),

$$\begin{aligned} \lambda _{j} = (1-f_{\mathrm {bkg}}) N_{F,j}^{\mathrm {sig}}(m_{t}) + f_{\mathrm {bkg}} N_{F,j}^{\mathrm {bkg}}, \end{aligned}$$
(6)

where \(f_{\mathrm {bkg}}\) is the fraction of multijet background events, which is determined by the fit.

Equation (5) is maximised with respect to \(m_{t}\) and \(f_{\mathrm {bkg}}\) for \(R_{3/2}\) values between \(1.5\) and \(3.6\), taking the normalisation from data, yielding

$$\begin{aligned} m_{t} = 175.06 \pm 1.35\;\text{(stat.) }\mathrm{\,GeV}\end{aligned}$$
(7)

for a background fraction of \(f_{\mathrm {bkg}}=0.72 \pm 0.01\) and \(\chi ^2/\)ndf \(=48/39=1.23\). The difference between the fitted background fraction and the value quoted in Sect. 3.3 is due to the restricted \(R_{3/2}\) range used in the fit. The result of this fit is shown in Fig. 5. The \(\chi ^2/\)ndf value is enlarged by the statistical correlation between the two \(R_{3/2}\) values from each event. Its impact has been incorporated in the quoted statistical uncertaintyFootnote 3 of Eq. (7) as follows.

Fig. 4
figure 4

Distribution of \(R_{3/2}\) for multijet background events according to the data-driven prescription of Eq. (4), normalised to unit integral. The parameterisation of the distribution by the sum of a Gaussian function and a linear function is superimposed

Fig. 5
figure 5

Result of the fit of Eq. (5) (solid black) to the measured \(R_{3/2}\) distribution. The red dotted curve shows the contribution from top-quark events and corresponds to the black curve in Fig. 3; the green dashed line is the modelled multijet background

Fig. 6
figure 6

Expected statistical uncertainty on the top-quark mass obtained from 5000 pseudo-experiments using \(t\bar{t}\) MC simulation events assuming \(m_{t}=175\mathrm{\,GeV}\) and neglecting correlations between the two \(R_{3/2}\) values per event

The statistical uncertainty of the fit is studied by performing pseudo-experiments, where 5000 pseudo-datasets of \(R_{3/2}\) values, each statistically equivalent to the data, are assembled from values randomly picked from signal and background histogramsFootnote 4. They are obtained from \(t\bar{t}\) MC simulationFootnote 5 generated for \(m_{t}=175\mathrm{\,GeV}\), and from the multijet background estimate, detailed in Sect. 4, respectively. Pseudo-datasets are created from two-dimensional histograms for the full MC sample of \(R_{3/2}\) from the top-quark candidate versus \(R_{3/2}\) of the top-antiquark candidate in an event, thereby accounting for the \(60\,\%\) correlation. Similarly, one-dimensional histograms are used to produce pseudo-datasets which do not include the correlations. The top quark mass and its statistical uncertainty are evaluated for each pseudo-dataset, using the likelihood fit of Eq. (5)

The expected statistical uncertainty of the fit when neglecting the correlation is shown in Fig. 6. A fit of a Gaussian function to the output of the 5000 pseudo-experiments yields an expected statistical uncertainty of \(1.19\pm 0.08\mathrm{\,GeV}\), which agrees with the observed statistical uncertainty of \(1.15\mathrm{\,GeV}\).

The same procedure with 5000 pseudo-datasets is applied to each of the seven top-quark mass values used for MC simulation, considering the correlation of the \(R_{3/2}\) values for the top quark and antiquark candidates in an event. Distributions of the pull values for the 5000 pseudo-datasets are derived, where the pull is the difference between the fitted, \(m_{t}^{\mathrm {fit}}\), and input, \(m_{t}^{\mathrm {inp}}\), top-quark mass values divided by the statistical uncertainty, \(\sigma ^{\mathrm {fit}}\), of the fit; \(\mathrm {pull}=(m_{t}^{\mathrm {fit}}-m_{t}^{\mathrm {inp}})/\sigma ^{\mathrm {fit}}\). The pull distribution for an unbiased measurement has a mean of zero and a standard deviation of unity. For this measurement no dependence of the pull mean on \(m_{t}^{\mathrm {inp}}\) is observed. An average pull mean value corresponding to \(m_{t}^{\mathrm {fit}}-m_{t}^{\mathrm {inp}} = -0.23 \pm 0.14\mathrm{\,GeV}\) and an average pull width of \(1.175 \pm 0.027\) are obtained. The bias in the width of the pull is due to the statistical correlation. To correct for this bias, the observed statistical uncertainty of \(1.15\mathrm{\,GeV}\) is scaled by 1.175 to yield the statistical uncertainty of \(1.35\mathrm{\,GeV}\) quoted in Eq. (7). The bias indicated by the non-zero mean value of the pull distribution is corrected for in the above quoted result. The uncertainty of the pull mean value is considered as part of the systematic uncertainty related to the calibration of this measurement method.

6 Systematic uncertainties

A large number of potential sources of systematic uncertainty were evaluated. They can be categorised as uncertainties due to: (i) the modelling of the \(t\bar{t}\) events in the MC simulation, (ii) the modelling of the multijet background by the data-driven approach, (iii) the correction and calibration of the energies of the reconstructed jets, the jet reconstruction and the \(b\)-quark identification efficiency. These are described in detail in Sects. 6.16.3. In general, for every investigated source of systematic uncertainty the likelihood fit of Eq. (5) for the top-quark mass is repeated with a modified parameter. Any change of the measured top-quark mass is assigned as the systematic uncertainty due to this source. The total systematic uncertainty arises from adding all individual contributions in quadrature. Table 3 lists the individual contributions and their combination. The largest systematic uncertainties are due to the jet and \(b\)-jet energy scales and the hadronisation modelling.

6.1 Signal modelling

All systematic uncertainties related to the modelling of \(t\bar{t}\) events and the lineshape of the top-quark mass distribution are investigated using 5000 data sets, created by the resampling technique described in Sect. 5 by randomly selecting \(R_{3/2}\) values from a distribution of \(t\bar{t}\) MC simulation events generated with a shifted value for the relevant parameter as detailed below. In Table 3, the difference between the mean values obtained with shifted and with default parameter values, from 5000 pseudo-experiments each, is quoted for the investigated sources of systematic uncertainty.

Table 3 Compilation of investigated systematic uncertainties on the determined top-quark mass reported in Sect. 5. The three parts of the table correspond to uncertainties in the \(t\bar{t}\) and multijet background modelling, and uncertainties in the jet measurements

Method calibration Our particular choice of signal parameterisation functions and the adopted linear dependence of the parameters of these functions on the top-quark mass value can affect the reconstructed top-quark mass. This uncertainty is estimated from the differences between the fitted and the input top-quark mass value when determining the \(t\bar{t}\) template for each of the seven simulation samples separately. The average of the absolute differences is \(0.23\mathrm{\,GeV}\) and also accounts for the average shift of the pull distributions.

The shapes of the templates for \(t\bar{t}\) and multijet background events can be affected by statistical uncertainties of either simulated events (signal templates) or data (background templates). This is assessed by creating 1000 new sets of templates by letting the standard templates fluctuate within their statistical uncertainties. The top-quark mass values obtained with these new templates are found to have an RMS spread of \(0.42\mathrm{\,GeV}\).

The larger of 0.23 and \(0.42\mathrm{\,GeV}\) is assigned as a systematic uncertainty for the method calibration.

Trigger Studies of the trigger efficiency close to the threshold region reveal a 5 % difference between data and MC simulation. The impact of this deviation is evaluated by reweighting the efficiency for triggering MC simulation events to match the efficiency observed in data as a function of the transverse momentum of the fifth leading jet. The observed change in the measured top-quark mass is \(0.01\mathrm{\,GeV}\).

Signal MC generator The impact of the choice of Powheg-box as the signal MC generator is evaluated by generating \(t\bar{t}\) events at \(m_{t}=\) \(172.5\mathrm{\,GeV}\) using either Powheg-box or MC@NLO [44, 45], each with Herwig [46] for the modelling of the parton shower and the hadronisation. The full difference in the top-quark mass values of \(0.30\mathrm{\,GeV}\) found from using Powheg or MC@NLO to determine the signal templates is quoted as the systematic uncertainty.

Hadronisation Potential systematic uncertainties due to our choice of parton shower and hadronisation model are assessed by using Powheg \(t\bar{t}\) events with parton shower and hadronisation performed by either Pythia with the Perugia P2012 tune or by Herwig Footnote 6 and Jimmy with the ATLAS AUET2 tune [47]. The full difference in the top-quark mass values of \(0.50\mathrm{\,GeV}\) between these two samples is ascribed to the uncertainty due to parton shower and hadronisation modelling.

Fast simulation The \(t\bar{t}\) MC simulation events for all seven \(m_{t}\) mass values are processed by a fast simulation of the ATLAS detector [22, 48]. For \(m_{t}=172.5\mathrm{\,GeV}\) an additional \(t\bar{t}\) MC simulation sample is created using the full simulation of the ATLAS detector. The systematic uncertainty of \(0.24\mathrm{\,GeV}\) is estimated from the difference of \(0.24\pm 0.30\;\text{(stat.) }\mathrm{\,GeV}\) between the top-quark masses obtained by performing pseudo-experiments on either the fast or the full MC simulation sample.

Colour reconnection Consequences of reconnection of colour flux lines between the partons are estimated with Powheg-box and Pythia by comparing simulated \(t\bar{t}\) events based on the Perugia 2012 tune including colour reconnection (CR) and the Perugia 2012 loCR tune [29], which uses a lower colour reconnection strength than the default tune. The full difference of \(0.22\mathrm{\,GeV}\) in measured top-quark mass between these two samples is attributed to the uncertainty from colour reconnection.

Underlying event The potential uncertainty due to the choice of a particular model to simulate underlying events is evaluated by considering events simulated using Powheg-box and Pythia based on the Perugia 2012 tune and comparing to events based on the Perugia 2012 mpiHi tune [29], which has an increased rate of jets from multi-parton interactions. Both tunes use the same parameters for the modelling of colour reconnection and both predict similar activity in the plane transverse to the leading charged particle. The samples used for colour reconnection uncertainties are based on different values for these parameters. The full difference between the fitted mass values of \(0.08\mathrm{\,GeV}\) is taken as the systematic uncertainty.

Initial- and final-state QCD radiation The impact from additional jets due to initial- and final-state QCD radiation, ISR and FSR, respectively, on the top-quark mass measurement is analysed with dedicated \(t\bar{t}\) event samples generated with the leading-order generator AcerMC [49]. Parton showering and hadronisation are performed by Pythia using the Perugia 2011C tune. Tunable parameters that control the parton shower strength are varied up and down in these samples in a range for which the simulated radiation in \(t\bar{t}\) events is compatible with the results found from an investigation of additional jets in \(t\bar{t}\) events [50]. Half of the full difference between the measured top-quark masses from these two samples is taken as the systematic uncertainty, which is \(0.22\mathrm{\,GeV}\).

Proton–parton distribution function The \(t\bar{t}\) event samples were generated using CT10 PDF. The uncertainties in these PDFs are specified by 26 pairs of additional PDF sets provided by the CTEQ group [51]. The effect of the PDF uncertainties on the \(t\bar{t}\) templates is derived from samples generated using MC@NLO with Herwig for hadronisation. For every additional PDF set, the simulated events are reweighted by the ratio of the varied PDF to the central PDF. Signal templates are constructed for each of these 26 pairs of sets. Using these templates, pseudo-experiments are performed per pair of PDF sets but using the same events for the up and down variations within every pair to alleviate the effects of the statistical fluctuations. Half of the sum in quadrature of the difference within each of the 26 pairs is assigned as the systematic uncertainty derived from the CTEQ PDF. Additionally, the \(t\bar{t}\) event samples are also reweighted to the central PDF set of either MSTW2008 [52] or NNPDF23 [53]. The final systematic uncertainty due to PDF is the sum in quadrature of these three contributions, which yields \(0.09\mathrm{\,GeV}\).

Pile-up The consequences of additional proton–proton interactions on the top-quark mass measurement are investigated by repeating the full analysis separately as a function of the number of reconstructed collision vertices, \(n_{\mathrm {vtx}}\), and as a function of the average number, \(\langle \mu \rangle \), of inelastic proton–proton interactions per bunch crossing. This is in addition to the effects already accounted for in the corresponding jet energy scale. The data sample is split into disjoint subsamples of \(n_{\mathrm {vtx}}\le 5\), \(5<n_{\mathrm {vtx}}\le 7\), and \(7<n_{\mathrm {vtx}}\), or into subsamples of \(\langle \mu \rangle \le 6\), \(6<\langle \mu \rangle \le 10\), and \(10<\langle \mu \rangle \). In each of these subsamples the full analysis for the top-quark mass measurement is repeated, giving per-subsample variations, \(\Delta m_{t}\). Within large statistical uncertainties, data and MC simulation agree. The effect of any residual differences between data and simulation is included by scaling \(\Delta m_{t}\) with the absolute difference between the \(n_{\mathrm {vtx}}\) distribution in data and simulation, each normalised to unit integral. The scaled \(\Delta m_{t}\) obtained for each of the three subsamples are summed, yielding \(0.02\mathrm{\,GeV}\). The same procedure is applied to the \(\Delta m_{t}\) from the subsamples of the \(\langle \mu \rangle \) distribution, yielding \(0.01\mathrm{\,GeV}\). The two sums, derived from the \(n_{\mathrm {vtx}}\) and for \(\langle \mu \rangle \) distributions, are then added in quadrature to estimate the systematic uncertainty on the top-quark mass measurement of \(0.02\mathrm{\,GeV}\).

6.2 Background modelling

Each of the prescriptions in Eq. (3) yields an independent estimate of the multijet background to the \(t\bar{t}\) events. Employing these separately distinguishes different contributions from background processes and accounts for conceivable correlations between the distribution \(N_F^{\mathrm {bkg}}(x)\) and the multiplicity of the \(b\)-tagged jets. In particular, the regions \(C\) and \(D\), where one jet is \(b\)-tagged, accumulate background from single top-quark production while suppressing contributions from \(W\) \(+\) jets processes. The regions \(A\) and \(B\), where no jets are \(b\)-tagged, are essentially free from \(t\bar{t}\) events and, hence, insensitive to systematic uncertainties from the subtraction of residual \(t\bar{t}\) contributions (see Eq. (2)). The average of the absolute shifts on \(m_{t}\) when using either of the prescriptions in Eq. (3) separately is taken as symmetric uncertainty on the background modelling, which amounts to \(0.35\mathrm{\,GeV}\).

6.3 Jet measurement

Systematic uncertainties due to measuring jets are listed in Table 3 and detailed in the following.

Jet energy scale The relative jet energy scale uncertainty varies between about 1 and \(3\,\%\) depending on the \(p_{\mathrm {T}}\) and \(\eta \) of the jet. This was investigated in detail in Refs. [17, 34, 35], which prescribe 21 components of uncertainty, including a proper treatment of the correlations between the individual sources. The 21 components involve nuisance parameters from different in situ techniques applied to evaluate residual jet energy scale correction factors which account for differences between data and MC simulation. They originate from the calibration method, the calorimeter response, the detector simulation and the specific choice of parameters in the physics model employed by the MC event generator. Further sources of uncertainty are related to the extrapolation to the high-\(p_{\mathrm {T}}\) region, to the intercalibration of jets at large pseudorapidity with central jets and to the pile-up. Topology-dependent uncertainties arising from the relative numbers of jets initiated by gluons and light quarks are included as well as uncertainties on the response to jets with nearby hadronic activity. The 21 components are considered uncorrelated. After repeating the top-quark mass measurement separately for each component, the variation in the top-quark mass value obtained from the up and down variation of each nuisance parameter is symmetrised. The individual symmetrised contributions are added in quadrature to estimate the overall \(\Delta m_{t}\) due to jet energy scale uncertainty of \(0.51\mathrm{\,GeV}\).

Table 4 lists the individual systematic uncertainty components related to the energy measurements of jets combined into different categories according to the type of source and correlations (see Ref. [34]).

Table 4 Individual contributions to the systematic uncertainty of the top-quark mass due to uncertainties on the jet energy scale listed in Table 3

Relative \(b\) -jet energy scale The relative \(b\)-jet energy scale accounts for the remaining differences between an inclusive jets sample and jets originating from bottom quarks after the global jet energy scale is determined. It is estimated by choosing different fragmentation models. An extra uncertainty, ranging between \(1.8\) and \(0.7\,\%\), and decreasing as jet \(p_{\mathrm {T}}\) increases, is assigned to each \(b\)-jet to account for the difference between jets containing \(b\)-flavoured hadrons and the inclusive jet sample. This uncertainty is derived from MC simulation studies and validated by comparison with data (see Ref. [36] for details). For the spectrum of jets selected in this analysis the average uncertainty is less than \(1.2\,\%\). The systematic uncertainty on \(m_t\) due to the relative \(b\)-jet energy scale is \(0.62\mathrm{\,GeV}\).

Jet energy resolution The impact of a residual difference between the jet energy resolution in data and MC simulation is accounted for by smearing the energy of each reconstructed jet in the simulation by a Gaussian function before applying the event selection requirements (see Ref. [54] for details). The top-quark mass measurement is repeated using the smeared jet energies yielding a variation of \(0.01\mathrm{\,GeV}\), which is symmetrised and assigned as a systematic uncertainty.

Jet reconstruction efficiency The jet reconstruction efficiency was found in Ref. [17] to differ in data and MC simulation by no more than \(\pm 2\,\%\). This residual difference is applied as a variation by randomly removing jets from the simulated events before applying the event selection criteria. The variation of \(0.01\mathrm{\,GeV}\) found by repeating the top-quark mass measurement employing this modified MC simulation sample is taken as a systematic uncertainty.

\(b\) -tagging efficiency and mistag rate The efficiency for tagging \(b\)-quark jets as well as the \(c\)-quark and light-quark (\(u\), \(d\), \(s\)) jet mistag rate in simulation are corrected to data by scale factors [19, 39]. The uncertainty of this correction is propagated to the measured top-quark mass by varying these scale factors by one standard deviation about their central values, which depend on the \(p_{\mathrm {T}}\) and the \(\eta \) of the jet, and on the underlying quark flavour. The variations in the top-quark mass are added in quadrature to assess the systematic uncertainty from this source, which yields \(0.17\mathrm{\,GeV}\).

Soft contribution to missing energy Measured energy deposits in the calorimeter which are not associated with a high-\(p_{\mathrm {T}}\) jet, photon, electron, or muon, stem mostly from low-\(p_{\mathrm {T}}\) particles. These energy deposits are calibrated using the local hadronic calibration scheme [40]. An uncertainty of \(0.02\mathrm{\,GeV}\) on the top-quark mass due to this assumption is derived by scaling the soft contributions within their uncertainties.

Jet vertex fraction scale factor uncertainty The difference in JVF between data and MC simulation is corrected by applying scale factors. These scale factors, varied according to their uncertainty, are applied to MC simulation events as a function of the \(p_{\mathrm {T}}\) of a jet. The resulting variation in the measured top-quark mass amounts to \(0.02\mathrm{\,GeV}\).

7 Comparison with alternative analysis

The result of this measurement is compared with an independent measurement based on essentially the same selection described in Sect. 3. For this independent measurement, however, entirely different methods are chosen for alleviating the effects due to uncertainties from the jet energy measurement and for modelling the multijet background. Applying a simultaneous two-dimensional fit to the \(W\) boson and top-quark masses unfolds the dependency of the top-quark mass on a global jet scale factor. Thus systematic uncertainties affecting the jet scale factor are mostly removed from the uncertainties in the measured top-quark mass; however, this gives rise to increased statistical uncertainty (see also Ref. [15]).

In the independent alternative measurement, the multijet background is modelled using an event mixing procedure. Here, events with six or more jets are composed from events with exactly five jets, two of which are \(b\)-tagged, merged with the sixth and subsequent leading jets from events of an independent inclusive jet sample. Kinematic similarity of the two events to be mixed is ensured by requiring the similarity of the transverse momenta of both the leading jets in the two events and also of the fifth leading jets. Evaluation of the systematic uncertainties described in Sect. 6 was performed for this independent analysis. This investigation showed that the alternative analysis and the main analysis have similar sensitivities to the top-quark mass. The alternative analysis has yielded a top-quark mass value and a total statistical uncertainty of \(m_{t} = 174.7 \pm 1.4\ (\mathrm {stat.+JSF}) \mathrm{\,GeV}\) with a global jet scale factor of \(\mathrm {JSF} = 1.013 \pm 0.008\, (\mathrm {stat.})\), in good agreement with the results presented in Sects. 5 and 6.

8 Summary

In a data set corresponding to \(4.6\,\text{ fb }^{-1}\) of proton–proton collisions collected by the ATLAS experiment at the LHC at \(\sqrt{s}=7\mathrm{\,TeV}\), events consistent with \(t\bar{t}\) pairs decaying into a fully hadronic final state were selected. A kinematic likelihood fit was employed to assign reconstructed jets to the partons expected from the leading-order hadronic decay of the intermediate \(t\bar{t}\) state. To reduce the sensitivity of the analysis to the energy scale of jets, the ratio \(R_{3/2}\) of the three-jet mass to the dijet mass was calculated. The three-jet mass calculation combines all jets from a top-quark decay, and the dijet mass is computed with the two jets from the hadronically decaying \(W\) boson. The multijet background was determined by dividing the event sample into six disjoint sets according to the number of \(b\)-tagged jets and the \(p_{\mathrm {T}}\) of the sixth jet. The background in the region of interest is then estimated by cross-multiplication. Fitting the \(R_{3/2}\) distribution for the top-quark mass yields

$$\begin{aligned} m_{t} = 175.1 \pm 1.4\;\text{(stat.) }\pm 1.2\;\text{(syst.) }\mathrm{\,GeV}\end{aligned}$$
(8)

with a measured fraction of background events \(f_{\mathrm {bkg}}=0.72\pm 0.01\). The systematic uncertainties are dominated by the residual uncertainties from the jet energy scale for all jets and, specifically, for \(b\)-quark jets and by the uncertainties from hadronisation modelling. The total uncertainty is \(1.8\mathrm{\,GeV}\). This result has a precision similar to, and within uncertainties fully agrees with, the top-quark mass measured from the fully hadronic final state by other experiments [10, 11] and the result measured in the lepton plus jets final state and published previously by ATLAS [15].