1 Introduction

The measurement of top quark pair (\(t\bar{t}\)) production in association with one or more jets containing b-hadrons (henceforth referred to as b-jets) is important in providing a detailed understanding of quantum chromodynamics (QCD). The most accurate theoretical predictions for these processes are fixed-order calculations at next-to-leading order (NLO) accuracy [13] in perturbative QCD (pQCD), which have been matched to a parton shower [46]. These calculations have significant uncertainties from missing higher-order terms [7, 8], making direct experimental measurements of this process desirable. The measurement of such cross-sections in fiducial phase-spaces, defined to correspond as closely as possible to the acceptance of the ATLAS detector, can be compared to theoretical predictions using the same fiducial requirements. This minimises theoretical extrapolations to phase-space regions that are not experimentally measurable.

Moreover, following the discovery of the Higgs boson [9, 10], the Standard Model prediction for the top quark Yukawa coupling can be tested via a measurement of the \(t\bar{t}H\) associated production cross-section. Due to the large Higgs branching ratio to b-quarks, the \(t\bar{t}H\rightarrow t\bar{t}b\bar{b}\) channel is promising, but suffers from a large and poorly constrained background of events with top pairs and additional b-jets from QCD processes [1113].

Measurements of \(t\bar{t}\) production with additional heavy-flavour jets have been performed by ATLAS at \(\sqrt{s}= 7\) TeV [14] and CMS at \(\sqrt{s}\) = 8 TeV [15, 16]. The ATLAS measurement reported a ratio of heavy flavour to all jets produced in association with a \(t\bar{t}\) pair where heavy flavour includes both bottom jets as well as charm jets. The CMS measurement is a fiducial measurement of events with two leptons and four or more jets, of which at least two are identified as containing a b-hadron.

This paper presents measurements of fiducial cross-sections for \(t\bar{t}\) production in association with one or two additional b-jets. Because the top quark decays almost exclusively to a b-quark and a W boson, these processes have three or four b-jets in the final state. The particle-level objects are required to be within the detector acceptance of \(|\eta |< 2.5\), where \(\eta \) is the pseudorapidity.Footnote 1 The jets are required to have transverse momenta above 20 GeV and the electrons and muons to have transverse momenta above 25 GeV. The lepton-plus-jets and dilepton (\(e \mu \)) channelsFootnote 2 are used to perform two measurements of the cross-section for the production of \(t\bar{t}\) events with at least one additional b-jet. In both cases, the signal cross-section is extracted from a fit to a multivariate discriminant used to identify b-tagged jets [17]. The lepton-plus-jets channel has a higher acceptance times branching ratio, but suffers from a significant background of events in which the W boson decays to a c- and a light quark.

Two analysis techniques are used in the dilepton channel (ee, \(\mu \mu \) and \(e \mu \)) to measure a cross-section for the production of \(t\bar{t}\) events with two additional b-jets. The first, referred to as the cut-based analysis, applies very tight selection criteria including a requirement of four b-tagged jets. This analysis results in a high signal-to-background ratio and relies on the Monte Carlo (MC) estimates of the background, including the \(t\bar{t}\) background with additional jets containing c-quarks (c-jets) or only light quarks and gluons (light jets). The second applies a looser selection and extracts the signal cross-section from a fit to the distribution of a multivariate b-jet identification discriminant. This second method, referred to as the fit-based analysis, confirms the validity of the background predictions used in the cut-based approach, and offers a measurement of the ratio of cross-sections for events with two additional b-jets and all events with two additional jets.

The fiducial measurements are made considering both electroweak (e.g. from Z boson decays) and QCD production of the additional b-quarks as signal. In order to compare to NLO pQCD theory predictions, the measurements are also presented after subtracting the electroweak processes, \(t\bar{t}V\) (V corresponding to a W or Z boson) and \(t\bar{t}H\).

The paper is organised as follows. First, the definitions of the fiducial regions are given in Sect. 2. The ATLAS detector is briefly described in Sect. 3, followed in Sect. 4 by a description of the data and simulated samples used. Section 5 describes the reconstruction of physics objects in the detector and presents the event selection used. The sources of systematic uncertainties affecting the measurements are described in Sect. 6. Section 7 describes the analysis techniques used to extract the cross-sections and their uncertainties. The final cross-sections are presented in Sect. 8 and compared to recent theoretical predictions. Finally, Sect. 9 gives brief conclusions.

2 Measurement definition

This section details the particle-level fiducial phase-space definitions. Particle-level object definitions that are common to all measurements are described in Sect. 2.1. The particle-level event selection is then discussed in Sect. 2.2, describing first the fiducial selection used to define the cross-section, and then, where relevant, the selection used to define the templates that are fit to the data.

2.1 Particle-level object definitions

The particle-level definition of objects is based on particles with a proper lifetime \(\tau _{\mathrm {particle}} > 3\times 10^{-11}\) s. The definitions used here follow very closely previous ATLAS \(t\bar{t}\) fiducial definitions [18]. Fiducial requirements are placed only on jets and charged leptons.

Electrons and muons: Prompt electrons and muons, i.e. those that are not hadron decay products, are considered for the fiducial lepton definition. Electrons and muons are dressed by adding to the lepton the four-vector momenta of photons within a cone of size \(\Delta R=0.1\) around it. Leptons are required to have \(p_{\text {T}} > 25\) GeV and \(|\eta | < 2.5\).

Jets: Jets are obtained by clustering all stable particles, except the leptons, dressed with their associated photons, and neutrinos that are not hadron decay products, using the anti-\(k_t\) algorithm [1921] with a radius parameter \(R=0.4\). Particles from the underlying event are included in this definition, whereas particles from additional inelastic proton–proton collisions (pile-up) are not included. The products of hadronically decaying \(\tau \) leptons are thus included within jets. Photons that were used in the definition of the dressed leptons are excluded from the jet clustering. Particle jets are required to have \(p_{\text {T}} > 20\) GeV and \(|\eta | < 2.5\). The \(p_{\text {T}}\) threshold for particle-level jets is optimised to reduce the uncertainty of the measurement; it is chosen to be lower than the 25 GeV threshold used for reconstructed jets (see Sect. 5), as jets with a true \(p_{\text {T}}\) just below the reconstruction threshold may satisfy the event selection requirement due to the jet energy resolution. This effect is enhanced by the steeply falling \(p_{\text {T}}\) spectra for the additional jets. A similar choice is not necessary for electrons and muons due to their better energy resolution.

Jet flavour identification: A jet is defined as a b-jet by its association with one or more b-hadrons with \(p_{\text {T}} >\) 5 GeV. To perform the matching between b-hadrons and jets, the magnitudes of the four-momenta of b-hadrons are first scaled to a negligible value (in order to not alter normal jet reconstruction), and then the modified b-hadron four-momenta are included in the list of stable particle four-momenta upon which the jet clustering algorithm is run, a procedure known as ghost-matching [22]. If a jet contains a b-hadron after this re-clustering, it is identified as a b-jet; similarly, if a jet contains no b-hadron but is ghost-matched to a c-hadron with \(p_{\text {T}} > 5\) GeV, it is identified as a c-jet. All other jets are considered light-flavour jets.

Overlap between objects: In order to ensure isolation of all objects considered, events are rejected if any of the jets satisfying the fiducial requirements lie within \(\Delta R=0.4\) of a dressed, prompt lepton.

2.2 Fiducial event selection

The fiducial object definitions given above are used to classify events as signal or background. This is described in Sect. 2.2.1. Section 2.2.2 defines the templates used in the fit-based measurements.

2.2.1 Signal event selection

The signal definitions are related to the fiducial definition of either a lepton-plus-jets or a dilepton \(t\bar{t}\) decay topology with at least one or at least two extra jets. The classification is based on the number of leptons and the number and flavour of the jets passing the fiducial object selection. Cross-section measurements are reported in the following three fiducial phase-spaces:

  • \(\user2{ttb}\) lepton-plus-jets refers to \(t\bar{t}\) events with exactly one lepton and at least five jets, of which at least three are b-jets;

  • \(\user2{ttb}\) \(\user2{e\,\mu} \) refers to \(t\bar{t}\) events with one electron, one muon, and at least three b-jets;

  • \(\user2{ttbb}\) dilepton refers to \(t\bar{t}\) events with two leptons and least four b-jets.

For the ttbb fiducial region, additional requirements are placed on the invariant mass of the lepton pair. For all flavours of lepton pairs, the invariant mass of the two leptons (\(m_{\ell \ell }\)) must be above 15 GeV. In events with same-flavour leptons, \(m_{\ell \ell }\) must also satisfy \(|m_{\ell \ell } - m_{Z} | > 10\) GeV, where \(m_{Z}\) is the mass of the Z boson. Table 1 summarises the fiducial definition of all three phase-spaces.

Table 1 Summary of the three sets of fiducial selection criteria employed for the ttb and ttbb cross-section measurements. The jet–lepton isolation (\(\Delta R_{\ell ,j}\)) requires \(\Delta R > 0.4\) between any of the jets and the leptons

2.2.2 Template definitions

The measurements based on fits determine the signal and background contributions using templates of the b-tagging discriminant for the various categories of events. Because b-jets, c-jets and light jets give different distributions for the discriminant, the non-signal \(t\bar{t}\) events are split according to the flavour of the additional jet(s) in the event.

In particular, the ttb analyses define the signal template (\(ttb\)) using the same requirements on the jets as used for the cross-section definition, and similar templates are defined for c-jets (\(ttc\)) and light jets (\(ttl\)). With two additional jets, the ttbb fit-based measurement has a larger number of possible flavour combinations. The templates of different combinations are merged if they have similar shapes and if they are produced through similar processes. This results in four templates: \(ttbb\), \(ttbX\), \(ttcX\) and \(ttlX\).

In addition, because the lepton kinematics do not significantly affect the distributions of the b-jet discriminant, the dilepton fit measurements do not include the lepton requirements in the template definitions. For these analyses, a correction for the fiducial acceptance of the leptons thus needs to be applied (\(f_{\mathrm {fid}}\)). The ttb lepton-plus-jets analysis uses the same lepton requirements in defining the templates as are used for the signal definition.

Table 2 shows the complete set of criteria used in the fiducial definitions of the various templates. For the lepton-plus-jets analysis, contributions from \(W \rightarrow cq\) (\(q = s, d\)) decays where the c-hadron is matched to one of the fiducial jets are included in the \(ttc\) template; this contribution is found to dominate over that from \(t\bar{t}\) with additional heavy flavour.

The ttbb cut-based measurement does not make use of templates for fitting. Events are considered as signal if they meet the definition of ttbb in Sect. 2.2.1; all other \(t\bar{t}\) events are considered background.

Table 2 Particle-level definitions used to classify selected \(t\bar{t}\) events into templates for the likelihood fits. The categories depend on the number of jets and number of b- and c-jets within the fiducial region

3 ATLAS detector

The ATLAS detector [23] at the LHC covers nearly the entire solid angle around the collision point. It consists of an inner tracking detector surrounded by a thin superconducting solenoid, electromagnetic and hadronic calorimeters, and a muon spectrometer incorporating three large superconducting toroid magnets. The inner-detector system (ID) is immersed in a 2T axial magnetic field and provides charged-particle tracking in the range \(|\eta | < 2.5\).

A high-granularity silicon pixel detector covers the vertex region and typically provides three measurements per track, the first hit being normally in the innermost layer. This pixel detector is important for the reconstruction of displaced vertices used to identify jets containing heavy-flavour hadrons. It is followed by a silicon microstrip tracker, which has four layers in the barrel region. These silicon detectors are complemented by a transition radiation tracker, which enables radially extended track reconstruction up to \(|\eta | = 2.0\). The transition radiation tracker also provides electron identification information based on the fraction of hits (typically 30 in total) above a higher energy-deposit threshold corresponding to transition radiation. The ID reconstructs vertices with spatial resolution better than 0.1 mm in the direction longitudinal to the beam for vertices with more than ten tracks.

The calorimeter system covers the pseudorapidity range \(|\eta | < 4.9\). Within the region \(|\eta |< 3.2\), electromagnetic calorimetry is provided by barrel and endcap high-granularity lead/liquid-argon (LAr) electromagnetic calorimeters, with an additional thin LAr presampler covering \(|\eta | < 1.8\), to correct for energy loss in material upstream of the calorimeters. Hadronic calorimetry is provided by a steel/scintillating-tile calorimeter, segmented into three barrel structures within \(|\eta | < 1.7\), and two copper/LAr hadronic endcap calorimeters. The solid angle coverage is completed with forward copper/LAr and tungsten/LAr calorimeter modules optimised for electromagnetic and hadronic measurements respectively.

The muon spectrometer (MS) comprises separate trigger and high-precision tracking chambers measuring the deflection of muons in a magnetic field generated by superconducting air-core toroids. The precision chamber system covers the region \(|\eta | < 2.7\) with three layers of monitored drift tubes, complemented by cathode strip chambers in the forward region, where the background is highest. The muon trigger system covers the range \(|\eta | < 2.4\) with resistive-plate chambers in the barrel, and thin-gap chambers in the endcap regions.

A three-level trigger system is used to select interesting events [24]. The Level-1 trigger is implemented in hardware and uses a subset of detector information to reduce the event rate to a design value of at most 75 kHz. This is followed by two software-based trigger levels which together reduce the event rate to about 400 Hz.

4 Data samples and MC simulations

4.1 Data samples

The results are based on proton–proton collision data collected with the ATLAS experiment at the LHC at a centre-of-mass energy of \(\sqrt{s} = 8\) TeV in 2012. Only events collected under stable beam conditions with all relevant detector subsystems operational are used. Events are selected using single-lepton triggers with \(p_{\text {T}} \) thresholds of 24 or 60 GeV for electrons and 24 or 36 GeV for muons. The triggers with the lower \(p_{\text {T}} \) threshold include isolation requirements on the candidate lepton in order to reduce the trigger rate to an acceptable level. The total integrated luminosity available for the analyses is \(20.3~\text{ fb }^{-1}\).

4.2 Signal and background modelling

The sample composition for all analyses is dominated by \(t\bar{t}\) events. Contributions from other processes arise from W+jets, Z+jets, single top (t-channel, Wt and s-channel), dibosons (WWWZZZ) and events with one or more non-prompt or fake leptons from decays of hadrons. In these measurements, \(t\bar{t}V\)(where V corresponds to a W or Z boson) and \(t\bar{t}H\) events that pass the fiducial selection are considered as part of the signal. Results with those processes removed are also provided to allow direct comparison to theory predictions at NLO in pQCD matched to parton showers (see Sect. 4.4). All backgrounds are modelled using MC simulation except for the non-prompt or fake lepton background, which is obtained from data for the \(ttb\) lepton-plus-jets and \(ttb\) \(e\mu \) analyses, as described below.

\({\varvec{t\bar{t}}}\): The nominal sample used to model \(t\bar{t}\) events was generated using the PowhegBox (version 1, r2330) NLO generator [2527], with the NLO CT10 parton distribution function (PDF) [28] assuming a top quark mass of 172.5 GeV. It was interfaced to Pythia 6.427 [29] with the CTEQ6L1 [30] PDF and the Perugia2011C [31] settings for the tunable parameters (hereafter referred to as tune). The hdamp parameter of PowhegBox, which controls the \(p_{\text {T}}\) of the first additional emission beyond the Born configuration, was set to \(m_{\mathrm {top}}=172.5\) GeV. The main effect of this is to regulate the high-\(p_{\text {T}}\) emission against which the \(t\bar{t}\) system recoils. In Figs. 1 and  2, tables of event yields, and comparison to predictions, the \(t\bar{t}\) sample is normalised to the theoretical calculation of \(253^{+13}_{-15}\) pb performed at next-to-next-to leading order (NNLO) in QCD that includes resummation of next-to-next-to-leading logarithmic (NNLL) soft gluon terms with Top++2.0 [3237]. The quoted uncertainty includes the scale uncertainty and the uncertainties from PDF and \(\alpha _\mathrm{S}\) choices.

\({{\varvec{t\bar{t}V}}}\): The samples of \(t\bar{t}V\) with up to one additional parton were generated with the MadGraph v5 generator (v1.3.33) [38] with the CTEQ6L1 PDF set. Pythia 6.426 with the AUET2B tune [39] was used for showering. The top quark production and decay was performed in MadGraph and \(t\bar{t} + Z/\gamma ^{*}\) interference was included. The \(t\bar{t}V\) samples are normalised to the NLO cross-section predictions [40, 41].

\({{\varvec{t\bar{t}H}}}\): The \(t\bar{t}H\) process was simulated using NLO matrix elements for \(pp\rightarrow t\bar{t}H\) provided by the HELAC-Oneloop package [42], interfaced to Pythia 8.175 [43] through PowhegBox [27], also known as the Powhel approach [44]. The matrix-element calculation was performed using the CT10 PDF set and the parton shower used the AU2CT10 tune [45]. The sample is normalised to the NLO cross-section prediction and uses the SM values for the Higgs boson branching ratios [46].

\(\varvec{W/Z}\) +jets: Samples of W+jets and \(Z/\gamma ^{*}\)+jets were generated using the Alpgen v2.14 [47] leading-order (LO) generator and the CTEQ6L1 PDF set [48]. Parton shower and fragmentation were modelled with Pythia 6.426 [29]. To avoid double-counting of partonic configurations generated by both the matrix-element calculation and the parton-shower evolution, a parton–jet matching scheme (“MLM matching”) [49] was employed. The W / Z+jets samples were generated with up to five additional partons, separately for production in association with b-quarks, c-quarks and light quarks. The overlap between events with heavy-flavour quarks obtained from the matrix element and the parton showers was removed using a scheme based on angular separation between the heavy quarks. The W / Z+jets backgrounds are normalised to the inclusive NNLO theoretical cross-section [50]. In the dilepton channel, a data-driven method is used to validate the Z+jets normalisation. A region enriched in Z+jets events is defined by inverting the requirement \(|m_{ee / \mu \mu }-91 \mathrm\,{\mathrm{GeV}}|>10\) GeV. The data are found to agree with the prediction in all lepton channels.

Dibosons: Samples of WW / WZ / ZZ+jets were generated using Alpgen v2.14 [47]. Parton shower and fragmentation were modelled with Herwig 6.520 [51]. Sherpa 1.4.3 [5255] samples including massive b- and c-quarks with up to three additional partons were used to cover the WZ channel with the Z decaying to hadrons, which was not taken into account in the Alpgen samples. All diboson samples are normalised to their NLO theoretical cross-sections [56, 57] as calculated with MCFM [58]; the NLO PDF set MSTW2008 was used for all decay channels.

Single top: Background samples of single top quarks corresponding to the t-channel, s-channel and Wt production mechanisms were generated with PowhegBox (version 1, r2330) [2527] using the CT10 PDF set [28]. All samples were interfaced to Pythia 6.426 [29] with the CTEQ6L1 set of parton distribution functions and the Perugia2011C tune. In the dilepton channels, only the Wt process is considered. Overlaps between the \(t\bar{t}\) and Wt final states were removed according to the inclusive Diagram Removal scheme [59]. The single-top-quark samples are normalised to the approximate NNLO theoretical cross-sections [6062] using the MSTW2008 NNLO PDF set.

All event generators using Herwig 6.520 [51] were also interfaced to Jimmy v4.31 [63] to simulate the underlying event. The samples that used Herwig or Pythia for showering and hadronisation were interfaced to Photos [64] for modelling of the QED final-state radiation and Tauola [65] for modelling the decays of \(\tau \) leptons. The \(t\bar{t}H\) sample was interfaced to Photos++. All samples were simulated taking into account the effects of multiple pp interactions based on the pile-up conditions in the 2012 data. The pile-up interactions are modelled by overlaying simulated hits from events with exactly one inelastic (signal) collision per bunch crossing with hits from minimum-bias events that are produced with Pythia 8.160 using the A2M tune [45] and the MSTW2008 LO PDF [66]. Finally the samples were processed through a simulation [67] of the detector geometry and response using Geant4 [68]. All simulated samples were processed through the same reconstruction software as the data. Simulated events are corrected so that the object identification efficiencies, energy scales and energy resolutions match those determined in data control samples. The alternative \(t\bar{t}\) samples described in Sect. 6.3, used for evaluating systematic uncertainties, were instead processed with the ATLFAST-II [67] simulation. This employs a parameterisation of the response of the electromagnetic and hadronic calorimeters, and Geant4 for the other detector components. The nominal \(t\bar{t}\) sample is processed with both Geant4 and ATLFAST-II; the latter is used when calculating the generator uncertainties.

Table 3 provides a summary of basic settings of the MC samples used in the analysis. The alternative \(t\bar{t}\) samples used to evaluate the \(t\bar{t}\) generator uncertainties are described in Sect. 6.3.

Table 3 Summary of the Monte Carlo event generators used in the analyses. Generators used only for evaluating systematic uncertainties are not included

4.3 Backgrounds with fake or non-prompt leptons

Events with fewer prompt leptons than required may satisfy the selection criteria if one or more jets are mis-identified as isolated leptons, or if the jets include leptonic decays of hadrons which then satisfy lepton identification and isolation requirements. Such cases are referred to as fake leptons.

In the lepton-plus-jets channel, this background is estimated from data using the so-called matrix method [69]. A sample enhanced in fake leptons is selected by removing the lepton isolation requirements and, for electrons, loosening the identification criteria (these requirements are detailed in Sect. 5.1). Next, the efficiency for these “loose” leptons to satisfy the tight criteria is measured in data, separately for prompt and for fake leptons. For prompt leptons it is taken from a sample of Z boson decays, while for fake leptons it is estimated from events with low missing transverse momentum or high lepton impact parameter. With this information the number of fake leptons satisfying the tight criteria can be calculated.

In the ttb \(e \mu \) analysis, this background is estimated from data using events where the two leptons have electrical charges with the same sign. Processes which contain two prompt leptons with the same sign, such as \(t\bar{t}W\), and cases of lepton charge mis-identification, are subtracted from the same-sign data using MC simulation. In the ttbb measurements, the background is less important, as the higher jet multiplicity requirement means fewer additional jets available to be mis-identified as leptons. In this case the background is estimated from the simulation samples described above.

4.4 Predictions for \(t\bar{t}\) with additional heavy flavour

The measured fiducial cross-sections are compared to a set of theory predictions obtained with the generators shown in Table 4. In each case the fiducial phase-space cuts are applied using Rivet 2.2.1 [70].

Two generators are used which employ NLO \(t\bar{t}b\bar{b}\) matrix elements with the top quarks being produced on-shell. A MadGraph5_aMC@NLO sample was generated in the massive 4-flavour scheme (4FS), using two different functional forms for the renormalisation and factorisation scales: \(\mu =m_{\mathrm {top}}^{1/2}\left( p_\mathrm{T}(b)p_\mathrm{T}(\bar{b})\right) ^{1/4}\) (the BDDP [1] form), and \(\mu =\frac{1}{4}H_\mathrm{T}=\frac{1}{4} \sum _{i} \sqrt{m_i^2+p_{\mathrm {T},i}^2}\), where the sum runs over all final-state particles. A Powhel sample was generated as described in Ref. [4], with the top quark mass set to 173.2 GeV. The renormalisation and factorisation scales were set to \(\mu =\frac{1}{2}H_\mathrm{T}\), with the sum in this case running over all final-state particles in the underlying Born configuration. In contrast to MadGraph5_aMC@NLO, this sample employed the 5-flavour scheme (5FS), which unlike the 4FS treats b-quarks as being massless and contains a resummation of logarithmically enhanced terms from collinear \(g\rightarrow b\bar{b}\) splittings [71]. In order to regularise the divergence associated with gluon splitting into a pair of massless \(b\bar{b}\) quarks, the transverse momentum of each b-quark, and the invariant mass of the \(b\bar{b}\) pair, were all required to be greater than 2 GeV. This implies that the 5FS calculation does not cover the entire phase-space measured by the ttb analyses. However, the missing events, in which a second b-quark is produced with \(p_{\text {T}}\) below 2 GeV, or two b-quarks have invariant mass below 2 GeV, are expected to contribute only a small amount to the fiducial cross-section. The prediction for the ttbb fiducial cross-section is unaffected by the generator cuts. Both the MadGraph5_aMC@NLO and Powhel samples used Pythia 8.205 [72] with the Monash tune [73] for the parton shower.

The cross-sections are also compared to predictions in which the additional b-quarks are not present in the matrix-element calculation and are only created in the parton shower. The PowhegBox sample is the same one used for the nominal \(t\bar{t}\) prediction, described in Sect. 4.2. A merged sample containing a \(t\bar{t}\) final state with up to three additional partons (b, c, or light) was generated with MadGraph5 interfaced to Pythia 6.427 with the Perugia2011C [31] tune. Finally, in order to assess the effect of the different descriptions of the \(g\rightarrow b\bar{b}\) splitting in the parton shower, a sample consisting of LO \(t\bar{t}\) matrix elements was generated with Pythia 8.205 [72] using the ATTBAR tune [74]. The inclusive cross-section of the sample was normalised to the NNLO + NNLL result [3237]. Pythia 8 offers several options for modelling \(g\rightarrow b\bar{b}\) splittings in the final-state parton showers, which may be accessed by varying the Timeshower:weightGluonToQuark (wgtq) parameter [75]. Differences between the models arise by neglecting (wgtq5) or retaining (wgtq3, wgtq6) the mass-dependent terms in the \(g\rightarrow b\bar{b}\) splitting kernels. Differences also arise with respect to the treatment of the high-\(m_{b\bar{b}}\) region, with specific models giving an enhanced or suppressed \(g\rightarrow b\bar{b}\) rate. The model corresponding to wgtq3 was chosen to maximise this rate. Finally, some of the models (wgtq5, wgtq6) offer the possibility to choose \(\mathrm {sgtq\cdot } m_{b\bar{b}}\) instead of the transverse momentum as the argument of \(\alpha _\mathrm{S}\) in the \(g\rightarrow b\bar{b}\) vertices. Here sgtq refers to the TimeShower:scaleGluonToQuark parameter, and is allowed to vary in the range \(0.25\le \mathrm {sgtq}\le 1\), with larger values giving a smaller \(g\rightarrow b\bar{b}\) rate and vice versa. For the model wgtq5, sgtq was set to 1, a combination that minimises the \(g\rightarrow b\bar{b}\) rate, while for wgtq6, sgtq was set to 0.25.

Table 4 Details of the theoretical cross-section calculations. For MadGraph5_aMC@NLO, two different functional forms are used for the renormalisation and factorisation scales. Additionally, the leading-order Pythia calculations were done with three different options for the \(g\rightarrow b\bar{b}\) splitting, as described in the text. The PowhegBox sample is the one used for the nominal \(t\bar{t}\) prediction in the analyses

5 Object and event selection

5.1 Object reconstruction

A description of the main reconstruction and identification criteria applied for electrons, muons, jets and b-jets is given below.

Electrons: Electron candidates [76] are reconstructed from energy clusters in the electromagnetic calorimeter that are matched to reconstructed tracks in the inner detector. The electrons are required to have \(E_{\text {T}} >25\,\mathrm{GeV}\) and \(|\eta _\mathrm{cluster}| < 2.47\). Candidates in the electromagnetic calorimeter barrel/endcap transition region \(1.37 < |\eta _\mathrm{cluster}| < 1.52\) are excluded. The longitudinal impact parameter of the track with respect to the primary vertex, \(|z_{0}|\), is required to be less than 2 mm. Electrons must satisfy tight quality requirements based on the shape of the energy deposit and the match to the track to distinguish them from hadrons. Additionally, isolation requirements are imposed based on nearby tracks or calorimeter energy deposits. These requirements depend on the electron kinematics and are derived to give an efficiency that is constant with respect to the electron \(E_{\text {T}} \) and \(\eta \). The cell-based isolation uses the sum of all calorimeter cell energies within a cone of \(\Delta R = 0.2\) around the electron direction while the track-based isolation sums all track momenta within a cone of \(\Delta R=0.3\); in both cases the track momentum itself is excluded from the calculation. A set of isolation selection criteria with an efficiency of 90 % for prompt electrons in \(Z \rightarrow ee\) events is used in the ttb analyses. Due to the reduced fake lepton background in the ttbb analyses, a looser 98 % efficient set of selection criteria is used.

Muons: Muon candidates are reconstructed by matching tracks formed in the muon spectrometer and inner detector. The final candidates are refit using the complete track information from both detector systems, and are required to have \(p_{\text {T}} >25\,\mathrm{GeV}\), \(|\eta |<2.5\), and \(|z_{0}|<2\) mm. Muons must be isolated from nearby tracks, using a cone-based algorithm with cone size \(\Delta R_{\mathrm {iso}} = 10\,\mathrm{GeV}/ p_\mathrm{T}^{\mu }\). All tracks with momenta above 1 GeV, excluding the muon’s track, are considered in the sum. The ratio of the summed track transverse momenta to the muon \(p_\mathrm{T}\) is required to be smaller than 5 %, corresponding to a 97 % selection efficiency for prompt muons from \(Z\rightarrow \mu \mu \) decays. If a muon and an electron are formed from the same track, the event is rejected.

Jets: Jets are reconstructed with the anti-\(k_t\) algorithm [1921] with a radius parameter \(R=0.4\), using calibrated topological clusters [23] built from energy deposits in the calorimeters. Prior to jet finding, a local cluster calibration scheme is applied to correct the topological cluster energies for the non-compensating response of the calorimeter, dead material, and out-of-cluster leakage [77]. The corrections are obtained from simulations of charged and neutral particles. After energy calibration, jets are required to have \(p_{\text {T}} > 25\) GeV and \(|\eta | < 2.5\). To avoid selecting jets from secondary interactions, a jet vertex fraction (JVF) cut is applied [78]. The variable is defined as the ratio of two sums of the \(p_{\text {T}} \) of tracks associated with a given jet and that satisfy \(p_{\text {T}} >1\,\mathrm{GeV}\). In the numerator, the sum is restricted to tracks compatible with the primary vertex, while in the denominator the sum includes all such tracks. A requirement that its value be above 0.5 is applied to jets with \(p_{\text {T}} <50\,\mathrm{GeV}\), \(|\eta |<2.4\), and at least one associated track.

During jet reconstruction, no distinction is made between identified electrons and other energy deposits. Therefore, if any of the jets lie within \(\Delta R\) = 0.2 of a selected electron, the single closest jet is discarded in order to avoid double-counting electrons as jets. After this, electrons or muons within \(\Delta R\) = 0.4 of a remaining jet are removed.

b -tagged jets: Jets are identified as likely to originate from the fragmentation of a b-quark (b-tagged) using multivariate techniques that combine information from the impact parameters of associated tracks and topological properties of secondary and tertiary decay vertices reconstructed within the jet [17]. The multivariate algorithms are trained either using only light-flavour jets as background (the “MV1” algorithm), or additionally including charm jets in the background to improve the charm jet rejection (the “MV1c” algorithm). The efficiency of identification in simulation is corrected to that measured in data, separately for each flavour of jet [17, 79]. For the analyses using a binned fit of the b-tagging discriminant, the probability for a simulated jet to lie in a particular bin is corrected using data.

5.2 Event selection

To ensure that events originate from proton collisions, events are required to have at least one reconstructed vertex with at least five associated tracks.

Events are required to have exactly one or exactly two selected leptons in the lepton-plus-jets and dilepton measurements, respectively. At least one of the leptons must be matched to the trigger object which triggered the event. For the ttb \(e \mu \) measurement, only events with one electron and one muon are considered. To increase the number of events in the ttbb measurements, all three lepton flavour combinations (ee, \(\mu \mu \) and \(e\mu \)) are considered. Additional lepton requirements are applied in the ttbb analyses to remove the backgrounds from \(Z/\gamma ^{*}\), \(\Upsilon \) and \(J/\psi \) decays. The invariant mass of the two leptons must satisfy \(m_{\ell \ell } > 15\) GeV and, for events with same-flavour leptons (ee or \(\mu \mu \)), must also satisfy \(|m_{\ell \ell }-91 \mathrm {\mathrm{GeV}}|>10\) GeV.

The lepton-plus-jets ttb analysis requires at least five jets, at least two of which must be b-tagged. For this analysis, c-jet rejection is important so the MV1c b-tagging algorithm is used, at a working point with 80 % efficiency for b-jets from top quark decays. This working point is optimised to give the lowest total expected uncertainty on the measurement. The ttb \(e \mu \) and ttbb fit-based dilepton analyses require at least three jets, two of which have to be b-tagged. The same b-tagging algorithm and working point as in the lepton-plus-jets analysis is used to improve the separation between b- and c-jets. The ttbb cut-based analysis requires exactly four b-tagged jets; for this analysis the MV1 algorithm is used at a working point with 70 % efficiency for b-jets from top decays. For this analysis, the tighter working point is chosen to reduce the background as much as possible, while the MV1 algorithm is chosen since the impact of the c-jet background on the analysis is less important. Table 5 summarises the selection criteria applied to the analyses.

Table 5 Summary of the main event selection criteria applied in the various channels. Other requirements which are common to all channels, including muon isolation, are described in the text

After these selection criteria are applied, the number of observed and expected events are shown in Table 6 for the ttb analyses and Table 7 for the ttbb analyses. For all but the ttbb cut-based analysis, the samples are dominated by \(t\bar{t}\) events with an additional light or charm jet. In all cases the data agree with the expectation within the systematic uncertainties described in Sect. 6. The kinematics in all channels are also found to be well-modelled. As an example, Fig. 1 shows the jet multiplicity, b-tagged jet multiplicity, and \(p_{\text {T}}\) distribution of the jet with the third highest MV1c weight in the lepton-plus-jets selection. Figure 2 shows the b-tagged jet multiplicity along with the \(p_{\text {T}}\) distribution of the jets with the third and fourth highest MV1c values in the dilepton selection. The jet \(p_{\text {T}}\) distributions in Figs. 1 and 2 correspond to the jets that are used in the fit to the distributions of the b-tagging discriminant MV1c (see Sect. 7.2).

Table 6 The number of observed and expected events in the ttb lepton-plus-jets and \(e \mu \) analysis signal regions. Indented sub-categories indicate that they are subsets of the preceding category. The uncertainty represents the total uncertainty (pre-fit) on the Monte Carlo samples, or on data events in the case of the fake and non-prompt leptons. In the ttb \(e \mu \) channel, only the \(Z\rightarrow \tau \tau \) contribution is included in Z+jets; the rest is accounted for in the fake lepton component, as is W+jets. The breakdown of the \(t\bar{t}\) sample into the fiducial sub-samples is given, using the template definitions. For illustration, the contributions to ttb from \(t\bar{t}V\) and \(t\bar{t}H\) are also shown
Table 7 The number of observed and expected events in the two ttbb analysis signal regions. Indented sub-categories indicate that they are subsets of the preceding category. The uncertainty represents the total uncertainty (pre-fit) on the Monte Carlo samples, or on data events in the case of the fake and non-prompt leptons. The breakdown of the \(t\bar{t}\) sample into the fiducial sub-samples is given, using the template definitions. For illustration, the contributions to ttbb from \(t\bar{t}V\) and \(t\bar{t}H\) are also shown
Fig. 1
figure 1

Jet multiplicity, b-tagged jet multiplicity, and transverse momentum \(p_{\text {T}}\) of the jet with the third highest MV1c value in the lepton-plus-jets channel. Events are required to have at least five jets, at least two b-tagged jets and one lepton. The data are shown as black points with their statistical uncertainty. The stacked distributions are the nominal predictions from Monte Carlo simulation; the hashed area shows the total uncertainty on the prediction. The bottom sub-plot shows the ratio of the data to the prediction. The non-prompt and fake lepton backgrounds are referred to as ‘NP & fakes’. The last bin of the distribution includes the overflow

Fig. 2
figure 2

Jet multiplicity, b-tagged jet multiplicity, and transverse momentum \(p_{\text {T}}\) of the jets with the third and fourth highest MV1c values, in the dilepton channel using the ttbb fit-based selection; events are required to have at least four jets, two b-tagged jets and two leptons (ee, \(e\mu \) or \(\mu \mu \)). The data are shown in black points with their statistical uncertainty. The stacked distributions are the nominal predictions from Monte Carlo simulation; the hashed area shows the total uncertainty on the prediction. The bottom sub-plot shows the ratio of the data to the prediction. The non-prompt and fake lepton backgrounds are referred to as ‘NP & fakes’. The last bin of the distribution includes the overflow

6 Systematic uncertainties

Several sources of systematic uncertainty are considered that can affect the normalisation of signal and background and/or the shape of their corresponding final discriminant distributions, where relevant. Individual sources of systematic uncertainty are considered as correlated between physics processes and uncorrelated with all other sources. The following sections describe each of the systematic uncertainties considered in these analyses. The uncertainties quoted are illustrative only and the effect of that uncertainty depends on the channel and analysis method used. All analyses use relative normalisation uncertainties. Section 7 details the method by which the uncertainties are included in each analysis and discusses their impact on the measurements.

6.1 Luminosity uncertainty

Using beam-separation scans performed in November 2012, a luminosity uncertainty of 2.8 % for \(\sqrt{s}=8\mathrm{TeV}\) analyses was derived applying the methodology of Ref. [80]. This uncertainty directly affects the cross-section calculation, as well as all background processes determined from MC simulation.

6.2 Physics objects

In this section, uncertainties relevant to the reconstruction of leptons, jets, and b-tagging are described.

Lepton reconstruction, identification and trigger: The reconstruction and identification efficiency of electrons and muons, their isolation, as well as the efficiency of the triggers used to record the events, differ slightly between data and simulation. Correction factors are derived using tag-and-probe techniques on \(Z\rightarrow \ell ^+\ell ^-\) (\(\ell =e,\mu \)) data and simulated samples to correct the simulation for these discrepancies [81, 82]. These have \(\sim \)1 % uncertainty on all simulated samples.

Lepton momentum scale and resolution The accuracy of the lepton momentum scale and resolution in simulation is checked using reconstructed distributions of the \(Z\rightarrow \ell ^+\ell ^-\) and \(J/\psi \rightarrow \ell ^+\ell ^-\) masses [82, 83]. In the case of electrons, E / p studies using \(W\rightarrow e\nu \) events are also used. Small discrepancies between data and simulation are observed and corrected for. In the case of muons, momentum scale and resolution corrections are only applied to the simulation, while for electrons these corrections are applied to data and simulation. Uncertainties on both the momentum scale and resolutions in the muon spectrometer and the tracking systems are considered, and varied separately. These uncertainties have an effect of less than 0.5 % on most samples, but up to 1 % on a few of the smaller backgrounds.

Jet reconstruction efficiency: The jet reconstruction efficiency is found to be about 0.2 % lower in the simulation than in data for jets with \(p_{\text {T}}\) below \(30\,\mathrm{GeV}\), and consistent with data for higher jet \(p_{\text {T}}\). To evaluate the systematic uncertainty due to this small inefficiency, 0.2 % of the jets with \(p_{\text {T}} \) below \(30\,\mathrm{GeV}\) are removed randomly and all jet-related kinematic variables are recomputed. The event selection is repeated using the modified selected jet list. These uncertainties have less than a 0.5 % effect on the acceptance of all samples.

Jet vertex fraction efficiency: The efficiency for each jet to satisfy the jet vertex fraction requirement is measured in \(Z(\rightarrow \ell ^+\ell ^-)\)+1-jet events in data and simulation, selecting separately events enriched in hard-scatter jets and events enriched in jets from other proton interactions in the same bunch crossing (pile-up). The corresponding uncertainty is evaluated in the analysis by changing the nominal JVF cut value. This uncertainty has less than a 1 % effect on the signal sample, and up to 5 % effect on the other samples [78, 84].

Jet energy scale: The jet energy scale (JES) and its uncertainty have been derived by combining information from test-beam data, LHC collision data and simulation [77, 85]. The jet energy scale uncertainty is split into 22 uncorrelated sources, each of which can have different jet \(p_{\text {T}} \) and \(\eta \) dependencies. The largest of these components is the uncertainty specifically related to b-jets, which yields an uncertainty of 1.2–2.5 % on the fiducial cross-section measurements.

Jet energy resolution: The jet energy resolution (JER) has been measured separately for data and simulation using two in situ techniques [86]. The expected fractional \(p_{\text {T}} \) resolution for a given jet is measured as a function of its \(p_{\text {T}} \) and pseudorapidity. A systematic uncertainty is defined as the difference in quadrature between the JER for data and simulation and is applied as an additional smearing to the simulation. This uncertainty is then symmetrised. This uncertainty has a 2–4 % effect on the acceptance of most samples.

Flavour tagging uncertainty: The efficiencies for b, c and light jets to satisfy the b-tagging criteria have been evaluated in data, and corresponding correction factors have been derived for jets in simulation [17, 79]. These scale factors and their uncertainties are applied to each jet depending on its flavour and \(p_{\text {T}} \). In the case of light-flavour jets, the corrections also depend on jet \(\eta \). The scale factors for \(\tau \) jets are set to those for c jets and an additional extrapolation uncertainty is considered. For the fit-based analyses, the effect on the shape of the MV1c templates is considered. A covariance matrix is formed describing how each source of uncertainty in the scale factor measurement affects each \(p_{\text {T}} \) bin. This matrix is diagonalised, leading to a set of statistically independent eigenvectors for each jet. The result is 24 uncorrelated uncertainties affecting the b-jet efficiency, 16 uncorrelated sources each for the c-jets and \(\tau \)-jets, and 48 uncorrelated sources affecting the light jets. The effect of these uncertainties depends on the analysis and the sample in question. The b-tagging uncertainties are typically largest for the ttbb channels, having an effect of up to 10 %. The uncertainty on the measurement from varying the c-jet and light jet mis-tagging rates is usually less than 1 %, but may be larger for individual backgrounds. The uncertainties associated with \(\tau \) jets are less than 0.5 % for all samples.

6.3 Uncertainties on \(t\bar{t}\) modelling

A number of systematic uncertainties affecting the modelling of \(t\bar{t}\) production are considered. In particular, systematic uncertainties due to the choice of parton shower and hadronisation models, the choice of generator, the choice of scale, the parton distribution function (PDF), and the inclusion of \(t\bar{t}V\) and \(t\bar{t}H\) events are considered. These systematic uncertainties are treated as fully correlated between the various components of \(t\bar{t}\) (e.g. between \(ttbX\), \(ttcX\) and \(ttlX\)). The effect of assuming these uncertainties to be uncorrelated among the \(t\bar{t}\) components is found to yield slightly smaller uncertainties on the measured cross-sections. As many of these uncertainties originate from similar physics processes, they are taken to be correlated.

Parton shower: An uncertainty due to the choice of parton shower and hadronisation model is derived by comparing events produced by Powheg interfaced with Pythia 6.427 to Powheg interfaced with Herwig 6.520. The PowhegBox parameter hdamp was set to infinity for this comparison for both samples. The difference between the samples is symmetrised to give the total uncertainty.

Generator: An uncertainty due to the choice of generator is derived by comparing a \(t\bar{t}\) sample generated with MadGraph interfaced to Pythia 6 to a sample generated by PowhegBox+Pythia 6. The MadGraph sample considered was produced with up to three additional partons. It used the CT10 PDF and was showered with Pythia 6.427. The difference between the samples is symmetrised to give the total uncertainty.

Initial- and final-state radiation: An uncertainty on the amount of additional radiation is determined using samples generated with MadGraph interfaced to Pythia 6 but where the renormalisation and factorisation scales are doubled or halved in the matrix element and parton shower simultaneously, which covers the variations allowed by the ATLAS measurement of \(t\bar{t}\) production with a veto on additional central jet activity [87]. The uncertainty is taken as half of the difference between the samples with higher and lower scales, relative to the central MadGraph prediction.

Parton distribution function: The PDF and \(\alpha _{\mathrm {S}}\) uncertainties are calculated using the PDF4LHC recommendations [88] considering the full envelope of the variations of the MSTW2008 68 % CL NLO [89, 90], CT10 NLO [28, 91] and NNPDF2.3 5f FFN [92] PDF sets. Due to limitations in the information available in the Powheg event record, this systematic uncertainty is evaluated on a \(t\bar{t}\) MC sample generated with MC@NLO [9395] using Herwig 6.520 for the parton shower, AUET2 for the underlying-event tune and CT10 as the nominal PDF.

Variation of \(\varvec{t\bar{t}V}\) and \(\varvec{t\bar{t}H}\) contributions: The signal in these analyses includes contributions from \(t\bar{t}V\) and \(t\bar{t}H\) in addition to QCD \(t\bar{t}b\bar{b}\) production. The relative proportion of these processes affects the fraction of ttbb events within the ttb templates, and the fractions of ttcc within the ttc and ttcX templates. It additionally affects the calculation of the fiducial efficiency, due to the different kinematics of the b-jets. In order to avoid making assumptions on the processes being measured, the effect of doubling or removing \(t\bar{t}V\) and \(t\bar{t}H\) is considered as an uncertainty.

Table 8 summarises the MC samples used to evaluate the systematic uncertainties on the \(t\bar{t}\) modelling.

Table 8 Summary of the Monte Carlo event generator parameters for the \(t\bar{t}\) samples used to evaluate the modelling uncertainties. For all PowhegBox samples version 1, r2330 is used. For MSTW2008 the 68 % CL at NLO is used

6.4 Uncertainties on the non \(t\bar{t}\) backgrounds

An uncertainty of 6.8 % is assumed for the theoretical cross-section of single top production [60, 61]. For the Wt channel, the diagram-removal scheme is applied in the default sample, in which all doubly-resonant NLO diagrams that overlap with the \(t\bar{t}\) definition are removed [95]. The difference between this and an alternative scheme, inclusive diagram subtraction, where the cross-section contribution from Feynman diagrams containing two quarks is subtracted, is considered as a systematic uncertainty.

Normalisation uncertainties for W+jets and Z+jets backgrounds are set conservatively to 50 %. The uncertainty on the diboson background rate is taken to be 25 %. In the lepton-plus-jets and ttb \(e \mu \) analyses, a conservative uncertainty of 50 % is used on the number of fake and non-prompt lepton events. Because the data samples are dominated by \(t\bar{t}\) events, the effect of all of these uncertainties on the final result is small.

7 Analysis methods

The common components of the cross-section extraction for all analyses are presented in Sect. 7.1. Three of the four measurements presented make use of the distribution of the multivariate discriminant used for b-jet identification. These distributions are presented in Sect. 7.2. The profile likelihood fits applied in the measurements of the cross-section for ttb production in the lepton-plus-jets and \(e \mu \) channels are presented in Sect. 7.3. The extraction of the cross-section for ttbb in the cut-based approach is presented in Sect. 7.4. This is followed in Sect. 7.5 by the description of the measurement of the same process using a template fit.

7.1 Cross-section extraction

The cross-sections for fiducial ttb and ttbb production (\(\sigma ^{\mathrm {fid}}\)) are obtained from the best estimate of the number of signal events (\(N_{\mathrm {sig}}\)), the fiducial efficiency (\(\epsilon _{\mathrm {fid}}\)), and, where relevant, the correction for the absence of leptons in the fiducial region used in the templates (\(f_{\mathrm {fid}}\)). The method to determine \(N_{\mathrm {sig}}\) is analysis specific and described in detail in each respective analysis section below. The fiducial efficiency is the probability for an event in the fiducial region of the templates to meet all reconstruction and selection criteria. The correction factor \(f_{\mathrm {fid}}\) is defined as the fraction of selected events satisfying the template definition that also meet the fiducial signal definition. It is only needed for the ttb \(e \mu \) and ttbb dilepton fit analyses, which do not include the lepton requirements in the template definitions as documented in Table 2; the ttb lepton-plus-jets analysis uses the same fiducial criteria for defining the signal and building the templates, while the ttbb cut-based does not make use of templates. The cross-section is given by

$$\begin{aligned} \sigma ^{\mathrm {fid}}= \frac{N_{\mathrm {sig}} \cdot f_{\mathrm {fid}}}{\mathcal {L}\cdot \epsilon _{\mathrm {fid}}}, \end{aligned}$$
(1)

where \(\mathcal {L}\) is the integrated luminosity.

The values for \(\epsilon _{\mathrm {fid}}\) and \(f_{\mathrm {fid}}\) are given in Table 9. While the cut-based ttbb analysis has the highest signal-to-background ratio, due to the high requirement on the number of b-tagged jets (at least four instead of at least two), the fiducial acceptance is much smaller than in the other channels.

Table 9 The fiducial efficiency (\(\epsilon _{\mathrm {fid}}\)) and leptonic fiducial acceptance (\(f_{\mathrm {fid}}\)) for all analyses. The uncertainties quoted include only the uncertainty due to the limited number of MC events

7.2 Multivariate discriminant for b-jet identification

The event selection for the three template fit analyses requires the presence of two or more b-tagged jets. Relatively loose working points are chosen with b-tagging efficiencies of \(\sim \)80 %, using the MV1c multivariate algorithm, because this allows for high efficiency and good signal-to-background separation.

The distribution of the MV1c discriminant for jets with the third highest, or third and fourth highest, MV1c weights is found to have significant shape differences between the \(t\bar{t}\) components. The b-tagging probability distribution for these jets has, on average, high values for ttb and ttbb events, intermediate values for events with additional c-jets, and low values for \(t\bar{t}\) events with only additional light jets.

The MV1c distribution is calibrated to data in five exclusive bins. These bin edges correspond to the equivalent cuts on the b-jet identification with efficiencies of approximately 80, 70, 60, and 50 % for b-jets from top quark decays.

The discriminant used in the ttb analyses consists of the distribution of the MV1c of the jet with the third highest MV1c weight, in the five calibrated bins. The templates used for the lepton-plus-jets and ttb \(e \mu \) analyses are shown in the left and right plots of Fig. 3, respectively.

For the dilepton ttbb fit analysis, the MV1c distributions for the jets with third and fourth highest MV1c weights are used. Since these are ordered, the weight of the fourth jet is by construction smaller than that of the third, resulting in 15 possible bins of the discriminant. The distribution of the templates used in the fit is shown in Fig. 4.

Fig. 3
figure 3

Distribution of the MV1c discriminant for the jet with the third highest MV1c weight in the lepton-plus-jets (left) and ttb \(e \mu \) (right) channels. The \(ttb\) signal distribution is compared to the distributions for backgrounds with an additional charm jet (\(ttc\)) and backgrounds with only additional light jets (\(ttl\)). The bin edges correspond to the b-tagging efficiency of the MV1c weight. The plots are normalised such that the sum over the bins is equal to unity. The statistical uncertainty of these distributions is negligible

Fig. 4
figure 4

Distributions of the third and fourth highest MV1c weight among jets for \(ttbb\) signal, \(ttbX\), \(ttcX\) and \(ttlX\) background. The bins are labelled with the upper edge of the efficiency point of the third highest and fourth highest MV1c scores in the event. The order of the bins does not affect the cross-section measurement, for this figure the bins have been ordered by decreasing MV1c efficiency point of the fourth and third MV1c score. The plots are normalised such that the sum over the bins is equal to unity. The statistical uncertainty of these distributions is negligible

7.3 Profile likelihood fit to extract the ttb cross-sections

In the lepton-plus-jets and ttb \(e \mu \) channels, the numbers of events in the \(ttb\), \(ttc\) and \(ttl\) categories are obtained by fitting to data the templates of the third highest MV1c weight. The fit is performed combining the events from both e+jets and \(\mu \)+jets into a single set of templates for the lepton-plus-jets analysis.

A binned likelihood function is constructed as the product of Poisson probability terms over all bins considered in the analysis. This likelihood depends on the signal-strength parameters, which are independent multiplicative factors of the MC predictions for \(ttb\), \(ttc\) and \(ttl\) production cross-sections, henceforth referred to as \(\mu _{ttb}\), \(\mu _{ttc}\) and \(\mu _{ttl}\). The nominal prediction (\(\mu =1\)) for each analysis is obtained from the PowhegBox \(t\bar{t}\) sample. No constraints are applied to the values of these parameters. Nuisance parameters (denoted \(\theta \)) are used to encode the effect of the various sources of systematic uncertainty on the signal and background expectations; these are implemented in the likelihood function with multiplicative Gaussian or log-normal priors. The likelihood is then maximised with respect to the full set of \(\mu \) and \(\theta \) parameters. The values of these parameters after maximisation are referred to as \(\hat{\mu }\) and \(\hat{\theta }\). The cross-section from Eq. (1) can be re-written as:

$$\begin{aligned} \sigma ^{\mathrm {fid}}= \frac{N_{\mathrm {sig}}(\hat{\mu }, \hat{\theta }) \cdot f_{\mathrm {fid}}}{\mathcal {L}\cdot \epsilon _{\mathrm {fid}}( \hat{\theta })}. \end{aligned}$$

The effects from the systematic uncertainties on both the shape and normalisation of the templates are considered, as well as the effect on the fiducial efficiency. In the ttb \(e \mu \) analysis, the uncertainty on \( f_{\mathrm {fid}}\) is also taken into account. The impact of each systematic uncertainty on these different quantities are considered as correlated.

Due to the large number of nuisance parameters considered, the likelihood fit only includes uncertainties with at least a 0.5 % effect on the event yield, or shape uncertainties that cause a relative variation of more than 0.5 % between two bins. This simplification changes the final result or uncertainty by less than 1 % and significantly reduces the execution time.

The shape variations for the PDF uncertainties on \(t\bar{t}\) in the lepton-plus-jets analysis are found to be negligible, therefore only the largest variation in acceptance is considered. In the ttb \(e \mu \) analysis, the PDF uncertainty is evaluated outside of the profile likelihood fit. For each eigenvector of each PDF set, new nominal templates are obtained for each of the components and a statistics-only fit to the Asimov dataset [96] obtained using the central value of the MC@NLO prediction is done. The relative difference between the fitted cross-section and the one obtained from the nominal MC@NLO is considered as the PDF uncertainty of that eigenvector. The envelope of all eigenvectors is then considered as the PDF uncertainty and added in quadrature to the total uncertainty obtained from the full profile likelihood fit.

Figure 5 shows the MV1c distribution used to fit the \(ttb\) signal strength in the lepton-plus-jets analysis (top) and ttb \(e \mu \) analysis (bottom). The left figure shows the predictions from simulation and the uncertainty band from the sum in quadrature of the impact of each source of uncertainty. The right plot shows the fitted results and the final uncertainty on the total prediction, which is largely driven by the size of the available MC samples. Table 10 shows the fitted values of the parameters of interest. The Asimov dataset is used to provide expected results. The total uncertainty on the measurement is found to be similar to the expected one in both analyses and the fitted \(ttb\) signal strength in both analyses is higher than one, but still compatible with unity within uncertainties. The impact of the \(ttc\) and \(ttl\) backgrounds on the measurement may be assessed by considering the correlation of \(\mu _{ttb}\) with \(\mu _{ttc}\) or \(\mu _{ttl}\) within the likelihood function. In the ttb \(e \mu \) analysis, the correlation is \(-0.5\) between \(\mu _{ttb}\) and \(\mu _{ttc}\), and \(+0.5\) between \(\mu _{ttb}\) and \(\mu _{ttl}\); in the lepton-plus-jets analysis, the correlation is \(+0.1\) in both cases.

The effect of the dominant uncertainties on the fitted signal strength is illustrated in Fig. 6. The post-fit effect on \(\mu _{ttb}\) is calculated by fixing the corresponding nuisance parameter at \(\hat{\theta }\pm \sigma _{\theta }\), where \(\hat{\theta }\) is the fitted value of the nuisance parameter and \(\sigma _{\theta }\) is its post-fit uncertainty, and performing the fit again. The difference between the default and the modified \(\hat{\mu }_{ttb}\), \(\Delta \hat{\mu }_{ttb}\), represents the effect on \(\mu _{ttb}\) of this particular uncertainty. The dominant uncertainties on both of these measurements are from \(t\bar{t}\) modelling and b-tagging uncertainties affecting the c-jets. In the lepton-plus-jets analysis, due to the large fraction of \(t\bar{t}\) events where the W-boson decays to a c-quark and a light quark, the effect of the b-tagging uncertainties on the c-jets is large. Other significant contributions come from the effect of b-tagging on b-jets and light jets, and the jet energy scale and resolution. The generator comparison shows a large effect on both the template shapes and normalisations; it is the dominant uncertainty for the ttb \(e \mu \) analysis, while for the lepton-plus-jets analysis it is smaller due to a cancellation in these effects.

Table 11 shows a summary of the uncertainties grouped into categories. The effect of each uncertainty is obtained as above and all sources of uncertainty within a category are added in quadrature to obtain the category uncertainty. The total uncertainty in the table is the uncertainty obtained from the full fit, and is therefore not identical to the sum in quadrature of each component, due to the correlations induced between the uncertainties by the fit. Nonetheless, these correlations are small enough that the difference is less than 3 % in both analyses. In order to obtain separate estimates for the statistical and systematic components of the total uncertainty in both profile likelihood fit analyses, the statistical component of the uncertainty is evaluated by fixing all nuisance parameters to their fitted values and re-evaluating the uncertainty on the fit.

Fig. 5
figure 5

The MV1c distribution of jets with the third highest MV1c weight in the lepton-plus-jets analysis (top) and ttb \(e \mu \) analysis (bottom) for all signal and background components. The data are compared to the nominal predictions (Pre-fit) (left), and to the output of the fit (Post-fit) (right). The points include the statistical uncertainty on the data. The hashed area shows the uncertainty on the total prediction. The non-prompt and fake lepton backgrounds are referred to as ‘NP & fakes’

Table 10 Fitted values for the parameters of interest for the signal strength for \(ttb\), \(ttc\) and \(ttl\) in the lepton-plus-jets and ttb \(e \mu \) analyses. Both the results from the Asimov dataset and the values obtained from the fits to data are shown. The uncertainties quoted are from the total statistical and systematic uncertainties
Fig. 6
figure 6

Effect of the uncertainty on the fitted value of \(\mu _{ttb}\) (\(\Delta \hat{\mu }_{ttb}\)) and pull of the dominant nuisance parameters in the lepton-plus-jets (left) and ttb \(e \mu \) analyses (right). The shaded and hashed areas refer to the top axis: the shaded bands show the initial impact of that source of uncertainty on the precision of \(\mu _{ttb}\); the hatched areas show the impact on the measurement of that source of uncertainty, after the profile likelihood fit at the \(\pm 1 \sigma \) level. The points and associated error bars show the fitted value of the nuisance parameters and their errors and refer to the bottom axis; a mean of zero and a width of 1 would imply no constraint due to the profile likelihood fit. Dashed lines are shown at 0 and \(\pm \)1 for reference. Only the ten highest ranked uncertainties on \(\mu _{ttb}\) are shown. The index on the b-tagging uncertainties refers to the fixed, but arbitrary position in the list of eigenvectors associated with each jet flavour

Table 11 Effect of the various sources of uncertainty on the \(ttb\) and \(ttbb\) cross-section measurements in the lepton-plus-jets and dilepton channels. The uncertainties on the \(R_{ttbb}\) ratio measurement in the dilepton fit analysis are also shown. Asymmetric uncertainties are shown when relevant. For the fit-based measurements, the individual and total uncertainties are evaluated from the fit to the data

7.4 ttbb cross-section from cut-based analysis

This ttbb measurement uses an event counting method in the dilepton channel to extract the cross-section. Events with at least four identified b-jets are considered.

The estimate of the number of signal events is obtained from the total number of observed events passing the final selection (\(N_{\mathrm {data}}\)) and the estimate of the number of background events. A distinction is made between background processes which contain two top and two bottom quarks, but do not pass the fiducial selection (referred to as non-fiducial background), and backgrounds from all other processes (referred to as non-ttbb). In order to avoid making any assumptions about the cross-section for ttbb processes, the prediction for the non-fiducial background is not taken directly from simulation; instead, simulation is used to determine the fractions of ttbb events that are signal and non-fiducial background. In particular, the fraction of particle-level ttbb events that pass the fiducial selection, \(f_{\mathrm {sig}}\), is defined as

$$\begin{aligned} f_{\mathrm {sig}} = \frac{N_{\mathrm {sig}}}{N_{\mathrm {sig}} + N^{\mathrm {non-fiducial}}_{ttbb}}. \end{aligned}$$

The cross-section from Eq. (1) can then be re-written as

$$\begin{aligned} \sigma _{t\bar{t}b\bar{b}}^{\mathrm {fid}}= \frac{(N_{\mathrm {data}} - N_{\mathrm {non}-ttbb}) \cdot f_{\mathrm {sig}}}{\mathcal {L}\cdot \epsilon _{\mathrm {fid}}}. \end{aligned}$$

In order to classify background events as non-fiducial or non-ttbb, an attempt is made to match the four reconstructed b-jets to particle-level jets.Footnote 3 If two or more of the reconstructed b-tagged jets match light-flavour or charm particle-level jets, then the event is classified as non-ttbb, otherwise it is considered as ttbb non-fiducial.

The prediction for the non-ttbb backgrounds is taken from simulation. The prediction has been validated by repeating the calculation with different definitions of the signal region, based on the b-jets with the fourth-highest value in MV1. These alternative signal regions vary in the fraction of non-ttbb backgrounds from less than 1 % to more than 50 %. Nonetheless, the measured cross-sections among the regions agree within their statistical uncertainties, giving confidence that the Monte Carlo simulation provides a sufficient description of these backgrounds.

For the calculations of \(\epsilon _{\mathrm {fid}}\) and \(f_{\mathrm {sig}}\), both electroweak (\(t\bar{t}Z\) and \(t\bar{t}H\)) and QCD production are considered, weighted according to their theoretical cross-sections. The values of the parameters \(N_{\mathrm {data}}\), \(N_{\mathrm {non}-ttbb}\), \(\epsilon _{\mathrm {fid}}\), and \(f_{\mathrm {sig}}\) are shown in Table 12, together with their uncertainties.

Table 12 The number of observed data events \(N_{\mathrm {data}}\), the predicted non-ttbb background \(N_{\mathrm {non-ttbb}}\), the signal fraction \(f_{\mathrm {sig}}\), and the fiducial efficiency \(\epsilon _{\mathrm {fid}}\) in the ttbb cut-based measurement. The numbers include \(t\bar{t}V\) and \(t\bar{t}H\) as signal

Each source of systematic uncertainty is propagated to the cross-section measurement in a coherent way by varying simultaneously the effect on the background prediction, on \(f_{\mathrm {sig}}\) and on \(\epsilon _{\mathrm {fid}}\), where applicable. A symmetrisation of the uncertainties is carried out; for uncertainties for which the positive and negative variations differ (in absolute value) by less than 0.5 %, the larger of the two is used for both variations. The middle column of Table 11 shows the effect of the dominant sources of uncertainty on this cross-section measurement.

7.5 Maximum-likelihood fit to extract the ttbb cross-section

The looser event selection used in this analyses allows a template fit to be performed in the 15 populated bins of the MV1c distribution for the jets with the third and fourth highest MV1c values. A maximum-likelihood fit to the nominal templates of \(ttbb\), \(ttbX\), \(ttcX\), \(ttlX\) and non-\(t\bar{t}\) background is carried out to extract the number of signal events in each category. Systematic uncertainties are not included in the likelihood. The cross-section is then extracted directly from Eq. (1).

This analysis also allows an extraction not only of the \(ttbb\) signal but also of the \(ttbX\), \(ttcX\), \(ttlX\) contributions and of the ratio of ttbb to the total ttjj yield:

$$\begin{aligned} R_{ttbb} = \frac{\sigma _{ttbb}}{\sigma _{ttjj}}, \end{aligned}$$

where ttjj refers to \(t\bar{t}\) production with two additional jets. The cross-section for ttjj is obtained by correcting the \(ttbb\), \(ttbX\), \(ttcX\) and \(ttlX\) cross-sections, which are calculated for events with three or four particle-level jets, to the fraction with four jets only. For \(ttbb\) the fiducial efficiency and fraction as documented in Table 9 are used; for \(ttbX\), \(ttcX\) and \(ttlX\) the fiducial efficiencies and fractions are shown in Table 13.

Table 13 The fiducial efficiency (\(\epsilon _{\mathrm {fid}}\)) and leptonic fiducial acceptance (\(f_{\mathrm {fid}}\)) for the \(ttbX\), \(ttcX\) and \(ttlX\) categories as used in the ttbb fit-based analysis. The uncertainties quoted include only the uncertainty due to the limited number of MC events

Figure 7 shows the MV1c distribution used to fit the number of \(ttbb\) events; the left figure shows the predictions from simulation compared to the observed distribution in data; the right plot shows data compared to the result of the fit. The fitted cross-sections for each of the components are shown in Table 14 along with the predictions from PowhegBox+Pythia 6; the uncertainties shown are the statistical uncertainty of each component as obtained from the fit. The fitted cross-sections are compatible with the predictions within fit uncertainties. The central value for \(t\bar{t}b\bar{b}\) is 1.1 times the predictions from PowhegBox+Pythia 6, consistent with the \(\mu \) values found in the two ttb analyses. In particular the values for the \(ttbX\)  \(ttcX\) and \(ttlX\) may be used to cross-check the assumptions made about the background contributions to the cut-based analysis.

Fig. 7
figure 7

The MV1c distribution of jets with the third and fourth highest MV1c weight in the dilepton channel for all signal and background components. The bins are labelled with the upper edge of the efficiency point of the third highest and fourth highest MV1c scores in the event. The data (left) are compared to the nominal predictions (Pre-fit), and (right) to the output of the fit (Post-fit). The points include the statistical uncertainty on the data. The hashed area shows the total uncertainties. The bottom sub-plot shows the ratio of the data to the prediction. The non-prompt and fake lepton backgrounds are referred to as ‘NP & fakes’

Table 14 Observed and predicted cross-sections for each of the components measured in the ttbb fit analysis and on the \(R_{ttbb}\) ratio. The statistical, total systematic, and total uncertainties on each component are also shown. The predicted values are from Powheg+Pythia 6 \(t\bar{t}\)

For most sources of systematic uncertainty, the templates for signal and background distributions are obtained from the event sample where a \(\pm 1\sigma \) shift of the uncertainty was applied. The new templates and the old templates are fitted to the nominal MC sample, and the relative difference between the yields is taken as the uncertainty on the number of events. For systematic uncertainties that also affect the fiducial efficiencies, the efficiency is varied coherently and the effect on the final cross-section is obtained. The effect due to limited number of MC events in the templates is obtained from the mean of 5000 pseudo-datasets obtained from simulation, where the variance of each bin depends on the total MC statistical uncertainty of that bin. The second to last column of Table 11 shows the effect on the final ttbb cross-section measurement in this analysis whereas the rightmost column shows the uncertainties on the \(R_{ttbb}\) measurement.

The total cross-section uncertainty of each process and on the \(R_{ttbb}\) ratio are shown in Table 14 along with the statistical and total systematic uncertainties. The uncertainties on the \(ttbX\) and \(ttcX\) processes are large and do not allow the cross-sections of these processes to be constrained significantly. The signal strength \(\mu _{ttbb}\) has a correlation of 0.4 with \(\mu _{ttbX}\), \(-0.1\) with \(\mu _{ttlX}\), and nearly 0 with \(\mu _{ttcX}\).

8 Results

The fiducial cross-sections obtained for each analysis in the previous section are shown in Table 15.

The measurements of the ttb cross-section in the lepton-plus-jets and ttb \(e \mu \) analyses are both higher than the predicted cross-section from the Powheg+Pythia 6 sample, with a best fit value for the signal strength \(\mu _{ttb}\) of 1.32 and 1.30, respectively. The total measurement uncertainty in the lepton-plus-jets channel is fractionally smaller than in the ttb \(e \mu \) analysis, \(\sim \)25 % compared to \(\sim \)32 %, owing to the higher acceptance times branching ratio of this decay channel. The uncertainty in this channel is dominated by uncertainties on the tagging efficiency due to c-jets from events in which the W boson decays to a c- and a light quark.

The two measurements of the ttbb cross-section show similar precision despite the different approaches, with the cut-based and fit-based analyses having a total uncertainty of \(\sim \)35 % and \(\sim \)36 %, respectively. The cut-based analysis is largely insensitive to the modelling of the non-ttbb background from \(t\bar{t}\) events as the selection criteria are very tight. In contrast, the fit-based analysis uses looser selection criteria in an attempt to obtain a data-driven constraint on these processes. While the precision of the fit-based analysis does not allow for a measurement of these backgrounds, it does confirm the validity of the simulation, and allows for an explicit measurement of the \(R_{ttbb}\) ratio. The two ttbb measurements select different events and hence are not fully correlated. A small excess of data with respect to the nominal prediction is seen in the events that are common to both measurements, while a small deficit is seen for events with jets that satisfy the MV1c 80 % criterion but fail the MV1 70 % criterion that is used in the cut-based analysis. These two features explain the difference between the observed cross-section in the two analyses.

An alternative set of results is obtained by subtracting the predicted \(t\bar{t}V\) and \(t\bar{t}H\) contribution from the signal; no additional uncertainty due to the cross-section of these processes is considered. This allows a direct comparison of the measurements to QCD-only predictions, although with assumptions about the \(t\bar{t}V\) and \(t\bar{t}H\) cross-sections. These results are summarised in Table 16 and Fig. 8 and compared to theoretical predictions obtained with the generators described in Sect. 4.4 and shown in Table 4. The ratio of the \(t\bar{t}b\bar{b}\) and ttjj cross-sections as measured in the \(t\bar{t}b\bar{b}\) fit-based analysis is compared to theoretical predictions in Fig. 9. The uncertainties on the theoretical predictions are obtained by simultaneously varying the renormalisation and factorisation scales by a factor of two.

Table 15 Measured fiducial cross-section for ttb in the lepton-plus-jets and \(e \mu \) channels, and ttbb in the dilepton channel using a cut-based or a fit-based method. Results for the \(R_{ttbb}\) ratio measurement from the ttbb fit-based method are also shown. The uncertainties quoted are from the statistical and total systematic uncertainties. The predicted cross-section is from PowhegBox with Pythia 6 for the QCD component, from Helac for \(t\bar{t}H\)  and from MadGraph 5 for \(t\bar{t}V\)
Table 16 Observed and predicted cross-sections for the three fiducial phase-space regions. The measurements are shown with the contributions from \(t\bar{t}V\) and \(t\bar{t}H\) removed to allow direct comparison to the predictions containing only the pure QCD matrix elements. Results for the \(R_{ttbb}\) ratio measurement from the ttbb fit-based method are also shown. The measurement uncertainties are separated into statistical (first) and systematic (second) uncertainties. The uncertainties on the theoretical predictions are obtained by simultaneously varying the renormalisation and factorisation scales by a factor of two up or down. These variations have not been calculated for the LO Pythia 8 samples or for the Powheg+Pythia 6 sample

The predictions containing NLO matrix elements for the \(pp\rightarrow t\bar{t}b\bar{b}\) process, as well as the merged LO + PS prediction from MadGraph+Pythia 6 are in agreement with the measured cross-sections within the measurement uncertainties. The cross-sections obtained in the 5FS (Powhel) are higher than the 4FS ones (MadGraph5_aMC@NLO) as expected, however the two predictions agree within the respective scale uncertainties. The models utilizing softer choices for the renormalisation/factorisation scales show the best agreement with the data. Different \(g\rightarrow b\bar{b}\) splitting models significantly affect the ttbb and ttb cross-sections in the samples where all additional b-jets come from the parton shower. The predictions corresponding to wgtq = 3 and wgtq = 5, which correspond to the extreme models, differ by more than a factor of two. The cross-sections obtained with the wgtq = 3 model are significantly higher than the measured ones, thus indicating that this model overestimates the \(g\rightarrow b\bar{b}\) rate. The cross-sections obtained with the other models are both in agreement with the data.

Fig. 8
figure 8

Comparison of the measured cross-sections in the three fiducial phase-space regions with theoretical predictions obtained from a variety of different generators. The measurements are shown with the contributions from \(t\bar{t}V\) and \(t\bar{t}H\) removed to allow direct comparison to the predictions containing only the pure QCD matrix elements. The coloured bands indicate the statistical and total uncertainties of the measurements. The errors on the theoretical prediction are obtained by simultaneously varying the renormalisation and factorisation scales by a factor of two. These variations have not been calculated for the LO Pythia 8 samples or for the Powheg+Pythia 6 sample

Fig. 9
figure 9

Comparison of the measured ratio of the ttbb and ttjj cross-sections in the fiducial phase-space region of the ttbb fit-based analysis with theoretical predictions obtained from a variety of different generators. The measurements are shown with the contributions from \(t\bar{t}V\) and \(t\bar{t}H\) removed to allow direct comparison to the pure QCD generators. The coloured bands indicate the statistical and total uncertainties of the measurement. The error on the MadGraph+Pythia prediction is obtained by simultaneously varying the renormalisation and factorisation scales by a factor of two. These variations have not been calculated for the LO Pythia 8 samples or for the Powheg+Pythia 6 sample

9 Conclusions

Measurements in the fiducial phase space of the detector of the cross-sections for the production of \(t\bar{t}\) events with one or two additional b-jets are performed in proton–proton collisions at a centre-of-mass energy of 8 TeV at the LHC. The results are based on a dataset corresponding to an integrated luminosity of 20.3 fb\(^{-1}\), collected with the ATLAS detector. The cross-section times branching ratio for top pair events with at least one additional b-jet is measured to be 950 \(\pm \) 70 (stat.) \(^{+240}_{-190}\) (syst.) fb in the lepton-plus-jets channel and 50 \(\pm \) 10 (stat.) \(^{+15}_{-10}\) (syst.) fb in the \(e \mu \) channel. The cross-section times branching ratio with at least two additional b-jets is measured to be 19.3 \(\pm \) 3.5 (stat.) \(\pm \) 5.7 (syst.) fb in the dilepton channel (\(e \mu \)\(\mu \mu \), and ee) using a method based on tight selection criteria, and 13.5 \(\pm \) 3.3 (stat.) \(\pm \) 3.6 (syst.) fb using a looser selection which allows extraction of the background normalisation from data. A measurement of the ratio of \(t\bar{t}\) production with two additional b-jets to \(t\bar{t}\) production with any two additional jets is also performed; this ratio is found to be 1.30 \(\pm \) 0.33 (stat.) \(\pm \) 0.28 (syst.)%. The measurements are found to agree within their uncertainties with NLO + PS calculations of the \(pp\rightarrow t\bar{t}b\bar{b}\) process, as well as with merged LO + PS calculations of \(pp\rightarrow t\bar{t}+\le 3\) jets, favouring the predictions obtained with soft renormalisation/factorisation scales. The measurements are shown to be sensitive to the description of \(g\rightarrow b\bar{b}\) splitting in the parton shower, with the most extreme Pythia 8 model being disfavoured by the measurements.