1 Introduction and motivation

Neutral strange particle production has been studied in collider experiments using \(e^{+}e^{-}\) [1,2,3,4,5,6,7,8,9,10,11], pp [12,13,14,15,16,17,18,19], \(p{\bar{p}}\) [20,21,22], ep [23, 24] and heavy-ion collisions [25, 26], as well as in fixed-target experiments [27,28,29,30,31,32,33,34,35,36,37,38,39,40]. These measurements provide interesting tests of theoretical jet fragmentation functions [41] and can be used to validate and tune the values of empirical parameters used in the parton shower and fragmentation parts of the Monte Carlo (MC) models. Since the mass of the strange quark is comparable to the QCD scale parameter \(\Lambda _{\mathrm {QCD}}\), perturbative calculations cannot be performed. These models must be highly accurate to constrain the underlying event (UE) effects in the high transverse momentum (\(p_{\mathrm{T}}\)) production investigated at the Large Hadron Collider (LHC). In particular the ratio \(\gamma _{\mathrm {s}}=s/u\), giving the suppression factor of strange to non-strange meson production in the hadronic final states, is measured to be larger in pp collisions than in \(e^{+}e^{-}\) annihilation. A review is given in Ref. [42].

It was suggested [43] that the suppression factor \(\gamma _{\mathrm {s}}\) would be significantly larger, or even tend to unity, in nucleus–nucleus collisions because the many strings produced within the Lund fragmentation scheme in a limited phase space may interact, giving rise to the formation of ‘colour ropes’. Recent data from RHIC [25, 44] tend to support these ideas and show that neutral strange particle production is enhanced. In pp collisions at LHC energies, many overlapping strings due to multi-parton interactions are also expected to come into play, so that higher rates of strange meson and baryon production are expected [42]. This effect was confirmed recently by the ALICE [16] and CMS [15] collaborations.

The measurements presented in this paper are useful contributions for future determinations of \(|V_{ts}|\). The prospects for directly measuring the CKM matrix element \(|V_{ts}|\)are discussed in Refs. [45, 46]. The idea is to measure the fraction of \(pp\rightarrow t\bar{t}\rightarrow W^+ b W^- {\bar{s}} (W^+ s W^- {\bar{b}})\) in \(t\bar{t}\) decays. Since this is small compared with the dominant background \(pp\rightarrow t\bar{t}\rightarrow W^+ b W^- {\bar{b}}\), a good understanding of neutral strange particle production inside b-jets in \(t\bar{t}\) final states is needed for a future direct measurement of this matrix element.

Studies of neutral strange particle production at the LHC have been carried out using minimum-bias events at low luminosities [12,13,14,15,16,17,18]. The aim of this paper is to extend the studies to \(t\bar{t}\) production, which is known to be a copious source of high-\(p_{\mathrm{T}}\) jets, especially b-jets. In doing so, three cases are considered depending on whether the neutral strange particles are embedded in jets, with or without a b-tag, or not associated with any selected jet.

In current MC generators the production of neutral strange particles within jets in top quark decays exhibits little sensitivity to initial-state radiation effects, different choices of parton distribution functions (PDF) or UE effects. In contrast, neutral strange particle production outside jets is more sensitive to details of the parton shower’s initial- and final-state radiation, the fragmentation scheme and multi-parton interactions (MPI). They are also very sensitive to the ratio \(\gamma _{\mathrm {s}}\) of strange to up quarks.

This analysis was performed using a \(t\bar{t}\) event sample collected with the ATLAS detector in the 2011 running period with pp collisions at \(\sqrt{s} = \) 7 TeV. These data are less affected by multiple pp interactions within the same (in time) or nearby (out of time) bunch crossings, or pile-up, than data collected later.

This paper is organised as follows. Section 2 gives a brief description of the ATLAS detector. Section 3 is devoted to the MC samples used. Section 4 explains the data sample and the event selection criteria. Section 5 is dedicated to the reconstruction and selection of neutral strange particles, as well as the background subtraction procedure. Section 6 shows the results at the detector level compared with MC generator simulations. Neutral strange particle production is studied in terms of distributions of transverse momentum, pseudorapidity, energy and multiplicity for the three cases stated above. Section 7 discusses the efficiency correction calculations and the statistical error propagation. Section 8 gives details of the main systematic uncertainties. Section 9 shows the results corrected to the particle level compared with the predictions of different MC models, thus checking the model-dependence of neutral strange particle production in these events. Finally, Sect. 10 presents a summary and conclusions.

2 The ATLAS detector

The ATLAS detector is described in detail in Ref. [47]. All of its subsystems are relevant for this analysis, including the inner detector (ID), the electromagnetic and hadronic calorimeters and the muon spectrometer.

The inner detector, located within a \(2 \hbox { T}\) axial magnetic field, is used to measure the momentum of charged particles. Its \(\eta \)\(\phi \) coverage includes the full azimuthal range \(-\pi \le \phi \le \pi \) and the pseudorapidity range \(\left| \eta \right| < 2.5\).Footnote 1 The inner detector includes a silicon pixel detector (Pixel), a silicon microstrip tracker (SCT) and a transition radiation tracker (TRT). The calorimeter system covers the pseudorapidity range \(\left| \eta \right| < 4.9\). The electromagnetic section, covering the region \(\left| \eta \right| < 3.2\), uses liquid argon as the active material in barrel and endcap calorimeters with accordion-shaped electrodes and lead absorbers. The hadronic calorimeter system consists of a steel/scintillator-tile barrel calorimeter (\(\left| \eta \right| < 1.7\)) and a copper liquid-argon endcap (\(1.7< \left| \eta \right| < 3.2\)). In addition, a forward calorimeter consisting of liquid argon with copper and tungsten for the absorbers extends the pseudorapidity coverage to \(\left| \eta \right| = 4.9\). The muon spectrometer, located inside a toroidal magnetic field, provides triggering and muon tracking capabilities in the ranges \(\left| \eta \right| < 2.4\) and \(\left| \eta \right| < 2.7\) respectively. This allows identification of muons with momenta above 3 \(\text {GeV}\) and precision determination of the muon transverse momentum up to 1 TeV. In this analysis muons reconstructed in the muon spectrometer are matched with well-measured tracks from the inner detector.

The trigger system [48] uses three consecutive levels: level 1 (L1), level 2 (L2) and the event filter (EF). The L1 triggers are hardware-based and use coarse detector information to identify regions of interest, whereas the L2 triggers are based on fast online data reconstruction algorithms. Finally, the EF triggers use offline data reconstruction algorithms. For this analysis, events are required to pass a single-electron or single-muon trigger.

3 Monte Carlo event simulation

The MC generators used to describe particle production in pp collisions differ in the approximations used to calculate the underlying short-distance QCD process, in the manner parton showers are used to take into account higher-order effects and in the fragmentation scheme responsible for long-distance effects. The generated events were passed through a detailed Geant 4 simulation [49] of the ATLAS detector [50].

The baseline \(t{\bar{t}}\) MC sample was produced with the next-to-leading-order (NLO) generator PowhegBox (referred to hereafter as Powheg) [51,52,53] for the matrix element calculation with the CTEQ66 NLO PDF. The parton shower and hadronisation processes were implemented using Pythia6 [54] with the CTEQ6L PDF [55]. Pythia6 orders the parton shower by \(p_{\mathrm{T}}\) and uses the Lund string fragmentation scheme [56]. The parton shower and UE effects were modelled using a set of tuned parameters called the Perugia2011c tune [57]. Pile-up contributions were accounted for by generating events with Pythia6, using the AMBT2B minimum bias (MB) tune. These were then overlaid onto the signal events at detector level. The strangeness suppression factor \(\gamma _{\mathrm {s}}\) was taken at its default value \(\gamma _{\mathrm {s}} = 0.3\) in the AMBT2B tune, while \(\gamma _{\mathrm {s}} = 0.2\) was used in the Perugia2011c tune. The latter was tuned to LEP data.

Additional MC samples are used to estimate the hadronisation model dependence of \(K_{S}^{0}\) and \(\Lambda \) production. They are based on MC@NLO + Herwig [58, 59], which orders the parton showers by angular separation and uses the cluster hadronisation model [60] and CT10 NLO [61] PDFs. Multi-parton interactions were simulated using Jimmy [62] with the AUET2 tune, while pile-up effects were taken into account as in Powheg+Pythia6. The parameter governing strangeness suppression in Herwig is not \(\gamma _{\mathrm {s}}\), but the probability of producing an \(s{\bar{s}}\)-pair when the clusters are fragmented. This parameter was set at its default value, which is equal to that for the other light quarks. The suppression is then given by the s-quark mass in the non-perturbative gluon splitting \(g \rightarrow s{\bar{s}}\).

The data, corrected for detector effects, are also compared with events from other MC generators at particle level, without detector MC simulation:

  • Sherpa 2.1.1 [63], which uses a different approach than previous generators for the matrix element calculation up to NLO accuracy with the CT10 PDFs, as well as for the parton shower implementation, with cluster hadronisation. Sherpa uses \(\gamma _{\mathrm {s}} = 0.4\).

  • Powheg with the NNPDF3.0 NLO PDF set [64], interfaced to Pythia8 [65] with the NNPDF2.3 LO PDF set and the A14 tune [66] for the parton shower, hadronisation and UE modelling.

  • Powheg interfaced to Herwig7 (v7.1) [67] with the NNPDF3.0 NLO PDF set and H7UE tune, as default, for the parton shower, hadronisation and UE modelling.

  • MadGraph5 aMC@NLO generator (referred to hereafter as aMC@NLO) [68] interfaced to Herwig7 as before.

  • The leading-order (LO) Acermc generator [69] interfaced to Pythia6, with different tunes such as Perugia2011c or TuneAPro [70] for parton showering and hadronisation, as well as with different colour reconnection (CR) schemes.

Table 1 presents a summary of the different signal MC sample tunes used in this analysis.

Table 1 Summary of basic generator settings used to simulate the \(t{\bar{t}}\) events

Background samples were generated for the production of \(Z\) boson in association with jets, including heavy flavours, using the Alpgen [71] generator with the CTEQ6L PDFs [55], and interfaced with Herwig and Jimmy. The same generator was used for the diboson backgrounds, \(WW\), \(WZ\) and \(ZZ\), while MC@NLO was used for the simulation of the single-top-quark background in the \(Wt\) final state.

The MC simulated samples are normalised to their corresponding cross-sections, as described in the following. The \(t\bar{t}\) signal is normalised to the cross-section calculated at approximate next-to-next-to-leading order (NNLO) using the Hathor package [72], while for the single-top-quark production cross-section, the calculations in Ref. [73] were used. The \(Z\) plus jets cross-sections are taken from Alpgen [71] with additional NNLO K-factors as given in Ref. [74].

The simulated events are weighted such that the distribution of the number of interactions per bunch crossing in the simulated samples matches that of the data. The size of the MC samples considered in this analysis exceeds that of the data sample by more than an order of magnitude.

4 Data sample and event selection

The data sample used in this analysis corresponds to an integrated luminosity of 4.6 \(\hbox {fb}^{-1}\), collected in 2011. The uncertainty in the integrated luminosity is 1.8% [75]. The sample consists of data taken while all relevant subdetector systems were operating under stable beam conditions.

In order to reduce the jet activity from hadronic \(W^{\pm }\) decay channels, the dileptonic \(t {\bar{t}}\) decay mode is used in this analysis. Events in this decay mode were selected as described in Refs. [76, 77], using a trigger based upon a high-\(p_{\mathrm{T}}\) electron with a threshold of either 20 or \(22~\text {GeV}\), or a muon with \(p_{\mathrm{T}}(\mu )> 18~\text {GeV}\). Events are required to have at least one primary vertex, with five or more tracks with \(p_{\mathrm{T}}^{\mathrm {track}}\ge 400~\text {MeV}\). If there is more than one primary vertex, the one with the largest \(\sum p_{\mathrm{T}}^2\) is chosen, where the sum is over the transverse momenta of tracks from the vertex.

Electron candidates are reconstructed from energy deposits in the calorimeter that are associated with tracks reconstructed in the ID. The candidates must pass a tight selection [78], which uses calorimeter and tracking variables as well as TRT information for \(|\eta |<2.0\), and are required to have transverse momentum \(p_{\mathrm{T}}(e) > 25~\text {GeV}\) and \(|\eta |<2.47\). Electrons in the transition region between the barrel and endcap calorimeters are not considered. Muon candidates are reconstructed by searching for track segments in different layers of the muon spectrometer. These segments are combined and matched with tracks found in the ID. The candidates are re-fitted using the complete track information from both detector systems and are required to have a good fit for muons with \(p_{\mathrm{T}}(\mu ) > 20~\text {GeV}\) and \(|\eta |<2.5\).

The selected events are required to have exactly two isolated charged leptons (e or \(\mu \)). At least one of them must match with the corresponding trigger object. For electron candidates, the isolation criterion requires that the transverse energy deposited around the electron in the calorimeter in a cone of sizeFootnote 2 \(\Delta R = 0.2\) is below 3.5 \(\text {GeV}\), excluding the electron energy cluster itself. For muon candidates, both the transverse energy in the calorimeter and the transverse momentum in the tracking detector around the muon in a cone of size \(\Delta R = 0.3\) must be below 4 \(\text {GeV}\). The track isolation is calculated from the scalar sum of the transverse momenta of tracks with \(p_{\mathrm{T}}> 1~\text {GeV}\), excluding the muon. Cosmic-ray muons are rejected by a veto on muon candidate pairs back-to-back in the transverse plane and with transverse impact parameter \(|d_{0}|>0.5~\hbox {mm}\) relative to the beam axis [79]. The two isolated leptons are required to have opposite charges. For the ee and \(\mu \mu \) channels, the invariant mass of the two leptons must be greater than 15 \(\text {GeV}\), to reject background from low-mass resonances decaying into lepton pairs, and at least 10 \(\text {GeV}\) away from the \(Z\) boson mass.

Jets are reconstructed with the anti-\(k_t\) algorithm [80] with radius parameter \(R = 0.4\). The input objects to the jet algorithm, for both data and detector-level simulation, are topological clusters of energy in the calorimeter [81]. These clusters are seeded by calorimeter cells with \(|E_{\mathrm {cell}}| > 4 \sigma \), with \(\sigma \) the RMS of the noise. Neighbouring cells are added and clusters are formed following an iterative procedure.

The baseline calibration of these clusters corrects their energy to the electromagnetic energy scale, which is established using test beam measurements for electrons, pions and muons in the electromagnetic and hadronic calorimeters [82,83,84]. Effects due to non-compensating calorimeter response, energy losses in dead material, shower leakage, and inefficiencies in energy clustering and jet reconstruction are taken into account. This is done by matching calorimeter jets with MC particle jets in bins of \(|\eta |\) and \(p_{\mathrm{T}}\). The result is called the jet energy scale (JES), thoroughly discussed in Ref. [85]. It is different for b-jets and light-flavour jets since they have different particle compositions. More details and a discussion of JES uncertainties are given in Ref. [86]. The jet energy resolution (JER) and its uncertainties are discussed in Ref. [87].

The selected events are required to have at least two jets with \(|\eta |< 2.5\) and \(p_{\mathrm{T}}> 25~\text {GeV}\). In addition, jets are required to have a jet vertex fraction [88], defined as the scalar transverse momentum sum of the tracks that are associated with the jet and originate from the hard-scatter vertex divided by the scalar sum of all associated tracks, greater than 0.75 in order to minimise pile-up effects. At least one of the jets must be identified as a b-tagged jet, using the multivariate MV1 algorithm [89] based on the reconstruction of secondary vertices and three-dimensional impact parameter information. The MV1 working point corresponds to a b-tagging efficiency of 70%, calculated using \(t{\bar{t}}\) MC events with an average light-flavour mistag rate of 2%. Jets overlapping with an accepted electron are removed if the separation is \(\Delta R < 0.2\). Electrons are removed if \(0.2< \Delta R < 0.4\). Muons are removed if their separation from a jet is \(\Delta R < 0.4\).

The reconstruction of the direction and magnitude of the missing transverse momentum (\(E_{\mathrm{T}}^{\mathrm{miss}}\)) is described in Ref. [90] and begins with the vector sum of the transverse momenta of all jets with \(p_{\mathrm{T}}> 20~\text {GeV}\) and \(|\eta |<4.5\). The transverse momenta of electron candidates are added. The contributions from all muon candidates and from all calorimeter clusters not belonging to a reconstructed object are also included. The missing transverse momentum is required to be \(E_{\mathrm{T}}^{\mathrm{miss}}> 60~\text {GeV}\) for the ee and \(\mu \mu \) channels, and for the \(e\mu \) channel the requirement is \(H_{\mathrm{T}}>130~\text {GeV}\), where \(H_{\mathrm{T}}\) is the scalar sum of the transverse momenta of the two leptons and the selected jets.

Table 2 Summary of the event selection criteria for the analysis

After applying these selection criteria, which are summarised in Table 2, a sample of 6926 \(t {\bar{t}}\) candidate events is selected. MC studies indicate that the background contamination in the sample after event selection is \(\sim \) 6%, dominated by single-top-quark events. The background contribution from \(Z\) boson production with the \(Z\) boson decaying leptonically, in association with jets (including heavy flavours \(b{\bar{b}}\)), is at the level of 1%. An additional source of background where one or more of the reconstructed lepton candidates are non-prompt or misidentified is found to be at the 1% level with a very large (50%) statistical uncertainty [76, 91], and is not considered in this analysis. The expected composition of the sample is summarised in Table 3, where ‘Diboson’ includes the \(WW\), \(WZ\) and \(ZZ\) contributions. The percentages for signal and background processes quoted in Table 3 are in agreement with those quoted in Ref. [91].

Table 3 Expected composition of the selected sample in terms of number of events (\(N_{\mathrm {evt}}\)) and fractions (%) of different processes. Uncertainties are statistical only

Figure 1 shows the distribution of jet multiplicity and the \(p_{\mathrm{T}}\) spectra of all jets, b-tagged jets and non-b-tagged jets. The jet activity is indeed limited, as 94% of the selected events contain at most four jets. The shapes of the normalised distributions in data are in good agreement with the prediction given by the \(t\bar{t}\) Powheg + Pythia6 simulation only. The small contributions from processes other than \(t\bar{t}\) are neglected in the following analysis.

MC studies show that 99% of the selected b-tagged jets correspond to particle level b-jets, while 28% of jets in the non-b-tagged sample are b-jets which are not tagged by the MV1 algorithm. These fractions are calculated by matching detector-level jets, b-tagged or not, to their corresponding particle-level jets, which are defined in Sect. 7. These fractions are found to be largely independent of whether non-\(t\bar{t}\) backgrounds are considered or not. Furthermore, the purity of b-tagged jets is rather independent of jet \(p_{\mathrm{T}}\) as shown in Figure 1(c).

Fig. 1
figure 1

Spectra of a jet multiplicity \(N_{\mathrm {jets}}\) and b jet \(p_{\mathrm{T}}\) in data compared with \(t{\bar{t}}\) Powheg + Pythia6 predictions at detector level. The expected b-jet flavour fractions for c b-tagged and d non-b-tagged jets as a function of jet \(p_{\mathrm{T}}\) are also compared with data. These distributions are normalised to the total number of selected events in data or MC predictions

5 \(K_{S}^{0}\) and \(\Lambda \) reconstruction

5.1 Neutral strange particle reconstruction

Neutral strange hadrons are reconstructed in the \(K_{S}^{0}\rightarrow \pi ^{\mathrm{+}}\pi ^{\mathrm{-}}\) (\(\Lambda \rightarrow p \pi ^{\mathrm{-}}\), \({\bar{\Lambda }} \rightarrow {\bar{p}} \pi ^{\mathrm{+}}\)) decay mode by identifying two tracks originating from a displaced vertex, thus profiting from the long lifetimes of neutral K mesons (\(\Lambda \) baryons) with \(c\tau _0 \approx \) 2.7 \(\hbox {cm}\) (\(c\tau _0 \approx \) 7.9 \(\hbox {cm}\)).

Tracks are reconstructed within the \(|\eta |< 2.5 \) acceptance of the ID, as described in Refs. [92, 93]. The \(K_{S}^{0}\) (\(\Lambda \)) candidates are oppositely charged track pairs with the transverse momentum of the two-track system \(p_{\mathrm{T}}>\) 100 (500) \(\text {MeV}\). The tracks must have at least two hits in the Pixel or SCT detectors, and are fitted to a common vertex. For \(K_{S}^{0}\) reconstruction, the pion mass is assumed for both tracks, while the proton and pion masses are assumed for the \(\Lambda \) case.Footnote 3 Further requirements on these candidates are given below, and Ref. [12] provides more details:

  • The \(\chi ^2\) of the two-track vertex fit is required to be less than 15 (with 1 degree of freedom).

  • The transverse flight distance (\(R_{xy}\)) is defined to be the distance between the \(K_{S}^{0}\) (\(\Lambda \)) decay point and either the secondary b-tagged vertex, when the \(K_{S}^{0}\) (\(\Lambda \)) is contained in a jet with a b-tag, or the reconstructed primary vertex when the \(K_{S}^{0}\) (\(\Lambda \)) is contained in a jet without a b-tag or is not associated with any selected jet. A requirement 4 \(\hbox {mm}\) \(< R_{xy}<\) 450 \(\hbox {mm}\) (17 \(\hbox {mm}\) \(< R_{xy}<\) 450 \(\hbox {mm}\)) ensures that the tracks are reconstructed inside the Pixel+SCT part of the ID tracker.

  • The angle between the \(K_{S}^{0}\) (\(\Lambda \)) momentum vector and the \(K_{S}^{0}\) (\(\Lambda \)) flight direction (obtained from the line connecting the decay vertex to the primary vertex, or to the secondary vertex if the \(K_{S}^{0}\) (\(\Lambda \)) is inside a b-jet) has to satisfy \(\cos \theta _K>\) 0.999 (\(\cos \theta _{\Lambda }>\) 0.9998).

The \(K_{S}^{0}\) (\(\Lambda \)) candidates that fulfil these conditions are then separated into three classes: candidates inside a b-tagged jet, inside a non-b-tagged jet and outside any jet. To this end the separation \(\Delta R\) between the \(K_{S}^{0}\) (\(\Lambda \)) line of flight and the jet axis in the \(\eta \)\(\phi \) plane is calculated. If \(\Delta R<\) 0.4, a \(K_{S}^{0}\) (\(\Lambda \)) is associated with a jet. Otherwise it is classified as being outside any jet. There are no cases of a single \(K_{S}^{0}\) (\(\Lambda \)) being inside two different jets.

The mass distributions for three classes of \(K_{S}^{0}\) and the total sample of \(\Lambda \) candidates in data are shown in Fig. 2 compared with the Powheg + Pythia6 predictions scaled to the total number of events in the data sample. The three \(K_{S}^{0}\) mass distributions exhibit a resonance structure, centred around the nominal \(K_{S}^{0}\) mass, with constant tails extending on both sides, indicating the presence of fake candidates, i.e. track pairs which not being \(K_{S}^{0}\) or \(\Lambda \) decay products have a mass in the signal mass ranges considered. The \(K_{S}^{0}\) mass distributions are fairly well described by the nominal \(t{\bar{t}}\) MC simulation except for the \(K_{S}^{0}\) candidates not associated with jets, in which case the MC prediction underestimates the data by roughly \(30\%\). Similar features are exhibited by the mass distribution of \(\Lambda \) candidates.

Fig. 2
figure 2

\(K_{S}^{0}\) and \(\Lambda \) candidate mass distributions in data compared with Powheg + Pythia6 simulation. Three classes are presented for \(K_{S}^{0}\): a inside b-tagged jets, b inside non-b-tagged jets, and c outside any jet. The total sample is shown for d \(\Lambda \) candidates

5.2 Background subtraction

In order to take into account the background due to fake candidates in the \(K_{S}^{0}\) (\(\Lambda \)) mass distributions, a simple sideband subtraction in the reconstructed mass distribution is used. The signal range [480–520] \(\text {MeV}\) ([1106–1126] \(\text {MeV}\)) is considered for \(K_{S}^{0}\) (\(\Lambda \)) production. The background sidebands are taken to be [460–480] and [520–540] \(\text {MeV}\) ([1096–1106] and [1126–1136] \(\text {MeV}\)) for \(K_{S}^{0}\) (\(\Lambda \)) production. Candidates in the signal (sideband) region are given positive (negative) weights when filling histograms for neutral strange particle spectra such as the transverse momentum, pseudorapidity or energy. The sideband subtraction is applied to both data and detector-level MC samples. It relies on the assumption that the kinematic distributions for fake candidates in the signal region are similar to those in the sidebands. The validity of this assumption was checked with MC studies. The number of reconstructed events is shown in Table 4. It was checked that after unfolding for detector effects, the results of the analysis are independent of sensible variations of the signal and sideband region widths.

The results of this simple sideband subtraction procedure, used as a baseline, is cross-checked by fitting the mass distributions to a Gaussian function centred at the nominal mass plus a constant (linear) shape for the \(K_{S}^{0}\) (\(\Lambda \)) background. Choosing a different background shape (constant/linear) changes the results by less than 10% of the statistical uncertainties. The estimated numbers of signal and background events obtained using the two background methods agree within statistical uncertainties. The limited sample size precludes extending this fitting procedure for signal extraction as a function of the neutral strange particle kinematic variables under study.

The resulting \(K_{S}^{0}\) and \(\Lambda \) masses from fits to the total \(\pi \pi \) and \(p \pi \) mass distributions are \(497.8 \pm 0.2~\text {MeV}\) and \(1115.8 \pm 0.3~\text {MeV}\), in agreement with the PDG values [94]. The \(K_{S}^{0}\) (\(\Lambda \)) widths from the fits are \(6.83 \pm 0.03~\text {MeV}\) (\(4.16 \pm 0.04~\text {MeV}\)). The signal mass range includes 99% (95%) of the \(K_{S}^{0}\) (\(\Lambda \)) signal, which ensures that the sidebands for the background subtraction are not contaminated by signal.

As previously noted in Refs. [12,13,14, 16, 17], fitting to a double-Gaussian function with a common mean value improves the quality of the fits. This was also tried in this analysis. The resulting Gaussian mean values coincide with those obtained from a single-Gaussian fit and the numbers of signal and background events are stable within statistical uncertainties.

Table 4 The numbers of \(K_{S}^{0}\) and \(\Lambda \) particles (\(N_{K}\) and \(N_{\Lambda }\)) reconstructed in the data after sideband background subtraction for each class and for the total sample with their statistical uncertainties

6 Results at detector level

Neutral strange particle production is studied as a function of the transverse momentum \(p_{\mathrm{T}}\), the energy E, the pseudorapidity \(\eta \), the transverse flight distance \(R_{xy}\) and the multiplicity N. For \(K_{S}^{0}\) and \(\Lambda \) production inside jets, the energy fraction, \(x_{K,\Lambda } = E_{K,\Lambda }/E_{\mathrm {jet}}\), is also considered.

For this purpose, the reconstructed \(K_{S}^{0}\) and \(\Lambda \) mass distributions are obtained in different bins of the variables under study, and the numbers of signal events after proper sideband subtraction are determined, as discussed in Sect. 5.2. They are normalised to the total number of events in data or MC generator fulfilling the dileptonic \(t {\bar{t}}\) selection criteria presented in Table 3. No attempt is made to subtract the non-\(t {\bar{t}}\) background because the normalised \(K_{S}^{0}\) spectra for the \(t{\bar{t}}\) signal are found in MC predictions to be compatible with those for single-top-quark events, which form the main background.

6.1 \(K_{S}^{0}\) production at detector level

The kinematic distributions for \(K_{S}^{0}\) production are displayed in Figs. 3, 4, 5. They are separated into the three different classes defined in Sect. 5 and compared with two different MC models, namely Powheg + Pythia6 + Perugia2011c and MC@NLO + Herwig + Jimmy. The data show both the statistical as well as the total systematic uncertainties. The total uncertainties are obtained as the sum in quadrature of the statistical and detector level systematic uncertainties, namely those due to tracking, JES and JER. The systematic uncertainties are discussed in detail in Sect. 8.

As shown in Table 4, approximately two-thirds of the total \(K_{S}^{0}\) sample are not associated with jets, the remaining one-third being roughly equally distributed between b-tagged and non-b-tagged jets. For those inside jets, the \(K_{S}^{0}\) spectra do not show a strong dependence on whether the jets are b-tagged or not. On the other hand, \(K_{S}^{0}\) candidates not associated with jets are softer in \(p_{\mathrm{T}}\) or energy than those embedded in jets and their pseudorapidity distribution is constant over a wider central plateau. The \(K_{S}^{0}\) multiplicity inside b-tagged jets, Fig. 3f, is similar to that inside non-b-tagged jets, Fig. 4f, while that outside any jet falls off less steeply, Fig. 5e.

The gross features of the data are described fairly well by both MC simulations, except for \(K_{S}^{0}\) production not associated with any jet. Here the MC simulations predict roughly 30% fewer \(K_{S}^{0}\) than observed in data, while the shapes of the distributions exhibit fair agreement but for the multiplicity distribution.

Fig. 3
figure 3

Kinematic characteristics for \(K_{S}^{0}\) production inside b-tagged jets, for data and detector-level MC events simulated with the Powheg + Pythia6 and MC@NLO + Herwig generators. Total uncertainties are represented by the shaded area. Statistical uncertainties for MC samples are negligible in comparison with data

Fig. 4
figure 4

Kinematic characteristics for \(K_{S}^{0}\) production inside non-b-tagged jets, for data and detector-level MC events simulated with the Powheg + Pythia6 and MC@NLO + Herwig generators. Total uncertainties are represented by the shaded area. Statistical uncertainties for MC samples are negligible in comparison with data

Fig. 5
figure 5

Kinematic characteristics for \(K_{S}^{0}\) production not associated with jets, for data and detector-level MC events simulated with the Powheg + Pythia6 and MC@NLO + Herwig generators. Total uncertainties are represented by the shaded area. Statistical uncertainties for MC samples are negligible in comparison with data

6.2 \(\Lambda \) production at detector level

Similar distributions for \(\Lambda \) production are also obtained. Due to the limited number of events, only distributions for the total sample are shown in Fig. 6.

Fig. 6
figure 6

Kinematic characteristics for the total \(\Lambda \) production, for data and detector-level MC events simulated with the Powheg + Pythia6 and MC@NLO + Herwig generators. Total uncertainties are represented by the shaded area. Statistical uncertainties for MC samples are negligible in comparison with data

The gross features exhibited by the \(\Lambda \) baryons are similar to those of the \(K_{S}^{0}\) mesons. The quality of the MC description of the data is also similar to that discussed in the previous subsection.

7 Unfolding to particle level

In order to take into account detector effects, the data are unfolded to the particle level. This allows a direct comparison with theoretical calculations as well as with measurements from other experiments. For kinematic quantities such as the transverse momentum and pseudorapidity, for which migrations are negligible, this is done by computing the reconstruction efficiencies on a bin-by-bin basis (as also in Refs. [1,2,3,4,5,6,7,8,9,10,11, 23, 24]). For the multiplicity distributions, however, a Bayesian unfolding procedure is applied because the bin-to-bin migrations are relevant.

7.1 Efficiency correction

The reconstruction efficiencies (\(\epsilon \)) are calculated by dividing bin-by-bin each of the distributions (\(p_{\mathrm{T}}\), \(|\eta |\), energy and energy fraction) at detector level by the one at particle level for each of the three classes of candidates considered:

$$\begin{aligned} \epsilon _i = \frac{\frac{1}{N^{\mathrm {det}}_{\mathrm {evt}}} \frac{\mathrm {d}N^{\mathrm {det}}_{K,\Lambda }}{\mathrm {d}x_{i}}}{\frac{1}{N^{\mathrm {particle}}_{\mathrm {evt}}} \frac{\mathrm {d}N^{\mathrm {particle}}_{K,\Lambda }}{\mathrm {d}x_{i}}} \end{aligned}$$

where \(x_{i}\) stands for the i-th bin in the variable x which denotes any of the kinematic variables mentioned above. They are shown in Fig. 7, for each of the classes, as well as for the total sample. For neutral strange particles embedded in b-tagged jets, this efficiency correction also includes the b-tagging efficiency. The small size of the MC sample prevents the use of a multidimensional binning for the correction procedure.

The particle-level distributions are obtained using leptons (from \(W\) decays), jets and neutral strange particles (\(K_{S}^{0}\) and \(\Lambda \)) in the events selected at detector level. Particle-level jets are built using all particles in MC simulation with a lifetime above \(10^{-11}\) s, excluding muons and neutrinos. The kinematic criteria for jets at particle and detector level are the same, namely \(p_{\mathrm{T}}>25~\text {GeV}\) and \(|\eta |<2.5\). The particle-level b-jets are defined as those containing a b-hadron, with \(p_{\mathrm{T}}>5~\text {GeV}\) and \(\Delta R < 0.3\) from the jet axis. Particle-level \(K_{S}^{0}\) and \(\Lambda \) candidates, including those decaying to neutral particles, are required to be within \(|\eta |<2.5\) and have an energy \(E>1~\text {GeV}\), as no \(K_{S}^{0}\) candidates are reconstructed below that energy at detector level. Similar to the detector level, the \(K_{S}^{0}\) (\(\Lambda \)) candidates at particle level which fulfil these conditions are separated into three classes using the same \(\Delta R\) criteria with respect to a particle-level jet.

MC studies show that migrations between classes when going from detector to particle level are generally smaller than 5%. For example, \(K_{S}^{0}\) candidates which are not associated with any jet at detector level, have a 1% (3%) probability to be classified as embedded in a b-jet (non-b-jet) at particle level. A notable exception is that of \(K_{S}^{0}\) candidates inside non-b-tagged jets at detector level, which have a 32% probability to be classified as embedded in a b-jet at particle level. This is due to the b-tagging efficiency, which is included in the reconstruction efficiency as defined above.

The contribution of non-fiducial events, i.e. events which pass the detector-level selection but are not present at the particle level, introduces a small bias which is taken into account as a systematic uncertainty. More details are given in Sect. 8.

The reconstructed distributions of \(K_{S}^{0}\) (\(\Lambda \)) are corrected with a weight given by 1/\(\epsilon _i\), depending on their class. The Powheg + Pythia6 MC sample was used to derive efficiencies. Since the MC simulation does not include pile-up at the particle level, the efficiency calculation effectively corrects for the pile-up effects present at the detector level. This is further discussed in Sect. 8.

Figure 7 shows that the reconstruction efficiency inside b-tagged jets is lower than inside non-b-tagged jets, due to the fact that the average b-tagging efficiency is 70% and the b-jet contamination in the non-b-tagged sample is around 30%. It was checked that the efficiency for \(K_{S}^{0}\) reconstruction inside b-jets is independent of whether they are b-tagged or not at detector level. The efficiency for \(K_{S}^{0}\) (\(\Lambda \)) outside jets peaks at lower \(p_{\mathrm{T}}\) values than for those inside, and falls more sharply in the distributions’ tails. This can be attributed to the differences in their transverse momentum spectra.

Fig. 7
figure 7

The \(K_{S}^{0}\) reconstruction efficiency as a function of a \(p_{\mathrm{T}}\), b energy, c \(|\eta |\) and d energy fraction for Powheg + Pythia6 and four classes of \(K_{S}^{0}\): inside b-tagged jets (triangle), inside non-b-tagged jets (inverted triangle), outside any jet (circle), and the total sample (square)

The efficiency for \(K_{S}^{0}\) (\(\Lambda \)) outside jets is lower than that reported in Ref. [12] for a minimum-bias sample with less pile-up and restricted to lower transverse momenta.

In order to investigate the dependence of the efficiency corrections on the jet multiplicity, these efficiencies are derived for events with more than or at most four jets. They are found to agree within statistical errors. This is expected since each additional jet with \(R=0.4\) represents only about \(1.5\%\) of the total available phase space in the \(\eta \)\(\phi \) plane.

7.2 Bayesian unfolding

The unfolding based on the efficiency calculations discussed so far relies on the assumption that the neutral strange particles, once reconstructed, are measured to a precision which is much smaller than the bin widths. As an alternative an iterative Bayesian unfolding [95] as implemented in the RooUnfold program [96] was tried. The method numerically calculates the inverse of the migration matrices for each of the distributions under study. The Powheg + Pythia6 MC sample is used to determine these migration matrices by matching detector-level and particle-level \(K_{S}^{0}\) that are in the same class and have an angular separation \(\Delta R < 0.01\). The resulting matrices exhibit a very pronounced diagonal correlation. The number of iterations is chosen such that the residual bias, evaluated through a closure test as discussed in Sect. 8, is within a tolerance of 1% for the statistically significant bins as in Ref. [97]. The Bayesian unfolded distributions and the bin-by-bin corrected ones agree with each other with a precision which is much smaller than the statistical uncertainties.

The \(K_{S}^{0}\) multiplicity distributions cannot be unfolded using a bin-by-bin efficiency correction due to the large migrations between the particle multiplicity bins at the detector and particle levels. For the multiplicity unfolding, only the visible decays, \(K_{S}^{0}\rightarrow \pi ^{\mathrm{+}}\pi ^{\mathrm{-}}\), are considered at particle level. This reduces the size of the non-diagonal terms in the migration matrices. For the calculation of average multiplicities, a correction factor accounting for the invisible decays, \(K_{S}^{0}\rightarrow \pi ^0 \pi ^0\), is applied a posteriori when necessary.

The results (\(N^{\mathrm {part},i}\)) of this Bayesian unfolding procedure are given by:

$$\begin{aligned} N^{\mathrm {part},i} = \sum _{j} N_{\mathrm {detec},j} \times \epsilon _{\mathrm {detec}, j} A^{\mathrm {part}, i}_{\mathrm {detec}, j} / \epsilon ^{\mathrm {part}, i} \end{aligned}$$

where i and j are the particle and detector level bin indices, respectively, \(N_{\mathrm {detec},j}\) is the data result at detector level, \(A^{\mathrm {part}, i}_{\mathrm {detec}, j}\) is the migration matrix refined through iteration as explained above, and \(\epsilon _{\mathrm {detec}, j}\) and \(\epsilon ^{\mathrm {part}, i}\) the matching efficiencies for detector and particle level neutral strange particles.

The statistical uncertainties in data and MC simulation are propagated simultaneously through the unfolding procedure by using pseudo-experiments. A set of \(10^3\) replicas is created for each measured distribution by applying a Poisson-distributed fluctuation. Each replica is then unfolded using a statistically independent fluctuated migration matrix. The statistical uncertainty of the unfolded distribution is defined as the standard deviation of the \(10^3\) unfolded replicas. As a cross-check, pulls are obtained from these replicas and found to follow a normal distribution, as expected.

8 Systematic uncertainties

Since the present analysis is concerned with the measurement of normalised distributions, many systematic uncertainty sources considered in top quark cross-section measurements [76], particularly those related to the lepton and b-tagging efficiency scale factors, cancel out. Similarly, the systematic uncertainty due to non-\(t {\bar{t}}\) processes is expected to be very small and therefore not taken into account. The following systematic uncertainties are considered in this analysis:

  • The systematic uncertainties due to tracking inefficiencies: they are taken from minimum-bias events [12] in bins of the track transverse momentum and pseudorapidity and found to be below 2% and dominated by the uncertainties in the modelling of the detector material. They result in an estimated 4–5% uncertainty for two-body decays, which is the case for \(K_{S}^{0}\) and \(\Lambda \) production. This relies on the assumption that the uncertainties from minimum-bias events are also valid in a dense environment as given by a jet. It was checked that there are no systematic effects in the MC description of the neutral strange particle production as a function of the angular separation to the jet-axis.

  • The systematic uncertainty related to the choice of MC generator used in the unfolding: the systematic uncertainties due to modelling are calculated as the relative differences between the unfolded distributions obtained with the nominal Powheg + Pythia6 MC samples and those obtained with the alternative MC@NLO + Herwig samples. For kinematic quantities such as the transverse momentum and pseudorapidity, they can be expressed as the deviation of the efficiency ratios from unity. These systematic uncertainties range up to 20–25%, or even to 50% for the tails of the multiplicity distributions, and thus represent the dominant source of systematic uncertainty. The choice of parton shower (PS) and hadronisation scheme plays the predominant role, as tested by comparing the efficiencies calculated with Powheg + Pythia6 and Powheg + Herwig samples. The matrix element (ME) calculation method plays a minor role, as seen when comparing the efficiencies calculated with Powheg + Herwig and MC@NLO + Herwig samples.

  • The systematic uncertainty related to pile-up: as a first attempt to check how well the MC pile-up modelling describes the data, mass distributions for \(K_{S}^{0}\) and \(\Lambda \) candidates are obtained for two samples of events depending on whether the average number of interactions per bunch crossing, \(\langle \mu \rangle \), is higher or lower than the median (\(\langle \mu \rangle = 8.36\)). This exercise shows that neutral strange particles embedded in jets, with or without a b-tag, are not at all affected by pile-up. This is not the case for neutral strange particles outside jets, for which a clear linear dependence on \(\langle \mu \rangle \) is observed. In order to estimate the systematic uncertainty associated with this class, a data-driven procedure is developed, which compares the \(\langle \mu \rangle \) dependence of the \(K_{S}^{0}\) multiplicity observed in data and MC events at high longitudinal impact parameter values. The resulting systematic uncertainty is found to be of the order of 8%.

  • The systematic uncertainty related to the JES and JER: the propagation of the JES [85] and JER [87] uncertainties is taken into account. They affect most of the distributions indirectly through changes in the number of jets satisfying the selection criteria. The only distribution directly affected by these uncertainties is the energy fraction. The resulting uncertainties from both the JES and JER are found to be well below 5%.

  • The b-tagging efficiency: in order to study the systematic uncertainty due to the choice of b-tagging efficiency, the analysis was repeated using two alternative b-tagging efficiencies of 60% and 85%. The obvious effect of this is that the number of jets classified as b-tagged (or not) changes. However, as shown in Figs. 3 and 4 , the distributions for \(K_{S}^{0}\) production inside b-tagged jets are very similar to those for \(K_{S}^{0}\) production inside non-b-tagged jets, so the uncertainties for normalised distributions are expected to be small. The average multiplicity for \(K_{S}^{0}\) production per b-tagged jet (or non-b-tagged jet) is found to be independent of the choice of b-tagging working point within the statistical uncertainty. Thus the systematic uncertainty due to the choice of b-tagging efficiency is negligible.

  • The unfolding non-closure uncertainty, which is calculated in two steps. In the first step, the particle-level MC distributions are reweighted such that the reweighted detector-level distributions match the data. Then, these reweighted detector-level MC distributions are unfolded to the particle level using the same procedure as for the data, and compared with the reweighted particle-level MC distributions. The relative difference seen in this comparison is taken as the systematic uncertainty and is typically below 1%.

  • Non-fiducial events uncertainty: it is calculated as the difference between two sets of Powheg + Pythia6 \(K_{S}^{0}\) and \(\Lambda \) particle level distributions, normalised to the total number of selected events. One set comes from an event sample selected using detector-level criteria and the other set is selected using particle-level criteria. The same kinematic requirements are applied to leptons and jets at the detector and particle levels. Typically, these systematic uncertainties are below 5%.

Table 5 summarises the approximate magnitude of the systematic uncertainties considered. The total systematic uncertainties are then calculated as the sum in quadrature of the systematic uncertainties due to the sources discussed above.

Table 5 Summary table of systematic uncertainty sources: MC generator choice for the unfolding, pile-up (PU) mismodelling, tracking inefficiency, JES and JER, non-fiducial events, and non-closure test

9 Results at the particle level

The \(K_{S}^{0}\) and \(\Lambda \) average multiplicities per event for corrected data and Powheg + Pythia6 MC events at particle level are shown in Table 6. They are obtained from the unfolded transverse momentum (\(p_{\mathrm{T}}\)) spectra.

Table 6 \(K_{S}^{0}\) and \(\Lambda \) unfolded (particle-level) average multiplicities per event (\(\langle n_{K,\Lambda }\rangle \)), including statistical and systematic uncertainties, for each class and for the total sample, along with the ones obtained from the Powheg + Pythia6 MC generator at particle level

It was checked that the average multiplicities obtained from the iterative Bayesian unfolded multiplicity distributions are in good agreement within statistical uncertainties with the values in Table 6. Since the migration matrices were obtained considering only visible decays at particle level, the resulting multiplicities are corrected for the branching ratio of \(K_{S}^{0}\rightarrow \pi ^{\mathrm{0}}\pi ^{\mathrm{0}}\).

A complete set of results at particle level can be found in Ref. [98].

9.1 \(K_{S}^{0}\) unfolded distributions

The unfolded distributions in \(p_{\mathrm{T}}\), E, \(|\eta |\) and \(N_K\) for \(K_{S}^{0}\) production inside b-jets, inside non-b-jets and outside any jet are shown in Figs. 8, 9, 10. Furthermore, for \(K_{S}^{0}\) production inside jets, the distribution of the energy fraction, \(x_{K}\), is also shown. Numerical results are summarised in the Appendix. The unfolded data are compared with the expectations from six different MC models: Powheg + Pythia6, MC@NLO + Herwig, Sherpa, Powheg + Pythia8, Powheg + Herwig7 and aMC@NLO + Herwig7.

To be more quantitative in the comparison between data and MC predictions, a \(\chi ^2\) test is performed for the distributions shown in Figs. 8, 9, 10 prior to normalisation. The MC samples are then scaled to the same number of \(t{\bar{t}}\) dileptonic events as in the data. The \(\chi ^2\) is defined as:

$$\begin{aligned} \chi ^2 = V^{\mathrm {T}} \cdot Cov^{-1} \cdot V \end{aligned}$$

where V is the vector of differences between MC predictions and unfolded data, and \(Cov^{-1}\) denotes the inverse of the covariance matrix.

The covariance matrix is obtained by using pseudo-experiments. A set of \(10^{3}\) replicas of the corresponding unfolded data distributions is created. In order to include systematic effects, these replicas are smeared with Gaussian functions whose widths are given by the systematic errors considered as uncorrelated. The results of the \(\chi ^2\) test are summarised in Tables 7, 8, 9, including the associated p-values. They are used to assess the significance of the differences between the various generators and the data for each observable. For the kinematic distributions, which are normalised to the average multiplicities, the number of degrees of freedom has been taken as the number of bins. While for the \(N_K\) distributions, which are normalised to unit area, the number of degrees of freedom has been reduced by one.

The unfolded distributions in Figs. 8, 9, 10 and the \(\chi ^2\) and p-values in Tables 789 show that:

  • On average, the Powheg + Pythia6 or Pythia8 and Sherpa generators give a very similar description of the data, while MC@NLO + Herwig, aMC@NLO + Herwig7 and Powheg + Herwig7 are slightly disfavoured.

  • In general, the MC distributions reproduce the \(K_{S}^{0}\) particle spectra inside jets rather well. This is expected since jet fragmentation functions are studied from S\(p{\bar{p}}\)S to Tevatron energies, so the MC simulations are tuned fairly well.

  • The spectra for \(K_{S}^{0}\) production outside jets are reproduced in shape, but are underestimated by approximately \(30\%\). This observation is consistent with a CMS study of strange particle production in the UE [14]. These data could be used to improve the simulation of the UE, especially to tune the \(\gamma _{\mathrm {s}} = s/u\) parameter. This parameter needs to be larger than 0.2, the value used in the Pythia6 + Perugia2011c tune [57]. The Pythia8 + A14 predictions come closer to the data than the Pythia6 + Perugia2011c ones. This is attributed to the fact that the A14 tune uses \(\gamma _s\) value equal to 0.217, as in the Monash tune [99], which is 10 % larger than that in the default Pythia6 + Perugia2011c tune. Herwig + Jimmy and Herwig7 + H7UE gives a somewhat worse description than Pythia6 + Perugia2011c, which indicates the need to also tune the strangeness suppression here or even to use an improved colour reconnection scheme for MPI as suggested in Ref. [100]. Sherpa, which uses strangeness suppression of \(\gamma _{\mathrm {s}} = 0.4\), tends to overestimate the \(K_{S}^{0}\) yields outside jets.

  • The energy and transverse momentum spectra for \(K_{S}^{0}\) mesons inside b-jets are similar to the spectra for those inside non-b-jets. The spectra for \(K_{S}^{0}\) mesons produced outside jets are much softer than for those produced in association with a jet.

  • The pseudorapidity distributions for \(K_{S}^{0}\) mesons produced outside jets are constant over a wider central plateau than for those produced in association with a jet.

Fig. 8
figure 8

Kinematic characteristics for \(K_{S}^{0}\) production inside b-jets, for corrected data and particle-level MC events simulated with the Powheg + Pythia6, MC@NLO + Herwig, Sherpa, Powheg + Pythia8, Powheg + Herwig7 and aMC@NLO + Herwig7 generators. Total uncertainties are represented by the shaded area. Statistical uncertainties for MC samples are negligible in comparison with data

Fig. 9
figure 9

Kinematic characteristics for \(K_{S}^{0}\) production inside non-b-jets, for corrected data and particle-level MC events simulated with the Powheg + Pythia6, MC@NLO + Herwig, Sherpa, Powheg + Pythia8, Powheg + Herwig7 generators. Total uncertainties are represented by the shaded area. Statistical uncertainties for MC samples are negligible in comparison with data

Fig. 10
figure 10

Kinematic characteristics for \(K_{S}^{0}\) production not associated with jets, for corrected data and particle-level MC events simulated with the Powheg + Pythia6, MC@NLO + Herwig, Sherpa, Powheg + Pythia8, Powheg + Herwig7 and aMC@NLO + Herwig7 generators. Total uncertainties are represented by the shaded area. Statistical uncertainties for MC samples are negligible in comparison with data

Table 7 Values of the \(\chi ^2\) per degree of freedom and their corresponding p-values, for \(K_{S}^{0}\) production inside b-jets, for Powheg + Pythia6, MC@NLO + Herwig, Sherpa, Powheg + Pythia8, Powheg + Herwig7 and aMC@NLO + Herwig7 predictions
Table 8 Values of the \(\chi ^2\) per degree of freedom and their corresponding p-values, for \(K_{S}^{0}\) production inside non-b-jets, for Powheg + Pythia6, MC@NLO + Herwig, Sherpa, Powheg + Pythia8, Powheg + Herwig7 and aMC@NLO + Herwig7 predictions
Table 9 Values of the \(\chi ^2\) per degree of freedom and their corresponding p-values, for \(K_{S}^{0}\) production outside jets, for Powheg + Pythia6, MC@NLO + Herwig, Sherpa, Powheg + Pythia8, Powheg + Herwig7 and aMC@NLO + Herwig7 predictions

9.2 \(\Lambda \) unfolded distributions

The same distributions studied for \(K_{S}^{0}\) production are now presented for the total \(\Lambda \) production. Numerical results are summarised in the Appendix. Comparisons with MC predictions are made in Fig. 11. The \(\Lambda \) production is suppressed relative to \(K_{S}^{0}\) production as expected. Due to poor statistics, \(\Lambda \) production cannot be divided into classes.

The results of a \(\chi ^2\) test for the comparison between unfolded data and MC predictions are summarised in Table 10. Powheg + Pythia6 and Sherpa generators give a similar fair description of the data, while MC@NLO + Herwig is somewhat disfavoured. Powheg + Pythia8, Powheg + Herwig7 and aMC@NLO + Herwig7 are even more disfavoured.

Fig. 11
figure 11

Kinematic characteristics for the total \(\Lambda \) production, for corrected data and particle-level MC events simulated with the Powheg + Pythia6, MC@NLO + Herwig, Sherpa, Powheg + Pythia8, Powheg + Herwig7 and aMC@NLO + Herwig7 generators. Total uncertainties are represented by the shaded area. Statistical uncertainties for MC samples are negligible in comparison with data

Table 10 Values of the \(\chi ^2\) per degree of freedom and their corresponding p-values, for the total \(\Lambda \) sample, for Powheg + Pythia6, MC@NLO + Herwig, Sherpa, Powheg + Pythia8, Powheg + Herwig7 and aMC@NLO + Herwig7 predictions

9.3 Comparison with other MC generators

Following Ref. [77], the sensitivity of the total neutral strange particle production to different underlying-event tunes and colour reconnection schemes was studied. A comparison with the Acer + Pythia6 MC generator, with two different underlying-event tunes with and without colour reconnection, is presented in Fig. 12. The results of a \(\chi ^2\) test, similar to that described in the previous subsection, are summarised in Table 11. The study shows that:

  • Colour reconnection effects are very small, and therefore difficult to tune with present statistics.

  • TuneAPro is slightly disfavoured relative to the Perugia tune.

Fig. 12
figure 12

Kinematic characteristics for the total \(K_{S}^{0}\) production, for corrected data and particle-level events from the ACER + Pythia6 generator with two different tunes: Perugia and TuneAPro, with and without colour reconnection (CR). Total uncertainties are represented by the shaded area. Statistical uncertainties for MC samples are negligible in comparison with data

Table 11 Values of the \(\chi ^2\) per degree of freedom and their corresponding p-values, for the total \(K_{S}^{0}\) production, along with the Acermc + Pythia6 predictions with the following tunes: Perugia and TuneAPro (with and without colour reconnection)

10 Summary

Measurements of \(K_{S}^{0}\) and \(\Lambda \) production in \(t\bar{t}\) dileptonic final states are reported. They use a data sample with integrated luminosity of 4.6 \(\hbox {fb}^{-1}\) from proton–proton collisions at a centre-of-mass energy of 7 TeV, collected in 2011 with the ATLAS detector at the LHC. The \(K_{S}^{0}\) distributions in energy, \(p_{\mathrm{T}}\) and \(|\eta |\) are presented for three subsamples depending on whether the \(K_{S}^{0}\) is associated with a jet, with or without a b-tag, or is outside any selected jet. The corresponding \(K_{S}^{0}\) multiplicities are also measured. The small sample size precludes such a detailed analysis for \(\Lambda \) production, for which distributions are shown only for the total sample, which includes the sum of \(\Lambda \) and \({\bar{\Lambda }}\). The results are unfolded to the particle level using the neutral strange particle reconstruction efficiencies in each distribution within the kinematic region given by \(E>1~\text {GeV}\) and \(|\eta |<2.5\). The measurements are compared with current MC predictions where the \(t\bar{t}\) matrix elements are calculated at NLO accuracy with Powheg, MC@NLO, Sherpa and aMC@NLO, or at LO with Acermc. Several variations of the MC generators are considered:

  • Fragmentation scheme and UE: Pythia6+Perugia2011C, Pythia8 + A14, Pythia6 + TuneAPro, Herwig + Jimmy, Herwig7 + H7UE or Sherpa.

  • Colour reconnection effects.

The main conclusions to be drawn from the analysis are the following:

  • Strange baryon production is suppressed relative to strange meson production both inside and outside jets.

  • Neutral strange particle production outside jets is much softer than inside jets, and the pseudorapidity distributions are constant over a wider region.

  • Neutral strange particle multiplicities outside jets are larger than inside.

  • Current MC models give a fair description of the gross features exhibited by \(K_{S}^{0}\) and \(\Lambda \) produced inside jets, while the observed yields for neutral strange particles outside jets lie roughly 30% above the Pythia6 + Perugia2011C and Herwig + Jimmy or Herwig7 + H7UE MC predictions, with Pythia8 + A14 falling short of the data by 15–20%.

A better description of the yields for \(K_{S}^{0}\) and \(\Lambda \) outside jets in \(t\bar{t}\) final states would require further tuning of the current MC models, particularly the strangeness suppression mechanisms, and/or more elaborate models for MPI and colour reconnection schemes. For this purpose a Rivet analysis routine and HEPData tables are provided.