1 Introduction

Final states with hadronically decaying tau leptons play an important part in the physics programme of the ATLAS experiment [1]. Examples from Run 1 (2009–2013) of the Large Hadron Collider (LHC) [2] are measurements of Standard Model processes [37], Higgs boson searches [8], including models with extended Higgs sectors [911], and searches for new physics phenomena, such as supersymmetry [1214], new heavy gauge bosons [15] and leptoquarks [16]. These analyses depended on robust tau reconstruction and excellent particle identification algorithms that provided suppression of backgrounds from jets, electrons and muons [17].

With the discovery of a Higgs boson [18, 19] and evidence for the Higgs-boson Yukawa coupling to tau leptons [8, 20], a key future measurement will be that of the CP mixture of the Higgs boson via spin effects in \(H\rightarrow \tau \tau \) decays [2123]. This measurement relies on high-purity selection of the \(\tau ^{-} \rightarrow \pi ^{-} \nu \), \(\tau ^{-} \rightarrow \pi ^{-} \pi ^0 \nu \) and \(\tau ^{-} \rightarrow \pi ^{-} \pi ^+ \pi ^{-} \nu \) decays, as well as the reconstruction of the individual charged and neutral pion four-momenta. The tau reconstruction used in ATLAS throughout Run 1 (here denoted as “Baseline”), however, only differentiates tau decay modes by the number of charged hadrons and does not provide access to reconstructed neutral pions.

This paper presents a new method (called “Tau Particle Flow ”) of reconstructing the individual charged and neutral hadrons in tau decays with the ATLAS detector. Charged hadrons are reconstructed from their tracks in the tracking system. Neutral pions are reconstructed from their energy deposits in the calorimeter. The reconstructed hadrons, which make up the visible part of the tau decay (\(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)), are used to classify the decay mode and to calculate the four-momentum of reconstructed \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates. The superior four-momentum resolution from the tracking system compared to the calorimeter, for charged hadrons with transverse momentum (\(p_{\text {T}}\)) less than \(\sim \)100\(\,\text {GeV}\), leads to a significant improvement in the tau energy and directional resolution. This improvement, coupled with the ability to better identify the hadronic tau decay modes, could lead to better resolution of the ditau mass reconstruction [24]. The performance of the Tau Particle Flow is validated using samples of real hadronic tau decays and jets in Z+jets events selected from data. The samples correspond to \(5\,\mathrm{fb}^{-1}\) of data collected during proton–proton collisions at a centre-of-mass energy of \(\sqrt{s} = 8\,\text {TeV}\), which was the amount of data reprocessed using Tau Particle Flow. While similar concepts for the reconstruction of hadronic tau decays have been employed at other experiments [2531], the Tau Particle Flow is specifically designed to exploit the features of the ATLAS detector and to perform well in the environment of the LHC.

The paper is structured as follows. The ATLAS detector, event samples, and the reconstruction of physics objects used to select \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates from the \(8\,\text {TeV}\) data are described in Sect. 2. The properties of \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) decays and the Tau Particle Flow method are described in Sect. 3, including its concepts (Sect. 3.1), neutral pion reconstruction (Sect. 3.2), reconstruction of individual photon energy deposits (Sect. 3.3), decay mode classification (Sect. 3.4) and \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) four-momentum reconstruction (Sect. 3.5). Conclusions are presented in Sect. 4.

2 ATLAS detector and event samples

2.1 The ATLAS detector

The ATLAS detector [1] consists of an inner tracking system surrounded by a superconducting solenoid, electromagnetic (EM) and hadronic (HAD) calorimeters, and a muon spectrometer. The inner detector is immersed in a 2 T axial magnetic field, and consists of pixel and silicon microstrip detectors inside a transition radiation tracker, which together provide charged-particle tracking in the region \(|\eta |<2.5\).Footnote 1 The EM calorimeter is based on lead and liquid argon as absorber and active material, respectively. In the central rapidity region, the EM calorimeter is divided radially into three layers: the innermost layer (EM1) is finely segmented in \(\eta \) for optimal \(\gamma / \pi ^0\) separation, the layer next in radius (EM2) collects most of the energy deposited by electron and photon showers, and the third layer (EM3) is used to correct leakage beyond the EM calorimeter for high-energy showers. A thin presampler layer (PS) in front of EM1 and in the range \(|\eta | < 1.8\) is used to correct showers for upstream energy loss. Hadron calorimetry is based on different detector technologies, with scintillator tiles (\(|\eta | < 1.7\)) or liquid argon (\(1.5 < |\eta | < 4.9\)) as active media, and with steel, copper, or tungsten as absorber material. The calorimeters provide coverage within \(|\eta |<4.9\). The muon spectrometer consists of superconducting air-core toroids, a system of trigger chambers covering the range \(|\eta |<2.4\), and high-precision tracking chambers allowing muon momentum measurements within \(|\eta |<2.7\). A three-level trigger system is used to select interesting events [32]. The first-level trigger is implemented in hardware and uses a subset of detector information to reduce the event rate to a design value of at most 75 kHz. This is followed by two software-based trigger levels which together reduce the average event rate to 400 Hz.

2.2 Physics objects

This section describes the Baseline \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) reconstruction and also the reconstruction of muons and the missing transverse momentum, which are required for the selection of samples from data. Tau Particle Flow operates on each reconstructed Baseline tau candidate to reconstruct the charged and neutral hadrons, classify the decay mode and to provide an alternative \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) four-momentum. Suppression of backgrounds from other particles misidentified as \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) is achieved independently of the Tau Particle Flow.

The Baseline \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) reconstruction and energy calibration, and the algorithms used to suppress backgrounds from jets, electrons and muons are described in detail in Ref. [17]. Candidates for hadronic tau decays are built from jets reconstructed using the anti-\(k_{t}\) algorithm [33, 34] with a radius parameter value of 0.4. Three-dimensional clusters of calorimeter cells calibrated using a local hadronic calibration [35, 36] serve as inputs to the jet algorithm. The calculation of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) four-momentum uses clusters within the core region (\(\Delta R<0.2\) from the initial jet-axis). It includes a final tau-specific calibration derived from simulated samples, which accounts for out-of-cone energy, underlying event, the typical composition of hadrons in hadronic tau decays and contributions from multiple interactions occurring in the same and neighbouring bunch crossings (called pile-up). Tracks reconstructed in the inner detector are matched to the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidate if they are in the core region and satisfy the following criteria: \(p_{\text {T}} >1\,\text {GeV}\), at least two associated hits in the pixel layers of the inner detector, and at least seven hits in total in the pixel and silicon microstrip layers. Furthermore, requirements are imposed on the distance of closest approach of the tracks to the tau primary vertex in the transverse plane, \(|d_0|<1.0\) mm, and longitudinally, \(|z_0\sin \theta |<1.5\) mm. The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) charge is reconstructed from the sum of the charges of the associated tracks.

Backgrounds for \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates originating from quark- and gluon-initiated jets are discriminated against by combining shower shape and tracking information in a multivariate algorithm that employs boosted decision trees (BDTs) [37]. The efficiency of the jet discrimination algorithm has little dependence on the \(p_{\text {T}}\) of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates (evaluated using candidates with \(p_{\text {T}} >15\,\text {GeV}\)) or on the number of reconstructed primary vertices, which is correlated to the amount of pile-up, and has been evaluated up to a maximum of 25 primary vertices per event. All \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates are required to have \(p_{\text {T}} {} > 15\,\text {GeV}\), to be in the fiducial volume of the inner detector, \(|\eta | < 2.5\), and to have one or three associated tracks. They must also meet jet discrimination criteria, corresponding to an efficiency of about 55 % (40 %) for simulated \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) with one (three) charged decay products [17], leading to a rate of false identification for quark- and gluon-initiated jets of below a percent. A discriminant designed to suppress candidates arising from the misidentification of electrons [17] is also applied.

Muons are reconstructed using tracks in the muon spectrometer and inner detector [38]. The missing transverse momentum is computed from the combination of all reconstructed and fully calibrated physics objects and the remaining clustered energy deposits in the calorimeter not associated with those objects [39].

2.3 Event samples and selection

The optimisation and measurement of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) reconstruction performance requires Monte Carlo simulated events. Samples of simulated pp collision events at \(\sqrt{s}=8\,\text {TeV}\) are summarised in Table 1. Tau decays are provided by \(Z \rightarrow \tau \tau \) events. The sophisticated tau decay option of Pythia 8 is used, which provides fully modelled hadronic decays with spin correlations [40]. Tau decays in the \(t\bar{t}\) sample are generated by Tauola [41]. Photon radiation is performed by Photos [42]. Single-pion samples are also used, in which the pions originate from the centre of the ATLAS detector and are generated to have a uniform distribution in \(\phi \) and \(\eta \) (\(|\eta |<5.5\)) and also in \(\log (E)\) (\(200\,\text {MeV}< E < 2\,\text {TeV}\)).

Table 1 Details regarding the simulated samples of pp collision events. The following information is provided for each sample: the generator of the hard interaction, parton shower, hadronisation and multiple parton interactions; the set of parton distribution functions (PDFs) and the underlying event (UE) tune of the Monte Carlo

The response of the ATLAS detector is simulated using Geant4 [54, 55] with the hadronic-shower model QGSP_BERT [56, 57]. The parameters of the underlying event (UE) simulation were tuned using collision data. Simulated pp collision events are overlaid with additional minimum-bias events generated with Pythia 8 to account for the effect of pile-up. When comparing to the data, the simulated events are reweighted so that the distribution of the number of pile-up interactions matches that in the data. The simulated events are reconstructed with the same algorithm chain as used for the collision data.

Samples of \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates are selected from the data using a tag-and-probe approach. Candidates originating from hadronic tau decays and jets are obtained by selecting \(Z \rightarrow \tau \tau \) and \(Z(\rightarrow \mu \mu )\mathrm{+jets}\) events, respectively. The data were collected by the ATLAS detector during pp collisions at \(\sqrt{s}=8\,\text {TeV}\). The sample corresponds to an integrated luminosity of 5 fb\(^{-1}\) after making suitable data quality requirements for the operation of the tracking, calorimeter, and muon spectrometer subsystems. The data have a maximum instantaneous luminosity of \(7\cdot 10^{33}\) cm\(^{-2}\) s\(^{-1}\) and an average number of 19 pp interactions in the same bunch crossing.

The \(Z \rightarrow \tau \tau \) tag-and-probe approach follows Ref. [17]; events are triggered by the presence of a muon from a leptonic tau decay (tag) and must contain a \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidate (probe) with \(p_{\text {T}} {} > 20\,\text {GeV}\), which is used to evaluate the tau reconstruction performance. The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) selection criteria described in Sect. 2.2 are used. In addition the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) must have unit charge which is opposite to that of the muon. A discriminant designed to suppress candidates arising from the misidentification of muons [17] is also applied to increase signal purity. The invariant mass of the muon and \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\), \(m(\mu ,\tau _{\mathrm{had}{\text {-}}\mathrm{vis}})\), is required to be in the range \(50\,\text {GeV}< m(\mu ,\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}) < 85\,\text {GeV}\), as expected for \(Z \rightarrow \tau \tau \) decays. The background is dominated by multijet and \(W(\rightarrow \mu \nu )\mathrm{+jets}\) production and is estimated using the techniques from Ref. [7].

The \(Z(\rightarrow \mu \mu )\mathrm{+jets}\) tag-and-probe approach follows Ref. [58], with the following differences: both muons are required to have \(p_{\text {T}} >26\) \(\,\text {GeV}\), the dimuon invariant mass must be between 81 and 101 \(\,\text {GeV}\), and the highest-\(p_{\text {T}}\) jet is selected as a probe \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidate if it satisfies the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) selection criteria described in Sect. 2.2 but with \(p_{\text {T}} {} > 20\,\text {GeV}\) and without the electron discriminant. In this approach, two more steps are made when comparing simulated events to the data. Before the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) selection, the simulated events are reweighted so that the \(p_{\text {T}}\) distribution of the Z boson matches that in data. After the full event selection, the overall normalisation of the simulation is scaled to that in the data.

3 Reconstruction of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)

Over 90 % of hadronic tau decays occur through just five dominant decay modes, which yield one or three charged hadrons (\(h^{\pm }\)), up to two neutral pions (\(\pi ^0\)) and a tau neutrino. The neutrino goes undetected and is omitted in further discussion of the decay modes. Table 2 gives the following details for each of the five decay modes: the branching fraction, \(\mathcal {B}\); the fraction of simulated \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates that pass the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) selection described in Sect. 2.2 without the jet and electron discrimination, \(\mathcal {A}\cdot \varepsilon _\mathrm{reco}\); and the fraction of those that also pass the jet and electron discrimination, \(\varepsilon _\mathrm{ID}\). The \(h^{\pm }\) ’s are predominantly \(\pi ^{\pm } \)’s with a minor contribution from \(K^{\pm }\)’s. The modes with two or three pions proceed mainly through the intermediate \(\rho \) or \(a_1\) resonances, respectively. The \(h^{\pm }\) ’s are sufficiently long-lived that they typically interact with the detector before decaying and are therefore considered stable in the Tau Particle Flow. The \(\pi ^0 \)’s decay almost exclusively to a pair of photons. Approximately half of the photons convert into an \(e^+e^-\) pair because of interactions with the beampipe or inner-detector material. Modes with more \(\pi ^0 \)’s tend to have lower \(\varepsilon _\mathrm{ID}\) as they have wider showers that are more similar to those produced by quark- and gluon-initiated jets. The mode dependence of \(\mathcal {A}\cdot \varepsilon _\mathrm{reco}\) is due to a mixture of effects. The fraction of energy carried by visible decay products is mode dependent and the response of the calorimeter to \(h^{\pm }\) ’s and \(\pi ^0 \)’s is different, both of which impact the efficiency of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) \(p_{\text {T}}\) requirement. The efficiency of the track association is also dependent on the number of \(h^{\pm }\) ’s and to a lesser extent the number of \(\pi ^0 \)’s, which can contribute tracks from conversion electrons.

The goal of the Tau Particle Flow is to classify the five decay modes and to reconstruct the individual \(h^{\pm }\) ’s and \(\pi ^0 \)’s. The performance is evaluated using the energy and directional residuals of \(\pi ^0\) and \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) and the efficiency of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) decay mode classification. The \(\eta \) and \(\phi \) residuals are defined with respect to the generated values: \(\eta - \eta ^\mathrm{gen}\) and \(\phi - \phi ^\mathrm{gen}\), respectively. For \(E_{\text {T}} \), the relative residual is defined with respect to the generated value \(E_{\text {T}}/E_{\text {T}} ^\mathrm{gen}\). The core and tail resolutions for \(\eta \), \(\phi \) and \(E_{\text {T}} \) are defined as half of the 68 and 95 % central intervals of their residuals, respectively. Decays into higher-multiplicity states are accommodated by including modes with more than two \(\pi ^0 \)’s in the \(h^{\pm } \,{\ge }2\pi ^0 \) category and more than one \(\pi ^0\) in the \(3h^{\pm } \,{\ge }1\pi ^0 \) category. Decays with more than three charged hadrons are not considered. No attempt is made to reconstruct neutral kaons or to separate charged kaons from charged pions.

Table 2 Five dominant \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) decay modes [59]. Tau neutrinos are omitted from the table. The symbol \(h^\pm \) stands for \(\pi ^{\pm } \) or \(K^\pm \). Decays involving \(K^\pm \) contribute \(\sim \)3 % to the total hadronic branching fraction. Decays involving neutral kaons are excluded. The branching fraction (\(\mathcal {B}\)), the fraction of generated \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s in simulated \(Z \rightarrow \tau \tau \) events that are reconstructed and pass the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) selection described in Sect. 2.2 without the jet and electron discrimination (\(\mathcal {A}\cdot \varepsilon _\mathrm{reco}\)) and the fraction of those \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates that also pass the jet and electron discrimination (\(\varepsilon _\mathrm{ID}\)) for each decay mode are given

3.1 Concepts of the Tau Particle Flow method

The main focus of the Tau Particle Flow method is to reconstruct \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s with \(p_{\text {T}}\) values between 15 and \(100\,\text {GeV}\), which is the relevant range for tau leptons produced in decays of electroweak and SM Higgs bosons. In this case the hadrons typically have \(p_{\text {T}}\) lower than \(20\,\text {GeV}\) (peaked at \(\sim \)4\(\,\text {GeV}\)) and have an average separation of \(\Delta R\approx 0.07\). The \(h^{\pm }\) ’s are reconstructed using the tracking system, from which the charge and momentum are determined. Each track associated with the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidate in the core region is considered to be a \(h^{\pm }\) and the \(\pi ^{\pm }\) mass hypothesis is applied. Approximately 2 % of the selected \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s have a misclassified number of \(h^{\pm }\) ’s. Overestimation of the number of \(h^{\pm }\) ’s is primarily due to additional tracks from conversion electrons, which are highly suppressed by the strict track selection criteria described in Sect. 2.2. Underestimation of the number of \(h^{\pm }\) ’s is primarily caused by tracking inefficiencies (\(\sim \)10 % for charged pions with \(p_{\text {T}} >1\,\text {GeV}\) [1]), which arise from interactions of the \(h^{\pm }\) ’s with the beampipe or detector material. The \(h^{\pm }\) ’s also produce a shower in the calorimeter from which their energy and direction can be determined, but the tracker has a better performance in the relevant momentum range. The shower shapes of \(h^{\pm }\) ’s are also highly irregular, with a typical width of \(0.02<\Delta R<0.07\) in the EM calorimeter, combined with large fluctuations in the fractional energy depositions in the layers of the calorimeter. The \(\pi ^0 \)’s are reconstructed from their energy deposits in the EM calorimeter. The main challenge is to disentangle their energy deposits from \(h^{\pm }\) showers, which have a width similar to the average separation between hadrons. The photons from \(\pi ^0\) decays are highly collimated, with a typical separation of \(0.01<\Delta R < 0.03\). The majority of the \(\pi ^0\) energy is reconstructed in a single cluster in the EM calorimeter. Compared to \(h^{\pm }\) ’s, \(\pi ^0\) showers are smaller and more regular, leaving on average 10, 30 and 60 % of their energy in PS, EM1 and EM2, respectively. Almost no \(\pi ^0\) energy is deposited beyond EM2, so EM3 is considered part of the HAD calorimeter in Tau Particle Flow. The characteristic shower shapes and the kinematics of \(h^{\pm }\) ’s and \(\pi ^0 \)’s are used to identify \(\pi ^0 \)’s and to classify the tau decay mode.

In the following sections, the individual steps of the Tau Particle Flow method for \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) reconstruction are described. The first step is the reconstruction and identification of neutral pions. Next, energy deposits from individual photons in the finely segmented EM1 layer are reconstructed to identify cases where two \(\pi ^0 \)’s are contained within a single cluster. The decay mode is then classified by exploiting the available information from the reconstructed \(h^{\pm }\) ’s and \(\pi ^0 \)’s and the photons reconstructed in EM1. Following the decay mode classification, the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) four-momentum is reconstructed from the individual hadrons and then combined with the Baseline energy calibration to reduce tails in the \(E_{\text {T}}\) residual distribution. The performance of the Tau Particle Flow is evaluated using \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates from simulated \(Z \rightarrow \tau \tau \) events.

3.2 Reconstruction and identification of neutral pions

The reconstruction of neutral pion candidates (\(\pi ^0 _\mathrm{cand}\)) within hadronic tau decays using the Tau Particle Flow proceeds as follows. First, \(\pi ^0 _\mathrm{cand}\)’s are created by clustering cells in the EM calorimeter in the core region of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\). In the next step, the \(\pi ^0 _\mathrm{cand}\) energy is corrected for contamination from \(h^{\pm }\) ’s. To do this, the energy that each \(h^{\pm }\) deposits in the EM calorimeter (\(E_{h^{\pm }}^\mathrm{EM}\)) is estimated as the difference between the energy of the \(h^{\pm }\) from the tracking system (\(E_{h^{\pm }}^\mathrm{trk}\)) and the energy deposited in the HAD calorimeter which is associated with the \(h^{\pm }\) (\(E_{h^{\pm }}^\mathrm{HAD}\)): \(E_{h^{\pm }}^\mathrm{EM}= E_{h^{\pm }}^\mathrm{trk}- E_{h^{\pm }}^\mathrm{HAD}\). To calculate \(E_{h^{\pm }}^\mathrm{HAD}\), all clustered energy deposits in the HAD calorimeter in the core region are assigned to the closest \(h^{\pm }\), determined using the track position extrapolated to the calorimeter layer that contains most of the cluster energy. The \(E_{h^{\pm }}^\mathrm{EM}\) of each \(h^{\pm }\) is then subtracted from the energy of the closest \(\pi ^0 _\mathrm{cand}\) if it is within \(\Delta R =0.04\) of the \(h^{\pm }\).

At this stage, many of the \(\pi ^0 _\mathrm{cand}\)’s in reconstructed hadronic tau decays do not actually originate from \(\pi ^0 \)’s, but rather from \(h^{\pm }\) remnants, pile-up or other sources. The purity of \(\pi ^0 _\mathrm{cand}\)’s is improved by applying a minimum \(p_{\text {T}}\) requirement and an identification criterion designed to reject \(\pi ^0 _\mathrm{cand}\)’s not from \(\pi ^0 \)’s. The \(p_{\text {T}}\) thresholds are in the range 2.1–\(2.7\,\text {GeV}\). After the \(p_{\text {T}}\) requirement the background is dominated by \(h^{\pm }\) remnants. The \(\pi ^0\) identification uses a BDT and exploits the properties of the \(\pi ^0 _\mathrm{cand}\) clusters, such as the energy density and the width and depth of the shower. The variables used for \(\pi ^0 _\mathrm{cand}\) identification are described in Table 3. The BDT is trained using \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s that have only one \(h^{\pm }\), and which are produced in simulated \(Z \rightarrow \tau \tau \) events. The \(\pi ^0 _\mathrm{cand}\)’s are assigned to signal or background based on whether or not they originated from a generated \(\pi ^0\). Figure 1a shows signal and background distributions for the logarithm of the second moment in energy density, which is one of the more important identification variables. The discriminating power of the \(\pi ^0\) identification is quantified by comparing the efficiency of signal and background \(\pi ^0 _\mathrm{cand}\)’s to pass thresholds on the identification score, as shown in Fig. 1b. The \(p_{\text {T}}\) and identification score thresholds are optimised in five \(|\eta |\) ranges, corresponding to structurally different regions of the calorimeter, to maximise the number of \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s with the correct number of reconstructed \(h^{\pm }\) ’s and identified \(\pi ^0 _\mathrm{cand}\)’s  (\(\pi ^0 _\mathrm{ID}\)’s).

The \(h^{\pm }\) and \(\pi ^0\) counting performance is depicted in Fig. 2 by a decay mode classification matrix which shows the probability for a given generated mode to be reconstructed as a particular mode. Only \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) decays that are reconstructed and pass the selection described in Sect. 2.2 are considered (corresponding efficiencies are given in Table 2). The total fraction of correctly classified tau decays (diagonal fraction) is 70.9 %. As can be seen, for \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s with one \(h^{\pm }\), the separation of modes with and without \(\pi ^0 \)’s is quite good, but it is difficult to distinguish between \(h^{\pm } \,\pi ^0 \) and \(h^{\pm } \,{\ge }2\pi ^0 \). The largest contributions to the misclassification arise from \(h^{\pm } \,{\ge }2\pi ^0 \) decays where one of the \(\pi ^0 \)’s failed selection or where the energy deposits of both \(\pi ^0 \)’s merge into a single cluster. It is also difficult to distinguish between the \(3h^{\pm } \) and \(3h^{\pm } \,{\ge }1\pi ^0 \) modes because the \(\pi ^0 \)’s are typically soft with large overlapping \(h^{\pm }\) deposits.

Table 3 Cluster variables used for \(\pi ^0 _\mathrm{cand}\) identification. The variables \(|\eta ^\mathrm{clus}|\), \(\langle r^2\rangle ^\mathrm{clus}\), \(\lambda _\mathrm{centre}^\mathrm{clus}\), \(f_\mathrm{core}^\mathrm{clus}\) and \(\log \langle \rho ^2\rangle ^\mathrm{clus}\) are taken directly from the cluster reconstruction [36]. To avoid confusion with other variables used in tau reconstruction, the superscript clus has been added to each variable
Fig. 1
figure 1

a Distribution of the logarithm of the second moment in energy density of \(\pi ^0 _\mathrm{cand}\) clusters that do (signal) or do not (background) originate from \(\pi ^0 \)’s, as used in the \(\pi ^0\) identification. b 1 \(-\) efficiency for background \(\pi ^0 _\mathrm{cand}\)’s vs. the efficiency for signal \(\pi ^0 _\mathrm{cand}\)’s to pass thresholds on the \(\pi ^0\) identification score. The \(\pi ^0 _\mathrm{cand}\)’s in both figures are associated with \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s selected from simulated \(Z \rightarrow \tau \tau \) events

Fig. 2
figure 2

Decay mode classification efficiency matrix showing the probability for a given generated mode to be reconstructed as a particular mode by the Tau Particle Flow after \(\pi ^0\) reconstruction in simulated \(Z \rightarrow \tau \tau \) events. Decays containing neutral kaons are omitted. Only decays from \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s that are reconstructed and pass the selection described in Sect. 2.2 are considered. The statistical uncertainty is negligible

Two alternative methods for \(\pi ^0\) reconstruction were also developed. In the first method (Pi0Finder) the number of \(\pi ^0 \)’s in the core region is first estimated from global tau features measured using calorimetric quantities and the momenta of the associated \(h^{\pm }\) tracks. Clusters in the EM calorimeter are then chosen as \(\pi ^0 _\mathrm{cand}\)’s using a \(\pi ^0\) likeness score based on their energy deposition in the calorimeter layers and the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) track momenta. The likeness score does not exploit cluster moments to the same extent as the \(\pi ^0\) identification of the Tau Particle Flow and cluster moments are not used at all to estimate the number of \(\pi ^0\). This method was used to calculate variables for jet discrimination in Run 1 [17], but was not exploited further. The other method (shower shape subtraction, SSS) is a modified version of Tau Particle Flow, which attempts to subtract the \(h^{\pm }\) shower from the calorimeter at cell level using average shower shapes derived from simulation. The shower shapes are normalised such that their integral corresponds to \(E_{h^{\pm }}^\mathrm{EM}\) and centred on the extrapolated position of the \(h^{\pm }\) track. They are then subtracted from the EM calorimeter prior to the clustering, replacing the cluster-level subtraction of \(E_{h^{\pm }}^\mathrm{EM}\).

The \(\pi ^0\) \(E_{\text {T}}\), \(\eta \) and \(\phi \) residual distributions for all \(\pi ^0\) reconstruction algorithms are shown in Fig. 3a–c, respectively. The core angular resolutions for each algorithm are quite similar with \(\sim \)0.0056 in \(\eta \) and \(\sim \)0.012 rad in \(\phi \). The Pi0Finder algorithm has the poorest performance, with core resolutions of 0.0086 and 0.016 rad in \(\eta \) and \(\phi \), respectively, and significantly larger tails. The core \(E_{\text {T}}\) resolutions are almost identical for the Tau Particle Flow and SSS, both with 16 %, compared to 23 % for Pi0Finder. The Tau Particle Flow and SSS both show a shift in the reconstructed \(E_{\text {T}}\) of a few percent, due to incomplete subtraction of the \(h^{\pm }\) remnant. In the calculation of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) four-momentum in the Tau Particle Flow (Sect. 3.5), this bias is corrected for by a decay-mode-dependent calibration. Despite the more sophisticated shower subtraction employed in the SSS algorithm, it does not perform significantly better; the improvement in the total fraction of correctly classified tau decays is \(\sim \)1 %. This is partly because many of the \(\pi ^0 _\mathrm{cand}\)’s are sufficiently displaced from \(h^{\pm }\) ’s so that they have little energy contamination and are unaffected by the subtraction, and partly because the signature of clusters that contain \(\pi ^0 \)’s, even in the presence of overlapping \(h^{\pm }\) energy, is distinct enough for the BDT to identify. Contributions from pile-up have little effect on the \(\pi ^0 _\mathrm{cand}\) reconstruction in Tau Particle Flow; on average the \(E_{\text {T}}\) increases by \(\sim \)15\(\,\text {MeV}\) and its resolution degrades fractionally by \(\sim \)0.5\(\,\%\) per additional reconstructed vertex.

Fig. 3
figure 3

Distributions of the \(\pi ^0\) residuals in a transverse energy \(E_{\text {T}}\), b pseudorapidity \(\eta \) and c azimuth \(\phi \) in correctly reconstructed \(h^{\pm } \,\pi ^0 \) decays of tau leptons in simulated \(Z \rightarrow \tau \tau \) events

3.3 Reconstruction of individual photon energy deposits in EM1

During the \(\pi ^0\) reconstruction, the energy deposits from both photons typically merge into a single cluster. Furthermore, for \(Z \rightarrow \tau \tau \) events, in about half of the \(h^{\pm } \,{\ge }2\pi ^0 \) decays misclassified as \(h^{\pm } \,\pi ^0 \) by the \(\pi ^0\) reconstruction, at least three of the photons from two \(\pi ^0 \)’s are grouped into a single cluster. The fraction increases for higher \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) \(p_{\text {T}}\) due to the collimation of the tau decay products. The identification of the energy deposits from individual photons in the finely segmented EM1 layer can be exploited to improve the \(\pi ^0\) reconstruction, as discussed in the following.

Almost all photons begin to shower by the time they traverse EM1, where they deposit on average \(\sim \)30 % of their energy. In contrast, particles that do not interact electromagnetically rarely deposit a significant amount of energy in this layer, making it ideal for the identification of photons. Furthermore, the cell segmentation in \(\eta \) in this layer is finer than the average photon separation and comparable to the average photon shower width, allowing individual photons to be distinguished.

The reconstruction of energy deposits in EM1 proceeds as follows. First, local energy maxima are searched for within the core region. A local maximum is defined as a single cell with \(E_{\text {T}} >100\,\text {MeV}\) whose nearest neighbours in \(\eta \) both have lower \(E_{\text {T}}\). Maxima found in adjacent \(\phi \) cells are then combined: their energy is summed and the energy-weighted mean of their \(\phi \) positions is used. Figure 4 shows the efficiency for photons to create a local maximum (maxima efficiency), evaluated in the sample of single \(\pi ^0 \)’s. The efficiency decreases rapidly at low photon \(p_{\text {T}}\) as many of the photons fall below the \(100\,\text {MeV}\) threshold. The fraction of misreconstructed maxima due to noise or fluctuations from the photon shower is very low for maxima with \(E_{\text {T}} >500\,\text {MeV}\), but increases quickly at lower \(E_{\text {T}}\). At high photon \(p_{\text {T}}\) , corresponding to high \(\pi ^0\) \(p_{\text {T}}\) , the boost of the \(\pi ^0\) becomes large enough that the pair of photons almost always creates a single maximum. Figure 4 also shows the probability that a maximum is shared with the other photon in the single \(\pi ^0\) sample (share probability).

Fig. 4
figure 4

Efficiency for a photon to create a maximum in the first layer of the EM calorimeter in simulated \(\pi ^{0}\rightarrow \gamma \gamma \) events and the corresponding probability to create a maximum that is shared with the other photon. The photons are required to not interact with the material in the tracking system

The \(h^{\pm } \,{\ge }2\pi ^0 \) decay mode classification is improved by counting the number of maxima associated with \(\pi ^0 _\mathrm{cand}\)’s. An energy maximum is assigned to a \(\pi ^0 _\mathrm{cand}\) if its cell is part of the \(\pi ^0 _\mathrm{cand}\) cluster and it has an \(E_{\text {T}}\) of more than 300–\(430\,\text {MeV}\) (depending on the \(\eta \) region). The energy threshold is optimised to maximise the total number of correctly classified tau decays. Maxima with \(E_{\text {T}} >10\,\text {GeV}\) are counted twice, as they contain the merged energy deposits of two photons from a \(\pi ^0\) decay with a probability larger than 95 %. Finally, \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates that were classified as \(h^{\pm } \,\pi ^0 \) , but have a \(\pi ^0 _\mathrm{cand}\) with at least three associated maxima are reclassified as \(h^{\pm } \,{\ge }2\pi ^0 \). The method recovers 16 % of misclassified \(h^{\pm } \,{\ge }2\pi ^0 \) decays with a misclassification of \(h^{\pm } \,\pi ^0 \) decays of 2.5 %.

3.4 Decay mode classification

Determination of the decay mode by counting the number of reconstructed \(h^{\pm }\) ’s and \(\pi ^0 _\mathrm{ID}\)’s alone can be significantly improved by simultaneously analysing the kinematics of the tau decay products, the \(\pi ^0\) identification scores and the number of photons from the previous reconstruction steps. Exploitation of this information is performed via BDTs.

As the most difficult aspect of the classification is to determine the number of \(\pi ^0 \)’s, three decay mode tests are defined to distinguish between the following decay modes: \(h^{\pm }\)’s with zero or one \(\pi ^0\), \(h^{\pm } \,\{0,1\}\pi ^0 \); \(h^{\pm }\)’s with one or more \(\pi ^0 \)’s, \(h^{\pm } \,\{1,{\ge }2\}\pi ^0 \); and 3\(h^{\pm }\)’s with and without \(\pi ^0 \)’s, \(3h^{\pm } \,\{0,{\ge }1\}\pi ^0 \). Which of the three tests to apply to a \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidate is determined as follows. The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates with one or three associated tracks without any reconstructed \(\pi ^0 _\mathrm{cand}\)’s are always classified as \(h^{\pm } \) or \(3h^{\pm } \) , respectively. The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates with one associated track and at least two \(\pi ^0 _\mathrm{cand}\)’s, of which at least one is \(\pi ^0 _\mathrm{ID}\), enter the \(h^{\pm } \,\{1,{\ge }2\}\pi ^0 \) test. The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates with one \(\pi ^0 _\mathrm{ID}\) that are classified as \(h^{\pm } \,{\ge }2\pi ^0 \) by counting the photons in this cluster, as described in Sect. 3.3, retain their classification and are not considered in the decay mode tests. The remaining \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates with one or three associated tracks enter the \(h^{\pm } \,\{0,1\}\pi ^0 \) or \(3h^{\pm } \,\{0,{\ge }1\}\pi ^0 \) tests, respectively.

A BDT is trained for each decay mode test using \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates from simulated \(Z \rightarrow \tau \tau \) events, to separate \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s of the two generated decay types the test is designed to distinguish. The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates entering each decay mode test are then further categorised based on the number of \(\pi ^0 _\mathrm{ID}\)’s. A threshold is placed on the output BDT score in each category to determine the decay mode. The thresholds are optimised to maximise the number of correctly classified \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates. The BDT training was not split based on the number of \(\pi ^0 _\mathrm{ID}\)’s due to the limited size of the training sample.

The variables used for the decay mode tests are designed to discriminate against additional misidentified \(\pi ^0 _\mathrm{cand}\)’s, which usually come from imperfect \(h^{\pm }\) subtraction, pile-up or the underlying event. The associated clusters typically have low energy and a low \(\pi ^0\) identification score. Remnant clusters from imperfect \(h^{\pm }\) subtraction are also typically close to the \(h^{\pm }\) track and have fewer associated photon energy maxima. If the \(\pi ^0 _\mathrm{cand}\) clusters originate from tau decays, their directions and fractional energies are correlated with each other. Additionally, with increasing number of tau decay products, the available phase space per decay product becomes smaller. Each variable used in the BDTs is described briefly in Table 4. Table 5 summarises the decay mode tests and indicates which variables are used in each.

Table 4 Variables used in the BDTs for the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) decay mode classification. They are designed to discriminate against additional misidentified \(\pi ^0 _\mathrm{cand}\)’s, which usually come from imperfect subtraction, pile-up or the underlying event
Table 5 Details regarding the decay mode classification of the Tau Particle Flow. BDTs are trained to distinguish decay modes in three decay mode tests. The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s entering each test are further categorised based on the number of reconstructed, \(N(\pi ^0 _\mathrm{cand})\), and identified, \(N(\pi ^0 _\mathrm{ID})\), neutral pions. The variables used in the BDTs for each test are listed

Figure 5 shows the discrimination power of the tests categorised by the number of \(\pi ^0 _\mathrm{cand}\)’s and \(\pi ^0 _\mathrm{ID}\)’s. The decay mode fractions at the input of each test vary strongly, which impacts the position of the optimal BDT requirements. The resulting classification matrix is shown in Fig. 6. The total fraction of correctly classified tau decays is 74.7 %. High efficiencies in the important \(h^{\pm } \), \(h^{\pm } \,\pi ^0 \) and \(3h^{\pm } \) modes are achieved. The decay mode purity is defined as the fraction of \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates of a given reconstructed mode which originated from a generated \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) of the same mode, also calculated using \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s in simulated \(Z \rightarrow \tau \tau \) events. The purity of the \(h^{\pm } \), \(h^{\pm } \,\pi ^0 \) and \(3h^{\pm } \) decay modes is 70.3, 73.5 and 85.2 %, respectively. For comparison, in the Baseline reconstruction where \(\pi ^0\) reconstruction was not available, the fractions of generated \(h^{\pm } \) and \(h^{\pm } \,\pi ^0 \) in \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s with one reconstructed track are 27.4 and 52.2 %, respectively, and the fraction of \(3h^{\pm } \) in \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s with three reconstructed tracks is 68.9 %. Decays containing neutral kaons are omitted from the table. They are classified as containing \(\pi ^0 \)’s approximately half of the time. Contributions from pile-up have little effect on the classification efficiency, degrading it by \(\sim \)0.04 % per additional reconstructed vertex. The number of \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates for each classified decay mode is shown in Fig. 7a for real \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s from the \(Z \rightarrow \tau \tau \) tag-and-probe analysis and in Fig. 7b for jets from the \(Z(\rightarrow \mu \mu )\mathrm{+jets}\) tag-and-probe analysis. While systematic uncertainties have not been evaluated, the figures indicate reasonable modelling of the decay mode classification for \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s and jets. In both selections, the \(3h^{\pm } \) efficiency is slightly underestimated and the \(h^{\pm } \,{\ge }2\pi ^0 \) and \(3h^{\pm } \,{\ge }1\pi ^0 \) efficiencies are slightly overestimated.

Fig. 5
figure 5

Decay mode classification efficiency for the \(h^{\pm } \,\{0,1\}\pi ^0 \) , \(h^{\pm } \,\{1,{\ge }2\}\pi ^0 \) , and \(3h^{\pm } \,\{0,{\ge }1\}\pi ^0 \) tests. For each test, “decay mode 1” corresponds to the mode with fewer \(\pi ^0 \)’s. Working points corresponding to the optimal thresholds on the BDT score for each test are marked

Fig. 6
figure 6

Decay mode classification efficiency matrix showing the probability for a given generated mode to be reconstructed as a particular mode by the Tau Particle Flow after final decay mode classification in simulated \(Z \rightarrow \tau \tau \) events. Decays containing neutral kaons are omitted. Only decays from \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\)’s that are reconstructed and pass the selection described in Sect. 2.2 are considered. The statistical uncertainty is negligible

Fig. 7
figure 7

Number of \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidates for each classified decay mode in the a \(Z \rightarrow \tau \tau \) and the b \(Z(\rightarrow \mu \mu )\mathrm{+jets}\) tag-and-probe analyses. The simulated \(Z \rightarrow \tau \tau \) sample is split into contributions from each generated tau decay mode. The background in the \(Z \rightarrow \tau \tau \) analysis is dominated by multijet and \(W(\rightarrow \mu \nu )\mathrm{+jets}\) production. The simulated \(Z(\rightarrow \mu \mu )\mathrm{+jets}\) events are reweighted so that the Z boson \(p_{\text {T}}\) distribution and the overall normalisation match that in the data. The hatched band represents the statistical uncertainty on the prediction

3.5 Four-momentum reconstruction

The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) four-momentum reconstruction begins with summing the four-momenta of the \(h^{\pm }\) and \(\pi ^0 _\mathrm{cand}\) constituents (Constituent-based calculation). Only the first n \(\pi ^0 _\mathrm{cand}\)’s with the highest \(\pi ^0\) identification scores are included, where n is determined from the decay mode classification, and can be at most 2 \(\pi ^0 _\mathrm{cand}\)’s in the \(h^{\pm } \,{\ge }2\pi ^0 \) mode and at most 1 \(\pi ^0 _\mathrm{cand}\) in the \(3h^{\pm } \,{\ge }1\pi ^0 \) mode. A pion mass hypothesis is used for \(\pi ^0 _\mathrm{cand}\)’s. There are two exceptions: if the decay mode is classified as \(h^{\pm } \,\pi ^0 \) but there are two identified \(\pi ^0 _\mathrm{cand}\)’s, the mass of each is set to zero and both are added to the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) four-momentum as they are most likely photons from a \(\pi ^0\) decay; or if the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) candidate is classified as \(h^{\pm } \,{\ge }2\pi ^0 \) because three or more photons are found in a single \(\pi ^0 _\mathrm{cand}\), only this \(\pi ^0 _\mathrm{cand}\) is added and its mass is set to twice the \(\pi ^0\) mass. A calibration is applied to the Constituent-based \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) energy in each decay mode as a function of the Constituent-based \(E_{\text {T}}\) , to correct for the \(\pi ^0 _\mathrm{cand}\) energy bias. The resulting four-momentum is used to set the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) direction in the Tau Particle Flow. Figure 8a, b show distributions of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) \(\eta \) and \(\phi \) residuals of the Tau Particle Flow and the Baseline four-momentum reconstruction. The core angular resolutions of the Tau Particle Flow are 0.002 in \(\eta \) and 0.004 rad in \(\phi \), which are more than five times better than the Baseline resolutions of 0.012 and 0.02 rad, respectively.

Figure 9a shows distributions of the \(E_{\text {T}}\) residuals. The Constituent-based calculation is inherently stable against pile-up as both the decay-mode classification used to select \(h^{\pm }\) ’s and \(\pi ^0 _\mathrm{cand}\)’s, and the reconstruction of \(h^{\pm }\) ’s and \(\pi ^0 _\mathrm{cand}\)’s themselves, are stable against pile-up. The \(E_{\text {T}}\) increases by \(\sim \)6\(\,\text {MeV}\) and its resolution degrades fractionally by \(\sim \)0.6 % per additional reconstructed vertex. Figure 9b shows the resolution as a function of the \(E_{\text {T}}\) of the generated \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) . For the final energy calibration of the Tau Particle Flow, the Constituent-based \(E_{\text {T}}\) is combined with the Baseline \(E_{\text {T}}\) by weighting each by the inverse-square of their respective \(E_{\text {T}}\)-dependent core resolutions, which ensures a smooth transition to high \(p_{\text {T}}\) where the Baseline calibration is superior. The Baseline \(E_{\text {T}}\) is used if the two \(E_{\text {T}}\) values disagree by more than five times their combined core resolutions, as it has smaller resolution tails. The resolution of the Tau Particle Flow is superior in both the core and tails at low \(E_{\text {T}}\) with a core resolution of 8 % at an \(E_{\text {T}}\) of 20 \(\,\text {GeV}\), compared to 15 % from the Baseline. It approaches the Baseline performance at high \(E_{\text {T}}\) . Contributions from pile-up have little effect on the four-momentum reconstruction of the Tau Particle Flow; the \(E_{\text {T}}\) increases by \(\sim \)4\(\,\text {MeV}\) and its core resolution degrades fractionally by \(\sim \)0.5 % per additional reconstructed vertex. The \(E_{\text {T}}\) residual distributions of the Tau Particle Flow split into the reconstructed decay modes are shown in Fig. 9c. The total is non-Gaussian, as it is the sum of contributions with different functional forms. Correctly reconstructed decays containing only \(h^{\pm }\) ’s have the best resolution, followed by correctly reconstructed decays containing \(\pi ^0 _\mathrm{cand}\)’s. The excellent resolution of these decays leads to a superior overall core resolution. Misreconstructed decays have the poorest resolution and result in larger tails. In particular, misestimation of the number of \(\pi ^0 _\mathrm{cand}\)’s leads to a bias of up to 25 %. Decays containing neutral kaons exhibit a large low-energy bias because at least some of their energy is typically missed by the reconstruction.

An alternative method for the \(E_{\text {T}}\) calibration was also developed, based on Ref. [30]. It also uses a combination of calorimetric and tracking measurements and the Tau Particle Flow decay mode classification. The \(h^{\pm }\) \(p_{\text {T}}\) is measured using tracks and the \(\pi ^0\) \(E_{\text {T}}\) is estimated as the difference between the \(E_{\text {T}}\) of the seed jet at the EM scale [36] and the \(E_{\text {T}}\) from the summed momenta of all \(h^{\pm }\) ’s, scaled by their expected calorimeter response [60]. The method has similar overall performance to the Tau Particle Flow.

Fig. 8
figure 8

The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) a \(\eta \) and b \(\phi \) residual distributions of the Tau Particle Flow compared to the Baseline reconstruction

Fig. 9
figure 9

The a \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) relative \(E_{\text {T}}\) residual distribution and b the half-widths spanned by the 68 and 95 % quantiles, i.e. the core and tail resolutions, of the relative \(E_{\text {T}}\) residual distributions as a function of the generated \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) \(E_{\text {T}}\). The Baseline, Constituent-based and Tau Particle Flow calculations are shown. The relative \(E_{\text {T}}\) residual distribution of the Tau Particle Flow split in the reconstructed decay mode c is also shown

Figure 10a shows the distribution of the invariant mass of the muon and \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\), \(m(\mu ,\tau _{\mathrm{had}{\text {-}}\mathrm{vis}})\), calculated using the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) four-momentum reconstruction from the Tau Particle Flow in the \(Z \rightarrow \tau \tau \) tag-and-probe analysis before selection on \(m(\mu ,\tau _{\mathrm{had}{\text {-}}\mathrm{vis}})\). The \(m(\mu ,\tau _{\mathrm{had}{\text {-}}\mathrm{vis}})\) has a linear dependence on the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) \(E_{\text {T}}\) and analysis of the distribution has previously been used to calibrate the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) \(E_{\text {T}}\)  [17]. Data and simulation agree well, indicating that the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) \(E_{\text {T}}\) is well modelled by the simulation. Finally, Fig. 10b shows the mass spectrum of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) reconstructed with the Tau Particle Flow in the \(Z \rightarrow \tau \tau \) tag-and-probe analysis. The \(a_1\) resonance in the \(3h^{\pm } \) mode is reconstructed with negligible experimental resolution compared to the intrinsic line shape due to the excellent four-momentum resolution of the inner detector for \(h^{\pm }\) ’s. The \(\rho \) and \(a_1\) resonances in the \(h^{\pm } \,\pi ^0 \) and \(h^{\pm } \,{\ge }2\pi ^0 \) modes are also visible, but have significant degradation due to the resolution from the reconstructed \(\pi ^0 _\mathrm{cand}\) four-momentum. The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) mass spectra in data and simulation agree well, suggesting good modelling of the individual \(h^{\pm }\) and \(\pi ^0 _\mathrm{cand}\) four-momenta.

Fig. 10
figure 10

Distribution of a the invariant mass of the muon and \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\), \(m(\mu ,\tau _{\mathrm{had}{\text {-}}\mathrm{vis}})\) before selection on \(m(\mu ,\tau _{\mathrm{had}{\text {-}}\mathrm{vis}})\) is applied; and b the reconstructed mass of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) , when using the Tau Particle Flow \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) four-momentum reconstruction in the \(Z \rightarrow \tau \tau \) tag-and-probe analysis. The simulated \(Z \rightarrow \tau \tau \) sample is split into contributions from each generated tau decay mode. The background is dominated by multijet and \(W(\rightarrow \mu \nu )\mathrm{+jets}\) production. The hatched band represents the statistical uncertainty on the prediction

4 Summary and conclusions

This paper presents a new method to reconstruct the individual charged and neutral hadrons in tau decays with the ATLAS detector at the LHC. The neutral pions are reconstructed with a core energy resolution of \(\sim \)16 %. The reconstructed hadrons are used to calculate the visible four-momentum of reconstructed tau candidates and to classify the decay mode, allowing the decays to be distinguished not only by the number of \(h^{\pm }\) ’s but also by the number of \(\pi ^0 \)’s, which is not possible with the existing tau reconstruction. This improves the purity with which the \(\tau ^{-} \rightarrow \pi ^{-} \nu \), \(\tau ^{-} \rightarrow \pi ^{-} \pi ^0 \nu \) and \(\tau ^{-} \rightarrow \pi ^{-} \pi ^+ \pi ^{-} \nu \) decays can be selected, by factors of 2.6, 1.4 and 1.2, respectively. The \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) core directional resolution is improved by more than a factor of five and the core energy resolution is improved by up to a factor of two at low \(E_{\text {T}}\) (\(20\,\text {GeV}\)). The performance was validated using samples of \(Z \rightarrow \tau \tau \) and \(Z(\rightarrow \mu \mu )\mathrm{+jets}\) events selected from pp collision data at \(\sqrt{s}=8\,\text {TeV}\), corresponding to an integrated luminosity of 5 \(\mathrm{fb}^{-1}\). The results suggest good modelling of the \(\tau _{\mathrm{had}{\text {-}}\mathrm{vis}}\) decay mode classification efficiency and four-momentum reconstruction.