1 Introduction

The efficient identification of muons and the accurate measurement of their momenta are two of the main features of the ATLAS detector [1] at the LHC. These characteristics are often crucial in physics analysis, as for example in precise measurements of Standard Model processes [24], in the discovery of the Higgs boson, in the determination of its mass [5, 6], and in searches for physics beyond the Standard Model [7, 8]. This publication presents the performance of the ATLAS muon reconstruction during the LHC run at \(\sqrt{s}=7\)–8 TeV, focusing mainly on data collected in 2012. The performance of the ATLAS muon reconstruction has already been presented in a recent publication [9] based on 2010 data. The results presented here are based on an integrated luminosity \(\approx 500\) times larger, which allows a large reduction of the uncertainties. The measurements of the efficiency, of the momentum scale and resolution are discussed with a particular emphasis on the comparison between data and Monte Carlo (MC) simulation, on the corrections used in the physics analyses and on the associated systematic uncertainties. Muons with very large transverse momentum,Footnote 1 \(p_{\mathrm{T}}> 120\) GeV, are not treated here as they will be the subject of a forthcoming publication on the alignment of the ATLAS muon spectrometer and its high-\(p_{\mathrm{T}}\) performance.

This publication is structured as follows: Sect. 2 gives a short description of muon detection in ATLAS and Sect. 3 describes the real and simulated data samples used in the performance analysis. The measurement of the reconstruction efficiency is described in Sect. 4 while Sect. 5 reports the momentum scale and resolution. A method for including photons from final-state radiation in the reconstruction of the muon kinematics, is described in Sect. 6. Conclusions are given in Sect. 7.

2 Muon identification and reconstruction

A detailed description of the ATLAS detector can be found elsewhere [1]. The ATLAS experiment uses the information from the muon spectrometer (MS) and from the inner detector (ID) and, to a lesser extent, from the calorimeter, to identify and precisely reconstruct muons produced in the \(pp\) collisions.

The MS is the outermost of the ATLAS sub-detectors: it is designed to detect charged particles in the pseudorapidity region up to \(|\eta | = 2.7\), and to provide momentum measurement with a relative resolution better than 3 % over a wide \(p_{\mathrm{T}}\) range and up to 10 % at \(p_{\mathrm{T}}\approx 1\) TeV. The MS consists of one barrel part (for \(|\eta | < 1.05\)) and two end-cap sections. A system of three large superconducting air-core toroid magnets provides a magnetic field with a bending integral of about \(2.5\) Tm in the barrel and up to \(6\) Tm in the end-caps. Triggering and \(\eta \), \(\phi \) position measurements, with typical spatial resolution of \(5\)–10 mm, are provided by the Resistive Plate Chambers (RPC, three doublet layers for \(|\eta | < 1.05\)) and by the Thin Gap Chambers (TGC, three triplet and doublet layers for \(1.0<|\eta | < 2.4\)). Precise muon momentum measurement is possible up to \(|\eta |=2.7\) and it is provided by three layers of Monitored Drift Tube Chambers (MDT), each chamber providing six to eight \(\eta \) measurements along the muon track. For \(|\eta |>2\) the inner layer is instrumented with a quadruplet of Cathode Strip Chambers (CSC) instead of MDTs. The single hit resolution in the bending plane for the MDT and the CSC is about \(80\) \(\upmu \)m and \(60\) \(\upmu \)m, respectively. Tracks in the MS are reconstructed in two steps: first local track segments are sought within each layer of chambers and then local track segments from different layers are combined into full MS tracks.

The ID provides an independent measurement of the muon track close to the interaction point. It consists of three sub-detectors: the Silicon Pixels and the Semi-Conductor Tracker (SCT) detectors for \(|\eta | < 2.5\) and the Transition Radiation Tracker (TRT) covering \(|\eta | < 2.0\). They provide high-resolution coordinate measurements for track reconstruction inside an axial magnetic field of \(2\) T. A track in the barrel region has typically 3 Pixel hits, 8 SCT hits, and approximately 30 TRT hits.

The material between the interaction point and the MS ranges approximately from 100 to 190 radiation lengths, depending on \(\eta \), and consists mostly of calorimeters. The sampling liquid-argon (LAr) electromagnetic calorimeter covers \(|\eta |<3.2\) and is surrounded by hadronic calorimeters based on iron and scintillator tiles for \(|\eta |\lesssim 1.5\) and on LAr for larger values of \(|\eta |\).

Muon identification is performed according to several reconstruction criteria (leading to different muon “types”), according to the available information from the ID, the MS, and the calorimeter sub-detector systems. The different types are:

  • Stand-Alone (SA) muons: the muon trajectory is reconstructed only in the MS. The parameters of the muon track at the interaction point are determined by extrapolating the track back to the point of closest approach to the beam line, taking into account the estimated energy loss of the muon in the calorimeters. In general the muon has to traverse at least two layers of MS chambers to provide a track measurement. SA muons are mainly used to extend the acceptance to the range \(2.5<|\eta |<2.7\) which is not covered by the ID;

  • Combined (CB) muon: track reconstruction is performed independently in the ID and MS, and a combined track is formed from the successful combination of a MS track with an ID track. This is the main type of reconstructed muons;

  • Segment-tagged (ST) muons: a track in the ID is classified as a muon if, once extrapolated to the MS, it is associated with at least one local track segment in the MDT or CSC chambers. ST muons can be used to increase the acceptance in cases in which the muon crossed only one layer of MS chambers, either because of its low \(p_{\mathrm{T}}\) or because it falls in regions with reduced MS acceptance;

  • Calorimeter-tagged (CaloTag) muons: a track in the ID is identified as a muon if it could be associated to an energy deposit in the calorimeter compatible with a minimum ionizing particle. This type has the lowest purity of all the muon types but it recovers acceptance in the uninstrumented regions of the MS. The identification criteria of this muon type are optimized for a region of \(|\eta |<0.1\) and a momentum range of \(25\lesssim p_{\mathrm{T}}\lesssim 100\) GeV.

CB candidates have the highest muon purity. The reconstruction of tracks in the spectrometer, and as a consequence the SA and CB muons, is affected by acceptance losses mainly in two regions: at \(\eta \approx 0\), where the MS is only partially equipped with muon chambers in order to provide space for the services for the ID and the calorimeters, and in the region (\(1.1 < \eta < 1.3\)) between the barrel and the positive \(\eta \) end-cap, where there are regions in \(\phi \) with only one layer of chambers traversed by muons in the MS, due to the fact that some of the chambers of that region were not yet installed.Footnote 2

The reconstruction of the SA, CB and ST muons (all using the MS information) has been performed using two independent reconstruction software packages, implementing different strategies [10] (named “Chains”) both for the reconstruction of muons in the MS and for the ID-MS combination. For the ID-MS combination, the first chain (“Chain 1”) performs a statistical combination of the track parameters of the SA and ID muon tracks using the corresponding covariance matrices. The second (“Chain 2”) performs a global refit of the muon track using the hits from both the ID and MS sub-detectors. The use of two independent codes provided redundancy and robustness in the ATLAS commissioning phase. A unified reconstruction programme (“Chain 3”) has been developed to incorporate the best features of the two chains and has been used, in parallel to the other two, for the reconstruction of 2012 data. It is planned to use only Chain 3 for future data taking. So far, the first two chains were used in all ATLAS publications. As the three chains have similar performance, only results for “Chain 1” are shown in the present publication. A summary of the results for the other two chains is reported in Appendix A.

The following quality requirements are applied to the ID tracks used for CB, ST or CaloTag muons:

  • at least 1 Pixel hit;

  • at least 5 SCT hits;

  • at most 2 active Pixel or SCT sensors traversed by the track but without hits;

  • in the region of full TRT acceptance, \(0.1<|\eta |<1.9\), at least 9 TRT hits.

The number of hits required in the first two points is reduced by one if the track traverses a sensor known to be inefficient according to a time-dependent database. The above requirements are dropped in the region \(|\eta |>2.5\), where short ID track segments can be matched to SA muons to form a CB muon.

3 Data and Monte Carlo samples

3.1 Data samples

The results presented in this article are mostly obtained from the analysis of \(\sqrt{s}=8\) TeV \(pp\) collision events corresponding to an integrated luminosity of \(20.3\) fb\(^{-1}\) collected by the ATLAS detector in 2012. Results from \(pp\) collisions at \(\sqrt{s}=7\) TeV, collected in 2011, are presented in Appendix B. Events are accepted only if the ID, the MS and the calorimeter detectors were operational and both solenoid and toroid magnet systems were on.

The online event selection was performed by a three-level trigger system described in Ref. [11]. The performance of the ATLAS muon trigger during the 2012 data taking period is reported in Ref. [12]. The \(Z\rightarrow \mu \mu \) candidates have been selected online by requiring at least one muon candidate with \(p_{\mathrm{T}}>24\) GeV, isolated from other activity in the ID. The \({J/\psi \rightarrow \mu \mu }\) and the \({\Upsilon \rightarrow \mu \mu }\) samples used for momentum scale and resolution studies have been selected online with two dedicated dimuon triggers that require two opposite-charge muons compatible with the same vertex, with transverse momentum \(p_{\mathrm{T}}>6\) GeV, and the dimuon invariant mass in the range 2.5–4.5 GeV for the \(J/\psi \) and 8–11 GeV for the \(\Upsilon \) trigger. The \({J/\psi \rightarrow \mu \mu }\) sample used for the efficiency measurement was instead selected using a mix of single-muon triggers and a dedicated trigger requiring a muon with \(p_{\mathrm{T}}>6\) GeV and an ID track with \(p_{\mathrm{T}}> 3.5\) GeV, such that the invariant mass of the muon+track pair, under a muon mass hypothesis, is in the window \(2.7\)–3.5 GeV. This dedicated trigger operated during the whole data taking period with a prescaled rate of \(\approx 1\) Hz.

3.2 Monte Carlo samples

Monte Carlo samples for the process \(pp \rightarrow (Z/\gamma ^*) X \rightarrow \mu ^+\mu ^- X\), called \(Z \rightarrow \mu \mu \) in the following, were generated using POWHEG [13] interfaced to PYTHIA8 [14]. The CT10 [15] parton density functions (PDFs) have been used. The PHOTOS [16] package has been used to simulate final state photon radiation (FSR), using the exponentiated mode that leads to multi-photon emission taking into account \(\gamma ^*\) interference in \(Z\) decays. To improve the description of the dimuon invariant mass distribution, the generated lineshape was reweighted using an improved Born approximation with a running-width definition of the \(Z\) lineshape parameters. The ALPGEN [17] generator, interfaced with PYTHIA6 [18], was also used to generate alternative \(Z \rightarrow \mu \mu \) samples.

Samples of prompt \({J/\psi \rightarrow \mu \mu }\) and of \({\Upsilon \rightarrow \mu \mu }\) were generated using PYTHIA8, complemented with PHOTOS to simulate the effects of final state radiation. The samples were generated requiring each muon to have \(p_{\mathrm{T}}>6.5\)(\(6\)) GeV for \(J/\psi \) (\(\Upsilon \)). The \(J/\psi \) distribution in rapidity and transverse momentum has been reweighted in the simulated samples to match the distribution observed in the data. The samples used for the simulation of the backgrounds to \(Z \rightarrow \mu \mu \) are described in detail in [19], they include \(Z\rightarrow \tau \tau \), \(W\rightarrow \mu \nu \) and \(W\rightarrow \tau \nu \), generated with POWHEG, \(WW\), \(ZZ\) and \(WZ\) generated with SHERPA [20], \(t\bar{t}\) samples generated with MC@NLO [21] and \(b\bar{b}\) as well as \(c\bar{c}\) samples generated with PYTHIA6.

All the generated samples were passed through the simulation of the ATLAS detector based on GEANT4 [22, 23] and were reconstructed with the same programs used for the data. The ID and the MS were simulated with an ideal geometry without any misalignment. To emulate the effect of the misalignments of the MS chambers in real data, the reconstruction of the muon tracks in the simulated samples was performed using a random set of MS alignment constants. The amount of random smearing applied to these alignment constants was derived from an early assessment of the precision of the alignment, performed with special runs in which the toroidal magnetic field was off. The knowledge of the alignment constants improved with time. In particular the alignment constants used for the reconstruction of the data were more precise than those used to define the random smearing applied in the simulation, resulting in some cases in a worse MS resolution in MC than in data.

4 Efficiency

The availability of two independent detectors to reconstruct the muons (the ID and the MS) enables a precise determination of the muon reconstruction efficiency in the region \(|\eta |<2.5\). This is obtained with the so called tag-and-probe method described in the next section. A different methodology, described in Sect. 4.2, is used in the region \(2.5<|\eta |<2.7\) in which only one detector (the MS) is available.

4.1 Muon reconstruction efficiency in the region \(|\eta |<2.5\)

The tag-and-probe method is employed to measure the reconstruction efficiencies of all muon types within the acceptance of the ID (\(|\eta |<2.5\)). The conditional probability that a muon reconstructed by the ID is also reconstructed using the MS as a particular muon type, \(P (\mathrm Type | \mathrm ID )\), with \(\mathrm Type = (\mathrm CB, ST )\), can be measured using ID probes. Conversely, the conditional probability that a muon reconstructed by the MS is also reconstructed in the ID, \(P (\mathrm ID | \mathrm MS )\), is measured using MS tracks as probes.

For each muon type, the total reconstruction efficiency is given by:

$$\begin{aligned} \varepsilon (\mathrm Type ) = \varepsilon (\mathrm Type | \mathrm ID ) \cdot \varepsilon (\mathrm ID ), \end{aligned}$$
(1)

where \(\varepsilon (\mathrm ID )\) is the probability that a muon is reconstructed as an ID track. The quantity \(\varepsilon (\mathrm ID )\) cannot be measured directly and is replaced by \(\varepsilon (\mathrm ID | \mathrm MS )\) to give the tag-and-probe approximation:

$$\begin{aligned} \varepsilon (\mathrm Type ) \simeq \varepsilon (\mathrm Type | \mathrm ID ) \cdot \varepsilon (\mathrm ID |\mathrm MS ). \end{aligned}$$
(2)

The level of agreement of the measured efficiency, \(\varepsilon ^{\mathrm{Data }}(\mathrm Type )\!,\) with the efficiency measured with the same method in MC, \(\varepsilon ^{\mathrm{MC }}(\mathrm Type )\), is expressed as the ratio between these two numbers, called “efficiency scale factor” or SF:

$$\begin{aligned} SF=\frac{\varepsilon ^\mathrm{Data }(\mathrm Type )}{\varepsilon ^\mathrm{MC }(\mathrm Type )}. \end{aligned}$$
(3)

Possible biases introduced by the tag-and-probe approximation and other systematic effects on the efficiency measurement, which appear both in data and in MC, cancel in the SF. The SF is therefore used to correct the simulation in physics analysis.

4.1.1 The tag-and-probe method with \(Z \rightarrow \mu \mu \) events

For \(Z\rightarrow \mu \mu \) decays, events are selected by requiring two oppositely charged isolated muonsFootnote 3 with transverse momenta of at least \(p_{\mathrm{T}}> 25 \) and \(10\) GeV respectively and a dimuon invariant mass within \(10\) GeV of the \(Z\)-boson mass. The muons are required to be back to back in the transverse plane (\(\Delta \phi > 2\)). One of the muons is required to be a CB muon, and to have triggered the readout of the event. This muon is called the “tag”. The other muon, the so-called “probe”, is required to be a MS track (i.e. a SA or a CB muon) when \(\varepsilon (\mathrm ID | \mathrm MS )\) is to be measured. The probe is required to be a CaloTag muon for the measurement of \(\varepsilon (\mathrm Type | \mathrm ID )\). The use of CaloTag muons as the ID probes reduces the background in the \(Z \rightarrow \mu \mu \) sample by an order of magnitude without biasing the efficiency measurement. The MS probes are also used to measure the efficiency of CaloTag muons. After selecting all tag-probe pairs, an attempt is made to match the probe to a reconstructed muon: a match is successful when the muon and the probe are close in the \(\eta -\phi \) plane (\(\Delta R<0.01\) for CaloTag probes to be matched with CB or ST muons and \(\Delta R<0.05\) for MS probes to be matched to ID or CaloTag muons).

4.1.2 Background treatment in \(Z \rightarrow \mu \mu \) events

Apart from \(Z\rightarrow \mu \mu \) events, a small fraction of the selected tag-probe pairs may come from other sources. For a precise efficiency measurement, these backgrounds have to be estimated and subtracted. Contributions from \(Z\rightarrow \tau \tau \) and \(t\bar{t}\) decays are estimated using MC simulation. Additionally, QCD multijet events and \(W\rightarrow \mu \nu \) decays in association with jet activity (\(W+\)jets) can yield tag-probe pairs through secondary muons from heavy- or light-hadron decays. As these backgrounds are approximately charge-symmetric, they are estimated from the data using same-charge (SC) tag-probe pairs. This leads to the following estimate of the opposite-charge (OC) background for each region of the kinematic phase-space:

$$\begin{aligned} N(\text {Bkg}) = N_{\text{ O }C}^{Z,t\bar{t}\text { MC}} + T \cdot ( N_{\text {SC}}^{\text {Data}} - N_{\text {SC}}^{Z,t\bar{t}\text { MC}} ) \end{aligned}$$
(4)

where \(N_{\text{ O }C}^{Z,t\bar{t}\text { MC}}\) is the contribution from \(Z\rightarrow \tau \tau \) and \(t\bar{t}\) decays, \(N_{\text {SC}}^{\text {Data}}\) is the number of SC pairs measured in data and \(N_{\text {SC}}^{Z,t\bar{t}\text { MC}}\) is the estimated contribution of the \(Z\rightarrow \mu \mu \), \(Z\rightarrow \tau \tau \) and \(t\bar{t}\) processes to the SC sample. \(T\) is a global transfer factor that takes into account the residual charge asymmetry of the QCD multijet and W+jets samples, estimated using the simulation:

$$\begin{aligned} T = 1 + \theta ;\qquad \theta = \frac{N_{\text {OC}}^{\text {QCD+W MC}} - N_{\text {SC}}^{\text {QCD+W MC}}}{N_{\text {SC}}^{\text {Data}}}. \end{aligned}$$
(5)

For the kinematic region covered by the measurement, the transfer factor is \(T=1.15\) for CaloTag probes. For the MS probes the misidentification rate is low and the residual QCD multijet background has a large contribution from oppositely charged muon pairs in \(b\bar{b}\) decays, leading to \(T=2.6\). The efficiency for finding a muon of type \(\text {A}\) given a probe of type \(\text {B}\), corrected for the effect of background, can then be computed as:

$$\begin{aligned} \varepsilon (\text {A}|\text {B}) = \frac{N^\mathrm{Match}_\mathrm{Probes}(\text {Data}) - N^\mathrm{Match}_\mathrm{Probes}(\text {Bkg})}{N^\mathrm{All}_\mathrm{Probes}(\text {Data}) - N^\mathrm{All}_\mathrm{Probes}(\text {Bkg})}, \end{aligned}$$
(6)

where \(N^\mathrm{All}_\mathrm{Probes}\) stands for the total number of probes considered and \(N^\mathrm{Match}_\mathrm{Probes}\) is the number of probes successfully matched to a reconstructed muon of type \(\text {A}\). According to the background estimate reported above, the sample of selected CaloTag probes is more than \(99.5\,\%\) pure in \(Z \rightarrow \mu \mu \) decays, as shown in Fig. 1. The \(Z \rightarrow \mu \mu \) purity is maximal for muon \(p_{\mathrm{T}}\simeq 40\) GeV and decreases to \(98.5\,\%\) (\(97\,\%\)) for \(p_{\mathrm{T}}=10\) (\(100\)) GeV. The \(Z \rightarrow \mu \mu \) purity has a weak dependence on the average number of inelastic \(pp\) interactions per bunch crossing, \(\langle \mu \rangle \), decreasing from \(99.8\,\%\) at \(\langle \mu \rangle = 10\) to \(99.5\,\%\) at \(\langle \mu \rangle =34\). A purity above \(99.8\,\%\) is obtained in the selection of MS probes, with weaker dependence on \(p_{\mathrm{T}}\) and \(\langle \mu \rangle \).

Fig. 1
figure 1

Pseudorapidity distribution of the CaloTag (top) or MS (bottom) probes used in the tag-and-probe analysis. The bottom panel shows the ratio between observed and expected counts. The sum of the MC samples is normalized to the number of events in the data. The green band represents the statistical uncertainty

4.1.3 Low \(p_{\mathrm{T}}\) efficiencies from \({J/\psi \rightarrow \mu \mu }\) decays

The efficiencies extracted from \(Z\rightarrow \mu \mu \) decays are complemented at low \(p_{\mathrm{T}}\) with results derived from a sample of \(J/\psi \rightarrow \mu \mu \) events. In 2012 ATLAS collected approximately 2M \(J/\psi \rightarrow \mu \mu \) decays which were not biased by dimuon triggers requirements, using a combination of single muon triggers (isolated and non-isolated) and the dedicated “muon + track” trigger described in Sect. 3.1.

The analysis proceeds in a similar manner to the \(Z \rightarrow \mu \mu \) with some modifications due to the different kinematics of the \(J/\psi \). Tags are required to be CB muons with \(p_{\mathrm{T}}>4\) GeV and \(|\eta |<2.5\). As with the \(Z\), the tag must have triggered the read-out of the event. Probes are sought from amongst the ID tracks and must have \(p_{\mathrm{T}}> 2.5\) GeV and \(|\eta | < 2.5\), opposite charge to the tag muon, and must form with the tag an invariant mass in the window \(2.7\)–3.5 GeV. Finally the tag-probe pairs must fit to a common vertex with a very loose quality cut of \(\chi ^2 < 200\) for one degree of freedom, which removes tracks from different vertices, without any significant efficiency loss. Muon reconstruction efficiencies are then derived by binning in small cells of \(p_{\mathrm{T}}\) and \(\eta \) of the probe tracks. Invariant mass distributions are built in each cell for two samples: (a) all tag-probe pairs and (b) tag-probe pairs in which the probe failed to be reconstructed in the MS. The invariant mass distributions are fitted with a signal plus background model to obtain the number of \(J/\psi \) signal events in the two samples, called \(N_a(p_{\mathrm{T}},\eta )\) and \(N_b(p_{\mathrm{T}},\eta )\), respectively. The fit model is a Gaussian plus a second order polynomial for the background. The two samples are fitted simultaneously using the same mean and width to describe the signal. The MS reconstruction efficiency in a given \((p_{\mathrm{T}},\eta )\) cell is then defined as:

$$\begin{aligned} \varepsilon _{p_{\mathrm{T}}, \eta }(\mathrm {Type}|\mathrm {ID}) = 1 - \frac{N_b(p_{\mathrm{T}},\eta )}{N_a(p_{\mathrm{T}},\eta )}. \end{aligned}$$
(7)

The largest contribution to the systematic uncertainty originates from the model used in the fit. This uncertainty was estimated by changing the background model to a first or a third order polynomial and by relaxing the constraint that the mass and the width of the \(J/\psi \) signal are the same between the two samples. The resulting variations in the efficiency are added in quadrature to the statistical uncertainty to give the total uncertainty on the efficiency. The efficiency integrated over the full \(\eta \) region is obtained as an average of the efficiencies of the different \(\eta \) cells. This method ensures a reduced dependency on local variations of background and resolution, and on the kinematic distribution of the probes.

4.1.4 Systematic uncertainties

The main contributions to the systematic uncertainty on the measurement of the efficiency SFs are shown in Fig. 2, as a function of \(\eta \) and \(p_{\mathrm{T}}\), and are discussed below (the labels in parenthesis refer to the legend of Fig. 2):

  • (Bkg) the uncertainty on the data-driven background estimate is evaluated by varying the charge-asymmetry parameter \(\theta \) of Eq. (5) by \(\pm 100\,\%\). This results in an uncertainty of the efficiency measurement below \(0.1\,\%\) in a large momentum range, reaching up to \(0.2\,\%\) for low muon momenta where the contribution of the background is most significant.

  • (dR) the choice of the cone size used for matching reconstructed muons to probe objects has been optimized to minimize the amount of matches with wrong tracks while keeping the maximum match efficiency for correct tracks. A systematic uncertainty is evaluated by varying the cone size by \(\pm 50\,\%\). This yields an uncertainty of \(\approx 0.1\,\%\).

  • (TP approximation) possible biases in the tag-and-probe method, for example due to different distributions between MS probes and “true” muons or due to correlation between ID and MS efficiencies, are investigated. The simulation is used to compare the efficiency measured with the tag-and-probe method with the “true” MC efficiency calculated as the fraction of generator-level muons that are successfully reconstructed. Agreement within less than \(0.1\,\%\) is observed, with the exception of the region \(|\eta |<0.1\). In the extraction of the data/MC scale factors, the difference between the measured and the “true” efficiency cancels to first order. To take into account possible imperfection of the simulation, half the observed difference is used as an additional systematic uncertainty on the SF.

  • (Probes) the scale factor maps may be sensitive to disagreements between data and simulation in the kinematic distributions of the probes. The corresponding systematic uncertainty is estimated by reweighting the distribution of the probes in the simulation to bring it into agreement with the data. The resulting effect on the efficiency is below \(0.1\,\%\) over most of the phase space.

  • (Low \(p_{\mathrm{T}}\)) for \(4<p_{\mathrm{T}}<10\) GeV the systematic uncertainties are obtained from the analysis performed with the \({J/\psi \rightarrow \mu \mu }\) sample, as discussed in Sect. 4.1.3 (not shown in Fig. 2). The resulting uncertainty on the low-\(p_{\mathrm{T}}\) SFs ranges between 0.5 % and 2 %, depending on \(p_{\mathrm{T}}\) and \(\eta \) and is dominated by the uncertainty on the background model.

  • (High \(p_{\mathrm{T}}\)) no significant dependence of the measured SFs with \(p_{\mathrm{T}}\) was observed in the momentum range considered. An upper limit on the SF variation for large muon momenta has been extracted by using a MC simulation with built-in imperfections, including a realistic residual misalignment of the detector components or a 10 % variation of the muon energy loss. On the basis of this, a systematic uncertainty of \(\pm 0.42~\% \times (p_{\mathrm{T}}/ 1\) TeV\()\) is obtained.

Fig. 2
figure 2

Systematic uncertainty on the efficiency scale factor for CB+ST muons, obtained from \(Z \rightarrow \mu \mu \) data, as a function of \(\eta \) (top) and \(p_{\mathrm{T}}\) (bottom) for muons with \(p_{\mathrm{T}}>10\) GeV. The background systematic uncertainty in the last two bins of the bottom plot is affected by a large statistical uncertainty. The combined systematic uncertainty is the sum in quadrature of the individual contributions

4.1.5 Results

Figure 3 shows the muon reconstruction efficiency \(\varepsilon (\mathrm Type )\) as a function of \(\eta \) as measured from \(Z \rightarrow \mu \mu \) events. The combination of all the muon reconstruction types (for CB, ST, and CaloTag muons) gives a uniform muon reconstruction efficiency of about \(99\,\%\) over most the detector regions. The use of ST muons allows the recovery of efficiency especially in the region \(1.1<\eta < 1.3\) (from \(85\,\%\) to \(99\,\%\)) in which part of the MS chambers were not installed, as discussed in Sect. 2. The remaining inefficiency of the combination of CB or ST muons (CB+ST) at \(|\eta |<0.1\) (\(66\,\%\)) is almost fully recovered by the use of CaloTag muons (\(97\,\%\)).

Fig. 3
figure 3

Muon reconstruction efficiency as a function of \(\eta \) measured in \(Z \rightarrow \mu \mu \) events for muons with \(p_{\mathrm{T}}>10\) GeV and different muon reconstruction types. CaloTag muons are only shown in the region \(|\eta |<0.1\), where they are used in physics analyses. The error bars on the efficiencies indicate the statistical uncertainty. The panel at the bottom shows the ratio between the measured and predicted efficiencies. The error bars on the ratios are the combination of statistical and systematic uncertainties

The efficiencies measured in experimental and simulated data are in good agreement, in general well within 1 %. The largest differences are observed in the CB muons. To reconstruct an MS track, the Chain 1 reconstruction requires track segments in at least two layers of precision chambers (MDT or CSC) and at least one measurement of the \(\phi \) coordinate from trigger chambers (RPC or TGC). These requirements introduce some dependency on detector conditions and on the details of the simulation in the regions in which only two layers of precision chambers or only one layer of trigger chambers are crossed by the muons. This results in a reduction of efficiency in data with respect to MC of approximately 1 % in the region of \(\eta \sim 0.5\) due the RPC detector conditions and to local deviations up to about 2 % at \(0.9<|\eta |<1.3\) related to imperfections in the simulation of the barrel-endcap transition region. For the CB+ST muons the agreement between data and MC is very good, with the only exception of a low-efficiency region in data at \(\eta = 0.3\)–0.4 related to an inactive portion of an MDT chamber (not included in MC) in a region with reduced coverage due to the supporting structure of the ATLAS detector.Footnote 4

The ID muon reconstruction efficiency, \(\varepsilon (\mathrm ID |\mathrm MS )\), for \(p_{\mathrm{T}}>10\) GeV as a function of \(\eta \) and \(p_{T}\) is shown in Fig. 4. The efficiency is greater than \(0.99\) and there is very good agreement between data and MC. The small efficiency reduction in the region \(1.5<\eta <2\) is related to temporary hardware problems in the silicon detectors. The larger uncertainty at \(|\eta |<0.1\) is related to the limited MS coverage in that region.

Fig. 4
figure 4

ID muon reconstruction efficiency as a function of \(\eta \) (top) and \(p_{\mathrm{T}}\) (bottom) measured in \(Z \rightarrow \mu \mu \) events for muons with \(p_{\mathrm{T}}>10\) GeV. The error bars on the efficiencies indicate the statistical uncertainty. The panel at the bottom shows the ratio between the measured and predicted efficiencies. The green areas depict the pure statistical uncertainty, while the orange areas also include systematic uncertainties

Fig. 5
figure 5

Reconstruction efficiency for CB (top), CB+ST (middle) and CaloTag (bottom) muons as a function of the \(p_{\mathrm{T}}\) of the muon, for muons with \(0.1 <|\eta |< 2.5\) for CB and CB+ST muons and for \(|\eta | < 0.1\) for CaloTag muons. The upper two plots also show the result obtained with \(Z \rightarrow \mu \mu \) and \({J/\psi \rightarrow \mu \mu }\) events. The insets on the upper plots show the detail of the efficiency as a function of \(p_{\mathrm{T}}\) in the low \(p_{\mathrm{T}}\) region. The CaloTag muon efficiency (bottom) is only measured with \(Z \rightarrow \mu \mu \) events. The error bars on the efficiencies indicate the statistical uncertainty for \(Z \rightarrow \mu \mu \) and include also the fit model uncertainty for \({J/\psi \rightarrow \mu \mu }\). The panel at the bottom shows the ratio between the measured and predicted efficiencies. The green areas show the pure statistical uncertainty, while the orange areas also include systematic uncertainties

Figure 5 shows the reconstruction efficiencies for CB and for CB+ST muons as a function of the transverse momentum, including results from \(Z \rightarrow \mu \mu \) and \({J/\psi \rightarrow \mu \mu }\). A steep increase of the efficiency is observed at low \(p_{\mathrm{T}}\), in particular for the CB reconstruction, since a minimum momentum of approximately 3 GeV is required for a muon to traverse the calorimeter material and cross at least two layers of MS stations before being bent back by the magnetic field. Above \(p_{\mathrm{T}}\approx 20\) GeV, the reconstruction efficiency for both CB and CB+ST muons is expected to be independent of the transverse momentum. This is confirmed within \(0.5\,\%\) by the \(Z \rightarrow \mu \mu \) data. The drop in efficiency observed in the \(J/\psi \) data at \(p_{\mathrm{T}}>15\) GeV is due to the inefficiency of the MS reconstruction for muon pairs with small angular separation as in the case of highly boosted \(J/\psi \). This effect is well reproduced by MC and the SF of the \({J/\psi \rightarrow \mu \mu }\) analysis are in good agreement with those from \(Z \rightarrow \mu \mu \) in the overlap region. The CaloTag muon efficiency reaches a plateau of approximately \(0.97\) above \(p_{\mathrm{T}}\gtrsim 30\) GeV, where it is well predicted by the MC.

Figure 6 shows the reconstruction efficiency for CB+ST muons as a function of \(\langle \mu \rangle \), showing a high value (on average above \(0.99\)) and remarkable stability. A small efficiency drop of about \(1\,\%\) is only observed for \(< \mu >\gtrsim 35\). This is mainly caused by limitations of the MDT readout electronics in the high-rate regions close to the beam lines. These limitations are being addressed in view of the next LHC run.

Fig. 6
figure 6

Measured CB+ST muon reconstruction efficiency for muons with \(p_{\mathrm{T}}>10\) GeV as a function of the average number of inelastic \(pp\) collisions per bunch crossing \(\angle \mu \rangle \). The error bars on the efficiencies indicate the statistical uncertainty. The panel at the bottom shows the ratio between the measured and predicted efficiencies. The green areas depict the pure statistical uncertainty, while the orange areas also include systematic uncertainties

4.2 Muon reconstruction efficiency for \(|\eta |> 2.5\)

As described in the previous sections, the CB muon reconstruction is limited by the ID acceptance which covers the pseudo-rapidity region \(|\eta |<2.5\). Above \(|\eta |=2.5\), SA muons are the only muon type that provides large efficiency. A measurement of the efficiency SF for muons in the range \(2.5<|\eta |<2.7\), hereafter called high-\(\eta \), is needed for the physics analyses that exploit the full MS acceptance.

A comparison with the Standard Model calculations for \(Z \rightarrow \mu \mu \) events is used to measure the reconstruction efficiency SF in the high-\(\eta \) region. To reduce the theoretical and experimental uncertainties, the efficiency SF is calculated from the double ratio

$$\begin{aligned} \mathrm{SF} = \frac{ \frac{N^\mathrm{Data}(2.5<|\eta _\mathrm{fwd}|<2.7)}{N^\mathrm{MC}(2.5<|\eta _\mathrm{fwd}|<2.7)}}{ \frac{N^\mathrm{Data}(2.2<|\eta _\mathrm{fwd}|<2.5)}{N^\mathrm{MC}(2.2<|\eta _\mathrm{fwd}|<2.5)}}, \end{aligned}$$
(8)

where the numerator is the ratio of the number of \(Z \rightarrow \mu \mu \) candidates in data and in MC for which one of the muons, called the forward muon, is required to be in the high-\(\eta \) region \(2.5<|\eta _\mathrm{fwd}|<2.7\) while the other muon from the \(Z\) decay, called the central muon, is required to have \(|\eta |<2.5\). The denominator is the ratio of \(Z \rightarrow \mu \mu \) candidates in data over MC with the forward muon lying in the control region \(2.2<|\eta _\mathrm{fwd}|<2.5\) and the central muon in the region \(|\eta | < 2.2\). In both the numerator and denominator the central muon is required to be a CB muon while the forward muon can either be a CB or SA muon. The simulation of muons with \(|\eta |<2.5\) is corrected using the standard SF described in the previous section.

The selection of the central muon is similar to that of the tag muon in the tag-and-probe method. It is required to have triggered the event readout, to be isolated and to have transverse momentum \(p_{\mathrm{T}}>25\) GeV. The requirements for the forward muon include calorimeter-based isolation, requiring the transverse energy \(E_T\) measured in the calorimeter in a cone of \(\Delta R = 0.2\) (excluding the energy lost by the muon itself) around the muon track, to be less than \(10\,\%\) of the muon \(p_{\mathrm{T}}\). The central and forward muons are required to have opposite charge, a dimuon invariant mass within 10 GeV of the \(Z\) mass, and a separation in \((\eta ,\phi )\) space of \(\Delta R>0.2\).

Different sources of systematic uncertainties have been considered: a first group is obtained by varying the \(p_{\mathrm{T}}\) and isolation cuts on the central muons and the dimuon mass window. These variations produce effects of less than 0.3 % in the efficiency SF for the \(p_{\mathrm{T}}\) range 20–60 GeV. The effect of the calorimetric isolation on the efficiency SF yields an uncertainty of less than 1 %, which is estimated by comparing the nominal SF values with the ones extracted when no calorimetric isolation is applied on the forward muons and by studying the dependence of this cut on the number of \(pp\) interactions. The contribution from the background processes, mainly dimuons from \(b\) and \(\bar{b}\) decays, has been studied using MC background samples and found to be negligible.

Fig. 7
figure 7

Reconstruction efficiency for muons within \(2.5<|\eta |<2.7\) from \(Z \rightarrow \mu \mu \) events. The upper plot shows the efficiency obtained as the product of scale factor (Eq.  8) and the MC efficiency. The lower plot shows the scale factor. The error bars correspond to the statistical uncertainty while the green shaded band corresponds to the statistical and systematic uncertainty added in quadrature

The theoretical uncertainty from higher-order corrections is estimated by varying the renormalization and factorization scales in the POWHEG NLO calculation at the generator level and is found to produce a negligible effect on the ratio of Eq. (8). The uncertainty from the knowledge of the parton densities is estimated by reweighting the PDFs used in the MC samples from CT10 to MSTW2008NLO [24] and by studying, at the generator level, the effect of the uncertainty associated to the MSTW2008 PDF set on the double ratio of Eq. (8), obtaining an overall theoretical uncertainty of less than \(0.55\,\%\).

The efficiency in this region is obtained as the product of the SF and the “true” MC efficiency, calculated as the fraction of generator-level muons that are successfully reconstructed. The reconstruction efficiency and the SF for muons in the high-\(\eta \) region is shown in Fig. 7 as a function of the muon \(p_{\mathrm{T}}\).

4.3 Scale factor maps

The standard approach used in ATLAS for physics analysis is to correct the muon reconstruction efficiency in the simulation using efficiency scale factors (SFs). The SFs are obtained with the tag-and-probe method using \(Z \rightarrow \mu \mu \) events, as described above, and are provided to the analyses in the form of \(\eta \)\(\phi \) maps. Since no significant \(p_{\mathrm{T}}\) dependence of the SF has been observed, no \(p_{\mathrm{T}}\) binning is used in the SF maps. Different maps are produced for different data taking sub-periods with homogeneous detector conditions. The whole 2012 dataset is divided into 10 sub-periods. For each analysis, the final map is obtained as an average of the maps for all sub-periods, weighted by the periods’ contribution to the integrated luminosity under study.

Figures 8 and 9 show the maps of the efficiencies measured using the data in the \(\eta \)\(\phi \) plane and the corresponding Scale Factors. The large data sample allows for a precise resolution of localized efficiency losses, for example in the muon spectrometer for \(|\eta | \sim 0\) due to limited coverage. The SF maps show local differences between data and MC related to detector conditions as discussed in Sect. 4.1.5.

Fig. 8
figure 8

Reconstruction efficiency measured in the experimental data (top), and the data/MC efficiency scale factor (bottom) for CB muons as a function of \(\eta \) and \(\phi \) for muons with \(p_{\mathrm{T}}>10\) GeV

Fig. 9
figure 9

Reconstruction efficiency measured in the experimental data (top) and the data/MC efficiency scale factor (bottom) for CB+ST muons as a function of \(\eta \) and \(\phi \) for muons with \(p_{\mathrm{T}}>10\) GeV

5 Momentum scale and resolution

The large samples of \({J/\psi \rightarrow \mu \mu }\), \({\Upsilon \rightarrow \mu \mu }\) and \(Z \rightarrow \mu \mu \) decays collected by ATLAS are used to study in detail the muon momentum scale and resolution. The ATLAS simulation includes the best knowledge of the detector geometry, material distribution, and physics model of the muon interaction at the time of the MC events were generated. Additional corrections are needed to reproduce the muon momentum resolution and scale of experimental data at the level of precision that can be obtained using high-statistics samples of dimuon resonances. Section 5.1 describes the methodology used to extract the corrections to be applied to the MC simulation. In Sect. 5.2, the muon momentum scale and resolution is studied in the data and in MC samples with and without corrections.

5.1 Corrections to the muon momentum in MC

Similarly to Ref. [9], the simulated muon transverse momenta reconstructed in the ID and in the MS sub-detectors, \(p_{\mathrm{T}}^\mathrm{MC, Det}\), where \(\mathrm{Det} = \mathrm{ID, MS}\), are corrected using the following equation:

$$\begin{aligned} p_{\mathrm{T}}^\mathrm{Cor,Det}&= \frac{p_{\mathrm{T}}^\mathrm{MC,Det} + \sum \limits ^{1}_{n=0} s^\mathrm{Det}_{n}(\eta ,\phi )(p_{\mathrm{T}}^\mathrm{MC,Det})^{n}}{1+\sum \limits ^{2}_{m=0} \Delta r^\mathrm{Det}_{m}(\eta ,\phi )(p_{\mathrm{T}}^\mathrm{MC,Det})^{m-1}g_{m}} \nonumber \\&\mathrm (with s^\mathrm{ID}_{0}=0\mathrm and \Delta r^\mathrm{ID}_{0} =0), \end{aligned}$$
(9)

where \(g_{m}\) are normally distributed random variables with mean 0 and width 1 and the terms \(\Delta r_{m}^\mathrm{Det}(\eta ,\phi )\) and \(s_n^\mathrm{Det}(\eta , \phi )\) describe, respectively, the momentum resolution smearing and the scale corrections applied in a specific \(\eta \), \(\phi \) detector region. The motivations for Eq. (9) are the following:

  • corrections are defined in \(\eta - \phi \) detector regions such that in each region the variation of momentum resolution and scale, and therefore of their possible corrections, are expected to be small. In particular the nominal muon identification acceptance region (up to \(|\eta |=2.7\)) is divided in 18 \(\eta \) sectors of size \(\Delta \eta \) between 0.2 and 0.4, for both the MS and the ID. In addition, the MS is divided into two types of \(\phi \) sectors of approximate size of \(\pi /8\), exploiting the octagonal symmetry of the magnetic system : the sectors that include the magnet coils (called “small sectors”) and the sectors between two coils (called “large sectors”).

  • The \(\Delta r_{m}^\mathrm{Det}(\eta ,\phi )\) correction terms introduce a \(p_{\mathrm{T}}\) dependent momentum smearing that effectively increases the relative momentum resolution, \(\frac{ \sigma (p_{\mathrm{T}})}{p_{\mathrm{T}}}\), when under-estimated by the simulation. The \(\Delta r_{m}^\mathrm{Det}(\eta ,\phi )\) terms can be related to different sources of experimental resolution by comparing the coefficient of the \(p_{\mathrm{T}}\) powers in the denominator of Eq. (9) to the following empirical parametrization of the muon momentum resolution (see for example [25]):

    $$\begin{aligned} \frac{ \sigma (p_{\mathrm{T}})}{p_{\mathrm{T}}} = r_0/p_{\mathrm{T}}\oplus r_1 \oplus r_2\cdot p_{\mathrm{T}}, \end{aligned}$$
    (10)

    where \(\oplus \) denotes a sum in quadrature. The first term (proportional to \(1/p_{\mathrm{T}}\)) accounts for fluctuations of the energy loss in the traversed material. Multiple scattering, local magnetic field inhomogeneities and local radial displacements are responsible for the second term (constant in \(p_{\mathrm{T}}\)). The third term (proportional to \(p_{\mathrm{T}}\)) describes intrinsic resolution effects caused by the spatial resolution of the hit measurements and by residual misalignment. Energy loss fluctuations are relevant for muons traversing the calorimeter in front of the MS but they are negligible in the ID measurement. For this reason \( \Delta r^\mathrm{ID}_{0}\) is set to zero in Eq. (9).

  • Imperfect knowledge of the magnetic field integral and of the radial dimension of the detector are reflected in the multiplicative momentum scale difference \(s_1^\mathrm{Det} \) between data and simulation. In addition, the \(s^\mathrm{MS}_0(\eta , \phi )\) term is necessary to model the \(p_{\mathrm{T}}\) scale dependence observed in the MS momentum reconstruction due to differences between data and MC in the energy loss of muons passing through the calorimeter and other materials between the interaction point and the MS. As the energy loss between the interaction point and the ID is negligible, \(s^\mathrm{ID}_0(\eta )\) is set to zero.

The separate correction of ID and MS momentum reconstruction allows a direct understanding of the sources of the corrections. In a second step the corrections are propagated to the CB momentum reconstruction, \(p_{\mathrm{T}}^\mathrm{Cor,CB}\), using a weighted average:

$$\begin{aligned} p_{\mathrm{T}}^\mathrm{Cor,CB} = f\cdot p_{\mathrm{T}}^\mathrm{Cor,ID} + (1 - f)\cdot p_{\mathrm{T}}^\mathrm{Cor, MS}, \end{aligned}$$
(11)

with the weight \(f\) derived for each muon by expressing the CB transverse momentum before corrections, \(p_{\mathrm{T}}^\mathrm{MC,CB}\), as a linear combination of \(p_{\mathrm{T}}^\mathrm{MC,ID}\) and \(p_{\mathrm{T}}^\mathrm{MC,MS}\):

$$\begin{aligned} p_{\mathrm{T}}^\mathrm{MC,CB} = f\cdot p_{\mathrm{T}}^\mathrm{MC,ID} + (1 - f)\cdot p_{\mathrm{T}}^\mathrm{MC, MS} \end{aligned}$$
(12)

and solving the corresponding linear equation.

5.1.1 Correction extraction using a template fit to \({J/\psi \rightarrow \mu \mu }\) and \(Z \rightarrow \mu \mu \) events

The MS and ID correction parameters contained in Eq. (9) need to be extracted from data. For this purpose, a MC template maximum likelihood fit is used to compare the simulation to the data for \({J/\psi \rightarrow \mu \mu }\) and \(Z \rightarrow \mu \mu \) candidate events: this gives sensitivity to reconstructed muon momenta in the \(p_{\mathrm{T}}\) range from a few GeV to \(\approx 100\) GeV. The dataset used for the correction extraction consists of \(6\)M \({J/\psi \rightarrow \mu \mu }\) and \(9\)M \(Z \rightarrow \mu \mu \) candidates passing the final selection.

The \({J/\psi \rightarrow \mu \mu }\) and \(Z \rightarrow \mu \mu \) candidates have been selected online according to the requirements described in Sect. 3.1 and, offline, by requiring two CB muons. For the correction extraction in a specific \(\eta - \phi \) Region Of Fit (ROF), the ID and MS reconstructed momenta are considered individually. All the events with at least one of the two muons in the ROF contribute to the correction extraction fit. The angles from the CB reconstruction are used to define the ROF and to calculate the invariant mass distributions.

The ID corrections are extracted using the distribution of the ID dimuon invariant mass, \(m_{\mu \mu }^\mathrm{ID }\). Events with \(m_{\mu \mu }^\mathrm{ID }\) in the window \(2.76\)–3.6 GeV and \(p_{\mathrm{T}}^\mathrm{ID }\) in the range \(8\)–17 GeV are selected as \({J/\psi \rightarrow \mu \mu }\) candidate decays; events with \(m_{\mu \mu }^\mathrm{ID }\) between \(76\) and \(96\) GeV and the leading (sub-leading) muons with \(26< p_{\mathrm{T}}^\mathrm{ID }< 300\) GeV (\(15< p_{\mathrm{T}}^\mathrm{ID }< 300\) GeV) are selected as \(Z \rightarrow \mu \mu \) candidate decays. To enhance the sensitivity to the \(p_{\mathrm{T}}\) dependent correction effects, the \(m_{\mu \mu }^\mathrm{ID }\) is classified according to the \(p_{\mathrm{T}}\) of the muons: for \({J/\psi \rightarrow \mu \mu }\) candidates the \(p_{\mathrm{T}}^\mathrm{ID }\) of the sub-leading muon defines three bins with lower thresholds at \(p_{\mathrm{T}}^\mathrm{ID }={8, 9, 11}\) GeV, for \(Z \rightarrow \mu \mu \) candidates the \(p_{\mathrm{T}}^\mathrm{ID }\) of the leading muon defines three bins with lower thresholds at \(p_{\mathrm{T}}^\mathrm{ID }={26, 47, 70}\) GeV.

Similarly, the MS corrections are extracted using the distribution of the MS reconstructed dimuon invariant mass, \(m_{\mu \mu }^\mathrm{MS }\), in the same way as for the ID. However, as in the MS part of Eq. (9) more correction parameters and more ROFs are present, an additional variable sensitive to the momentum scale and resolution is added to the MS fit. The variable, used only in \(Z \rightarrow \mu \mu \) candidate events, is defined by the following equation:

$$\begin{aligned} \rho = \frac{p_{\mathrm{T}}^\mathrm{MS } - p_{\mathrm{T}}^\mathrm{ID }}{p_{\mathrm{T}}^\mathrm{ID }}, \end{aligned}$$
(13)

representing a measurement of the \(p_{\mathrm{T}}\) imbalance between the measurement in the ID and in the MS. The \(\rho \) variable is binned according to \(p_{\mathrm{T}}^\mathrm{MS }\) of the muon in the ROF: the lower thresholds are \(p_{\mathrm{T}}^\mathrm{MS }={20, 30, 35, 40, 45, 55, 70}\) GeV.

In order to compare the simulation to the data distributions, the corresponding templates of \(m_{\mu \mu }^\mathrm{ID }\), \(m_{\mu \mu }^\mathrm{MS }\), and \(\rho \) are built using the MC samples of the \({J/\psi \rightarrow \mu \mu }\) and \(Z \rightarrow \mu \mu \) signals. The background in the \(Z \rightarrow \mu \mu \) mass region is added to the templates using the simulation and corresponds to approximately 0.1 % of the \(Z \rightarrow \mu \mu \) candidates. The non-resonant background to \({J/\psi \rightarrow \mu \mu }\), coming from decays of light and heavy hadrons and from Drell–Yan production, accounts for about 15 % of the selected \({J/\psi \rightarrow \mu \mu }\) candidates. As it is not possible to accurately simulate it, a data driven approach is used to evaluate it: an analytic model of the background plus the \(J/\psi \) signal is fitted to the dimuon mass spectrum of the \({J/\psi \rightarrow \mu \mu }\) candidates in a mass range \(2.7\)–4.0 GeV, then the background model and its normalization are used in the template fit from which the momentum correction are extracted. The analytic fit is performed independently on the ID and MS event candidates. The non-resonant dimuon background is parametrized with an exponential function, while the \(J/\psi \) and \(\psi ^{2S}\) resonances are parametrized by a Crystal-Ball function [26] in the ID fits, or by a Gaussian distribution convoluted with a Landau in the MS fits, where energy loss effects due to the calorimeter material are larger.

The template fit machinery involves several steps: first a binned likelihood function \(\mathcal {L}\) is built to compare the data to the MC templates of signal plus background. Then modified templates are generated by varying the correction parameters in Eq. (9) and applying them to the muon momentum of the simulated signal events. The \(-2\ln \mathcal {L}\) between data and the modified template is then minimized using MINUIT [27]. The procedure is iterated across all the ROFs: the first fit is performed using only events with both muons in the ROF, the following fits allow also one of the muons in a previously analysed ROF and one in the ROF under investigation. After all the detector ROFs have been analysed, the fit procedure is iterated twice in order to improve the stability of the results. The correction extraction is performed first for the ID and then for the MS, such that the ID transverse momentum present in Eq. (13) can be kept constant during the MS correction extraction.

Although the use of \(p_{\mathrm{T}}\) bins for the construction of the templates gives a good sensitivity to the \(p_{\mathrm{T}}\) dependence of the scale corrections, the fit is not very sensitive to the resolution correction terms \(\Delta r^\mathrm{MS }_0(\eta ,\phi )\) and \(\Delta r^\mathrm{MS }_2(\eta ,\phi )\) of Eq. (9). The reasons for this are, at low \(p_{\mathrm{T}}\), the \(p_{\mathrm{T}}>8\) GeV selection cut applied to the \(J/\psi \) data sample, which limits the sensitivity to \(\Delta r^\mathrm{MS }_0(\eta ,\phi )\), and, at high \(p_{\mathrm{T}}\), the limited statistics of the \(Z \rightarrow \mu \mu \) data sample with \(p_{\mathrm{T}}^\mathrm{MS }>100\) GeV, which limits the sensitivity to \(\Delta r^\mathrm{MS }_2(\eta ,\phi )\). As the energy loss fluctuations do not show significant disagreement between data and MC for \(|\eta |>0.8\), the parameter \(\Delta r^\mathrm{MS }_0(\eta ,\phi )\) has been fixed to zero in this region. The effect of the misalignment of MS chambers in real data, which is expected to be the largest contribution to \(\Delta r^\mathrm{MS }_2(\eta ,\phi )\), is already taken into account in the simulation as described in Sect. 3.2. Therefore the \(\Delta r^\mathrm{MS }_2(\eta ,\phi )\) term is also fixed to zero in the MS correction extraction. Two of the systematic uncertainties described in Sect. 5.1.2 are used to cover possible deviations from zero of these two terms.

5.1.2 Systematic uncertainties

Systematic uncertainties cover imperfections in the model used for the muon momentum correction and in the fit procedure used for the extraction of the correction terms. In particular the correction extraction procedure has been repeated using the following different configurations:

  • variation of \(\pm 5\) GeV in the dimuon mass window used for the \(Z \rightarrow \mu \mu \) event selection. This is intended to cover resolution differences between data and MC that are beyond a simple Gaussian smearing. This results in one of the largest systematic uncertainties on the resolution corrections, with an average effect of \(\approx \) 10 % on the \(\Delta r_{1}^\mathrm{ID }\), \(\Delta r_{2}^\mathrm{ID }\), and \(\Delta r_{1}^\mathrm{MS }\) parameters.

  • Two variations of the \(J/\psi \) templates used in the fit. The first concerns the \(J/\psi \) background parametrization: new \(m_{\mu \mu }^\mathrm{MS }\) and \(m_{\mu \mu }^\mathrm{ID }\) background templates are generated using a linear model, for the MS fits, and a linear-times-exponential model, for the ID fits. The second variation concerns the \(J/\psi \) event selection: the minimum muon \(p_{\mathrm{T}}^{MS,ID}\) cut is raised from 8 to 10 GeV, thus reducing the weight of low-\(p_{\mathrm{T}}\) muons on the corrections. The resulting variations on the resolution correction parameters are \(\approx 10\) % of \(\Delta r_{1}^\mathrm{ID }\) and \(\Delta r_{1}^\mathrm{MS }\). The effect is also relevant for the MS scale corrections with a variation of \(\approx 0.01\) GeV on \(s_0^\mathrm{MS }\) and of \(\approx 4\times 10^{-4}\) on \(s_1^\mathrm{MS }\).

  • The ID correction extraction is repeated using \({J/\psi \rightarrow \mu \mu }\) events only or \(Z \rightarrow \mu \mu \) events only. Since such configurations have a reduced statistical power, only the \(s_1^\mathrm{ID }\) correction parameter is left free in the fit, while the resolution correction terms are fixed to nominal values. The resulting uncertainty on \(s_1^\mathrm{ID }\), ranging from 0.01 % to 0.05 % from the central to the forward region of the ID, accounts for non-linear effects on the ID scale.

  • The parameter \(\Delta r^\mathrm{MS }_0\) of Eq. (9) is left free in all the regions, instead of fixing it to zero for \(|\eta |>0.8\). The largest variation of 0.08 GeV is applied as an additional systematic uncertainty on the parameter.

  • The MS correction is extracted using a special \(Z \rightarrow \mu \mu \) MC sample with ideal geometry, i.e. where no simulation of the misalignment of the MS chambers is applied. This is needed because the standard simulation has a too pessimistic resolution in the \(|\eta |<1.25\) region, forcing the \(\Delta r^\mathrm{MS }_1\) parameter to values compatible with zero. The template fit performed with the ideal-geometry \(Z \rightarrow \mu \mu \) MC sample gives \(\Delta r^\mathrm{MS }_1> 0\) in the region \(0.4<|\eta |<1.25\). The largest variation of \(\Delta r^\mathrm{MS }_1\), corresponding to \(0.012\), is applied as an additional systematic uncertainty for this region.

  • Variation of the normalization of the MC samples used in \(Z \rightarrow \mu \mu \) background estimate by factors of two and one half. The resulting systematic uncertainty is small except for the detector regions with \(|\eta |>2.0\), where the effect is comparable to the other uncertainties.

Independently from the fit procedure, the following studies are used to derive additional systematic uncertainties:

  • The simulation of the ID includes an excess of material for \(|\eta |>2.3\) resulting in a muon momentum resolution with is too pessimistic. Such imperfection is covered by adding a systematic uncertainties of \(2\times 10^{-3}\) on the \(s^\mathrm{ID}_{1}\) parameter, and of 0.01 on the \(\Delta r^\mathrm{ID}_{1}\) parameter, both for \(|\eta |>2.3\). These are the largest systematic uncertainties on the ID correction parameters.

  • The position of the mass peak in the \(Z \rightarrow \mu \mu \) sample is studied in finer \(\eta \) bins than those used to extract the corrections, using the fit that will be discussed in Sect. 5.2 as an alternative to the template fitting method. An additional uncertainty of \(2\times 10^{-4}\) on the \(s_1^\mathrm{ID }(\eta )\) parameter is found to cover all the observed deviations between data and corrected MC.

  • The effect of the measurement of the angle of the muon tracks has been checked by using the \(J/\psi \) MC and conservatively increasing the track angular resolution by \(\approx 40\,\%\). The maximum effect is an increase of the resolution correction \(\Delta r^\mathrm{ID }_1\) of \(0.001\), which is added to the systematic uncertainties.

  • Special runs with the toroidal magnetic field off have been used to evaluate the quality of the MS chamber alignment. These results are compared to the chamber misalignments in the simulation to define the systematic uncertainty on the \(\Delta r^\mathrm{MS }_2(\eta ,\phi )\) resolution correction parameter.

The final uncertainty on each of the eight muon momentum correction parameters is derived from the sum in quadrature of all the listed uncertainty sources. This is simplified for use in standard physics analyses, for which only four systematic variations are provided: global upper and lower scale variations and independent resolution variations for the ID and the MS. The upper and lower scale variations are obtained by a simultaneous variation of all the ID and MS scale correction parameters by \(1\sigma \). The resolution variation for ID (MS) is obtained by the simultaneous variation of all the ID (MS) correction parameters.

The MC-smearing approach of Eq. (9) cannot be used to correct the MC when the resolution in real data is better than in the simulation. To deal with these cases, the amount of resolution that should be subtracted in quadrature from the simulation to reproduce the data is included in the positive ID and MS resolution variations. Then the prescription for physics analysis is to symmetrize the effect of the positive variation of resolution parameters around the nominal value of the physical observables under study.

Table 1 Summary of ID muon momentum resolution and scale corrections used in Eq. (9), averaged over three main detector regions. The corrections are derived in 18 \(\eta \) detector regions, as described in Sect. 5.1.1, and averaged according to the \(\eta \) width of each region. The uncertainties are the result of the sum in quadrature of the statistical and systematic uncertainties. Only upper uncertainties are reported for the \(\Delta r\) parameters; lower uncertainties are evaluated by symmetrization, as described in Sect. 5.1.2
Table 2 Summary of MS momentum resolution and scale corrections for small and large MS sectors, averaged over three main detector regions. The corrections for large and small MS sectors are derived in 18 \(\eta \) detector regions, as described in Sect. 5.1.1, and averaged according to the \(\eta \) width of each region. The parameters \(\Delta r_0^\mathrm{MS }\), for \(|\eta |>1.05\), and \(\Delta r_2^\mathrm{MS }\), for the full \(\eta \) range, are fixed to zero. The uncertainties are the result of the sum in quadrature of the statistical and systematic uncertainties. Only upper uncertainties are reported for the \(\Delta r\) parameters; lower uncertainties are evaluated by symmetrization, as described in Sect. 5.1.2

5.1.3 Result of the muon momentum scale and resolution corrections

The ID and MS correction parameters used in Eq. (9) are shown in Tables 1 and 2, averaged over three \(\eta \) regions. The scale correction to the simulated ID track reconstruction is always below 0.1 % with an uncertainty ranging from \(\approx 0.02\) %, for \(|\eta |<1.0\), to 0.2 %, for \(|\eta |>2.3\). The correction to the MS scale is \(\lesssim 0.1\) % except for the large MS sectors in the barrel region of the detector, where a correction of \(\approx \)0.3 % is needed, and for specific MS regions with \(1.25<|\eta |<1.5\) where a correction of about \(-0.4\) % is needed. An energy loss correction of approximately \(30\) MeV is visible for low values of \(p_{\mathrm{T}}\) in the MS reconstruction. This correction corresponds to about \(1\,\%\) of the total energy loss in the calorimeter and in the dead material in front of the spectrometer and is compatible with the accuracy of the material budget used in the simulation. Depending on the considered \(p_{\mathrm{T}}\) range, total resolution smearing corrections below 10 % and below 15 % are needed for the simulated ID and MS track reconstructions.

5.2 Measurement of the dimuon mass scale and resolution

The collected samples of \({J/\psi \rightarrow \mu \mu }\), \({\Upsilon \rightarrow \mu \mu }\) and \(Z \rightarrow \mu \mu \) decays have been used to study the muon momentum resolution and to validate the momentum corrections obtained with the template fit method described in the previous section with a different methodology. In addition the \(\Upsilon \) sample, not used in the extraction of the corrections, provides an independent validation.

Neglecting angular effects, the invariant mass resolution \(\sigma (m_{\mu \mu })\) is related to the momentum resolution by

$$\begin{aligned} \frac{\sigma (m_{\mu \mu })}{m_{\mu \mu }} \, = \, \frac{1}{2}\frac{\sigma (p_1)}{p_1} \oplus \frac{1}{2}\frac{\sigma (p_2)}{p_2}, \end{aligned}$$
(14)

where \(p_1\) and \(p_2\) are the momenta of the two muons. If the momentum resolution is similar for the two muons then the relative mass resolution is proportional to the relative momentum resolution:

$$\begin{aligned} \frac{\sigma (m_{\mu \mu })}{m_{\mu \mu }} \, = \, \frac{1}{\sqrt{2}} \frac{\sigma (p)}{p}. \end{aligned}$$
(15)

The mass resolution has been obtained by fitting the width of the invariant mass peaks. In the \({J/\psi \rightarrow \mu \mu }\) and \({\Upsilon \rightarrow \mu \mu }\) decays, the intrinsic width of the resonance is negligible with respect to the experimental resolution. In the \(Z \rightarrow \mu \mu \) case the fits have been performed using a convolution of the true line-shape obtained from the MC simulation with an experimental resolution function. The momentum scale was obtained by comparing the mass peak position in data and in MC. Details of the event selection and of the invariant mass fits are given below.

Fig. 10
figure 10

Dimuon invariant mass distribution of \({J/\psi \rightarrow \mu \mu }\) (left), \({\Upsilon \rightarrow \mu \mu }\) (center) and \(Z \rightarrow \mu \mu \) (right) candidate events reconstructed with CB muons. The upper panels show the invariant mass distribution for data and for the signal MC simulation plus the background estimate. The points show the data, the filled histograms show the simulation with the MC momentum corrections applied and the dashed histogram shows the simulation when no correction is applied. Background estimates are added to the signal simulation. The lower panels show the Data/MC ratios. The band represents the effect of the systematic uncertainties on the MC momentum corrections. In the \(J/\psi \) case the background was fitted in a sideband region as described in the text. In the \(\Upsilon \) case a simultaneous fit of the normalization of the three simulated \({\Upsilon \rightarrow \mu \mu }\) distributions and of a linear background was performed. In the \(Z\) case, the MC background samples are added to the signal sample according to their expected cross sections. The sum of background and signal MC is normalized to the data

5.2.1 Event selection and mass fitting

The \(J/\psi \) and \(\Upsilon \) events are selected online by the dedicated dimuon triggers described in Sect. 3.1. The offline event selection requires in addition that both muons are reconstructed as CB muons and have \(p_{\mathrm{T}}>7\) GeV. The trigger acceptance limits the muons to the region \(|\eta |<2.4\). The resulting data samples consist of \(17\)M and \(4.7\)M candidates for \(J/\psi \) and \(\Upsilon \), respectively. The \(Z \rightarrow \mu \mu \) sample was selected online with the single-muon trigger described in Sect. 4.1. One of the two muons can be outside the trigger acceptance, allowing coverage of the full range \(|\eta |<2.7\). The offline selection requires two opposite-charge muons, one with \(p_{\mathrm{T}}>25\) GeV and one with \(p_{\mathrm{T}}>20\) GeV. The two muons are required to be isolated, to have opposite charges and to be compatible with the primary interaction vertex.

The invariant mass distribution of the \({J/\psi \rightarrow \mu \mu }\), \({\Upsilon \rightarrow \mu \mu }\) and \(Z \rightarrow \mu \mu \) samples are shown in Fig. 10 and compared with uncorrected and corrected MC. With the uncorrected MC the signal peaks have smaller width and are slightly shifted with respect to data. After correction, the lineshapes of the three resonances agree very well with the data. For a detailed study, the position \(\langle m_{\mu \mu } \rangle \) and the width \(\sigma (m_{\mu \mu })\) of the mass peaks are extracted in bins of \(\eta \) and \(p_{\mathrm{T}}\) from fits of the invariant mass distributions of the three resonances.

In the \(J/\psi \) case, for each bin, the background is obtained from a fit of two sideband regions outside the \(J/\psi \) mass peak (\(2.55<m_{\mu \mu }<2.9\) and \(3.3<m_{\mu \mu }<4.0\) GeV) using a second order polynomial. The background is then subtracted from the signal mass window. The parameters \(\langle m_{\mu \mu } \rangle \) and \(\sigma (m_{\mu \mu })\) of the background subtracted signal distribution are obtained with a Gaussian fit in the range \(\langle m_{\mu \mu } \rangle \pm 1.5 \sigma (m_{\mu \mu })\), obtained using an iterative procedure. Systematic uncertainties associated to the fit are evaluated by repeating the fit using a third order polynomial as the background model and by varying the fit range to \(\pm 1 \times \) and \(\pm 2 \times \sigma (m_{\mu \mu })\).

As shown in Fig. 10, the three \(\Upsilon \) resonances (1S, 2S, 3S) partially overlap. Moreover in the \(\Upsilon \) case the mass window imposed by the trigger limits considerably the size of the sidebands available for fixing the background level. Therefore a different fit strategy is adopted in this case. For each bin, the whole invariant mass distribution in the range \(8.5<m_{\mu \mu }<11.5\) GeV is fitted with a linear background plus three Crystal-Ball functions representing the three resonances. The \(\alpha \) and \(n\) parameters that fix the tail of the Crystal-Ball function are fixed to the values obtained from a fit of the signal MC mass distribution. The relative mass shifts of the three signal peaks are fixed using the PDG masses of the three resonances, while the widths of the three peaks, divided by the corresponding PDG masses, are constrained to be equal. The remaining free parameters in the fit are the mass scale, the width \(\sigma (m_{\mu \mu })\) of the \(\Upsilon (1S)\), the relative normalizations of the \(\Upsilon (2S)\) and \(\Upsilon (3S)\) distributions with respect to \(\Upsilon (1S)\) and two parameters for the linear background. A similar fit is performed on the MC simulation of the invariant mass distribution obtained by adding the three signal peaks and a flat background distribution. The fit systematic uncertainties have been evaluated by chaining the fit range to \(8.25< m_{\mu \mu }< 11.75\) and \(8.75< m_{\mu \mu }< 11.0\) GeV and by varying the \(\alpha \) and \(n\) parameters in the range allowed by fits to the simulation.

Fig. 11
figure 11

Ratio of the fitted mean mass, \(\langle m_{\mu \mu } \rangle \), for data and corrected MC from \(Z\) (top), \(\Upsilon \) (middle), and \(J/\psi \) (bottom) events as a function of the pseudorapidity of the highest-\(p_{\mathrm{T}}\) muon. The ratio is shown for corrected MC (filled symbols) and uncorrected MC (empty symbols). The error bars represent the statistical and the systematic uncertainty on the mass fits added in quadrature. The bands show the uncertainty on the MC corrections calculated separately for the three samples

In the \(Z \rightarrow \mu \mu \) case, for each bin, the true lineshape predicted by the MC simulation is parametrized with a Breit–Wigner function. The measured dimuon mass spectrum is fitted with a Crystal-Ball function, representing the experimental resolution effects, convoluted with the Breit–Wigner parametrization of the true lineshape. The fit is repeated in different ranges around the mass peak (corresponding approximately to one to two standard deviations) and the spread of the results is used to evaluate the systematic uncertainty of the fit.

Fig. 12
figure 12

Ratio of the fitted mean mass, \(\langle m_{\mu \mu } \rangle \), for data and corrected MC from \(J/\psi \), \(\Upsilon \) and \(Z\) events as a function of the average transverse momentum in three \(|\eta |\) ranges. Both muons are required to be in the same \(|\eta |\) range. The \(J/\psi \) and \(\Upsilon \) data are shown as a function of the \(\bar{p}_\mathrm{T} = \frac{1}{2}(p_{\mathrm {T},1}+p_{\mathrm {T},2})\) while for \(Z\) data are plotted as a function of \(p_{\mathrm{T}}^*\) as defined in Eq. (16). The error bars represent the statistical uncertainty and the systematic uncertainty on the fit added in quadrature. The bands show the uncertainty on the MC corrections calculated separately for the three samples

5.2.2 Mass scale results

Figure 11 shows the Data/MC ratio of the mean mass \(\langle m_{\mu \mu } \rangle \) obtained from the fits to the \(Z\), \(J/\psi \), \(\Upsilon \) samples described above, as a function of the pseudorapidity of the highest-\(p_{\mathrm{T}}\) muon for pairs of CB muons. For the uncorrected MC, the ratio deviates from unity in the large \(|\eta |\) region of the \(j/\psi \) and \(\Upsilon \) cases by up to \(5\,\%\). This is mainly due to imperfections in the simulation of the muon energy loss that have a larger effect at low \(p_{\mathrm{T}}\) and in the forward \(\eta \) region where the MS measurement has a larger weight in the MS-ID combination. The corrected MC is in very good agreement with the data, well within the scale systematics that are \(\approx 0.035\,\%\) in the barrel region and increase with \(|\eta |\) to reach \(\sim 0.2\,\%\) in the region \(|\eta |>2\) for the \(Z \rightarrow \mu \mu \) case.

Figure 12 shows the data/MC ratio for \(\langle m_{\mu \mu } \rangle \) as a function of the transverse momentum \(\langle p_{\mathrm{T}}\rangle \) for muons in three different pseudorapidity regions.

For the \(J/\psi \) and \(\Upsilon \) cases, \(\langle p_{\mathrm{T}}\rangle \) is defined as the average momentum \(\bar{p}_T = \frac{1}{2}(p_{\mathrm {T},1}+p_{\mathrm {T},2})\) while in the \(Z\) case it is defined as

$$\begin{aligned} p_{\mathrm{T}}^{*} = m_{Z} \sqrt{ \frac{\sin {\theta _1}\sin {\theta _2}}{2 (1-\cos \alpha _{12})}}, \end{aligned}$$
(16)

where \(m_{Z}\) is the \(Z\) pole mass [28], \(\theta _1\), \(\theta _2\) are the polar angles of the two muons and \(\alpha _{12}\) is the opening angle of the muon pair. This definition, based on angular variables only, removes the correlation between the measurement of the dimuon mass and of the average \(p_{\mathrm{T}}\) that is particularly relevant around the Jacobian peak at \(p_{\mathrm{T}}=m_Z/2\) in the distribution of muons from \(Z\) decays.

The data from the three resonances span from \(\langle p_{\mathrm{T}}\rangle =7\) GeV to \(\langle p_{\mathrm{T}}\rangle =120\) GeV and show that the momentum scale is well known and within the assigned systematic uncertainties in the whole \(p_{\mathrm{T}}\) range.

5.2.3 Resolution results

The dimuon mass width \(\sigma (m_{\mu \mu })\) for CB muons is shown as a function of the leading-muon \(\eta \) in Fig. 13 for the three resonances. The width of the uncorrected MC is 5–10 % smaller than that of the data. After correction the MC reproduces the width of the data well within the correction uncertainties.

At a given \(\eta \), the relative dimuon mass resolution \(\sigma (m_{\mu \mu })/m_{\mu \mu }\) depends approximately on \(\langle p_{\mathrm{T}}\rangle \) (Eq. 15). This allows a direct comparison of the momentum resolution using different resonances. This is shown in Fig. 14, where the relative mass resolution from \({J/\psi \rightarrow \mu \mu }\), \({\Upsilon \rightarrow \mu \mu }\) and \(Z \rightarrow \mu \mu \) events is compared in three regions of \(|\eta |\). The \({J/\psi \rightarrow \mu \mu }\) and \({\Upsilon \rightarrow \mu \mu }\) resolutions are in good agreement.

Fig. 13
figure 13

Dimuon invariant mass resolution for CB muons for \({J/\psi \rightarrow \mu \mu }\) (a), \({\Upsilon \rightarrow \mu \mu }\) (b) and \(Z \rightarrow \mu \mu \) (c) events for data and for uncorrected and corrected MC as a function of the pseudorapidity of the highest-\(p_{\mathrm{T}}\) muon. The upper plots show the fitted resolution parameter for data, uncorrected MC and corrected MC. The lower panels show the data/MC ratio, using uncorrected and corrected MC. The error bars represent the statistical uncertainty and the systematic uncertainty on the fit added in quadrature. The bands in the lower panels represent the systematic uncertainty on the correction

Fig. 14
figure 14

Dimuon invariant mass resolution for CB muons measured from \(J/\psi \), \(\Upsilon \) and \(Z\) events as a function of the average transverse momentum in three \(|\eta |\) ranges. Both muons are required to be in the same \(|\eta |\) range. The \(J/\psi \) and \(\Upsilon \) data are plotted as a function of \(\bar{p}_\mathrm{T} = \frac{1}{2}(p_{\mathrm {T},1}+p_{\mathrm {T},2})\) while for \(Z\) data are plotted as a function of \(p_{\mathrm{T}}^*\) as defined in Eq. (16). The error bars represent statistical and systematic errors added in quadrature. The lower panel shows the ratio between data and the corrected MC, with bands representing the uncertainty on the MC corrections for the three calibration samples

Fig. 15
figure 15

Dimuon invariant mass resolution for muons reconstructed with the ID only, measured from \(J/\psi \), \(\Upsilon \) and \(Z\) events as a function of the average transverse momentum in three \(|\eta |\) ranges. Other details as in Fig. 14

Fig. 16
figure 16

Dimuon invariant mass resolution for muons reconstructed with the MS only, measured from \(J/\psi \), \(\Upsilon \) and \(Z\) events as a function of the average transverse momentum in three \(|\eta |\) ranges. Other details as in Fig. 14

In the \(Z \rightarrow \mu \mu \) sample, due to the decay kinematics, below \(\langle p_{\mathrm{T}}\rangle = m_Z /2\) there is a strong correlation between \(\langle p_{\mathrm{T}}\rangle \) and the pseudorapidity of the muons, in such a way that the lower is the \(\langle p_{\mathrm{T}}\rangle \), the larger is the \(|\eta |\) of the muons. Above \(\langle p_{\mathrm{T}}\rangle = m_Z /2\), the correlation effect is strongly reduced and the \(Z\) measurements are well aligned with those from the lighter resonances. In the barrel region, \(|\eta |<1\), the mass resolution increases from \(\sigma (m_{\mu \mu })/m_{\mu \mu } \approx 1.2\,\%\) at \(p_{\mathrm{T}}<10\) GeV to \(\sigma (m_{\mu \mu })/m_{\mu \mu } \approx 2\,\%\) at \(p_{\mathrm{T}}=100\) GeV. For \(|\eta |>1\) it goes from \(\sigma (m_{\mu \mu })/m_{\mu \mu } \approx 2\,\%\) to \(\approx 3\,\%\) in the same \(p_{\mathrm{T}}\) range. This behavior is very well reproduced by the corrected MC. Following Eq. (15), it is possible to scale \(\sigma (m_{\mu \mu })/m_{\mu \mu }\) by \(\sqrt{2}\) to extract a measurement of the relative momentum resolution \(\sigma (p)/p\), which ranges from \(\approx 1.7\,\%\) in the central region and at low \(p_{\mathrm{T}}\) to \(\approx 4\,\%\) at large \(\eta \) and \(p_{\mathrm{T}}= 100\) GeV.

To understand better the \(p_{\mathrm{T}}\) dependence of the momentum resolution of CB muons, it is useful to study separately the resolution of the ID and of the MS measurements, as shown in Figs. 15 and 16. The ID measurement has a better resolution than the MS in the \(p_{\mathrm{T}}\) range under study for \(|\eta |<2\) while the MS has a better resolution at larger \(|\eta |\). The resolution of the CB muons is significantly better than the ID or the MS measurements taken separately in the whole \(|\eta |\) range. The ID resolution has an approximately linear increases with \(p_{\mathrm{T}}\), corresponding to a non-zero \(r_2\) term in Eq. (10). The MS resolution is largest in the region \(1<|\eta |<2\) which contains the areas with the lowest magnetic field integral. In the region \(|\eta |<1\) there is a visible increase at low \(p_{\mathrm{T}}\) that corresponds to the presence of a non-zero \(r_0\) term in Eq. (10). The \(p_{\mathrm{T}}\) dependence of the resolutions for both the ID and the MS measurements is well reproduced by the corrected MC. According to studies based on MC, the MS measurement is expected to dominate over the ID in the whole \(|\eta |\) range for sufficiently large \(p_{\mathrm{T}}\).

6 Final state radiation recovery

The invariant mass distributions of resonances that decay into muons, such as \(Z \rightarrow \mu \mu \) and \(H\rightarrow ZZ\rightarrow 4\ell \), is affected by QED final state radiation of photons, causing the mass reconstructed using muons to be shifted to lower values. In this section, a dedicated method to include FSR photons in the reconstruction of resonances decaying into muons is introduced and tested with \(Z \rightarrow \mu \mu \) data. This method has been used in several ATLAS publications [6, 29].

Fig. 17
figure 17

Invariant mass distribution of \(Z \rightarrow \mu \mu \) events with identified FSR in data before (filled triangles) and after (filled circles) FSR correction, for collinear (top) and non-collinear (bottom) FSR. The MC prediction is shown before correction (red histogram) and after correction (blue histogram)

Final state radiation photons emitted collinearly to muons can be reconstructed with the LAr calorimeter: electromagnetic clusters are searched for within a narrow cone around the axis defined by the muon momentum direction at the interaction point (i.e. the direction which would be followed by an uncharged particle). The longitudinal segmentation of the LAr calorimeter is exploited to reduce fake photon clusters produced by muon energy losses in the calorimeter. This is achieved by using as a discriminant the fraction \(\mathrm {f_1}\) of the cluster energy deposited in the first segment of the calorimeter divided by the total cluster energy. Collinear FSR photon candidates are required to have \(E_{T}>1.5\) GeV, \(\Delta R_{\mathrm {cluster},\mu }<0.15\) and \(\mathrm {f_1}>0.1\). In addition, non-collinear FSR photons are recovered using the standard ATLAS photon reconstruction, selecting isolated photons emitted with \(\Delta R_{\mathrm {cluster},\mu }>0.15\) and with \(E_{T}>10\) GeV [30].

The effect of adding a collinear or non-collinear FSR photon to the \(Z \rightarrow \mu \mu \) invariant mass in data is studied in a sample obtained with a dedicated selection of \(Z \rightarrow \mu \mu \) candidates plus at least one radiated photon candidate.

The correction for collinear FSR is applied for events in the mass window \(66\) GeV\( < m_{\mu \mu } <89\) GeV while the correction for non-collinear FSR photons is applied only if the collinear search has failed and the dimuon mass satisfies \(m_{\mu \mu } < 81\) GeV.

In Fig. 17 the invariant mass distributions for the sample of \(Z \rightarrow \mu \mu \) events with a FSR photon candidate are shown before and after the addition of collinear and non-collinear FSR photons. A good agreement between data and MC is observed for the corrected \(Z \rightarrow \mu \mu \) events. According to MC studies, the collinear FSR selection has an efficiency of \(70 \pm 4\) % for FSR photons emitted with \(E_{T}>1.5\) GeV and \(\Delta R_{\gamma ,\mu }<0.15\) in the fiducial region defined requiring \(|\eta |<2.37\) and excluding the calorimeter crack region \(1.37< |\eta | < 1.52\). About \(85\,\%\) of the corrected events have genuine FSR photons, with the remaining photons coming from muon bremsstrahlung or ionization or from random matching with energy depositions from other sources. The fraction of all \(Z \rightarrow \mu \mu \) events corrected with a collinear FSR photon is \(\simeq \) 4 %. The non-collinear FSR selection has an efficiency of \(60\pm 3\) % in the fiducial region and a purity of \(\ge 95\) %. The fraction of \(Z \rightarrow \mu \mu \) events corrected with a non-collinear FSR photon is \(\simeq 1\) %.

The FSR correction may introduce systematic variations in the invariant mass scale and resolution. To study these effects, a Gaussian fit of the \(Z \rightarrow \mu \mu \) distribution has been performed in the mass range \(91.18 \pm 3.00\) GeV. The FSR correction induces a mass shift of \(+40\pm 3\) MeV and an improvement of the resolution of \(3 \pm 1\) % in the full \(Z \rightarrow \mu \mu \) sample. The effects observed in the data are well reproduced by the MC. The systematic uncertainty introduced by the FSR recovery on the inclusive \(Z\) mass scale can be understood by considering a 0.5 % photon energy scale uncertainty, the fact that only 5 % of the \(Z\) events are corrected, and that the fraction of energy carried by the photons is a few %. This leads to a systematic uncertainty smaller than 2 MeV.

The effect of pile up on the FSR correction has been estimated by dividing the data and the MC into three categories based on the average number of interactions per bunch crossing: \(\langle \mu \rangle =0\)–17, 17–23, 23–40. A comparison of the fitted \(Z\) mass between data and MC has been performed in the three categories and no dependence on \(\langle \mu \rangle \) was observed. Good agreement between data and MC within the statistical uncertainties was found.

7 Conclusions

The performance of the ATLAS muon reconstruction has been measured using data from LHC \(pp\) collisions at \(\sqrt{s} = 7\)–8 TeV. The muon reconstruction efficiency is close to \(99\,\%\) over most of the pseudorapidity range of \(|\eta |<2.5\) and for \(p_{\mathrm{T}}>10\) GeV. The large collected sample of 9M \(Z \rightarrow \mu \mu \) decays allows the measurement of the efficiency over the full acceptance of \(|\eta |<2.7\), and with a precision at the 1 per-mille level for \(|\eta |<2.5\). By including \({J/\psi \rightarrow \mu \mu }\) decays, the efficiency measurement has been extended over the transverse momentum range from \(p_{\mathrm{T}}\approx 4\) GeV to \(p_{\mathrm{T}}\approx 100\) GeV.

The muon momentum scale and resolution has been studied in detail using large calibration samples of \({J/\psi \rightarrow \mu \mu }\), \({\Upsilon \rightarrow \mu \mu }\) and \(Z \rightarrow \mu \mu \) decays. These studies have been used to correct the MC simulation to improve the data-MC agreement and to minimize the uncertainties in physics analyses. The momentum scale for combined muons is known with an uncertainty of \(\pm 0.05\,\%\) for \(|\eta | <1\), which increases to \(\lesssim 0.2\,\%\) for \(|\eta |>2.3\) for \(Z \rightarrow \mu \mu \) events. The dimuon mass resolution is \(\approx 1.2\,\%\) (\(2\,\%\)) at low-\(p_{\mathrm{T}}\) increasing to \(\approx 2\,\%\) (\(3\,\%\)) at \(p_{\mathrm{T}}\approx 100\) GeV for \(|\eta |<1\) (\(|\eta |>1\)). The resolution is reproduced by the corrected simulation within relative uncertainties of \(3\,\%\) to \(10\,\%\) depending on \(\eta \) and \(p_{\mathrm{T}}\).

The mass resolution for the \(Z \rightarrow \mu \mu \) resonance was found to improve when photons from QED final state radiation are recovered. The FSR recovery allows to recover \(\approx 4\,\%\) of the events from the low-mass tail to the peak region, improving the dimuon mass resolution by \(\approx 3\,\%\).