1 Introduction

Precise lifetime measurements involving the weak interaction play an important role in the study of nonperturbative aspects of quantum chromodynamics (QCD). The phenomenology is commonly described by the QCD-inspired heavy-quark expansion model, which provides estimates of the ratio of lifetimes for hadrons containing a common heavy quark [1]. In this paper, we report measurements of the lifetimes of the \({\mathrm {B}^0}\), \(\mathrm {B}^0_\mathrm {s} \), \(\varLambda _\mathrm {b}^0\), and \(\mathrm {B}_\mathrm {c}^+\) hadrons.

The measurements are based on the reconstruction of the transverse decay length \(L_{xy}\), where \(\vec {L}_{xy}\) is defined as the flight distance vector from the primary vertex to the decay vertex of the \(\mathrm {b}\) hadron, projected onto the transverse component \(\vec {p}_\text {T}\) (perpendicular to the beam axis) of the \(\mathrm {b}\) hadron momentum. The proper decay time of the \(\mathrm {b}\) hadron times the speed of light is measured using

$$\begin{aligned} ct =c L_{xy}\frac{M}{p_{\mathrm {T}}}, \end{aligned}$$
(1)

where M is the world-average value of the mass of the \(\mathrm {b}\) hadron [2].

In this analysis, the \(\mathrm {b}\) hadrons are reconstructed from decays containing a \(\mathrm {J}/\psi \) meson. The data were recorded by the CMS detector [3] at the CERN LHC using dedicated triggers that require two oppositely charged muons consistent with originating from a common vertex and with an invariant mass compatible with that of the \(\mathrm {J}/\psi \) meson. Specifically, we reconstruct the decay modes \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}\), \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^0_S}\), \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\), \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\), \(\varLambda _\mathrm {b}^0\!\rightarrow \!\mathrm {J}/\psi \mathrm {\Lambda }^{0}\), and \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {\pi ^+}\), where \(\mathrm {J}/\psi \!\rightarrow \! \upmu ^+ \upmu ^-\), \(\mathrm {K^{*}(892)}{}^{0} \!\rightarrow \! \mathrm {K^+}\mathrm {\pi ^-}\), \(\mathrm {K^0_S}\!\rightarrow \! \mathrm {\pi ^+}\mathrm {\pi ^-}\), \(\mathrm {\phi (1020)}\!\rightarrow \! \mathrm {K^+}\mathrm {K^-}\), and \(\mathrm {\Lambda }^{0}\!\rightarrow \! \mathrm {p}\mathrm {\pi ^-}\). The \({\mathrm {B}^{+}}\!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\) decay is used as a reference mode and in evaluating some of the systematic uncertainties. Charge conjugation is implied throughout, unless otherwise indicated.

The decay rate of neutral \(\mathrm {B}_{\mathrm {q}}\) (\(\hbox {q} = \hbox {s or d}\)) mesons is characterized by two parameters: the average decay width \(\varGamma _{\mathrm {q}} = (\varGamma _{\mathrm {L}}^{\mathrm {q}} + \varGamma _{\mathrm {H}}^{\mathrm {q}})/2\) and the decay width difference \(\varDelta \varGamma _{\mathrm {q}} = \varGamma _{\mathrm {L}}^{\mathrm {q}} - \varGamma _{\mathrm {H}}^{\mathrm {q}}\), where \(\varGamma _{\mathrm {L,H}}^{\mathrm {q}}\) are the decay widths of the light (L) and heavy (H) mass eigenstates. Assuming equal amounts of \(\mathrm {B}_{\mathrm {q}}\) and its antiparticle are produced in the proton–proton collisions, the time-dependent decay rate into a final state f that is accessible by both particle and antiparticle can be written as [4]:

$$\begin{aligned} R_{\mathrm {L}}^{f} \mathrm {e}^{- \varGamma _{\mathrm {L}}^{\mathrm {q}} t } + R_{\mathrm {H}}^{f} \mathrm {e}^{- \varGamma _{\mathrm {H}}^{\mathrm {q}} t}, \end{aligned}$$
(2)

where \(R_{\mathrm {L}}^{f}\) and \(R_{\mathrm {H}}^{f}\) are the amplitudes of the light and heavy mass states, respectively. Since the neutral B mesons have two eigenstates with different lifetimes, the \(ct\) distribution consists of the sum of two exponential contributions. The effective lifetime of the neutral \(\mathrm {B}_{\mathrm {q}}\) meson, produced as an equal admixture of particle and antiparticle flavour eigenstates and decaying into a final state f, can be written as [4]:

$$\begin{aligned} \tau _\text {eff} = \frac{ \frac{ R_\mathrm {L}^f}{ \left( \varGamma _{\mathrm {L}}^{\mathrm {q}} \right) ^2 } + \frac{R_\mathrm {H}^f}{ \left( \varGamma _\mathrm {H}^{\mathrm {q}} \right) ^2 }}{ \frac{R_\mathrm {L}^f}{\varGamma _{\mathrm {L}}^{\mathrm {q}} } + \frac{R_\mathrm {H}^f}{\varGamma _{\mathrm {H}}^{\mathrm {q}} } }. \end{aligned}$$
(3)

Since the amplitudes \(R_{\mathrm {H}}^f\) and \(R_{\mathrm {L}}^f\) are specific to the decay channel, the effective lifetime depends on the final state f and is measured by fitting an exponential function to a distribution consisting of the sum of two exponential contributions. Because the \({\mathrm {B}^0}\) system has a small lifetime difference with respect to the average lifetime, \(\varDelta \varGamma _\mathrm {d}/\varGamma _\mathrm {d} = (-0.2 \pm 1.0)\%\) [5], the \(ct\) distribution is close to an exponential, and it is treated as such for the lifetime measurement. Following Ref. [6], the \({\mathrm {B}^0}\) lifetimes measured in the flavour-specific channel \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}\) and the CP eigenstate channel \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^0_S}\) are used to determine values for \(\varDelta \varGamma _\mathrm {d}\), \(\varGamma _\mathrm {d}\), and \(\varDelta \varGamma _\mathrm {d}/\varGamma _\mathrm {d}\).

In the \(\mathrm {B}^0_\mathrm {s} \) system, \(\varDelta \varGamma _\mathrm {s}/\varGamma _{\mathrm {s}} = (13.0 \pm 0.9)\%\) [5] and the deviation from an exponential \(ct\) distribution is sizeable. In this analysis, the two lifetimes associated with the \(\mathrm {B}^0_\mathrm {s} \) meson are measured in the \(\mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) and \(\mathrm {J}/\psi \mathrm {\phi (1020)}\) decay channels. The \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) decays are reconstructed in the invariant mass range \(0.9240< M( \pi ^{+}\pi ^{-} ) < 1.0204\) \(\,\text {GeV}\), which is dominated by the \(f_{0}(980)\) resonance [7, 8], making it a CP-odd final state. Therefore, the lifetime measured in this channel is related to the inverse of the decay width of the heavy \(\mathrm {B}^0_\mathrm {s} \) mass eigenstate, \(\tau _{\mathrm {B}^0_\mathrm {s}}^{\text {CP-odd}} \approx 1/\varGamma _\mathrm {H}\), as CP violation in mixing is measured to be negligible [2]. The \(\mathrm {J}/\psi \mathrm {\phi (1020)}\) decay channel is an admixture of CP-even and CP-odd states, corresponding to the light and heavy mass eigenstates, respectively, neglecting CP violation in mixing. Rewriting Eq. (3), the effective lifetime of the \(\mathrm {B}^0_\mathrm {s} \) meson decaying to \(\mathrm {J}/\psi \mathrm {\phi (1020)}\) can be expressed as

$$\begin{aligned} \tau _{\text {eff}} = f_\mathrm {H} \tau _\mathrm {H} + (1-f_\mathrm {H}) \tau _\mathrm {L}, \end{aligned}$$
(4)

where \(\tau _\mathrm {L}\) and \(\tau _\mathrm {H}\) are the lifetimes of the light and heavy mass states, respectively, and \(f_\mathrm {H}\) is the heavy-component fraction, defined as:

$$\begin{aligned} f_\mathrm {H} = \frac{| A_{\perp } |^2 \tau _\mathrm {H}}{ |A|^2 \tau _\mathrm {L} + |A_\perp |^2 \tau _\mathrm {H}}. \end{aligned}$$
(5)

Here, \(|A|^2 = | A_0(0) |^2 + | A_{\parallel }(0) |^2\) is the sum of the squares of the amplitudes of the two CP-even states, and \(| A_{\perp } |^2 = | A_{\perp }(0) |^2\) is the square of the amplitude of the CP-odd state. The amplitudes are determined at the production time \(t=0\). Normalization constraints require \(|A|^2 = 1 - |A_\perp |^2\) and therefore

$$\begin{aligned} f_\mathrm {H} = \frac{| A_{\perp } |^2 \tau _\mathrm {H}}{ (1 - |A_\perp |^2) \tau _\mathrm {L} + |A_\perp |^2 \tau _\mathrm {H}}. \end{aligned}$$
(6)

By combining the \(\mathrm {B}^0_\mathrm {s} \) lifetimes obtained from the final states \(\mathrm {J}/\psi \mathrm {\phi (1020)}\) and \(\mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\), it is possible to determine the lifetime of the light \(\mathrm {B}^0_\mathrm {s} \) mass eigenstate. The results in this paper are complementary to the CMS weak mixing phase analysis in the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) channel [9], which provided measurements of the average decay width \(\varGamma _\mathrm {s}\) and the decay width difference \(\varDelta \varGamma _\mathrm {s}\).

The weak decay of the \(\mathrm {B}_\mathrm {c}^+\) meson can occur through either the b or c quark decaying, with the other quark as a spectator, or through an annihilation process. The latter is predicted to contribute 10% of the decay width [10], and lifetime measurements can be used to test the \(\mathrm {B}_\mathrm {c}^+\) decay model. As fewer and less precise measurements of the \(\mathrm {B}_\mathrm {c}^+\) lifetime exist [11,12,13,14,15,16] compared to other b hadrons, the \(\mathrm {B}_\mathrm {c}^+\) lifetime measurement presented in this paper is particularly valuable.

2 The CMS detector

The central feature of the CMS apparatus is a superconducting solenoid of 6 \(\text {m}\) internal diameter, providing a magnetic field of 3.8 \(\text {T}\). Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass and scintillator hadron calorimeter, each composed of a barrel and two endcap sections. Forward calorimeters extend the pseudorapidity coverage provided by the barrel and endcap detectors. Muons are detected in gas-ionization chambers embedded in the steel flux-return yoke outside the solenoid.

The main subdetectors used for this analysis are the silicon tracker and the muon detection system. The silicon tracker measures charged particles in the pseudorapidity range \(|\eta |<2.5\). It consists of 1440 silicon pixel and 15 148 silicon strip detector modules. For charged particles of \(1< p_{\mathrm {T}} < 10\,\text {GeV} \) and \(|\eta | < 1.4\), the track resolutions are typically 1.5% in \(p_{\mathrm {T}}\) and 25–90 (45–150)\(\,\upmu \text {m}\) in the transverse (longitudinal) impact parameter [17]. Muons are measured in the pseudorapidity range \(|\eta | < 2.4\), with detection planes made using three technologies: drift tubes, cathode strip chambers, and resistive-plate chambers.

Events of interest are selected using a two-tiered trigger system [18]. The first level, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select events at a rate of around 100 \(\text {kHz}\) within a time interval of less than 4\(\,\upmu \text {s}\). The second level, known as the high-level trigger (HLT), consists of a farm of processors running a version of the full event reconstruction software optimized for fast processing, and reduces the event rate to around 1 \(\text {kHz}\) before data storage. At the HLT stage, there is full access to the event information, and therefore selection criteria similar to those applied offline can be used.

A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [3].

3 Data and Monte Carlo simulated samples

The data used in this analysis were collected in 2012 from proton–proton collisions at a centre-of-mass energy of 8\(\,\text {Te}\text {V}\), and correspond to an integrated luminosity of 19.7\(\,\text {fb}^\text {-1}\).

Fully simulated Monte Carlo (MC) samples of \({\mathrm {B}^{+}}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^+}\), \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}\), \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^0_S}\), \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\), \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\), and \(\varLambda _\mathrm {b}^0\!\rightarrow \!\mathrm {J}/\psi \mathrm {\Lambda }^{0}\) were produced with pythia 6.424 [19] to simulate the proton–proton collisions, and subsequent parton shower and hadronization processes. The \(\mathrm {B}_\mathrm {c}^+\) MC sample was produced with the dedicated generator bcvegpy 2.0 [20, 21] interfaced to pythia . Decays of particles containing b or c quarks are simulated with the evtgen package [22], and final-state radiation is included via photos  [23]. Events are passed through the CMS detector simulation based on Geant4 [24], including additional proton–proton collisions in the same or nearby beam crossings (pileup) to match the number of multiple vertices per event in the data. Simulated events are processed with the same reconstruction and trigger algorithms as the data.

4 Reconstruction of \(\mathrm {b}\) hadrons

The data are collected with a trigger that is designed to identify events in which a \(\mathrm {J}/\psi \) meson decays to two oppositely charged muons. The transverse momentum of the \(\mathrm {J}/\psi \) candidate is required to be greater than 7.9\(\,\text {GeV}\) and both muons must be in the pseudorapidity region \(|\eta | < 2.2\). The distance of closest approach of each muon to the event vertex in the transverse plane must be less than 0.5\(\,\text {cm}\) and a fit of the two muons to a common vertex must have a \(\chi ^2\) probability greater than 0.5%. The invariant mass of the dimuon system must lie within ±5 times the experimental mass resolution (typically about 35\(\,\text {MeV}\)) of the world-average \(\mathrm {J}/\psi \) mass [2].

The offline selection starts from \(\mathrm {J}/\psi \) candidates that are reconstructed from pairs of oppositely charged muons. The standard CMS muon reconstruction procedure [25] is used to identify the muons, which requires multiple hits in the pixel, strip, and muon detectors with a consistent trajectory throughout. The offline selection requirements on the dimuon system replicate the trigger selection. From the sample of collected \(\mathrm {J}/\psi \) events, candidate b hadrons are reconstructed by combining a \(\mathrm {J}/\psi \) candidate with track(s) or reconstructed neutral particles, depending on the decay mode. Only tracks that pass the standard CMS high-purity requirements [17] are used. The b hadron candidate is fitted to a common vertex with the appropriate masses assigned to the charged tracks and the dimuon invariant mass constrained to the world-average \(\mathrm {J}/\psi \) mass [2]. In fits that include a \(\mathrm {K^0_S}\) or \(\mathrm {\Lambda }^0\) hadron, the world-average mass is used for those particles. Primary vertices (PV) are fitted from the reconstructed tracks using an estimate of the proton–proton interaction region (beamspot) as a constraint. The PV having the smallest pointing angle, defined as the angle between the reconstructed \(\mathrm {b}\) hadron momentum and the vector joining the PV with the decay vertex, is used. As the proper decay times are measured in the transverse plane, where the PV position is dominated by the beamspot, the choice of PV has little effect on the analysis and is accounted for as a systematic uncertainty.

4.1 Reconstruction of \({\mathrm {B}^{+}}\), \({\mathrm {B}^0}\), \(\mathrm {B}^0_\mathrm {s} \), and \(\varLambda _\mathrm {b}^0\) hadrons

The \({\mathrm {B}^{+}}\), \({\mathrm {B}^0}\), \(\mathrm {B}^0_\mathrm {s} \), and \(\varLambda _\mathrm {b}^0\) hadrons are reconstructed in the decays \({\mathrm {B}^{+}}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^+}\), \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^0_S}\), \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}\), \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\), \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\), and \(\varLambda _\mathrm {b}^0\!\rightarrow \!\mathrm {J}/\psi \mathrm {\Lambda }^{0}\). The \(\mathrm {K^{*}(892)}{}^{0}\), \(\mathrm {K^0_S}\), \(\mathrm {\phi (1020)}\), and \(\mathrm {\Lambda }^{0}\) candidates are reconstructed from pairs of oppositely charged tracks that are consistent with originating from a common vertex. Because of the lack of charged particle identification, the labelling of tracks as pions, kaons, and protons simply means the mass that is assigned to the track. The mass assignments for the \(\mathrm {K^0_S}\) and \(\mathrm {\phi (1020)}\) decay products are unambiguous (either both pions or both kaons). For the kinematic region considered in this analysis, simulations show that the proton always corresponds to the track with the larger momentum (leading track) from the \(\mathrm {\Lambda }^{0}\) decay. The \(\mathrm {K^{*}(892)}{}^{0}\) candidates are constructed from a pair of tracks with kaon and pion mass assignments.

Since two \(\mathrm {K^{*}(892)}{}^{0}\) candidates can be formed with a single pair of tracks, we select the combination for which the mass of the \(\mathrm {K^{*}(892)}{}^{0}\) candidate is closest to the world-average value [2]. This selects the correct combination 88% of the time.

All tracks must have a transverse momentum greater than 0.5\(\,\text {GeV}\). The decay vertices of the \(\mathrm {K^0_S}\) and \(\mathrm {\Lambda }^{0}\) particles are required to have a transverse decay length larger than 15\(\sigma \) and their two decay products must each have a transverse impact parameter of at least 2\(\sigma \), where the distances are with respect to the beamspot and \(\sigma \) is the calculated uncertainty in the relevant quantity. The intermediate candidate states \(\mathrm {K^{*}(892)}{}^{0}\), \(\mathrm {K^0_S}\), \(\mathrm {\phi (1020)}\), and \(\mathrm {\Lambda }^{0}\) are selected if they lie within the following mass regions that correspond to 1–2 times the experimental resolution or natural width around the nominal mass: \(0.7960<M(\mathrm {K^+}\mathrm {\pi ^-})<0.9880\) \(\,\text {GeV}\), \(0.4876<M(\mathrm {\pi ^+}\mathrm {\pi ^-})<0.5076\) \(\,\text {GeV}\), \(1.0095< M(\mathrm {K^+}\mathrm {K^-}) < 1.0295\) \(\,\text {GeV}\), and \(1.1096<M(\mathrm {p}\mathrm {\pi ^-})<1.1216\) \(\,\text {GeV}\). The accepted mass region of the \(\mathrm {\pi ^+}\mathrm {\pi ^-}\) system in \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) decay is \(0.9240< M(\mathrm {\pi ^+}\mathrm {\pi ^-}) < 1.0204\) \(\,\text {GeV}\). The \(\mathrm {K^0_S}\) contamination in the \(\mathrm {\Lambda }^{0}\) sample is removed by discarding candidates in which the leading particle in the \(\mathrm {\Lambda }^{0}\) decay is assigned the pion mass and the resulting \(\mathrm {\pi ^+}\mathrm {\pi ^-}\) invariant mass is in the range \(0.4876<M(\mathrm {\pi ^+}\mathrm {\pi ^-})<0.5076\) \(\,\text {GeV}\). Conversely, the \(\mathrm {\Lambda }^{0}\) contamination is removed from the \(\mathrm {K^0_S}\) sample by discarding candidates in the \(\mathrm {p}\mathrm {\pi ^-}\) mass region \(1.1096<M(\mathrm {p}\mathrm {\pi ^-})<1.1216\) \(\,\text {GeV}\), when the proton mass is assigned to the leading pion from the \(\mathrm {K^0_S}\) decay. The \(p_{\mathrm {T}}\) of the \(\mathrm {K^+}\) candidate track from the \({\mathrm {B}^{+}}\) decay must be larger than 1\(\,\text {GeV}\). The \(p_{\mathrm {T}}\) of the \(\mathrm {\pi ^+}\mathrm {\pi ^-}\) system in \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) decays and the \(\mathrm {K^{*}(892)}{}^{0}\) candidates in \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}\) decays must be greater than 3.5\(\,\text {GeV}\), with the leading (trailing) charged hadrons in these decays required to have a \(p_{\mathrm {T}}\) greater than 2.5 (1.5)\(\,\text {GeV}\). The \(p_{\mathrm {T}}\) of the \(\mathrm {b}\) hadrons must be at least 13\(\,\text {GeV}\), except for the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) decay where no requirement is imposed. The \(p_{\mathrm {T}}\) of the leading track from the \(\mathrm {K^0_S}\) and \(\mathrm {\Lambda }^{0}\) decays must be larger than 1.8\(\,\text {GeV}\). The minimum \(p_{\mathrm {T}}\) for the kaons forming a \(\mathrm {\phi (1020)}\) candidate is 0.7\(\,\text {GeV}\).

The \(\mathrm {b}\) hadron vertex \(\chi ^2\) probability is required to be greater than 0.1% in the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) channel only. The lifetime measurement is limited to events in which the \(\mathrm {b}\) hadron \(ct\) is greater than 0.02\(\,\text {cm}\) to avoid resolution and reconstruction effects present in the low-\(ct\) region. No attempt is made to select a single b hadron candidate in the relatively rare \((<1\%)\) events in which more than one b hadron candidate is found.

4.2 Reconstruction of \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {\pi ^+}\)

The \(\mathrm {B}_\mathrm {c}^+\) lifetime is measured using the method developed by the LHCb Collaboration [12] in which the measured difference in total widths between the \(\mathrm {B}_\mathrm {c}^+\) and \({\mathrm {B}^{+}}\) mesons is used in combination with the precisely known \({\mathrm {B}^{+}}\) lifetime to obtain the \(\mathrm {B}_\mathrm {c}^+\) lifetime. This method does not require modelling the background \(ct\) distribution, avoiding a source of systematic uncertainty. The same reconstruction algorithm and selection criteria are used for both decays, \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {\pi ^+}\) and \({\mathrm {B}^{+}}\!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\). As a result, the dependence of the efficiencies on the proper decay time is similar.

The charged hadron tracks are required to have at least 2 pixel hits, at least 6 tracker hits (strips and pixels together), a track fit \(\chi ^2\) less than 3 times the number of degrees of freedom, and \(|\eta |< 2.4\). The dimuon invariant mass is required to lie in the range ±3\(\sigma \) from the nominal \(\mathrm {J}/\psi \) meson mass, where \(\sigma \) is the average resolution for the \(\mathrm {J}/\psi \) signal, which depends on the \(\mathrm {J}/\psi \) pseudorapidity and ranges from 35 to 50\(\,\text {MeV}\). The \(p_{\mathrm {T}}\) of the charged hadron tracks and the b hadrons are required to be greater than 3.3 and 10\(\,\text {GeV}\), respectively. The b hadrons must have a rapidity of \(|y|<2.2\), a vertex \(\chi ^2\) probability greater than 5%, a dimuon vertex \(\chi ^2\) probability greater than 1%, and \(\cos \theta > 0.98\), where \({\cos \theta = \vec {L}_{xy}\cdot \vec {p}_{\mathrm {T},{\mathrm {B}}} / (|L_{xy}|\cdot |p_{\mathrm {T},{\mathrm {B}}}|)}\) and \(\vec {L}_{xy}\) and \(\vec {p}_{\mathrm {T},{\mathrm {B}}}\) refer to the transverse decay length and momentum of the \({\mathrm {B}^{+}}\) or \(\mathrm {B}_\mathrm {c}^+\) mesons. The lifetime measurement is limited to events in which the b hadron has \(ct > 0.01\,\text {cm} \), which ensures that the ratio of the \(\mathrm {B}_\mathrm {c}^+\) to \({\mathrm {B}^{+}}\) meson efficiencies is constant versus \(ct\). The analysis of the \(\mathrm {B}_\mathrm {c}^+\) lifetime is described in Sect. 6.

5 Measurement of the \({\mathrm {B}^0}\), \(\mathrm {B}^0_\mathrm {s} \), and \(\varLambda _\mathrm {b}^0\) lifetimes

For each decay channel, we perform a simultaneous fit to three input variables, the b hadron mass, \(ct\), and \(ct\) uncertainty (\(\sigma _{ct}\)). For the \({\mathrm {B}^{+}}\), \({\mathrm {B}^0}\), and \(\varLambda _\mathrm {b}^0\) hadrons, an unbinned maximum-likelihood fit is performed with a probability density function (PDF) given by:

$$\begin{aligned} \mathrm {PDF}= & {} f_s\; M_{s}(M)\; T_{s}(ct)\; E_{s}(\sigma _{ct})\; \varepsilon (ct) \nonumber \\&+ (1-f_s)\; M_{b}(M)\; T_{b}(ct)\; E_{b}(\sigma _{ct}), \end{aligned}$$
(7)

where \(f_s\) is the fraction of signal events, and \(M_s\) (\(M_b\)), \(T_s\) (\(T_b\)), and \(E_s\) (\(E_b\)) are the functions describing the signal (background) distributions of the b hadron mass, \(ct\), and \(\sigma _{ct}\), respectively, while \(\varepsilon \) is the efficiency function. These functions are derived below. For the \(\mathrm {B}^0_\mathrm {s} \) modes, we use an extended maximum-likelihood fit in order to correctly incorporate background sources whose yields are obtained from the fit.

5.1 Reconstruction and selection efficiency

The reconstruction and selection efficiency \(\varepsilon \) for each decay mode is determined as a function of \(ct\) by using fully simulated MC samples. This efficiency is defined as the generated \(ct\) distribution of the selected events after reconstruction and selection divided by the \(ct\) distribution obtained from an exponential decay with the lifetime set to the value used to generate the events. The efficiency for the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) channel is defined as the generated \(ct\) distribution of the selected events after reconstruction divided by the sum of the two exponentials generated with the theoretical \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) decay rate model [26]. In the theoretical model, the values of the physics parameters are set to those used in the simulated sample.

Figure 1 shows the efficiency as a function of \(ct\) for the various decay modes, with an arbitrary normalization since only the relative efficiency is relevant. The efficiencies display a sharp rise as \(ct\) increases from 0 to 0.01 \(\text {cm}\), followed by a slow decrease as \(ct\) increases further. The \(ct\) efficiency is modelled with an inverse power function.

Fig. 1
figure 1

The combined reconstruction and selection efficiency from simulation versus \(ct\) with a superimposed fit to an inverse power function for \({\mathrm {B}^{+}}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^+}\) (upper left), \(\varLambda _\mathrm {b}^0\!\rightarrow \!\mathrm {J}/\psi \mathrm {\Lambda }^{0}\) (upper right), \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^0_S}\) (centre left), \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}\) (centre right), \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) (lower left), and \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) (lower right). The efficiency scale is arbitrary

5.2 Data modelling

Depending on the decay channel, the invariant mass distribution for the signal \(M_s\) is modelled with one or two Gaussian functions, and a linear polynomial or an exponential function is used to model the combinatorial background \(M_b\). For the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) decay, three additional terms are added to \(M_b\) to include specific sources of background. The \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) decays are modelled by a Gaussian function, the \({\mathrm {B}^{+}}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^+}\) decays by a shape taken from simulation, and the \({\mathrm {B}^0}_{\mathrm {(d,s)}} \!\rightarrow \! \mathrm {J}/\psi \mathrm {h}^{+}_{1} \mathrm {h}^{-}_{2}\) decays, where \(\mathrm {h}^{+}_{1}\) and \(\mathrm {h}^{-}_{2}\) are charged hadron tracks that are not both pions, by a Gaussian function.

Fig. 2
figure 2

Invariant mass (left) and \(ct\) (right) distributions for \({\mathrm {B}^{+}}\) (upper) and for \(\varLambda _\mathrm {b}^0\) (lower) candidates. The curves are projections of the fit to the data, with the contributions from signal (dashed), background (dotted), and the sum of signal and background (solid) shown. the lower panels of the figures on the right show the difference between the observed data and the fit divided by the data uncertainty. The vertical bars on the data points represent the statistical uncertainties

Fig. 3
figure 3

Invariant mass (left) and \(ct\) (right) distributions for \({\mathrm {B}^0}\) candidates reconstructed from \(\mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}\) (upper) and \( \mathrm {J}/\psi \mathrm {K^0_S}\) (lower) decays. The curves are projections of the fit to the data, with the contributions from signal (dashed), background (dotted), and the sum of signal and background (solid) shown. the lower panels of the figures on the right show the difference between the observed data and the fit divided by the data uncertainty. The vertical bars on the data points represent the statistical uncertainties

Fig. 4
figure 4

Invariant mass (left) and \(ct\) (right) distributions for \(\mathrm {B}^0_\mathrm {s} \) candidates reconstructed from \(\mathrm {J}/\psi \mathrm {\phi (1020)}\) (upper) and \(\mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) (lower) decays. The curves are projections of the fit to the data, with the full fit function (solid) and signal (dashed) shown for both decays, the total background (dotted) shown for the upper plots, and the combinatorial background (dotted), misidentified \({\mathrm {B}^{+}}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^+}\) background (dashed-dotted), \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) contribution (dashed-dotted-dotted-dotted), and partially reconstructed and other misidentified \({\mathrm {B}}\) backgrounds (dashed-dotted-dotted) shown for the lower plots. the lower panels of the figures on the right show the difference between the observed data and the fit divided by the data uncertainty. The vertical bars on the data points represent the statistical uncertainties

The signal \(ct\) distribution \(T_s\) is modelled by an exponential function convolved with the detector resolution and then multiplied by the function describing the reconstruction and selection efficiency. The resolution is described by a Gaussian function with the per-event width taken from the \(ct\) uncertainty distribution. The backgrounds \(T_b\) are described by a superposition of exponential functions convolved with the resolution. The number of exponentials needed to describe the background is determined from data events in the mass sideband regions for each decay mode.

The signal \(E_s\) and background \(E_b\) \(\sigma _{ct}\) distributions are modelled with a sum of two gamma functions for the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) channel and two exponential functions convolved with a Gaussian function for the other channels. The background parameters are obtained from a fit to the mass sideband distributions. The signal parameters are obtained from a fit to the signal region after subtracting the background contribution using the mass sideband region to estimate the background. The parameters of the efficiency function and the functions modelling the \(\sigma _{ct}\) distributions are kept constant in the fit. The remaining fit parameters are allowed to vary freely.

For the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) mode, the parameters of the mass model for the \({\mathrm {B}^{+}}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^+}\) contamination are taken from the simulation, and the yield and lifetime are determined by the fit. The mass of the \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) contamination is fixed to the weighted average of the masses measured from our two \({\mathrm {B}^0}\) decay modes, and the width of the Gaussian function is the same as the width used for the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) signal, corrected by a factor of \(M_{{\mathrm {B}^0}}/M_{\mathrm {B}^0_\mathrm {s}}\). The lifetime of this contamination is fixed to the world-average value, corrected by the same factor as the width, and the yield is a free parameter of the fit.

5.3 Fit results

The invariant mass and \(ct\) distributions obtained from data are shown with the fit results superimposed in Figs. 2, 3 and 4. The \(ct\) distributions are fitted in the range 0.02–0.50\(\,\text {cm}\) for all modes except the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) channel, where the upper limit is increased to 0.60\(\,\text {cm}\). The average lifetimes times the speed of light obtained from the fits are: \(c\tau _{{\mathrm {B}^{+}}} = 490.9 \pm 0.8\,\upmu \text {m} \), \(c\tau _{{\mathrm {B}^0}\rightarrow \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}} = 453.0 \pm 1.6\,\upmu \text {m} \), \(c\tau _{{\mathrm {B}^0}\rightarrow \mathrm {J}/\psi \mathrm {K^0_S}} = 457.8 \pm 2.7\,\upmu \text {m} \), \(c\tau _{\mathrm {B}^0_\mathrm {s} \rightarrow \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}} = 502.7 \pm 10.2\,\upmu \text {m} \), \(c\tau _{\mathrm {B}^0_\mathrm {s} \rightarrow \mathrm {J}/\psi \mathrm {\phi (1020)}} = 445.2 \pm 2.0\,\upmu \text {m} \), and \(c\tau _{\varLambda _\mathrm {b}^0} = 442.9 \pm 8.2\,\upmu \text {m} \), where all uncertainties are statistical only. The \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) value given here is uncorrected for two offsets described in Sect. 7. There is good agreement between the fitted functions and the data. The probabilities calculated from the \(\chi ^2\) of the \(ct \) distributions in Figs. 2, 3 and 4 all exceed 25%.

6 Measurement of the \(\mathrm {B}_\mathrm {c}^+\) lifetime

The decay time distribution for the signal \(N_\mathrm {B}(ct)\) can be expressed as the product of an efficiency function \(\varepsilon _\mathrm {B}(ct)\) and an exponential decay function \(E_\mathrm {B}(ct) =\exp (-ct/c\tau _\mathrm {B})\), convolved with the time resolution function of the detector r(ct). The ratio of \(\mathrm {B}_\mathrm {c}^+\) to \({\mathrm {B}^{+}}\) events at a given proper time can be expressed as

$$\begin{aligned} \frac{N_{\mathrm {B}_\mathrm {c}^+}(ct)}{N_{{\mathrm {B}^{+}}}(ct)} \equiv R(ct) = \frac{\varepsilon _{\mathrm {B}_\mathrm {c}^+}(ct) [r(ct)\otimes E_{\mathrm {B}_\mathrm {c}^+}(ct)]}{\varepsilon _{{\mathrm {B}^{+}}}(ct) [r(ct)\otimes E_{{\mathrm {B}^{+}}}(ct)]}. \end{aligned}$$
(8)

We have verified through studies of simulated pseudo-events that Eq. (8) is not significantly affected by the time resolution, and therefore this equation can be simplified to

$$\begin{aligned} R(ct) \approx R_{\varepsilon }(ct) \exp (-\varDelta \varGamma t), \end{aligned}$$
(9)

where the small effect from the time resolution is evaluated from MC simulations and is included in \(R_{\varepsilon }(ct)\), which denotes the ratio of the \(\mathrm {B}_\mathrm {c}^+\) and \({\mathrm {B}^{+}}\) efficiency functions. The quantity \(\varDelta \varGamma \) is defined as

$$\begin{aligned} \varDelta \varGamma \equiv \varGamma _{\mathrm {B}_\mathrm {c}^+} - \varGamma _{{\mathrm {B}^{+}}} = \frac{1}{\tau _{\mathrm {B}_\mathrm {c}^+}} - \frac{1}{\tau _{{\mathrm {B}^{+}}}}. \end{aligned}$$
(10)

The \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {\pi ^+}\) and \({\mathrm {B}^{+}}\!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\) invariant mass distributions, shown in Fig. 5, are each fit with an unbinned maximum-likelihood estimator. The \(\mathrm {J}/\psi \mathrm {\pi ^+}\) invariant mass distribution is fitted with a Gaussian function for the \(\mathrm {B}_\mathrm {c}^+\) signal and an exponential function for the background. An additional background contribution from \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\) decays is modelled from a simulated sample of \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\) events, and its contribution is constrained using the value of the branching fraction relative to \(\mathrm {J}/\psi \mathrm {\pi ^+}\)  [27]. The \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {\pi ^+}\) signal yield is \(1128 \pm 60\) events, where the uncertainty is statistical only. The \({\mathrm {B}^{+}}\) meson invariant mass distribution is fit with a sum of two Gaussian functions with a common mean for the signal and a second-order Chebyshev polynomial for the background. Additional contributions from partially reconstructed \({\mathrm {B}^0}\) and \({\mathrm {B}^{+}}\) meson decays are parametrized with functions determined from \({\mathrm {B}^{+}}\!\rightarrow \!\mathrm {J}/\psi \pi ^+\) and inclusive \({\mathrm {B}^0}\!\rightarrow \!\mathrm {J}/\psi \mathrm {X}\) MC samples.

Fig. 5
figure 5

The \(\mathrm {J}/\psi \mathrm {\pi ^+}\) invariant mass distribution (left) with the solid line representing the total fit, the dashed line the signal component, the dotted line the combinatorial background, and the dashed-dotted line the contribution from \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\) decays. The \(\mathrm {J}/\psi \mathrm {K^+}\) invariant mass distribution (right) with the solid line representing the total fit, the dashed line the signal component, the dotted-dashed curves the \({\mathrm {B}^{+}}\!\rightarrow \! {\mathrm {J}/\psi } \mathrm {\pi ^+}\) and \({\mathrm {B}^0}\) contributions, and the dotted curve the combinatorial background. The vertical bars on the data points represent the statistical uncertainties

6.1 The fit model and results

The \(\mathrm {B}_\mathrm {c}^+\) lifetime is extracted through a binned \(\chi ^2\) fit to the ratio of the efficiency-corrected \(ct\) distributions of the \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {\pi ^+}\) and \({\mathrm {B}^{+}}\!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\) channels. The \(\mathrm {B}_\mathrm {c}^+\) and \({\mathrm {B}^{+}}\) \(ct\) signal distributions from data are obtained by dividing the data sample into \(ct\) bins and performing an unbinned maximum-likelihood fit to the \(\mathrm {J}/\psi \mathrm {\pi ^+}\) and \(\mathrm {J}/\psi \mathrm {K^+}\) invariant mass distribution in each bin, in the same manner as the fit to the full samples, except that the peak position and resolution are fixed to the values obtained by the fits to the full samples. Varied \(ct\) bin widths are used to ensure a similar statistical uncertainty in the \(\mathrm {B}_\mathrm {c}^+\) signal yield among the bins. The bin edges are defined by requiring a relative statistical uncertainty of 12% or better in each bin. The same binning is used for the \({\mathrm {B}^{+}}\) \(ct\) distribution. The \(\mathrm {B}_\mathrm {c}^+\) and \({\mathrm {B}^{+}}\) meson yields are shown versus \(ct\) in the left plot of Fig. 6, where the number of signal events is normalized by the bin width. Efficiencies are obtained from the MC samples and are defined as the \(ct\) distribution of the selected events after reconstruction divided by the \(ct\) distribution obtained from an exponential decay with the lifetime set to the same value used to generate each MC sample. The ratio of the two efficiency distributions, using the same binning scheme as for the data, is shown in the right plot of Fig. 6.

Fig. 6
figure 6

Yields of \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {\pi ^+}\) and \({\mathrm {B}^{+}}\!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\) events (left) as a function of \(ct\), normalized to the bin width, as determined from fits to the invariant mass distributions. Ratio of the \(\mathrm {B}_\mathrm {c}^+\) and \({\mathrm {B}^{+}}\) efficiency distributions (right) as a function of \(ct\), as determined from simulated events. The vertical bars on the data points represent the statistical uncertainties, and the horizontal bars show the bin widths

The ratio of the \(\mathrm {B}_\mathrm {c}^+\) to \({\mathrm {B}^{+}}\) efficiency-corrected \(ct\) distributions, \(R/R_{\varepsilon }\), is shown in Fig. 7, along with the result of a fit to an exponential function. The fit returns \(\varDelta \varGamma = 1.24 \pm 0.09\) ps\(^{-1}\). Using the known lifetime of the \({\mathrm {B}^{+}}\) meson, \(c\tau _{{\mathrm {B}^{+}}}\) = 491.1 ± 1.2 \(\,\upmu \text {m}\)  [5], a measurement of the \(\mathrm {B}_\mathrm {c}^+\) meson lifetime, \(c\tau _{\mathrm {B}_\mathrm {c}^+}\) = 162.3 ± 7.8 \(\,\upmu \text {m}\), is extracted, where the uncertainty is statistical only.

Fig. 7
figure 7

Ratio of the \(\mathrm {B}_\mathrm {c}^+\) to \({\mathrm {B}^{+}}\) efficiency-corrected \(ct\) distributions, \(R/R_{\varepsilon }\), with a line showing the result of the fit to an exponential function. The vertical bars give the statistical uncertainty in the data, and the horizontal bars show the bin widths

Table 1 Summary of the sources and values of systematic uncertainties in the lifetime measurements (in \(\,\upmu \text {m}\)). The total systematic uncertainty is the sum in quadrature of the individual uncertainties

7 Systematic uncertainties

The systematic uncertainties can be divided into uncertainties common to all the measurements, and uncertainties specific to a decay channel. Table 1 summarizes the systematic uncertainties for the sources considered below and the total systematic uncertainty in the \(\mathrm {B}^0_\mathrm {s} \), \({\mathrm {B}^0}\), and \(\varLambda _\mathrm {b}^0\) lifetime measurements. The systematic uncertainties in \(\varDelta \varGamma \) and the \(\mathrm {B}_\mathrm {c}^+\) meson lifetime are collected in Table 2. Using the known lifetime of the \({\mathrm {B}^{+}}\) meson, the uncertainties in \(\varDelta \varGamma \) are converted into uncertainties in the \(\mathrm {B}_\mathrm {c}^+\) meson lifetime measurement. The uncertainty in the \(\mathrm {B}_\mathrm {c}^+\) meson lifetime due to the uncertainty in the \({\mathrm {B}^{+}}\) meson lifetime [5] is quoted separately.

We have verified that the results are stable against changes in the selection requirements on the quality of the tracks and vertices, the kinematic variables, and \(ct\), as well as in detector regions and data-taking periods. The effect of replacing the mass of the b hadron in the \(ct\) definition of Eq. (1) from the world-average to the reconstructed candidate mass is found to be negligible. The lifetimes for all decay channels were measured by treating MC samples as data. No bias was found and all results were consistent with the input lifetimes of the generated samples.

Table 2 Summary of the systematic uncertainties in the \(\varDelta \varGamma \) and \(c\tau _{\mathrm {B}_\mathrm {c}^+}\) measurements

7.1 Common systematic uncertainties

  1. 1.

    Statistical uncertainty in the MC samples

    The number of events in the simulation directly affects the accuracy of the efficiency determination. In the case of the \(\mathrm {B}^0_\mathrm {s} \), \({\mathrm {B}^0}\), and \(\varLambda _\mathrm {b}^0\) lifetime measurements, 1000 efficiency curves are generated with variations of the parameter values. The parameter values are sampled using a multivariate Gaussian PDF that is constructed from the covariance matrix of the efficiency fit. The analysis is performed 1000 times, varying the parameters of the efficiency function. The distribution of the measured lifetimes is fitted with a Gaussian function, whose width is taken as the systematic uncertainty associated with the finite size of the simulated samples. In the measurement of the \(\mathrm {B}_\mathrm {c}^+\) lifetime, the bin-by-bin statistical uncertainty in the efficiency determination is propagated to the R(ct) distribution, the fit is performed, and the difference in quadrature of the uncertainty in \(\varDelta \varGamma \) with respect to the nominal value is taken as the systematic uncertainty.

  2. 2.

    Modelling of the mass distribution shape

    Biases related to the modelling of the shapes of the b hadron mass signal and background PDFs are quantified by changing the signal and background PDFs individually and using the new models to fit the data. For the \({\mathrm {B}^0}\), \(\mathrm {B}^0_\mathrm {s}\), and \(\mathrm {\Lambda }_\mathrm {b}^0\) lifetime measurements, the background model is changed to a higher-degree polynomial, a Chebyshev polynomial, or an exponential function, and the signal model is changed from two Gaussian functions to a single Gaussian function or a sum of three Gaussian functions. Differences in the measured lifetime between the results of the nominal and alternative models are used to estimate the systematic uncertainty, with the variations due to the modelling of signal and background components evaluated separately and added in quadrature. For the \(\mathrm {B}_\mathrm {c}^+\) lifetime measurement, the signal peak is alternatively modelled with a Crystal Ball distribution [28]. The alternative description for the background is a first-order Chebyshev distribution. The removal of the Cabibbo-suppressed \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\) contribution is also considered. The maximum deviation of the signal yield in each \(ct\) bin from the nominal value is propagated to the statistical uncertainty in the per-bin yield. The fit to R(ct) is performed and the difference in quadrature between the uncertainty from this fit and the nominal measurement is taken as the systematic uncertainty.

7.2 Channel-specific systematic uncertainties

  1. 1.

    Modelling of the background \(ct\) shape in the \(\mathrm {B}^0_\mathrm {s} \), \({\mathrm {B}^0}\), and \(\varLambda _\mathrm {b}^0\) channels

    To estimate a systematic uncertainty due to the \(ct\) background model, we add an additional background contribution modelled with its own lifetime, and compare the result to that obtained with the nominal fit model. The difference between the results of the nominal and alternative fit models is used as the systematic uncertainty from the \(ct\) shape modelling.

  2. 2.

    The \({\mathrm {B}^{+}}\) contamination in the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) sample

    In the nominal fit, the yield and lifetime of the \({\mathrm {B}^{+}}\!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\) contamination are determined from the fit with the mass shape obtained from simulation. An alternative estimate of the \(\mathrm {J}/\psi \mathrm {K^+}\) contamination is obtained from data by taking the leading pion of the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) decay to be the kaon. The lifetime and yield of the \({\mathrm {B}^{+}}\!\rightarrow \!\mathrm {J}/\psi \mathrm {K^+}\) decays contaminating the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) sample are determined from a fit of the \({\mathrm {B}^{+}}\) signal candidates in the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) sample, with the mass shape also obtained from the data. The difference between the \(\mathrm {B}^0_\mathrm {s} \) lifetime found with this model and the nominal model is considered as the systematic uncertainty due to \({\mathrm {B}^{+}}\) contamination.

  3. 3.

    Invariant mass window of the \(\pi ^+ \pi ^-\) in the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) channel

    Although the events selected by the \(\pi ^+\pi ^-\) mass window are dominated by the \(f_0(980)\), its width is not well known and possible backgrounds under the \(f_0(980)\) peak could be increased or decreased, depending on the mass window. The effect on the lifetime is studied by using mass windows of \(\pm 30\) and \(\pm 80\) \(\,\text {MeV}\) around the signal peak, compared to the nominal fit result with a ±50\(\,\text {MeV}\) window. The maximum variation of the lifetime is taken as the systematic uncertainty.

  4. 4.

    The \(\mathrm {K^+}\mathrm {\pi ^-}\) mass assignments for \(\mathrm {K^{*}(892)}{}^{0}\) candidates in the \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}\) channel

    The \(\mathrm {K^{*}(892)}{}^{0}\) candidates are constructed from a pair of tracks with kaon and pion mass assignments. The combination with invariant mass closest to the world-average \(\mathrm {K^{*}(892)}{}^{0}\) mass is chosen to reconstruct the \({\mathrm {B}^0}\) candidate. To estimate the effect on the lifetime due to a possible misassignment of kaon and pion, both combinations are discarded if both are within the natural width of the \(\mathrm {K^{*}(892)}{}^{0}\) mass, and the difference between the lifetime obtained with this sample and the nominal sample is taken as the systematic uncertainty.

  5. 5.

    The \(ct\) range in the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) channel

    Since the \(ct>0.02\) cm requirement distorts the fractions of heavy and light mass eigenstates, the measured \(\mathrm {B}^0_\mathrm {s} \) effective lifetime must be corrected. The correction and systematic uncertainty are quantified analytically. The correction to the effective lifetime is

    $$\begin{aligned} \delta _{ct}= & {} c\tau _{\text {eff}}^{\mathrm {cut}} - c\tau _{\text {eff}} \nonumber \\= & {} \frac{ (1-|A_\perp |^2) (c\tau _{\mathrm {L}})^2 \mathrm {e}^{-a/c\tau _\mathrm {L}} + |A_\perp |^2 (c\tau _{\mathrm {H}})^2 \mathrm {e}^{-a/c\tau _\mathrm {H}} }{(1-|A_\perp |^2) c\tau _{\mathrm {L}} \mathrm {e}^{-a/c\tau _\mathrm {L}} + |A_\perp |^2 c\tau _{\mathrm {H}} \mathrm {e}^{-a/c\tau _\mathrm {H}} } \nonumber \\&- \frac{ (1-|A_\perp |^2) (c\tau _{\mathrm {L}})^2 + |A_\perp |^2 (c\tau _{\mathrm {H}})^2 }{ (1-|A_\perp |^2) c\tau _{\mathrm {L}}+ |A_\perp |^2 c\tau _{\mathrm {H}} } , \end{aligned}$$
    (11)

    where the first term represents the effective lifetime in the presence of a \(ct>a\) requirement and the latter term is the unbiased effective lifetime. In this analysis, a is equal to 0.02\(\,\text {cm}\). The world-average values [2] for \(c\tau _\mathrm {H}=482.7 \pm 3.6\,\upmu \text {m} \), \(c\tau _\mathrm {L}= 426.3 \pm 2.4\,\upmu \text {m} \), and \({|A_\perp |^2 = 0.250 \pm 0.006}\) are used to obtain the correction \({\delta _{ct} = 0.62 \pm 0.10\,\upmu \text {m}}\).

  6. 6.

    The S-wave contamination in the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) channel

    The \(\mathrm {B}^0_\mathrm {s} \) candidates reconstructed in the \(\mathrm {J}/\psi \mathrm {\phi (1020)}\) final state contain a small fraction of nonresonant and CP-odd \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {K^+}\mathrm {K^-}\) decays, where the invariant mass of the two kaons happens to be near the \(\phi \) meson mass. The fraction of \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {K^+}\mathrm {K^-}\) decays among the selected events is measured in the weak mixing phase analysis [9] to be \(f_\mathrm {S} = (1.2^{+0.9}_{-0.7})\)%. Because of the different trigger and signal selection criteria of the present analysis, the S-wave fraction is corrected according to the simulation to be \((1.5^{+1.1}_{-0.9})\)%. The bias caused by the contamination of nonresonant \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {K^+}\mathrm {K^-}\) decays is estimated by generating two sets of pseudo-experiments, one with just \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\) events and one with a fraction of S-wave events based on the measured S-wave fraction and its uncertainty. The difference in the average of the measured lifetimes of these two samples is 0.74\(\,\upmu \text {m}\), which is used to correct the measured lifetime. The systematic uncertainty associated with this correction is obtained by taking the difference in quadrature between the standard deviation of the distribution of lifetime results from the pseudo-experiments with and without the S-wave contribution.

  7. 7.

    PV selection in the \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {\pi ^+}\) channel

    From the multiple reconstructed PVs in an event, one is selected to compute the \(ct\) value of the candidate. Two alternative methods to select the PV position are studied: using the centre of the beamspot and selecting the PV with the largest sum of track \(p_{\mathrm {T}}\). While all three methods are found to be effective and unbiased, there were small differences, and the maximum deviation with respect to the nominal choice is taken as the systematic uncertainty. The \({\mathrm {B}^{+}}\) and \(\mathrm {B}_\mathrm {c}^+\) primary vertex choices were changed coherently.

  8. 8.

    Detector alignment in the \(\mathrm {B}_\mathrm {c}^+ \!\rightarrow \!\mathrm {J}/\psi \mathrm {\pi ^+}\) channel

    Possible effects on the lifetime due to uncertainties in the detector alignment [29] are investigated for each decay topology using 20 different simulated samples with distorted geometries. These distortions include expansions in the radial and longitudinal dimensions, rotations, twists, offsets, etc. The amount of misalignment is chosen such that it is large enough to be detected and corrected by the alignment procedure. The standard deviation of the lifetimes for the tested scenarios is taken as the systematic uncertainty from this source. The \({\mathrm {B}^{+}}\) and \(\mathrm {B}_\mathrm {c}^+\) geometries were changed coherently.

  9. 9.

    Absolute ct accuracy in the \(\mathrm {B}^0_\mathrm {s} \), \({\mathrm {B}^0}\), and \(\varLambda _\mathrm {b}^0\) lifetime measurements

    The lifetime of the most statistically precise mode (\({\mathrm {B}^{+}}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^+}\)) is used to validate the accuracy of the simulation and various detector calibrations. The difference between our measurement of \(490.9 \pm 0.8\,\upmu \text {m} \) (statistical uncertainty only) and the world-average value of \(491.1 \pm 1.2\,\upmu \text {m} \) [5] is \(0.2 \pm 1.4\,\upmu \text {m} \). This implies a limit to the validation of \(1.4/491 = 0.3\%\). Four systematic effects that we expect to be included were checked independently. The systematic uncertainties from PV selection and detector alignment were found to be 0.7\(\,\upmu \text {m}\) and 0.3–0.7\(\,\upmu \text {m}\), respectively. Varying the efficiency functional form changed the lifetimes by 0.3–0.6\(\,\upmu \text {m}\), while varying \(\sigma _{ct}\) by factors of 0.5 and 2.0 resulted in lifetime differences of no more than \(0.2\,\upmu \text {m} \). As the sum in quadrature of these uncertainties is less than that obtained from the \({\mathrm {B}^{+}}\) lifetime comparison, we assign a value of 0.3% as the systematic uncertainty for the absolute ct accuracy.

8 Lifetime measurement results

Our final results for the \({\mathrm {B}^0}\), \(\mathrm {B}^0_\mathrm {s} \), and \(\varLambda _\mathrm {b}^0\) hadron lifetimes are:

$$\begin{aligned}&c\tau _{{\mathrm {B}^0}\rightarrow \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}} = 453.0 \pm 1.6\,\text {(stat)} \pm 1.8\,\text {(syst)} \,\upmu \text {m}, \end{aligned}$$
(12)
$$\begin{aligned}&c\tau _{{\mathrm {B}^0}\rightarrow \mathrm {J}/\psi \mathrm {K^0_S}} = 457.8 \pm 2.7\,\text {(stat)} \pm 2.8\,\text {(syst)} \,\upmu \text {m},\end{aligned}$$
(13)
$$\begin{aligned}&c\tau _{\mathrm {B}^0_\mathrm {s} \rightarrow \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}} = 502.7 \pm 10.2\,\text {(stat)} \pm 3.4\,\text {(syst)} \,\upmu \text {m}, \end{aligned}$$
(14)
$$\begin{aligned}&c\tau _{\mathrm {B}^0_\mathrm {s} \rightarrow \mathrm {J}/\psi \mathrm {\phi (1020)}} = 443.9 \pm 2.0\,\text {(stat)} \pm 1.5\,\text {(syst)} \,\upmu \text {m},\end{aligned}$$
(15)
$$\begin{aligned}&c\tau _{\varLambda _\mathrm {b}^0{}} = 442.9 \pm 8.2\,\text {(stat)} \pm 2.8\,\text {(syst)} \,\upmu \text {m}. \end{aligned}$$
(16)

The value of the \(\mathrm {B}^0_\mathrm {s} \) lifetime using the \(\mathrm {J}/\psi \mathrm {\phi (1020)}\) decay has been corrected for the \(ct\) range and S-wave contamination effects described in Sect. 7. The lifetime ratios \(\tau _{\mathrm {B}^0_\mathrm {s}}/\tau _{{\mathrm {B}^0}}\) and \(\tau _{\varLambda _\mathrm {b}^0}/\tau _{{\mathrm {B}^0}}\) have been determined using the decay channels \({\mathrm {B}^0}\!\rightarrow \! \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}\), \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\), and \(\varLambda _\mathrm {b}^0\!\rightarrow \!\mathrm {J}/\psi \mathrm {\Lambda }^{0}\). Including the statistical and correlated and uncorrelated systematic uncertainties, the results are:

$$\begin{aligned}&\tau _{\varLambda _\mathrm {b}^0}/\tau _{{\mathrm {B}^0}\rightarrow \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}} \nonumber \\&\quad = 0.978 \pm 0.018\,\text {(stat)} \pm 0.006\,\text {(syst)}, \end{aligned}$$
(17)
$$\begin{aligned}&\tau _{\varLambda _\mathrm {b}^0}/\tau _{{\mathrm {B}^0}\rightarrow \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}} \nonumber \\&\quad = 0.978 \pm 0.018\,\text {(stat)} \pm 0.006\,\text {(syst)}, \end{aligned}$$
(18)

These ratios are compatible with the current world-average values.

The measured lifetimes for the \({\mathrm {B}^0}\) meson in the two different channels are in agreement. Combining the two results, including the statistical and the correlated and uncorrelated systematic uncertainties, gives \(c\tau _{{\mathrm {B}^0}} = 454.1 \pm 1.4\,\text {(stat)} \pm 1.7\,\text {(syst)} \,\upmu \text {m} \). The lifetime measurements can also be used to estimate \(\varGamma _\mathrm {d}\) and \(\varDelta \varGamma _\mathrm {d}\) [6]. In the standard model, the effective lifetimes of the two \({\mathrm {B}^0}\) decay modes can be written as:

$$\begin{aligned} \tau _{{\mathrm {B}^0}\rightarrow \mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}}&= \frac{1}{\varGamma _\mathrm {d}} \left( \frac{1}{1-y_\mathrm {d}^2}\right) \left( \frac{1+2\cos {(2\beta )}y_\mathrm {d}+y_\mathrm {d}^2}{1+\cos {(2\beta )}y_\mathrm {d}}\right) , \end{aligned}$$
(19)
$$\begin{aligned} \tau _{{\mathrm {B}^0}\rightarrow \mathrm {J}/\psi \mathrm {K^0_S}}&= \frac{1}{\varGamma _\mathrm {d}} \left( \frac{1+y_\mathrm {d}^2}{1-y_\mathrm {d}^2}\right) , \end{aligned}$$
(20)

where \(y_\mathrm {d}=\varDelta \varGamma _\mathrm {d}/2\varGamma _\mathrm {d}\), and \(\beta =(21.9\pm 0.7)^\circ \) [5] is one of the CKM unitarity triangle angles. Using our measured values for the two \({\mathrm {B}^0}\) lifetimes, we fit for \(\varGamma _\mathrm {d}\) and \(\varDelta \varGamma _\mathrm {d}\) and use the values to determine \(\varDelta \varGamma _\mathrm {d}/\varGamma _\mathrm {d}\). The results are:

$$\begin{aligned} \varGamma _\mathrm {d}&= 0.662 \pm 0.003\,\text {(stat)} \pm 0.003\,\text {(syst)} \,\mathrm {ps}^{-1},\end{aligned}$$
(21)
$$\begin{aligned} \varDelta \varGamma _\mathrm {d}&= 0.023 \pm 0.015\,\text {(stat)} \pm 0.016\,\text {(syst)} \,\mathrm {ps}^{-1}, \end{aligned}$$
(22)
$$\begin{aligned} \varDelta \varGamma _\mathrm {d}/\varGamma _\mathrm {d}&= 0.034 \pm 0.023\,\text {(stat)} \pm 0.024\,\text {(syst)}. \end{aligned}$$
(23)

Neglecting CP violation in mixing, the measured \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) lifetime can be translated into the width of the heavy \(\mathrm {B}^0_\mathrm {s} \) mass eigenstate:

$$\begin{aligned} \varGamma _\mathrm {H} = 1/\tau _{\mathrm {B}^0_\mathrm {s}} = 0.596 \pm 0.012\,\text {(stat)} \pm 0.004\,\text {(syst)} \,\mathrm {ps}^{-1}. \end{aligned}$$
(24)

Solving for \(c\tau _{\mathrm {L}}\) from Eq. (4) gives

$$\begin{aligned} c\tau _\mathrm {L} = \frac{1}{2} c\tau _{\text {eff}} + \sqrt{ \frac{1}{4} (c\tau _{\text {eff}})^2 - \frac{|A_\perp |^2}{1-|A_\perp |^2} c\tau _\mathrm {H}( c\tau _\mathrm {H} - c\tau _{\text {eff}})}. \end{aligned}$$
(25)

Using the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) result in Eq. (14), the measured \(\mathrm {B}^0_\mathrm {s} \) effective lifetime in Eq. (15), and the world-average value of the magnitude squared of the CP-odd amplitude \(|A_\perp |^2 = 0.250 \pm 0.006\) [2], the lifetime of the light component is found to be \(c\tau _{\mathrm {L}} = 420.4 \pm 6.2\,\upmu \text {m} \). The uncertainty includes all statistical and systematic uncertainties, taking into account the correlated uncertainties. The result is consistent with the world-average value of \(423.6 \pm 1.8\,\upmu \text {m} \) [5].

Our measured lifetimes for \({\mathrm {B}^0}\), \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\phi (1020)}\), and \(\varLambda _\mathrm {b}^0\) are compatible with the current world-average values [5] of \(455.7 \pm 1.2\), \(443.4 \pm 3.6\), and \(440.7 \pm 3.0\) \(\,\upmu \text {m}\), respectively. In addition, our measurement of the \(\mathrm {B}^0_\mathrm {s} \) lifetime in the \(\mathrm {B}^0_\mathrm {s} \!\rightarrow \! \mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\) channel is in agreement with the results from CDF, LHCb, and D0: \(510\,^{+36}_{-33}\,\text {(stat)} \pm 9\,\text {(syst)} \) \(\,\upmu \text {m}\)  [30], \(495.3 \pm 7.2\,\text {(stat)} \pm 7.2\,\text {(syst)} \) \(\,\upmu \text {m}\)  [31], and \(508 \pm 42\,\text {(stat)} \pm 16\,\text {(syst)} \) \(\,\upmu \text {m}\)  [32], respectively.

Our final result for the \(\mathrm {B}_\mathrm {c}^+\) lifetime using the \(\mathrm {J}/\psi \mathrm {\pi ^+}\) mode is:

$$\begin{aligned} c\tau _{\mathrm {B}_\mathrm {c}^+} = 162.3 \pm 7.8\,\text {(stat)} \pm 4.2\,\text {(syst)} \pm 0.1 (\tau _{{\mathrm {B}^{+}}})\,\upmu \text {m},\nonumber \\ \end{aligned}$$
(26)

where the systematic uncertainty from the \({\mathrm {B}^{+}}\) lifetime uncertainty [5] is quoted separately in the result. This measurement is in agreement with the world-average value \((152.0 \pm 2.7\,\upmu \text {m})\) [5]. Precise measurements of the \(\mathrm {B}_\mathrm {c}^+\) lifetime allow tests of various theoretical models, which predict values ranging from 90 to 210\(\,\upmu \text {m}\)  [33,34,35,36]. Furthermore, they provide new constraints on possible physics beyond the standard model from the observed anomalies in \({\mathrm {B}}\rightarrow \mathrm {D}^{(*)}\tau \nu \) decays [37].

9 Summary

The lifetimes of the \({\mathrm {B}^0}\), \(\mathrm {B}^0_\mathrm {s}\), \(\varLambda _\mathrm {b}^0\), and \(\mathrm {B}_\mathrm {c}^+\) hadrons have been measured using fully reconstructed decays with a \(\mathrm {J}/\psi \) meson. The data were collected by the CMS detector in proton–proton collision events at a centre-of-mass energy of 8\(\,\text {Te}\text {V}\), and correspond to an integrated luminosity of 19.7\(\,\text {fb}^\text {-1}\). The \({\mathrm {B}^0}\) and \(\mathrm {B}^0_\mathrm {s}\) meson lifetimes have each been measured in two channels: \(\mathrm {J}/\psi \mathrm {K^{*}(892)}{}^{0}\), \( \mathrm {J}/\psi \mathrm {K^0_S}\) for \({\mathrm {B}^0}\) and\(\mathrm {J}/\psi \mathrm {\pi ^+}\mathrm {\pi ^-}\), \(\mathrm {J}/\psi \mathrm {\phi (1020)}\) for \(\mathrm {B}^0_\mathrm {s}\). The precision from each channel is as good as or better than previous measurements in the respective channel. The \({\mathrm {B}^0}\) lifetime results are used to obtain an average lifetime and to measure the decay width difference between the two mass eigenstates. The \(\mathrm {B}^0_\mathrm {s}\) lifetime results are used to obtain the lifetimes of the heavy and light \(\mathrm {B}^0_\mathrm {s} \) mass eigenstates. The precision of the \(\varLambda _\mathrm {b}^0\) lifetime measurement is also as good as any previous measurement in the \(\mathrm {J}/\psi \mathrm {\Lambda }^{0}\) channel. The measured \(\mathrm {B}_\mathrm {c}^+\) meson lifetime is in agreement with the results from LHCb and significantly more precise than the CDF and D0 measurements. All measured lifetimes are compatible with the current world-average values.