1 Introduction

Lepton Flavour Violation (LFV) [13], meaning local interactions that change the flavour of charged leptons, should occur because neutrinos have mass and mix. This motivates sensitive searches for processes such as \(\mu \rightarrow e \gamma \) [4] and \(\mu \mathrm{-}e ~\mathrm{conversion}\) [5, 6]. However, the mechanism responsible for neutrino masses is unknown, so it is interesting to parametrise LFV with contact interactions, and to look for it everywhere. In this context, the LHC could have the best sensitivity to LFV processes involving a heavy leg, such as the Z [731], the Higgs [3162], or a top [63, 64]. In this paper, we study the LFV top decays \(t\rightarrow q e^\pm \mu ^\mp \), where \(q \in \{u,c\}\).

We suppose that these decays are mediated by a 4-fermion interaction, and outline in Sect. 2 the current bounds on LFV branching ratios of the top. The bounds arise from rare decays and HERA’s single-top search, and they are discussed in more detail in the appendices. We find that, while these bounds place strong constraints on some specific Lorentz structures for the 4-fermion interactions, they still allow for \(t \rightarrow q e^\pm \mu ^\mp \) decays with rates within the LHC reach. In Sect. 3, we estimate the LHC sensitivity to \(t\rightarrow q e^\pm \mu ^\mp \), with 20 fb\(^{-1}\) of LHC data at 8 TeV. This estimate relies on simulations of the background and signal and is inspired by the CMS search for \(t\rightarrow Z q\) [65]. The extrapolation to higher energies and luminosities is discussed in Sect. 4.

Quark-flavour-changing top decays, such as \(t\rightarrow cZ\) and \(t\rightarrow h c\), have been studied in the context of explicit models [62, 6679] or described by contact interaction parametrisations [8087], and they have been searched for at the LHC [65, 8890]. Quark-flavour-changing (but lepton-flavour-conserving) three-body decays of the top, \(t \rightarrow c f\bar{f}\), where f is a lepton or quark, have also been calculated in explicit models [9194]. Top interactions that change quark and lepton flavour, and, in addition, baryon and lepton number, have been explored in [95, 96] and searched for by CMS [97]. In models with weak-scale neutrinos N [98], there can be lepton number- and flavour-changing W decays: \(W^- \rightarrow N \ell \rightarrow q \bar{q}' \ell ' \ell \), which could appear in the final state of top decays. In the presence of this decay, \(t\bar{t}\) production could give a final state with 3 leptons, missing energy and jets, as in the decay we study (see Fig. 2). However, a different combination of leptons and jets should reconstruct to the top mass. Finally, Fernandez et al. [63] studied almost the same process as us, \(t \rightarrow q\tau ^\pm \mu ^\mp \), but mediated by a (pseudo)-scalar boson. They obtained separately the low-energy bounds on the quark- and lepton-flavour-changing couplings of their boson, and they show that LFV top branching ratios can be \({\sim } 10^{-5}\) if the boson mass is \({\lesssim } 2 m_{W}\). Heng et al. [64] calculated LFV top decay rates in R-parity non-conserving supersymmetry, finding rates comparable to our leptoquark estimates.

2 Current bounds

We are interested in the decays of a top (or anti-top) to a jet and a pair of oppositely charged leptons of different flavour. In this work, we focus on the processes \(t \rightarrow q e^{\pm } \mu ^{\mp }\), where \(q\in \{u,c\}\), because the e and \({\mu }\) are easy to identify at the LHC, and \(e \leftrightarrow \mu \) flavour violation is the most strictly constrained at low energy. We leave the decays to \(q e^{\pm } \tau ^{\mp }\) and \(q \mu ^{\pm } \tau ^{\mp }\) for a later analysis.

We suppose that these decays are mediated by 4-fermion contact interactions. A complete list of the \(SU(3) \times U(1)\) invariant operators that we study is given in Appendix A. We do not impose SU(2) on our operators, because the scale we will probe is not far from the electroweak scale. We refer to these LFV operators as “top operators”. Here, as an example, consider the exchange of a heavy SU(2) singlet leptoquark \(S_0\) with couplings \(\lambda _{et} S_{0} \overline{e_{R}} t^{c}\) and \(\lambda _{\mu {c}} S_{0} \overline{\mu _{R}} c^{c}\), which (after Fierz rearrangement) generates the dimension-six contact interaction

$$\begin{aligned}&\frac{\lambda _{et}^*\lambda _{\mu {c}}}{2 m^2_{LQ}} (\overline{\mu }\gamma ^{\alpha } P_{R e})(\overline{c}\gamma ^{\alpha } P_{R t})\nonumber \\&\quad \equiv -\epsilon ^{\mu e ct}_{RR} \frac{4G_F}{\sqrt{2}} (\overline{\mu }\gamma ^\alpha P_{R e})(\overline{c}\gamma ^{\alpha } P_{R t})\nonumber \\&\quad = -\epsilon ^{\mu e ct}_{RR} \frac{4G_{F}}{\sqrt{2}} \mathcal{O}^{RR}_{\mu e ct}. \end{aligned}$$
(1)

Alternatively, we could define the operator coefficient as \(-1/\Lambda ^2\), in which case \(\epsilon \simeq m_t^2/\Lambda ^2\) because \(2 \sqrt{2}G_{F} \simeq m_{t}^{-2}\) (we take \(m_{t} = 173.3\) GeV). We will quote low-energy bounds on such interactions as limits on the dimensionless \(\epsilon \)s. In the case of our leptoquark example, \(m_{LQ}\gtrsim 1~\hbox {TeV}\) to satisfy current bounds on second generation leptoquarks from the LHC [99101], and thus, for \(\lambda \lesssim 1\), one expects \(\epsilon ^{\mu e ct}_{RR} \lesssim 0.02\).

Notice that \(t \rightarrow q e^{-} \mu ^{+}\) and \(t \rightarrow q e^{+} \mu ^{-}\) are mediated by different operators. Most of the bounds we quote will apply to \(|\epsilon ^{\mu e ct}|^{2} +|\epsilon ^{e \mu ct}|^{2}\), and we will study the LHC sensitivity assuming equal rates for \(t \rightarrow q e^{-} \mu ^+\) and \(t \rightarrow q e^{+} \mu ^{-}\).

2.1 Decay of the top

In the Standard Model, the top decays almost always to \(bW^+\), at a tree-level rate given by

$$\begin{aligned} \Gamma (t\rightarrow bW)= & {} \frac{g^{2}|V_{tb}|^2m_{t}^{3}}{64\pi m_{W}^{2}} \left( 1-\frac{m_W^2}{m_t^2} \right) ^2 \left( 1+2\frac{m_{W}^{2}}{m_{t}^{2}}\right) \nonumber \\\simeq & {} 1.3~\mathrm{GeV}. \end{aligned}$$
(2)

In the presence of the operator of Eq. (1), the three-body decay rate is

$$\begin{aligned} \Gamma (t\rightarrow e^{+} \mu ^{-} + c) = |\epsilon ^{\mu e ct}_{RR}|^{2} \frac{G_{F}^{2} m_{t}^{5}}{192\pi ^{3}}, \end{aligned}$$
(3)

so the branching ratio, allowing for all the operators listed in Appendix A, and neglecting fermion masses other than the top (to remove interferences), is

$$\begin{aligned}&BR (t \rightarrow \ell _{i}^{+} \ell _{j}^{-} + q)\nonumber \\&\quad \simeq \frac{1.3}{48\pi ^2}{\Big (}|\epsilon ^{ijqt}_{LL}|^2 + |\epsilon ^{ijqt}_{LR}|^2 + |\epsilon ^{ijqt}_{RL}|^2 + |\epsilon ^{ijqt}_{RR}|^2\nonumber \\&\qquad + \frac{1}{4} {\Big [}|\epsilon ^{ijqt}_{S+P,R}|^2 + |\epsilon ^{ijqt}_{LQ,R}|^2 -2 \mathrm{Re}\{\epsilon ^{ijqt}_{S+P,R} \epsilon ^{ijqt *}_{LQ,R}\} {\Big ]}\nonumber \\&\qquad + \frac{1}{4} {\Big [}|\epsilon ^{ijqt}_{S+P,L}|^2 + |\epsilon ^{ijqt}_{LQ,L}|^2 - 2\mathrm{Re}\{ \epsilon ^{ijqt}_{S+P,L} \epsilon ^{ijqt *}_{LQ,L}\} {\Big ]}\nonumber \\&\qquad + \frac{1}{4} {\Big [}|\epsilon ^{ijqt}_{S-P,L}|^2x + |\epsilon ^{ijqt}_{S-P,R}|^2 {\Big ]} {\Big )}. \end{aligned}$$
(4)

Here \(q \in \{ u,c\}\), and we approximated \(y_t |V_{tb}|\simeq 1\). This is small (\(\frac{1.3}{48\pi ^2 } \simeq 2.8 \times 10^{-3}\)), due to the three-body phase space. For the leptoquark example discussed above, \(BR(t \rightarrow c e^+ \mu ^-) \simeq 10^{-6}\) for \(\epsilon ^{\mu e ct}_{RR} = 0.02\).

The phase space distribution of the \(\ell _i^+ \ell _j^- \) and q depends on the Lorentz and spinor structure of the contact interaction and could affect the efficiency of experimental searches for this decay. The squared matrix elements for the individual contact interactions have the form \(|\mathcal{M}|^2 \propto x(1-x)\), where \(x = m_{ab}^2/m_t^2\) and \(m_{ab}^2\) is the invariant mass-squared of a pair of final state fermions a and b. Our study will not take this into account, since we found, in some explicit examples, only a small relative effect on the selection efficiency (of the order of 5 %).

2.2 Bounds from flavour physics and HERA

Low-energy constraints on 4-fermion operators involving 2 leptons and 2 quarks have been estimated and compiled for many operators taken one at a time [102104], and carefully studied for selected flavour combinations (see e.g. [105], or global fits to \(b\overline{s} \mu \overline{\mu }\) operators [106109]). However, even in the more recent compilations [103, 104], bounds on LFV operators involving a single top are not quoted. In Appendices B and C, we estimate bounds on such operators from their possible contributions, inside a loop, to rare \(\mu \), B and K decays. In Appendix D, we estimate bounds from single-top searches at HERA. Here, we summarise the resulting bounds, and we list in Tables 1 and 2 the best limits on the coefficients of the various operators. We will find that only the coefficients of some operators are stringently constrained, while others could mediate LFV top decays within the sensitivity of the LHC.

The current upper limit \(BR(\mu \rightarrow e \gamma ) < 5.7\times 10^{-13}\) [4] severely restricts \(e\leftrightarrow \mu \) flavour change. For our top operators to contribute, the quark lines must be closed, which requires at least two loops and a CKM factor, see the second diagram of Fig. 5. Nonetheless, in the case of scalar or tensor operators involving \(t_R\) this diagram can overcontribute to \(\mu \rightarrow e \gamma \) by several orders of magnitude, because the lepton chirality flip is provided by the operator (rather than \(m_\mu \)), so the diagram is enhanced by a factor \(m_t/m_\mu \). We make order-of-magnitude estimates in Appendix B, and we quote the resulting bounds in Table 2.

Table 1 Constraints on the dimensionless coefficient \(\epsilon _{XY}^{ijqt}\) of the 4-fermion operator \(\mathcal{O}^{XY}_{ijqt}=2 \sqrt{2} G_F (\bar{e}_{i} \gamma ^{\mu } P_{X} e_{j})(\bar{u}_{q}\gamma _{\mu } P_{Y} t)\) (see the operator list in Appendix A), where XY are given in the top row, and generation indices ijqt are given in the first column. The bounds are on the first line of each box, their origin on the second. They can arise from the HERA single-top search, or from the loop contribution to the operator involving lighter fermions whose coefficient is given in parentheses below the bound (see Appendix C.1 for current experimental bounds on the lighter-fermion operator coefficients). The \(\sim \) bounds are discussed in Appendix D.4. We expect that \(\epsilon < 1\), so in boldface are the bounds that impose an upper limit smaller than 1
Table 2 Constraints on the dimensionless coefficient \(\epsilon ^{ijqt}\), of the scalar and tensor 4-fermion interactions. See Appendix A for operator definitions corresponding to the subscript of \(\epsilon \). The generation indices ijqt are given in the first column. Beneath each bound is given its origin in parentheses; \(\epsilon ^{ij \alpha \beta }_{S \pm P,X}\) are the limits of Table 7, and \(\epsilon ^{i \nu q \beta }_{CC}\) are from Table 8. See the caption of Table 1 for additional details

Exchanging a W between the t and q quark legs of the top operator will generate an operator with down-type external quark legs, see the left diagram of Fig. 1. The coefficient of this light quark operator will be suppressed by a loop, CKM factors, and various masses. Numerical values for these suppression factors are given in Table 9 of the appendix; however, their approximate magnitude is simple to estimate. If the top is singlet (\(t_R\)), then the loop is finite; in the case of \(V \pm A\) interactions, this is because mass insertions are required on both internal quark lines to flip chirality. In the case of scalar operators, one internal quark mass for chirality flip is still required; then terms linear in the loop momentum vanish, so the diagram is also proportional to an external quark mass. For scalar operators involving \(t_L\) (which require an \(m_q\) insertion to connect the W to the q line), the best limit can arise from exchanging a W between the t and a charged lepton leg, which generates a charged-current operator as represented in the second diagram of Fig. 1. In the appendix are also given current bounds on the coefficients of the various light quark operators that the top operators can induce. Comparing these bounds to the induced coefficients, gives the limits of Tables 1 and 2 that are labelled with (\(\epsilon \))s.

Fig. 1
figure 1

Diagrams for generating an LFV operator with light external quark legs, by dressing a top operator with a W loop. To reduce index confusion, down-type quarks have Greek indices

HERA collided protons with positrons (or electrons) at a centre-of-mass energy of 319 GeV, and searched for single tops in the final state. The H1 collaboration had a few events with energetic isolated leptons and missing energy, consistent with \(e^\pm p\rightarrow te^\pm + X\) followed by leptonic decay of the top [110, 111]. However, this signal was not confirmed by the ZEUS experiment [112], and neither collaboration had a signal consistent with hadronic top decays. Both collaborations set bounds on \(\sigma (e^\pm p\rightarrow e^\pm tX)\); we follow H1, since they had some events and a weaker bound:

$$\begin{aligned} \sigma (e^\pm p\rightarrow e^\pm t +X) \le 0.30~\hbox {pb}= & {} \frac{2.3\times 10^{-5}}{m_t^2} \quad \hbox {at }95~\% \hbox { CL}.\nonumber \\ \end{aligned}$$
(5)

Contact interactions of the form \((\overline{e} \Gamma \mu ) (\overline{u} \Gamma t )\) and \((\overline{\mu } \Gamma e ) (\overline{u} \Gamma t )\) could, respectively, mediate \(e^-p\rightarrow \mu ^- t X\) and \(e^+p\rightarrow \mu ^+ t X\). As discussed in Appendix D, the limit of Eq. (5) translates to \(\epsilon \lesssim 0.3 \rightarrow 1\) for the various operators, as given in Tables 1 and 2.

It can be seen from Tables 1 and 2, that rare decays give very weak bounds on some contact interactions of the form \((\overline{e} \Gamma \tau ) (\overline{u} \Gamma t )\) and \((\overline{\tau } \Gamma e ) (\overline{u} \Gamma t )\). Such interactions might have contributed a signal at HERA via leptonic \(\tau \) decays, so we make some approximate estimates in Appendix D.4, and we include the bounds in the tables with a \(\sim \).

2.3 Implications

The current bounds on LFV branching ratios of the top can be obtained from Tables 1 and 2, and from and Eq. (4). In these tables, the bound on \(\epsilon ^{ijqt}_{XYZ}\) appears on the first row of each box, where the flavour indices ijqt are given in the left column, and the operator label XYZ is given in the first row (see Appendix A for the operator definitions). In parentheses below the bound is a clue to where it comes from: HERA means the single-top searches at HERA that are discussed in Appendix D, and the (\(\epsilon \))s mean the bound comes from dressing the top operator with a W loop. For instance, the bound 0.0037 in the second row and third column of Table 1 arises by exchanging a W between the t and u legs of the top operator, which gives the operator \((\overline{e} \gamma ^\rho P_L \mu ) (\overline{d} \gamma _\rho P_Ls )\). This would mediate the unobserved decay \(K^0 \rightarrow e\mu \), so its coefficient is bounded above, as indicated in Table 6. (The bounds on lighter-quark operators relevant to constraining top operators are given in Tables 6, 7 and 8, and Table 9 gives the loop suppression factors with which the top operators generate the lighter-quark operators.) Translated back to the top operator, the upper limit on \(BR(K^0 \rightarrow e\mu )\) gives the quoted limit on the top-operator coefficient.

In this paper, we are interested in top decays to \(e^\pm \mu ^\mp \), the bounds on which are given in the second and third rows of the tables. For manyFootnote 1 of the operators involving the doublet component of the top (\(t_L\); recall that the last index in the operator label is the top chirality), the rare decay bounds are restrictive, implying that these operators could only induce \(BR(t\rightarrow q \mu ^\pm e^\mp ) \le 10^{-6}\). Scalar and tensor operators involving e, \(\mu \) and \(t_R\) would overcontribute to \(\mu \rightarrow e \gamma \). However, there remain operators which are weakly or not constrained, allowing for a branching ratio \({\lesssim } 10^{-3}\). It is therefore interesting to explore the sensitivity of the LHC to \( t\rightarrow e^\pm \mu ^\mp q\) decays.

Finally, it is interesting to consider how large the \(\epsilon \) coefficient of the top operators can be. Some of the upper bounds quoted in Tables 1 and 2 are \(\gg \)1, and should not be interpreted as relevant constraints.Footnote 2 Indeed, the width of the top is given by D0 [113] as \(2.0 \pm 0.5\) GeV (the theoretical decay rate to bW is 1.3 GeV), which constrains \(\epsilon ^{ijqt}_{XYZ} < 10{-}20\). Furthermore, phenomenological prejudice and the leptoquark example of Eq. (1), suggest that \(\epsilon <1\), because the three-body decay should be mediated by sufficiently heavy (\(m>m_t)\) particles, with sufficiently small couplings to have not yet been detected. We therefore quote in boldface the “relevant” bounds that impose \(\epsilon < 1\).

3 \({t\rightarrow e^\pm \mu ^\mp q}\) at the 8 TeV LHC

In this section, we estimate the sensitivity of current LHC data to the LFV top decays \(t \rightarrow q e^{\pm }\mu ^{\mp }\), where \(q=u, c\). We consider strong production of a \(t\bar{t}\) pair, because this is the most abundant source of tops at the LHC, followed by the leptonic decay on one top, and the LFV decay of the other. This is illustrated in Fig. 2 and gives a final state containing 3 isolated muons or electrons,Footnote 3 which has small Standard Model backgrounds.

3.1 Simulation setup

This study is performed for proton–proton collisions at the LHC, with a centre-of-mass energy of 8 TeV and an integrated luminosity of 20 \(\mathrm {fb}^{-1}\), corresponding to the LHC Run1. The details of the signal and background generation are given in Sect. 3.2. The detector simulation is carried out by Delphes [114] using a CMS setup parametrisation.

Delphes uses a particle-flow-like reconstruction. The relative isolation of leptons is calculated from the total \(p_T\) of the particles inside a cone of \(\Delta R\) around the lepton direction (\(\Delta R=0.3\) for electrons and 0.4 for muons), divided by the \(p_T\) of the lepton. Jets are clustered using the fastjet package [115] with the Anti-kt [116] algorithm with distance parameter \(R=0.5\). The b-tagging performances are tuned on the typical efficiency and fake rate obtained in CMS.

For this study, no additional interactions in the same or neighbouring bunch crossing (pileup) are simulated.

3.2 Signal and SM backgrounds generation

The signal is generated with PYTHIA 8.205 [117] using tune 4C. Top quarks are pair produced, then one top is forced to decay to charm, \(\mu ^\pm \), and \( e^\mp \), with equal probability between \(\mu ^+ \, e^-\) and \(\mu ^- \, e^+\). The decay products are distributed according to the available phase space. 100k events have been generated both for LFV top and anti-top decays.

Fig. 2
figure 2

Feynman diagram for the considered signal where \(\ell =e\) or \(\mu \) (the conjugate diagram is also considered)

The backgrounds for this search, listed in Table 3, are processes that can give rise to 3 isolated leptons and at least 2 jets in the final state. Most of them are related to the production of real isolated leptons, e.g. from a top pair or vector bosons in the final state. In the table, are also shown the details as regards the number of generated events and production cross section for 8 and 13 TeV proton–proton collisions. The number of generated events refers to the generation at 8 TeV. The \(t\bar{t}\) cross section is calculated with the Top++2.0 program to next-to-next-to-leading order in perturbative QCD, including soft-gluon resummation to next-to-next-to-leading-log order (see [118] and references therein), and assuming a top-quark mass of \(m_{t} = 173.3\) GeV. When an explicit calculation was not available, the cross sections have been calculated with the MCFM package [119], version 7.0. The kinematic cuts used for the calculation are also shown in the table.

Table 3 Number of events generated, and cross sections at NLO (except for \(t\bar{t}{+}\)jets), for each background category. Here \(l=e, \mu , \tau \)

The leading order (LO) matrix element generator, MADGRAPH 5 [123], with CTEQ6 parton distribution functions, is used to generate top pair production, and associated production of a top pair and a vector boson (ttW, ttZ). MADGRAPH, interfaced with tauola for \(\tau \) decays, is used to generate vector–vector production (WW, WZ and ZZ) and the contribution of weak processes giving rise to final states with one top quark, one b quark and a Z boson (decaying to leptons). For the vector–vector production, we only consider final states with at least 2 real charged leptons. This means that for the WW system, the considered final states are 2 charged leptons and 2 neutrinos; for WZ, they are 3 charged leptons and one neutrino or 2 charged leptons and 2 quarks; and for ZZ, they are 2 charged leptons and 2 neutrinos, 2 charged leptons and 2 quarks, or 4 charged leptons. In all cases, MADGRAPH accounts for the presence of up to 2 additional jets at matrix-element level, and the hadronisation is carried out by PYTHIA 8.205. The details of the SM background simulation and cross sections are shown in Table 3.

3.3 Event selection

The signal for this search is \(t\bar{t}\) production, followed by the lepton-flavour violating decay of one top (which will be denoted as LFV top in the following), and the leptonic decay of the other (standard top, in the following). The number of expected signal events is given by

$$\begin{aligned} N_{\mathrm{SIG}}={{\mathcal {L}}}\cdot \epsilon _{\mathrm{SIG}}\cdot 2\cdot \sigma _{t\bar{t}}\cdot BR(W\rightarrow l\nu ) \cdot BR(t\rightarrow q \mu ^\pm e^\mp ),\nonumber \\ \end{aligned}$$
(6)

where \(l \in \{e,\mu ,\tau \}\), \({\mathcal {L}}\) the integrated luminosity, \(\epsilon _{\mathrm{SIG}}\) the selection efficiency on the signal, and \(\sigma _{t\bar{t}}=246.7\) pb (see [118] and references therein).

The considered signature is 3 isolated leptons (with one pair of opposite sign and opposite flavour from the LFV decay), 2 jets (one of which is a b-jet), and missing transverse energy. For the event selection, we consider only muons of \(p_T>20\) GeV and \(|\eta |<2.4\), electrons of \(p_T>20\) GeV and \(|\eta |<2.5\), and jets of \(p_T>30\) GeV and \(|\eta |<2.4\). These criteria are comparable to those used in real analyses by the CMS or ATLAS collaborations. A muon is considered isolated if its relative isolation value is less that 0.12, and an electron is considered isolated if its relative isolation value is less that 0.1. We select events containing:

  1. 1.

    exactly 3 isolated charged leptons (electrons or muons), 2 of which must be of opposite sign and opposite flavour.

  2. 2.

    Events are requested to contain at least 2 jets, and

  3. 3.

    exactly one b-tagged jet, and

  4. 4.

    the missing transverse energy has to be higher than 20 GeV.

  5. 5.

    In order to exclude events where 2 of the isolated leptons come from a real Z boson, we reject events containing any pair of opposite sign isolated muons or electrons with invariant mass between 78 and 102 GeV/c\(^2\). This cut is particularly helpful in rejecting background arising from \(t\bar{t}\) associated production with a Z.

  6. 6.

    The charged lepton that does not belong to the pair of opposite sign and opposite flavour leptons is assigned to the standard top in the event, and assumed to come from the W decay (bachelor lepton). Following a common procedure in reconstruction of \(t\bar{t}\) semi-leptonic events, the x and y components of the missing transverse energy are taken as a measurement of the neutrino \(p_{x}\) and \(p_{y}\), and the longitudinal component of the neutrino momentum is calculated imposing the requirement that the invariant mass of the system composed of the bachelor lepton and the neutrino must equal the mass of the W boson. The bachelor lepton and the neutrino 4-momenta are then combined with that of the b-tagged jet, to build a candidate standard top. When more choices of the bachelor lepton are possible (there can be up to two possible pairs of opposite sign opposite flavour charged leptons in one event), all are considered and the one giving the best standard top mass is chosen. We reject events in which the invariant mass of the standard top candidate is more than 45 GeV away from the nominal top mass. After the choice of the bachelor lepton, there is only one possible pair of opposite sign and opposite flavour leptons in each event. This is combined with all good (non b-tagged) jets present in the event to build a list of candidates for the LFV top. Events are required to have at least one combination giving a LFV top mass within 25 GeV of the nominal value.

The efficiency of the final selection on signal events is

$$\begin{aligned} \epsilon _{\mathrm{SIG}}= (1.85\pm 0.03)~\%, \end{aligned}$$
(7)

where the uncertainty is statistical only. The signal efficiency is calculated on \(t\bar{t}\) events where one top decays through \(t\rightarrow q \mu ^\pm e^\mp \), and the other one decays to a b quark and a W, which subsequently decays to a charged lepton (e, \(\mu \) or \(\tau \)) and a neutrino, and is defined as the fraction of such events passing the selection criteria.

The number of expected events, for the signal and for each background category, on 20 fb\(^{-1}\) of proton–proton data at 8 TeV is shown in Table 4, for different subsequent selection requirements.

Table 4 Number of expected events for a luminosity of 20 fb\(^{-1}\), at various steps of the selection, for the signal process normalised to a branching ratio \(BR(t\rightarrow qe\mu )=6.3 \times 10^{-5}\), and the various backgrounds considered in this study normalised to their NLO cross sections. All uncertainties are statistical only. The considered backgrounds are the same as in Table 3, grouped in wider categories. In particular, the numbers in the “no selection” column are relative only to the final states detailed in Table 3. The steps in the selection are as follows; step 1: 3 leptons with 2 opposite sign opposite flavour, step 2: at least 2 jets, step 3: exactly one b-tag, step 4: missing \(E_T\) greater than 20 GeV, step 5: Z boson veto, step 6: invariant masses cuts (see Sect. 3.3 and Fig. 3). Uncertainties are statistical only

3.4 Results and expected limits on the branching ratio

The selection and its efficiency, on signal and background, are discussed in Sect. 3.3, and summarised in Table 4. Assuming a branching ratio of \(BR(t\rightarrow q \mu ^\pm e^\mp )=6.3 \times 10^{-5}\) for the signal, an uncertainty of 2.5 % on the luminosity, and 20 fb\(^{-1}\) of data, we would expect \(N_{\mathrm{SIG}}=3.75\pm 0.06\) signal events, to compare to the \(N_{\mathrm{BKG}}=1.20\pm 0.18\) expected events from known backgrounds.

In Fig. 3, we show the invariant mass of the LFV top candidate (left) and the standard top candidate (right), in events passing all the cuts except those on the masses themselves.

Fig. 3
figure 3

Invariant mass of the LFV top candidates (left) and standard top candidate (right) in events passing all the selection, apart from the cut on the masses themselves. The different background contributions are shown in filled histograms and stacked, normalised to the number of events expected in 20 fb\(^{-1}\). For comparison, the distribution for signal events is also shown as a dashed line, normalised to the number of events expected in 20 fb\(^{-1}\), for an example signal branching ratio \(BR(t\rightarrow q \mu ^\pm e^\mp ) = 6.3 \times 10^{-5}\), equal to the expected limit extracted in Sect. 3.4

In order to evaluate the sensitivity of this search for \(t\rightarrow q \mu ^\pm e^\mp \), we calculate the expected upper limit that could be set, in the case of absence of the signal. The calculation is based on the number of expected background events surviving the final selection, in 20 fb\(^{-1}\) of 8 TeV LHC data, so the result can be interpreted as the possible upper limit the CMS or ATLAS collaborations (alone) could expect to set with Run1 data, if the signal were not there.

A 95 \(\%\) confidence level (CL) upper limit on the branching fraction of \(t\rightarrow q \mu ^\pm e^\mp \) is calculated using the modified frequentist approach (CL\(_{s}\) method [124, 125]), as it is implemented in the RooStats framework [126]. Based on the number of expected background events, summarised in Table 4, and on Eq. (6), the obtained limit is

$$\begin{aligned} BR(t\rightarrow q \mu ^\pm e^\mp ) < 6.3 \times 10^{-5}\quad \hbox {at }95~\%\hbox { CL}. \end{aligned}$$

Alternative techniques for limit calculations, as implemented in RooStats, have been tried, leading to compatible results. An eventual variation of 100 \(\%\) in the number of expected background events would lead, in the worst case, to an expected limit of \(BR(t\rightarrow q \mu ^\pm e^\mp ) < 7.4 \times 10^{-5}\) at 95 % CL.

Table 5 Expected upper limits on \(BR(t\rightarrow q \mu ^\pm e^\mp )\), under the hypothesis of the absence of signal, for 8, 13 TeV (in two scenarios: the case of 20 and 100 fb\(^{-1}\) collected luminosity) and 14 TeV for 3000 fb\(^{-1}\) collected luminosity

As explained in Appendix E, we emulate in our framework the published CMS search for \(BR(t\rightarrow Z q)\) [65]. The reason for this exercise is twofold: on one hand it allows one to validate our procedure on simulated samples, by comparing with the CMS background expectations. On the other hand, it provides an estimate of the constraint set on \(BR(t\rightarrow q \mu ^\pm e^\mp )\) from this previous analysis, which is found to be \(BR(t\rightarrow q \mu ^\pm e^\mp )< 3.7\times 10^{-3}\), on the verge of probing LFV top decays mediated by a 4-fermion operator. As proven in the present study, a dedicated analysis would set a limit 50 times stronger (of the order of \(BR(t\rightarrow q \mu ^\pm e^\mp ) < 6.3 \times 10^{-5}\)), showing that the existing LHC data from Run1 can still be used to obtain interesting constraints on lepton-flavour violation.

4 Discussion

4.1 Perspectives at 13 and 14 TeV

To estimate the reach of the described search at a centre-of-mass energy of 13 TeV, we extrapolate the 8 TeV results, rather than performing a full simulation of signal and background processes at 13 TeV. The increase of the production cross sections for SM processes, from 8 to 13 TeV (see Table 3), is taken into account. The selection requirements and efficiencies are kept the same as for the 8 TeV analysis. For the signal, we have checked on simulated events that the efficiencies at 8 and 13 TeV are consistent within 5 %.

The sensitivity is estimated by calculating the expected upper limits on \(BR(t\rightarrow q \mu ^\pm e^\mp )\), in the absence of signal, for two scenarios: the case of 20 and 100 fb\(^{-1}\) proton–proton data collected by the LHC at 13 TeV centre-of-mass energy. We also extrapolate the sensitivity to the case of 3000 fb\(^{-1}\) of integrated luminosity at 14 TeV. In this last case, we simply rescale signal and background rates from 13 to 14 TeV, and use the square root of the number of expected background events as an estimate of their uncertainty. The obtained values are summarised in Table 5. The upper limits presented here are derived using statistical uncertainties only, so do not take into account the possibility for such analyses to become systematically dominated in the future. In order to have an accurate evaluation of the systematics evolution, a deeper study from the LHC experiments would be needed. On the other hand, for an analysis on 13 or 14 TeV data, the selection would have to be re-optimised, possibly leading to an increase in sensitivity.

4.2 Single top

In addition to mediating LFV top decays, the top operators listed in Appendix A could lead to single-top production with an \(e^\pm \mu ^\mp \) pair, as illustrated in Fig. 4. The objects in the final state would be the same as for the \(t\bar{t} \) process we studied: 3 leptons, missing energy, and \({\ge } 2\) jets, of which one is a b. We estimate that at 8 TeV, the cross section for \(pp \rightarrow e^\pm \mu ^\mp t\rightarrow e^\pm \mu ^\mp , \overline{\ell } \nu b\) is of similar order to the cross section for \(pp \rightarrow \bar{t} t \rightarrow e^\pm \mu ^\mp \bar{q} , \overline{\ell } \nu b\), for operators involving a u quark and slightly less for a c quark.

Fig. 4
figure 4

Single-top production via the LFV contact interaction, which produces a final state similar to the considered signal

We neglect this process for two reasons. First, the contact interaction approximation (the |4-momentum\(|^2\) in the process \({\ll }\Lambda ^2\)) is more difficult to justify than in top decay, because the energy scale in the process can be \({\gg }m_t\). Second, finding such events in the backgrounds could be more challenging because the \(e^\pm \mu ^\mp \) cannot be required to participate in the reconstruction of a top (step 6 of Sect. 3.3).

We envisage that it makes sense to neglect the LFV single top process in a first search for LFV top decays. This is conservative, because LFV single-top production could contribute events that pass our selection. In the absence of a signal, such a search could sufficiently constrain the contact interaction scale \(\Lambda \), to justify including the single top process in subsequent analyses.

5 Summary

The aim of this paper was to explore the LHC sensitivity to the decay \(t\rightarrow q \mu ^\pm e^\mp \), which is lepton and quark flavour-changing, but baryon and lepton number conserving. We parametrise this decay as occurring via a contact interaction, and list a complete set of \(SU(3)\times U(1)\) invariant dimension-six operators in Appendix A. We parametrise the coefficient of these interactions, which we refer to as “top operators”, as \(\epsilon 2\sqrt{2} G_F\) or equivalently \(1/\Lambda ^2\), with

$$\begin{aligned} \epsilon \simeq \frac{m_t^2}{\Lambda ^2}. \end{aligned}$$

Model-building prejudice (see Sect. 2.3) suggests that \(\epsilon \lesssim 1\). The top branching ratio is then

$$\begin{aligned} BR(t\rightarrow q \ell _i^+ \ell _j^-) \simeq 1.3 \frac{|\epsilon |^2}{48\pi ^2} \times \left\{ \begin{array}{cc} 1 &{} V\pm A\\ \frac{1}{4} &{} S\pm P. \end{array} \right. \end{aligned}$$

These contact interactions are currently constrained from their contribution in loops to rare decays, and from single top searches at HERA. These bounds are discussed in the appendices, and summarised in Sect. 2. For interactions involving e and \(\mu \), rare decay bounds impose \(\epsilon \lesssim 0.01\) for some operators, but others can have \(\epsilon \sim 1\).

In Sect. 3, we evaluate the sensitivity reach of a dedicated search for lepton-flavour violation in top decays, at the 8 TeV LHC. The search targets \(t\bar{t}\) events, where one top decays to an up-type quark (u or c) and a pair of leptons of opposite sign and opposite flavour, and the other one decays to a b quark and a W, which subsequently decays to a charged lepton and a neutrino. This is illustrated in Fig. 2.

The relevant signal and SM background processes are simulated for LHC Run1-like conditions: proton–proton collisions at 8 TeV centre-of-mass energy, for an integrated luminosity of about 20 fb\(^{-1}\). The detector simulation is based on Delphes, with parameters tuned on the CMS detector reconstruction and performances, but it does not include pileup. The analysis setup is validated by emulating an existing CMS search for rare top decays to Zq in \(t\bar{t}\) events, showing reasonable results.

We find that a dedicated search by a single experiment using 20 fb\(^{-1}\) of 8 TeV data could be sensitive to

$$\begin{aligned} BR(t \rightarrow e \bar{\mu } + ~\mathrm{jet}) \sim 6.3\times 10^{-5}, \end{aligned}$$

and we extrapolate that a sensitivity of \( {\sim } 1.2 \times 10^{-5}\) (\( {\sim } 2 \times 10^{-6}\)) could be reached with 100 fb\(^{-1}\) at 13 TeV (3000 fb\(^{-1}\) at 14 TeV). From Eq. (4), we see that the 100 fb\(^{-1}\) data could impose \(|\epsilon | \le 0.06\) for the \(V \pm A\) operators, and \(|\epsilon | \le 0.1\) for the \(S \pm P\) and LQ operators. This analysis shows that the existing LHC data from Run1 can still be used to obtain interesting constraints on lepton-flavour violation. Although this is understandably not the priority focus in the times of the Run2 startup, let’s not to leave unchecked this possible path to New Physics.