1 Introduction

With the Higgs discovery, an era in humankind’s quest for the fundamental laws of Nature has been completed [1, 2]. At the same time, new uncharted territory has been opened: the spin-zero window to the universe. (Pseudo)Nambu-Goldstone scalars (pGBs) are strongly motivated from fundamental problems of the known particle physics laws, that is, of the Standard Model of Particle Physics (SM). They are the generic tell-tale of exact, although “hidden” (i.e. spontaneously broken), global symmetries of nature. A paradigmatic example is the axion, which results from the dynamical solution to the strong CP problem of the SM [3,4,5,6,7]. The traditional “invisible axion” is expected to be extremely light, with mass \(m_a<10^{-2}\) eV, and its scale \(f_a\) to be out of direct experimental reach, although recently tantalizing axion solutions to the strong CP problem are being explored with scales as low as \({\mathcal {O}}(\hbox {TeV})\) [8,9,10,11,12,13,14,15,16,17,18,19]. PGBs of deep interest extend well beyond axions, though. They appear in a plethora of constructions which reach beyond the SM (BSM), typically as SM scalar singlets, e.g. in: (i) Theories with extra space-time dimensions, (ii) Dynamical explanations to the smallness of neutrino masses: the Majoron [20], (iii) String theory models [21], (iv) Supersymmetric extensions of the SM [22], and v) Many dynamical flavor theories with hidden global U(1) symmetries, a particular class of which identifies the QCD axion as a flavon “à la Froggat-Nielsen” (axiflavon or flaxion) [23,24,25]. These pGBs are often denoted by the general name of axion-like particles (ALPs), as anomalous couplings to gauge currents often appear in addition to purely derivative ones. ALPs may or may not have anomalous couplings to gluons, and they are not required to solve the strong CP problem. One practical difference between a generic ALP and true axions which solve the strong CP problem is that, for ALPs, \(f_a\) and \(m_a\) are treated as independent parameters. Outstandingly, and as a wonderful byproduct, both axions and ALPs may be excellent candidates to explain the nature of dark matter (DM) [26,27,28].

The parameter space for very light ALPs, with masses below the MeV scale, is delimited mainly by astrophysical and cosmological constraints. Regarding terrestrial experiments, ADMX has finally entered the critical territory expected for an invisible axion signal if DM is made of axions. In addition, the investment in axion and ALP searches in a large range of masses is accelerating at present with CAST [29], IAXO [30, 31], and future projects such as Madmax, CASPEr, QUAX, HeXenia, FUNK and electric dipole moment searches (PSI and Co) [32,33,34,35,36]. Also, DM experiments like Xenon [37] and the future Darwin [38] target keV mass ALP dark matter (and solar axions). On the precision arena, flavor experiments provide valuable constraints. For instance, NA62 [39] is taking data, and new fixed target facilities (e.g. SHIP [40]) are in preparation, with sensitivity to MeV-GeVs ALPs and strong complementary potential to tackle ALP couplings to gauge bosons and fermions. Belle-II [41] will also have some sensitivity to this mass range, as well as the LHC with Mathusla, Faser and CodexB [42,43,44]. Indeed, ALPs may well show up first at colliders [45, 46]. Intense work on ALP signals at the LHC and future colliders is underway [47, 48], and the synergy between collider and low-energy fixed target experiments is increasingly explored [49]. All couplings must be analyzed combining fixed-term and accelerator data in a complementary approach.

In this work, we explore ALP contributions to flavor changing neutral current (FCNC) processes, formulating them in a model-independent approach via the linear realization of the ALP effective Lagrangian. The complete basis of bosonic and CP-even ALP couplings to the electroweak sector is considered. That is, the set of gauge invariant and independent leading-order couplings to the W, Z, photon and Higgs doublet is discussed. Given that these operators are flavor blind, they may impact flavor-changing data only at loop level. The couplings of ALPs to heavy SM bosons had been largely disregarded until recently, even if a priori they are all expected to be on equal footing with the pure photonic ones because of electroweak gauge invariance. In addition to novel collider signatures [47, 48], rare hadron decays provide a superb handle on the ALP couplings to massive vector bosons [50] for ALP masses below 5 GeV. The one-loop impact on FCNC processes of the anomalous ALP-W-W coupling was first considered in Ref. [49]: it was shown to induce flavor-changing rare meson decays via W exchange, with the ALP radiated from the W boson [49, 54, 55]. The axion can then either decay in some visible channel or escape the detector unnoticed, and novel bounds were derived in both cases. Given the level of accuracy provided by present flavor experiments, it is most pertinent to take into account the competing contribution of other electroweak ALP couplings leading to the same final states. In other words, the ensemble of the linearly independent ALP-electroweak couplings should be considered simultaneously in order to delimitate the parameter space. Putative anomalous couplings of ALPs to gluons could also contribute to flavor-blind decays into visible channels, but not to FCNC processes other than via pseudoscalar (e.g. ALP-\(\eta '\) and ALP-pion) mixing in SM flavor-changing decays, and they are not considered in this paper.

The analysis of two (or more) couplings simultaneously has the potential to change the experimental perspective on ALPs. Our theoretical analysis is confronted with the prospects for ALP detection in present and upcoming fixed-target experiments and B-physics experiments. After the theoretical analysis, the structure of this paper reflects successively the two alternative scenarios mentioned above, in which the ALP produced in FCNC meson decays can then either decay into visible channels within the detector, or it can be invisible by escaping the detector (or decaying to a hidden sector). For both cases, the comparison with data considers first each coupling separately and then the ensemble in combination, and the resulting interference patterns are worked out in detail.

2 Bosonic ALP Lagrangian

The most general effective Lagrangian describing ALP couplings contains – at leading order in the linear expansion – only three independent operators involving electroweak gauge bosons [47, 51,52,53],

$$\begin{aligned} \delta {\mathcal {L}}_{\mathrm {eff}}&= \dfrac{\partial _\mu a\, \partial ^\mu a}{2} - \dfrac{m_a^2 \,a^2}{2} \nonumber \\&\quad + {c_{a\Phi }} \,{\mathcal {O}}_{a\Phi }+{c_B}\, {\mathcal {O}}_B+{c_W} \,{\mathcal {O}}_W, \end{aligned}$$
(1)

with

$$\begin{aligned} \begin{aligned} {\mathcal {O}}_{a\Phi }&\equiv i\dfrac{\partial ^\mu a}{f_a}\, \Phi ^\dagger \overleftrightarrow {D}_\mu \Phi ,\\ {\mathcal {O}}_B&\equiv - \dfrac{a}{f_a}B_{\mu \nu } {\widetilde{B}}^{\mu \nu },\\ {\mathcal {O}}_W&\equiv - \dfrac{a}{f_a}W_{\mu \nu }^a {\widetilde{W}}^{\mu \nu }_a, \end{aligned} \end{aligned}$$
(2)

where \(\Phi \) is the SM Higgs doublet, \(f_a\) is the ALP decay constant, \(c_i\) are real operator coefficients and \(\Phi \overleftrightarrow {D}_\mu \Phi \equiv \Phi ^\dagger (D_\mu \Phi \big )-(D_\mu \Phi )^\dagger \Phi {}\). The dual field strengths are defined as \({\tilde{X}}^{\mu \nu }\equiv \frac{1}{2} \epsilon ^{\mu \nu \rho \sigma }X_{\rho \sigma }\), with \(\varepsilon ^{0123}=1\).

Upon electroweak symmetry breaking, \({\mathcal {O}}_{a\Phi }\) induces a mixing between a and the would-be Goldstone boson eaten by the Z. Its physical impact is best illustrated via an ALP-dependent rotation of the Higgs field, namely \(\Phi \rightarrow \Phi \, e^{i c_{a\Phi } a/f_a}\) [51], which trades \({\mathcal {O}}_{a\Phi }\) for the following fermionic couplings:

$$\begin{aligned} {\mathcal {O}}_{a\Phi } \rightarrow i\, \dfrac{a}{f_a} \Big [{\overline{Q}}\, Y_u {\widetilde{\Phi }}\, u_R - {\overline{Q}}\, Y_d \Phi \, d_R - {\overline{L}} \, Y_\ell \Phi \,\ell _R\Big ]+\mathrm {h.c.}, \end{aligned}$$
(3)

where \(Y_{u,d,\ell }\) denote the SM Yukawa matrices, flavor indices are omitted, and neutrino masses are disregarded. The ALP-electroweak operators in Eq. (2) are flavor blind, but \({\mathcal {O}}_{a\Phi }\) and \({\mathcal {O}}_{W}\) can participate in FCNC processes at one loop via \(W^\pm \) gauge boson exchange. At this order, the parameter space of ALP-electroweak couplings in FCNC processes is thus reduced to two dimensions spanned by the coefficients

$$\begin{aligned} \{c_W, {c_{a\Phi }}\}. \end{aligned}$$
(4)

They may contribute to rare decays as illustrated in the left (\({c_{a\Phi }}\)) and right (\({c_W}\)) panels of Fig. 2. While \({c_W}\) has been discussed separately in Refs. [49, 50], and the effective ALP-fermionic interactions have also been considered by themselves before [48, 54,55,56,57,58], the interplay between \(c_W\) and \(c_{a\Phi }\) will be shown below to lead to interesting new features.

3 FCNC ALP interactions

The effective interaction between a pGB and left-handed fermions can be expressed in all generality as

$$\begin{aligned} {\mathcal {L}}_\text {eff}^{d_i \rightarrow d_j} = -g_{ij}^a\, \left( \partial _\mu a\right) \, \bar{d_j} \gamma ^\mu P_L d_i+\mathrm {h.c.}, \end{aligned}$$
(5)

where latin indices ij denote flavor and \(g_{ij}^a\) is an effective coupling.

The impact of \({\mathcal {O}}_{a\Phi }\) and \({\mathcal {O}}_{W}\) on \(d_i \rightarrow d_j a\) (with \(i\ne j\)) transitions via one-loop \(W^\pm \) exchange induces a left-handed current of the form in Eq. (5), and thus a contribution to rare meson decays. The corresponding Feynman diagrams at the quark level are those contained in the illustration in Fig. 1, as well as the corresponding self-energy diagrams with the ALP operator inserted on the quark lines external to the W loop. At the quark level, those one-loop W exchanges result in a contribution to \(g_{ij}^a\), for \(i\ne j\), given by

$$\begin{aligned} g_{ij}^a = g^2\sum _{q=u,c,t} \dfrac{V_{qi} V_{qj}^*}{16\pi ^2} \Bigg [ \dfrac{3c_W}{f_a}g(x_q)-\dfrac{c_{a\Phi }}{4f_a} x_q \log \left( \dfrac{f_a^2}{m_q^2}\right) \Bigg ], \end{aligned}$$
(6)

where g is the electroweak gauge coupling, and \(V_{qi}\) are the CKM matrix elements. In this equation, \(m_q\) denotes the mass of a given up-type quark q that runs in the loop, the approximation \(m_{d_j} ,\, m_{d_i} \ll m_W\) has been used, \(x_q=m_q^2/m_W^2\), and the loop function is given by

$$\begin{aligned} g(x)=\dfrac{x\left[ 1+x (\log x-1)\right] }{(1-x)^2}. \end{aligned}$$
(7)
Fig. 1
figure 1

Illustration of diagrams giving one-loop contributions to the process \(K^+\rightarrow \pi ^+ a\) via the interactions defined in Eq. (3)

Fig. 2
figure 2

Invisible ALP: constraints on the absolute value of \(c_W\) (left panel) and \(c_{a\Phi }\) (right panel) as a function of the ALP mass, considering each of these couplings separately. The exclusion contours have been derived from the experimental limits on \({\mathcal {B}}(K^+\rightarrow \pi ^+ +\mathrm {inv})\) [62] (green) and \({\mathcal {B}}(B\rightarrow K +\mathrm {inv})\) [63] (blue) by fixing \(f_a = 1\) TeV and by setting the other couplings to zero. Projections for NA62 [49] and Belle-II [71] experiments are illustrated by dashed lines

It follows that the decay rate for the process \(K^+\rightarrow \pi ^+ a\) can be expressed as

$$\begin{aligned} \Gamma (K^+\rightarrow \pi ^+ a)&=\dfrac{m_{K}^3 |g_{sd}^a|^2}{64\pi } f_0(m_a^2)^2 \lambda _{\pi a}^{1/2}\left( 1-\dfrac{m_{\pi }^2}{m_{K}^2}\right) ^2, \end{aligned}$$

with \(\lambda _{\pi a}=\left[ 1-\frac{{( m_a}+m_\pi )^2}{m_K^2}\right] \left[ 1-\frac{({m_a}-m_\pi )^2}{{m_K}^2}\right] \). In this expression, \(f_0\) denotes the \(K\rightarrow \pi \) scalar form factor, which has been computed in lattice QCD in Ref. [64]. An analogous expression can be obtained mutatis mutandis for the decay \(B\rightarrow K a\), in which case the relevant form factors can be found in Refs. [65, 66].

Fig. 3
figure 3

Allowed \(\{c_W, c_{a\Phi }\}\) parameter space for the invisible ALP when those two couplings are simultaneously present. The superposition of the constraints from \(K^+\rightarrow \pi ^+ + \mathrm {inv} \) (green) and \(B^+\rightarrow K^+ + \mathrm {inv} \) (blue) data is shown for an illustrative case with \(f_a=1\) TeV and \(m_a \lesssim 100\) MeV. The left (right) panel shows the destructive (constructive) interference of the two couplings for \(c_{W}/c_{a\Phi }>0\) (\(c_{W}/c_{a\Phi }<0\)). The red solid (dashed) lines correspond to the current (projected) limits from mono-W searches at the LHC with \(3.2~\textit{fb}^{-1}\) (\(\,3~\textit{ab}^{-1}\!\)) of data [47]

In Eq. (6), the contribution proportional to \(c_W\) is finite due to the Glashow–Iliopoulos–Maiani (GIM) mechanism, in agreement with the results of Ref. [49]. The \(c_{a\Phi }\) term is instead logarithmically sensitive to the ultraviolet scale of the theory \(f_a\), and its contribution is thus approximated by the leading log model-independent component. Furthermore, because \(g(x)\sim x + {\mathcal {O}}(x^2)\) for small x, the contributions from the up and charm quarks are sub-leading in both terms with respect to that of the top quark. Also, note that the logarithmic enhancement of the \(c_{a\Phi }\) term (\(\propto \log \left( f_a/m_t\right) \)) should be particularly relevant for large values of \(f_a\). This logarithmic divergence is a consequence of the operator \({\mathcal {O}}_{a\Phi }\) being non-renormalizable [55,56,57], in contrast with renormalizable scenarios such as two-Higgs doublet models [55, 59,60,61].

The interplay between \(c_{a\Phi }\) and \(c_W\) presents interesting features which depend on their relative sign. Their contributions to ALP production in rare decays can interfere destructively if and only if \(c_{a\Phi }/c_{W}>0\). Such a cancellation would leave a region in parameter space which cannot be probed by relying only on FCNC decays such as \(K\rightarrow \pi a\) and \(B\rightarrow K a\). An alternative to lift this degeneracy using LHC constraints will be discussed further below, after deriving the constraints that follow from rare meson decays.

In order to determine the detection possibilities for a given final state channel, an important element is whether the ALP can decay into visible particles within the detector, or whether it escapes and contributes to an “invisible” channel. We discuss next both cases.

4 The invisible ALP

Let us consider first the scenario of an ALP that does not decay into visible particles in the detector, which we shall refer to as the “invisible ALP”. This situation can arise if a is sufficiently light, making a long-lived, or if there are large couplings of a to a dark sector, making \({\mathcal {B}}(a\rightarrow \mathrm {inv})\) sufficiently large. The analysis performed below is general and applies to both cases.

The experimental constraints relevant for different \(m_a\) ranges are listed next:

  • \(m_a \in (0, m_K-m_\pi )\): Searches for the decay \(K\rightarrow \pi \nu {\bar{\nu }}\) have been performed at the E787 and E949 experiments. The bounds obtained can be directly reinterpreted to limit the parameter space of new undetected particles. E787 and E949 experiments take measurements in two regions of pion momentum, namely \(p_\pi \in (140,199)\) MeV and \(p_\pi \in (211,229)\) MeV, which can be translated into the ALP mass ranges \(150~\mathrm {MeV}\lesssim m_a \lesssim 260\) MeV and \(m_a \lesssim 115\) MeV, respectively. The limits reported in these searches are \({\mathcal {B}}(K^+ \rightarrow \pi ^+ \nu {\bar{\nu }})^{\mathrm {exp}}=\left( 1.73^{+1.15}_{-1.05}\right) \times 10^{-10}\) [62] and \({\mathcal {B}}(K^+ \rightarrow \pi ^+ \nu {\bar{\nu }})^{\mathrm {exp}}< 2.2 \times 10^{-9}\) [63], which lie slightly above the SM prediction, \({\mathcal {B}}(K^+\rightarrow \pi ^+\nu {\bar{\nu }})^{SM}= \left( 9.11\pm 0.72\right) \times 10^{-11}\) [67]. Similar searches have been performed at the NA62 experiment, which aims at attaining the SM rates in the very near future [68]. In our analysis, we consider the E787 and E949 constraints, as summarized in Ref. [62].

  • \(m_a \in (0, m_B-m_K)\): The most constraining experimental limits on \({\mathcal {B}}(B\rightarrow K^{(*)}+\mathrm {inv})\) were obtained by the Belle collaboration. These are \({\mathcal {B}}(B\rightarrow K \nu {\bar{\nu }})<1.6 \times 10^{-5}\) and \({\mathcal {B}}(B\rightarrow K^*\nu {\bar{\nu }})<2.7 \times 10^{-5}\) (90\(\%\) C.L.) [69], which lie respectively a factor of 3.9 and 2.7 above the SM predictions [70]. In the near future, Belle-II aims at measuring the SM value with a \({\mathcal {O}}(10\%)\) precision [71]. For the new physics scenario considered here, the strongest constraint arises from the \(B\rightarrow K \nu {\bar{\nu }}\) result.

We have explicitly checked that \(\Delta F=2\) constraints on the effective couplings \(g_{ij}^a\) are less stringent than the ones presented above for most of the ALP parameter space considered here. Nevertheless, those constraints should provide the best bounds on \(c_{a\Phi }\) and \(c_{W}\) for masses larger than \(\sim 5\) GeV, which are out of reach of rare decays, see Fig. 2. Those observables are not included in our analysis, though, since the consistent assessment of the corresponding limits would require a complete two-loop computation, as well as the additional consideration of higher dimension ALP operators, which goes beyond the scope of this paper.

The constraints set on ALP-electroweak coefficients by data will be analyzed in two steps: first within a one coupling at a time approach, where either only \(c_W\) or \(c_{a\Phi }\) are switched on; next, the \(\{c_{a\Phi }, c_W \}\) parameter space spanned by the simultaneous presence of both couplings will be considered.

Figure 2 depicts the allowed values of \(c_W\) (left panel) and \(c_{a\Phi }\) (right panel) as a function of the ALP mass, when only one of these two couplings is added to the SM. The constraints obtained on the \(\{ m_a, c_W\}\) plane (left panel) coincide with those derived in Ref. [49]. The constraints on the parameter space for \(\{ m_a, c_{a\Phi }\}\) (right panel) are a novel contribution of this work. The case illustrated corresponds to \(f_a=1\) TeV. The quantitative similarity of the exclusion limits on the two couplings depicted in Fig. 2 is fortuitous; it is easy to check that the constraints on \(c_{a\Phi }\) become stronger than those for \(c_W\) for larger values of \(f_a\), as expected from the logarithmic dependence of its contribution, see Eq. (6).

These plots also indicate that kaon constraints are typically one order of magnitude stronger than those derived from B-meson decays, although limited to a more restricted \(m_a\) range. Future prospects from NA62 and Belle-II are also illustrated in Fig. 2 with dashed lines.

When both \(c_{a\Phi }\) and \(c_W\) are simultaneously considered, an interesting pattern of destructive interference can take place, as anticipated in Sect. 3. Figure 3 depicts the result of combining the different experimental constraints for fixed values of \(f_a\) and \(m_a\lesssim 0.1\) GeV. This shows indeed that when the relative sign of both couplings is positive, a blind direction in parameter space appears. This unconstrained direction is exactly aligned for kaon and B-meson decays. For this reason, additional experimental information is then needed to lift the degeneracy. One possibility is to consider the decays \(D\rightarrow \pi (a\rightarrow \mathrm {inv})\), which are sensitive to a different combination of \(c_{a\Phi }\) and \(c_W\), since the up- and down-type quark contributions to the term proportional to \(c_{a\Phi }\) have opposite signs, see Eq. 3. These decays, however, suffer from a heavy GIM suppression, and no such experimental searches have been performed to our knowledge. A more promising possibility is to consider LHC constraints that are sensitive to a specific ALP coupling. For example, LHC searches for mono-W final states are only sensitive to \(c_W\).Footnote 1 In Ref. [47] the authors derived the current (projected) bounds

$$\begin{aligned} \dfrac{|c_W|}{f_a}\lesssim 0.41~(0.16)~\mathrm {TeV}^{-1}, \end{aligned}$$
(8)

from \(3.2~\mathrm {fb}^{-1}\) (\(3~\mathrm {ab}^{-1}\)) of LHC data: these have been superimposed in Fig. 3. Similarly, a reinterpretation of \(pp \rightarrow t {\bar{t}}+\mathrm {MET}\) at the LHC would constrain only \(c_{a\Phi }\), but such analysis goes beyond the scope of this letter. Typically, LHC constraints are weaker than flavor bounds, except in the region of parameter space where the flavor signal is suppressed due to a cancellation between two contributions. In this case, the complementarity of low and high-energy constraints becomes an important handle on new physics.

5 The visible ALP

We analyze next the case of ALPs produced at loop level via rare meson decays, but decaying into visible states via the same set of bosonic interactions introduced in Eq. (2). For the \(m_a\) range considered in this work, the kinematically accessible decays are \(a\rightarrow \gamma \gamma \), \(a\rightarrow \mathrm {hadrons}\) and \(a\rightarrow \ell \ell \), with \(\ell =e,\mu ,\tau \). Both tree-level and loop-level contributions to the decays are to be taken into account. Indeed, experimental limits on ALP couplings to photons, electrons, and nucleons are so stringent that (indirect) loop-induced observables can give stronger constraints than (direct) tree-level ones [48, 50].

At tree level, \(c_W\) and \(c_{a\Phi }\) contribute respectively to ALP decays into photons and into fermions. Nevertheless, the coupling \(c_B\) may also enter the game for these decays: at tree level for the photonic channel and at loop level for the fermionic channel. That is, while the parameter space for the production of an ALP via rare meson decays is still the two-dimensional one in Eq. (4), the whole set of ALP electroweak couplings \(\{c_{a\Phi }, c_W, c_B\}\) is relevant for the analysis of visible decay channels. For consistency, all one-loop contributions induced by these three couplings are to be taken into account.

For instance, the partial width for ALP decay into leptons, including one-loop corrections, reads

$$\begin{aligned} \Gamma (a\rightarrow \ell ^+\ell ^-) = |c_{\ell \ell }|^2\dfrac{ m_a m_\ell ^2 }{8\pi f_a^2} \sqrt{1-\dfrac{4 m_\ell ^2}{m_a^2}} \end{aligned}$$
(9)

where \(\alpha _\mathrm {em}\) is the fine structure constant and \(c_{\ell \ell }\) is given at one-loop order by

$$\begin{aligned} \begin{aligned} c_{\ell \ell }&= c_{a\Phi }+\frac{3\,\alpha _\mathrm {em}}{4\pi } \left( \frac{3\,c_W }{s_w^2}+ \frac{5\,c_B}{c_w^2 } \right) \log \dfrac{f_a}{m_W}\\&\quad + \dfrac{6\, \alpha _{\mathrm {em}}}{\pi }\left( c_B \, c_w^2+c_W\, s_w^2\right) \log \dfrac{m_W}{m_\ell } , \end{aligned} \end{aligned}$$
(10)

where \(s_w=\sin \theta _w\), \(c_w=\cos \theta _w\) and \(\theta _W\) denotes the weak mixing angle. For the \(a\rightarrow \gamma \gamma \) decay, the partial width reads

$$\begin{aligned} \Gamma (a\rightarrow \gamma \gamma )&= |c_{a\gamma \gamma }|^2 \dfrac{m_a^3 }{4\pi f_a^2}, \end{aligned}$$
(11)

where the \(c_{a\gamma \gamma }\) coupling is defined at tree level, as

$$\begin{aligned} c_{a\gamma \gamma }\Big |_{\mathrm {tree}}\equiv c_B \,c_w^2 +c_W\,s_w^2 . \end{aligned}$$
(12)

Furthermore, bosonic loops give corrections to \(c_{a\gamma \gamma }\) proportional to \(c_W\). Fermionic loops may also induce nonzero values of \(c_{a\gamma \gamma }\) at the scale \(\mu = f_a\), even if the ALP has no tree-level couplings to gauge bosons, i.e. \(c_W = c_B = 0\) [48]. To sum up, both \(c_W\) and \(c_{a\Phi }\) induce one-loop corrections to the photonic width. Specifically, for \(m_a\ll \Lambda _{\text {QCD}}\),

$$\begin{aligned} \begin{aligned} c_{a\gamma \gamma }\Big |_\mathrm {1-loop}&= c_W\,\Big [s_w^2\,+\frac{2\,\alpha _{\text {em}}}{\pi } B_2(\tau _W)\Big ] +c_B\,c_w^2 \\&\quad - c_{a\Phi } \,\frac{\alpha _\mathrm {em}}{4\pi }\,\bigg ( B_0 +\frac{m_a^2}{m_{\pi }^2-m_a^2}\bigg ), \end{aligned} \end{aligned}$$
(13)

where \(B_0\) and \(B_2(\tau _f)\) are loop functions, which are detailed in Appendix A. For \(m_a\gg \Lambda _{\text {QCD}}\), the second term in the last line of the above equation is absent, since it stems from \(\pi \)-a mixing which becomes negligible in this mass range.

Fig. 4
figure 4

ALP partial decay widths to various two-particle channels as a function of \(m_a\), in presence of either \(c_W\) (left panel) or \(c_{a\Phi }\) (right panel), for \(c_{W}/f_a=1~\mathrm {TeV}^{-1}\) and \(c_{a\Phi }/f_a=1~\mathrm {TeV}^{-1}\!\), respectively. The grey shaded areas correspond: (i) to the pion mass region, which is experimentally excluded due to the large \(\pi ^0\)-a mixing; (ii) the interval (1, 3) GeV, in which the hadronic width cannot be fully assessed either with chiral estimates or perturbatively

For hadronic decays, it is pertinent to consider two separate \(m_a\) regions: (i) between \( 3\, m_\pi \) and 1 GeV, and (ii) above 3 GeV. In the former region, the dominant hadronic decay is \(a\rightarrow 3\pi \) which can be computed by employing chiral pertubation theory [48]. In the region above 3 GeV, the dominant decays are \(a\rightarrow c {\bar{c}}\) and \(a\rightarrow b {\bar{b}}\), which are well described by a perturbative expression analogous to Eq. (9) multiplied by the color factor \(N_c=3\).Footnote 2 In this work we remain agnostic about the intermediate region \(m_a\in (1,3)\) GeV, since several hadronic channels, which are particularly difficult to estimate reliably, open up for these masses.Footnote 3 In this region, the total hadronic width \(\Gamma _a\) will be replaced by its value at the range frontier at \(m_a=3\) GeV. Note that this is the most conservative choice, since the hadronic width is a continuous and strictly increasing function of \(m_a\).

Figure 4 illustrates the ALP partial widths as a function of \(m_a\), when either only \(c_W\) (left panel) or \(c_{a\Phi }\) (right panel) are present, for the benchmark values \(c_{W}/f_a=1~\mathrm {TeV}^{-1}\) and \(c_{a\Phi }/f_a=1~\mathrm {TeV}^{-1}\). The mass thresholds for each of the fermionic channels are clearly delineated.

In order to analyze the impact of an intermediate on-shell ALP on rare meson decays to visible channels, ALP production via the couplings in Eq. (2) needs to be convoluted with ALP decay into SM particles via that same set of couplings. When \(c_{a\Phi }\) and \(c_W\) are simultaneously present, a very interesting pattern of constructive/destructive interference is expected. We will assume for simplicity \(c_B=c_W\) to illustrate the effect. While a positive sign for \(c_W/ c_{a\Phi }\) leads to destructive interference in ALP production (see Eq. (6) and Fig. 3), the opposite can occur in the subsequent ALP decay into visible channels. Indeed, the decay into leptons shows destructive interference for negative \(c_{a\Phi }/c_{W}\), see Eq. (9). The expectation for the photonic channel is more involved and depends on the ALP mass: for \(m_a< m_{\pi }\) the terms in the last parenthesis in Eq. (13) are both real and positive and the interference pattern is thus analogous to that for ALP production, while for larger masses it may differ. Table 1 summarizes the interference pattern expected.

Three sets of experimental data that will be considered in order to constrain the \(\{m_a,c_{a\Phi },c_{W}\}\) parameter space for a visible ALP: 1) displaced vertices; 2) semileptonic and photonic meson decays; 3) leptonic meson decays.

  1. 1.

    Displaced vertices. Of particular interest are searches for long-lived scalars, which would result in displaced vertices. Two \(m_a\) ranges are pertinent:

    1. (a)

      \(m_a \in (2 m_\mu , m_B-m_K)\) The LHCb collaboration perfomed searches for long-lived (pseudo)scalar particles in the decays \(B\rightarrow K^{(*)}a\), with \(a\rightarrow \mu \mu \) [73, 74]. Limits on \({\mathcal {B}}(B\rightarrow K^{(*)}a)\cdot {\mathcal {B}}(a\rightarrow \mu \mu )\) which vary between \(10^{-10}\) and \(10^{-7}\) are reported as a function of \(m_a\) and the proper lifetime, \(\tau _a\). For \(\tau _a<1\) ps, the limit derived is independent of \(\tau _a\) since the ALP would decay promptly. The best constraints are those for values of \(\tau _a\) between 1 ps and 100 ps, for which the dimuon vertex would be displaced from the interaction vertex. See also Ref. [75] for a recent reinterpretation of these limits.

    2. (b)

      \(m_a \in (2 m_\mu , m_K-m_\pi )\) Similar searches have also been performed by the NA48/2 Collaboration for the decay \(K^+\rightarrow \pi ^+ a\), followed by \(a\rightarrow \mu \mu \) [76]. The limits reported on \({\mathcal {B}}(K^+\rightarrow \pi ^+ a)\cdot {\mathcal {B}}(a\rightarrow \mu \mu )\) decrease with ALP lifetime until \(\tau _a = 10\) ps, becoming constant for smaller values of \(\tau _a\). The best experimental limits are \({\mathcal {O}}(10^{-10})\) and obtained for \(\tau _a \le 10\) ps.

  2. 2.

    Semileptonic and photonic meson decays. Relevant constraints on ALPs can be inferred from their indirect contributions to low-energy meson decays. In particular:

    1. (a)

      Kaon decays. The measured kaon branching fractions \({\mathcal {B}}(K^+\rightarrow \pi ^+ee)^{\mathrm {exp}}=(3.00\pm 0.09)\times 10^{-7}\), \({\mathcal {B}}(K^+\rightarrow \pi ^+\mu \mu )^{\mathrm {exp}}=(9.4\pm 0.6)\times 10^{-8}\) [77], and \({\mathcal {B}}(K^+\rightarrow \pi ^+\gamma \gamma )^{\mathrm {exp}}=(1.01\pm 0.06)\times 10^{-7}\) [78] will be taken into account. In order to avoid the uncertainty related to the unknown SM long-distance contributions, it will be required that the ALP contribution alone does not saturate the \(2\sigma \) experimental bounds.

    2. (b)

      B-meson decays. Recently, LHCb observed several deviations from the expected values in ratios of \(B\rightarrow K^{(*)}\mu \mu \) and \(B\rightarrow K^{(*)}ee\) decays in different bins of dilepton squared mass [79, 80]. If these anomalies turn out to imply new physics, ALP couplings would not explain them. More precisely, pseudoscalar effective operators induced by a heavy mediator cannot reproduce current deviations due to the constraints derived from \({\mathcal {B}}(B_s \rightarrow \mu ^+\mu ^-)^{\mathrm {exp}}\) [81]. On the other hand, a light ALP with \(m_a \lesssim m_B-m_K\) would face stringent limits from LHCb searches for long-lived (pseudo)scalar particles in \(B\rightarrow K^{(*)}a(\rightarrow \mu \mu )\), as mentioned above [73, 74]. For these reasons, we leave out of our analysis the constraints that would stem from the comparison of exclusive \(B\rightarrow K^{(*)}\mu \mu \) measurements with the SM expectation until further clarification is provided by the B-physics experiments.

  3. 3.

    Leptonic \(B_s\) and \(K_L\) decays: While the constraints in (1) and (2) above correspond to on-shell ALPs, off-shell contributions are relevant in leptonic meson decays. LHCb measured \({\mathcal {B}}(B_s\rightarrow \mu \mu )^{\mathrm {exp}}=(3.0 \pm 0.6^{+0.3}_{-0.2})\times 10^{-9}\) [81], which agrees with the SM prediction, \({\mathcal {B}}(B_s\rightarrow \mu \mu )^{\mathrm {SM}}=(3.65\pm 0.23)\times 10^{-9}\) [82]. The ALP contribution to this observable can be computed by a straightforward modification of the expressions provided in Ref. [61]. Similarly, we consider the kaon decay \({\mathcal {B}}(K_L\rightarrow \mu \mu )^{\mathrm {exp}}=(6.84 \pm 0.11)\times 10^{-9}\) [77]. In the latter case, we impose once again the conservative requirement that the ALP (short-distance) contribution does not saturate the \(2\sigma \) experimental values. When the complete set of electroweak couplings in Eq. (2) will be simultaneously considered for an off-shell ALP, the interference pattern in the amplitudes can be understood analogously to the separate discussion on production and decay for on-shell ALPs.

Table 1 ALP-mediated rare meson decays: interference pattern between \(c_{a\Phi }\) and \(c_W\) in ALP production and decay as a function of \(c_W/c_{a\phi }\) sign, by assuming \(c_B=c_W\). The \(a\rightarrow \gamma \gamma \) column assumes \(m_a<m_\pi \), see text for details
Fig. 5
figure 5

Visible ALP: constraints on the absolute value of \(c_W\) (left panel) and \(c_{a\Phi }\) (right panel) when these couplings are considered separately, as a function of the ALP mass and for \(f_a=1\) TeV. The exclusion contours follow from the experimental limits on \(K^+\rightarrow \pi ^{+} \, a(\rightarrow \mu \mu )\) (red) [76], \(B\rightarrow K^{(*)} \, a(\rightarrow \mu \mu )\) (orange) [73, 74], \({\mathcal {B}}(K_L\rightarrow \mu \mu )\) (green) [77], \({\mathcal {B}}(B_s\rightarrow \mu \mu )\) (blue) [81, 82], \({\mathcal {B}}(K\rightarrow \pi ee)\) (purple) [77] and \({\mathcal {B}}(K\rightarrow \pi \gamma \gamma )\) (cyan) [78]. The grey dashed lines are projections for the SHiP experiment [40]. The unconstrained regions in the range of the LHCb bounds correspond to the masses of several hadronic resonances which are vetoed in their analysis

Fig. 6
figure 6

Visible ALP: Allowed parameter space when the couplings \(\{c_{aW},c_{a\Phi }\}\) are simultaneously present, for \(f_a=1\) TeV and \(m_a=0.1\) GeV (upper plots) or \(m_a=0.3\) GeV (lower plots). The different flat directions observed in the figures correspond to the destructive interferences of both couplings in ALP production and/or the various ALP channel decays, which depend on the sign of \(c_W/c_{a\Phi }\). See text for details

In analogy with the case of the invisible ALP in the previous section, all of these data will be analyzed first within a one coupling at a time approach, where either only \(c_W\) or \(c_{a\Phi }\) are switched on (as \(c_B\) by itself cannot mediate FCNC processes). In a second step, the simultaneous presence of \(\{c_{a\Phi }, c_W , c_B\}\) will be taken into account. We assume \(c_B=c_W\) in the figures because \(c_B\) has only a modulating role, and this choice does not preclude or fine-tune any particular decay channel.

Figure 5 illustrates the allowed values of \(|c_W|\) (left panel) and \(|c_{a\Phi }|\) (right panel) in the one-coupling-at-a-time analysis, as a function of the ALP mass and for \(f_a=1\) TeV. Constraints from \({\mathcal {B}}(K\rightarrow \pi \mu \mu )\) [77] are not displayed, since they are superseded by NA48/2 constraints on long-lived particles in \(K^+\rightarrow \pi ^{+} \, a(\rightarrow \mu \mu )\) decays. The grey dashed lines are projections for the SHiP experiment [40]. The figure reflects the stringent constraints from LHCb searches for displaced vertices in the dimuon channel [73,74,75] for the large mass range \(m_a \in (2\mu , m_B-m_K)\), see point 1.(a) above. These limits are more constraining than the analogous searches performed in the kaon sector [76]. Remarkably, this is in contrast to the invisible scenario discussed in Sect. 4, for which kaon constraints are considerably stronger than those derived from B-meson decays if \(K^+ \rightarrow \pi ^+ a\) is kinematically allowed. These results, which take into account only one coupling at a time, could be of special interest in specific new physics scenarios. For instance, the case of a non-vanishing \(c_{a\Phi }\) with \(c_{B}\) and \(c_{W}\) disregarded (right panel) is motivated by perturbative models producing \(c_{a\Phi }\) at tree level but \(\{c_{B}, c_{W}\}\) only at loop level (e.g. \(c_{B} \sim c_{W} \simeq g^2/(16 \pi ^2)\,c_{a\Phi }\)). Nevertheless, in all generality and for a rigorous approach, the simultaneous presence of all couplings in the electroweak bosonic basis in Eq. (2) must be considered. This may essentially modify the bounds inferred, as discussed above and illustrated next.

Figure 6 depicts the bounds resulting when \(c_{a\Phi }\), \(c_W\) and \(c_B\) are simultaneously considered. Once again, in the ALP mass region in which B-physics data on displaced vertices apply, they are seen to be more constraining than the bounds inferred from the kaon sector, see Fig. 6c, d. Furthermore, the four panels in the figure clearly illustrate – for two values of \(m_a\) and \(c_W=c_B\) – the remarkable pattern of constructive/destructive interference expected from the analysis in Sect. 5 and Table 1. For instance, the two flat directions in the photonic channel in Fig. 6a result from destructive interference in both production and decay for positive \(c_W/c_{a\Phi }\) and \(m_a< m_\pi \). The rest of the figures can be analogously understood. Once again, the various flat directions in different channels call for complementarity with collider data and other experimental projects. In particular, the degeneracy in parameter space which induces the flat direction in Fig. 6a and Fig. 6c, common to all rare decay channels discussed in this work, could be resolved by LHC data. Some of the flat directions appearing in ALP decays (cf. e.g. Fig. 6d) could also be probed by proposed beam-dump experiments such as SHiP, since they can measure ALP decays into both photons and muons, and because \({\mathcal {B}}(a\rightarrow \mu \mu )\) and \({\mathcal {B}}(a\rightarrow \gamma \gamma )\) do not simultaneously vanish. This is also true for various LHC searches, and so both experiments could be good handles on removing flat directions, though a full analysis is beyond the scope of this work.

6 Conclusions

The field of axions and ALPs is blooming, with an escalation of efforts both in theory and experiment. Theoretically, the fact that no new physics has shown up yet at colliders or elsewhere positions the SM fine-tuning issues as the most pressing ones and leads to further implications for our perspective of dark matter. The silence of data is calling for a rerouting guided by fundamental issues such as the strong CP problem and an open-minded approach to hunt for the generic tell-tale of global hidden symmetries: derivative couplings as given by axions (light or heavy) and ALPs. Experimentally, the worldwide program to hunt specifically for axions and ALPs is growing fast. At the same time, other experimental programs are realizing their potential to tackle the axion and ALP parameter space, e.g. the LHC and beam dump experiments.

In the absence of data supporting any concrete model of physics beyond the SM, effective Lagrangians provide a model-independent tool based on the SM gauge symmetries. Very often the effective analyses rely on considering one effective coupling at a time, though, instead of the complete basis of independent couplings. The time is ripe for further steps in the direction of a multi-parameter analysis of the ALP effective field theory, and this is the path taken by this work.

We have considered the impact on FCNC processes of the complete basis of bosonic electroweak ALP effective operators at leading order (dimension 5), taking into account the simultaneous action of those couplings. As this basis is flavor-blind, its impact on flavor-changing transitions (e.g. \(d_i\rightarrow d_j a\), with \(i\ne j\)) starts at loop level. Indeed, the experimental accuracy achieved on rare-decay physics, as well as on limits of ALP couplings to photons, electrons, and nucleons, is so stringent that loop-induced contributions may provide the best bounds in a large fraction of the parameter space.

We first revisited previous results in the literature, which had been derived considering just one operator at a time. We studied next the simultaneous action of the various electroweak couplings. An interesting pattern of constructive/destructive interference has been uncovered, which depends on the relative sign of the couplings and on the channel and mass range considered. In this way, the previous very stringent bounds stemming from kaon and B-decay data are alleviated. Furthermore, LHC searches for light pseudoscalar particles have been highlighted as more important in regions where deconstructive interference weakens flavor bounds. While they are generally considerably less sensitive than flavor observables, LHC searches are shown to provide complementary information to low-energy probes, exploring otherwise inaccessible directions in the ALP parameter space. We have also explicitly illustrated how they can overcome some of the blind directions on rare meson decays identified here.

We have derived the most up-to-date constraints on the effective electroweak ALP parameter space for two well-motivated scenarios: (i) an ALP decaying into channels invisible at the detector; (ii) an ALP decaying into \(\gamma \gamma \), ee and/or \(\mu \mu \). The conclusion is that searches for \(K\rightarrow \pi \nu {\bar{\nu }}\) decays provide the most stringent constraints in the first case. In contrast, for the second scenario, the strongest constraints arise from searches at LHCb for long-lived (pseudo)scalars (displaced vertices) in the decays \(B\rightarrow K^{(*)} a(\rightarrow \mu \mu )\). This illustrates beautifully the potential of flavor-physics observables to constrain new physics scenarios. These searches will be improved in the years to come thanks to the experimental effort at NA62, KOTO, LHCb and Belle-II, providing tantalizing oportunites to discover new physics, complementary to the direct searches performed at the LHC.

Much remains to be done to fully encompass the ALP parameter space. For instance, the anomalous ALP gluonic coupling has not been considered in this work. Even if it cannot mediate FCNC processes, it may impact our results for the visible ALP via the quantitative modification of the branching ratios. In fact, recent ALP analyses of FCNC decays [50] take into account the simultaneous presence of the gluonic coupling and just one electroweak ALP coupling, but no work considers all ALP bosonic couplings together, let alone the complete basis of operators including the most general fermionic ones. This effort is very involved and will be the object of future work. In a different realm, note that the type of effective operators considered above assumes a linear realization of electroweak symmetry breaking; the alternative of analyzing ALP FCNC processes via the non-linear effective SM Lagrangian is pertinent and also left for future consideration.