1 Introduction

Although the discovery of the Higgs boson and its properties have represented a major advance for verifying the mass generation mechanism through spontaneous symmetry breaking, along with its consequences, an explanation for the values of most of the elementary particles masses is still missing. It is understood in the standard model (SM) that the photon has zero mass due to an unbroken gauge symmetry, and the weak vector bosons W and Z 0 have interdependent masses resulting from the electroweak symmetry breakdown. Still, according to the Standard Model, the Higgs boson and all the charged fermions have arbitrary nonzero masses, with the neutrinos being massless. This last feature is in contradiction with the neutrinos oscillation phenomena, whose description requires nonzero neutrino mass differences. As a matter of fact, the present experimental limits show that neutrinos are ultralight compared to the other known massive particles. All of this might suggest that a new principle or mechanism is necessary to reach a more satisfactory understanding of the elementary particles masses.

It was found in ref. [1] that the peak of a function constructed by multiplying the basic fourteen Standard Model branching ratios of the Higgs boson decay channels occurs for a Higgs boson mass which is in good agreement with the experimental value measured at the LHC [2]. Additionally, it was also mentioned in ref. [1] a possible analogy with some sort of entropy arguing that the mass of the Higgs boson has a value that allows for the largest number of ways of decays into elementary particles. The work of ref. [3] indeed showed that the value of the Higgs boson mass results from the maximum entropy principle (MEP) [4,5,6], where the information entropy is suitably defined in terms of the Higgs branching ratios into Standard Model particles, furnishing the most accurate theoretical Higgs boson mass determination to date.

Our premise in invoking MEP is to consider an ensemble of N non-interacting identical spinless particles which have m basic decay modes. The probability that the ensemble evolves to a final state configuration, with n 0 bosons decaying into the mode with branching ratio B R 0, n 1 bosons decaying into the mode with branching ratio B R 1, and so on until n m bosons decaying into the mode having branching ratio B R m , is given by the following multinomial distribution

$$ \text{Pr}(\{n_{k}\}_{k=0}^{m})\equiv \frac{N!}{n_{0}!n_{1}!{\cdots} n_{m}!} \prod\limits_{k=0}^{m}(BR_{k})^{n_{k}}, $$
(1)

in which \({\sum }_{k=0}^{m}BR_{k}=1\), and \({\sum }_{k=0}^{m}n_{k}=N\). From these probabilities, the Shannon entropy [7] associated to the evolution of the initial ensemble to the final state in which all N scalars have decayed is given by

$$ S_{N}=-\sum\limits_{\{n_{i}\}}^{N}\text{Pr}(\{n_{k}\}_{k=0}^{m})\,\mathrm{ln\,Pr} (\{n_{k}\}_{k=0}^{m})\,, $$
(2)

with \({\sum }_{\{n_{i}\}}^{N}(\bullet )\equiv {\sum }_{n_{0}=0}^{N}{\sum }_{n_{1}=0}^{N}{\cdots } {\sum }_{n_{m}=0}^{N}(\bullet )\times \delta \left (\vphantom {{\sum }_{i=0}^{m}} N-\right . \left .{\sum }_{i=0}^{m}n_{i}\right ) \) [3]. The entropy S N is a function of unknown quantities as masses of particles and coupling constants entering in the branching ratios B R k . We propose that such quantities can be inferred through maximization of S N , also taking into account constraints that may enter as prior information. Similar approaches using information and configurational entropies can be found, for example, in refs. [8,9,10,11,12,13].

Our aim in this work is to use MEP to infer the mass of the axion, taking into account a model in which the axion decays to neutrinos and photons. More specifically, we made the following assumptions on the axion field, A(x). First, its particle excitation, the axion, decays dominantly into a pair photons and also into a pair of neutrinos. Second, its low energy effective Lagrangian describing the interactions with photons and neutrinos is given by

$$\begin{array}{@{}rcl@{}} \mathcal{L}_{eff}=\frac{1}{2}\partial_{\mu }A\,\partial^{\mu }A&-&\frac{1}{2} {m_{A}^{2}}\,A^{2}-\frac{g_{A\gamma }}{4}\,A\,F_{\mu \nu }{\widetilde{F}}^{\mu \nu }\\ &-&\frac{g_{A\nu }}{2}\,\overline{\nu }_{i}\gamma^{\mu }\gamma_{5}\nu_{i}\,\partial_{\mu }A \end{array} $$
(3)

where F μν is the electromagnetic field strength with \(\widetilde {F }^{\mu \nu }\equiv \epsilon ^{\mu \nu \lambda \rho }F_{\lambda \rho }/2\) its dual; ν i (x), i = 1, 2, 3, denote the mass eigenstates neutrinos fields; m A is the axion mass; f A the axion decay constant, which is a high energy scale; and g A γ and g A ν are the axion-photon and axion-neutrino coupling constants, respectively. Another important remark is that we are tacitly assuming that neutrinos are Dirac fermions. Therefore, the coupling between the axion and the neutrinos vanish in the massless limit. In the Appendix A, we show an example of an ultraviolet completed model leading to the effective Lagrangian in (3).

The axion is a hypothetical pseudo Nambu-Goldstone boson remnant of an anomalous U(1) symmetry, spontaneously broken at the energy scale f A , present in extensions of the Standard Model motivated to solve the strong CP problem through the Peccei-Quinn mechanism [14,15,16] (for a review of the strong CP problem and axions, see, for example, refs. [17, 18]). Experiments searching for the axion constrain f A to be much above the electroweak scale, i.e., f A v = 246 GeV. Consequently, the axion interacts very weakly with all other particles by the reason that the associated coupling constants are suppressed by f A . We define the axion-neutrino coupling constant in (3) as

$$ g_{A\nu }=\frac{C_{A\nu }}{f_{A}}, $$
(4)

where C A ν is the coefficient of the axion-neutrino coupling which depends on the ratios of vacuum expectation values (see Appendix A). The axion-photon coupling constant is, by its turn,

$$ g_{A\gamma }=\frac{\alpha }{2\pi f_{A}}{\widetilde{C}}_{A\gamma }\,, $$
(5)

with α the fine structure constant, and the coefficient of the axion-photon coupling

$$ {\widetilde{C}}_{A\gamma }=\left( \frac{C_{a^{\prime }\gamma }}{C_{a^{\prime }g}}-\frac{2}{3}\,\frac{4+m_{u}/m_{d}}{1+m_{u}/m_{d}}\right) \,. $$
(6)

In the coefficient \({\widetilde {C}}_{A\gamma }\) the anomaly coefficients \( C_{a^{\prime }\gamma }\) and \(C_{a^{\prime }g}\) are model dependent and typically of order one, with m u /m d ≈ 0.56 the ratio of up and down quark masses. Such coefficients for different models, as well as other features of the axion, can be found in [19]. Additionally, the axion mass is also suppressed by the energy scale f A and given by [15]

$$ m_{A}\simeq \frac{m_{\pi }f_{\pi }}{f_{A}}\frac{\sqrt{m_{u}/m_{d}}}{ 1+m_{u}/m_{d}}\approx 0.48\frac{m_{\pi }f_{\pi }}{f_{A}}, $$
(7)

in which m π and f π are the pion mass and its decay constant, respectively. This makes the axion an ultralight particle for f A ≫ 246 GeV.

Our paper is organized as follows: In the next section, we briefly discuss how we intend to use the MEP in order to obtain the axion mass considering the lightest neutrino mass and the coupling constants as inputs. In Section 3, we present a first part of our results which consider axions decaying into pairs of neutrinos and pair of photons, and a second part of the results corresponding to axions decaying only into pair of the lightest neutrinos, in addition to a pair of photons, which is presented in Section 4. We present our conclusions in Section 5.

2 Parameters Inferences from MEP

Let us consider an initial state ensemble with a very large number N of axions. After a time t ≫ 1/Γ, with the total axion decay width given by the sum of the partial decay widths into a pair of photons, Γ0, plus the ones into pairs of neutrinos, Γ i , i. e.,

$$ {\Gamma} ={\Gamma}_{0}+\sum\limits_{i}{\Gamma}_{i}\;, $$
(8)

the initial state ensemble evolves to a final state bath of photons and neutrinos. The summation above (and below) extends only over those neutrinos whose masses are less than m A /2. If the axion is massive enough, it could decay in all the three active neutrinos, where i = 1, 2, 3.

The axion partial decay widths derived from the effective Lagrangian in (3) are

$$\begin{array}{@{}rcl@{}} {\Gamma}_{0} &=&\frac{g_{A\gamma }^{2}}{64\pi }{m_{A}^{3}}\,, \end{array} $$
(9)
$$\begin{array}{@{}rcl@{}} {\Gamma}_{i} &=&\frac{g_{A\nu }^{2}}{8\pi }m_{A}m_{{i}}^{2}\beta_{i}\,, \end{array} $$
(10)

where m i is the i th neutrino mass and \(\beta _{i}=\sqrt {1-\frac { 4{m_{i}^{2}}}{{m_{A}^{2}}}}\).

These widths lead to the following branching ratios for the axion decaying into a pair of photons and into pairs of neutrinos

$$\begin{array}{@{}rcl@{}} BR_{0} &=&\frac{{\Gamma}_{0}}{\Gamma }=\left[ 1+\sum\limits_{i}32\pi^{2}\frac{ r_{\nu }^{2}}{\alpha^{2}}\frac{{m_{i}^{2}}\beta_{i}}{{m_{A}^{2}}}\right]^{-1}\,, \end{array} $$
(11)
$$\begin{array}{@{}rcl@{}} BR_{i}&=&\frac{{\Gamma}_{i}}{\Gamma }=\left[ 1+\frac{\alpha^{2}}{32\pi^{2}r_{\nu }^{2}}\frac{{m_{A}^{2}}}{{m_{i}^{2}}\beta_{i}}+\sum\limits_{j\neq i}\frac{ {m_{j}^{2}}\beta_{j}}{m_{{i}}^{2}\beta_{i}}\right]^{-1}\,, \end{array} $$
(12)

in which we define \(r_{\nu }=\left \vert C_{A\nu }/\widetilde {C}_{a\gamma }\right \vert \) as the ratio of the anomaly coefficients of the axion-neutrino coupling, C A ν , and the axion-photon coupling, \({ \widetilde {C}}_{A\gamma }\). This ratio is equivalent to the ratio of the associated coupling constants given by \(\left \vert g_{A\nu }/g_{A\gamma }\right \vert =2\pi r_{\nu }/\alpha \).

There are many possible final states characterized by the number of axions which decay into a pair of photons, n 0, and by the numbers axions which decay into each possible pair of neutrinos, n i , i = 1, 2, 3. Considering that the axion can decay into the three neutrinos plus the photon, according to (1), the probability that the ensemble of N axions decay into a particular final state characterized by n 0, n 1, n 2, and n 3 is

$$ \text{Pr}(\{n_{k}\}_{k=0}^{3})=\frac{N!}{n_{0}!n_{1}!n_{2}!n_{3}!} \prod\limits_{k=0}^{3}(BR_{k})^{n_{k}}\;. $$
(13)

The entropy function is then constructed from (2) summing over all the partitions satisfying \({\sum }_{k=0}^{3}n_{k}=N\).

According to MEP, the initial ensemble evolves to a final state of maximum entropy permitting us to infer the values of the unknown quantities through maximization of S N . We do this by taking into account the prior information of the neutrinos masses from the best fit mass squared differences determined by the data of neutrinos oscillations, considering the normal hierarchy pattern [20],

$$\begin{array}{@{}rcl@{}} {\Delta} m_{12}^{2} & = & {m_{2}^{2}}-{m_{1}^{2}} = (7.45\pm 0.25)\times 10^{-5}\,\text{eV}^{2} \\ {\Delta} m_{31}^{2} & = & {m_{3}^{2}}-{m_{1}^{2}} = (2.55\pm 0.05)\times 10^{-3}\,\text{eV}^{2}. \end{array} $$
(14)

Thus, S N depends, effectively, on the three parameters m A , r ν , and the lightest neutrino mass which we denote as m ν . Finally, our analysis do not depend on the neutrinos mass hierarchy pattern, the same results are obtained if the inverted hierarchy is assumed.

Contrary to the situation of the Higgs boson mass inference carried out in ref. [3], where the Higgs mass was the last independent Standard Model parameter to be determined, in the model under study we have three independent parameters as we just discussed. In principle, MEP could force all these parameters to be fixed at the global maximum of the entropy. However, this is not the case of the Standard Model, for example. The Higgs mass does not correspond to a global maximum of S, but just to a constrained maximum. Some parameters of the Standard Model are related by its symmetries imposing strong constraints on these parameters, for example, the W and Z bosons masses. This fact reveals that not all the parameters of the model might be determined by the MEP. Nevertheless, the success in the Higgs mass prediction suggests that MEP can be useful in inferring the masses of scalar particles when the other parameters are fixed by some different mechanism.

In this work, we remain agnostic about the type of inference that MEP can actually perform, waiting for the experimental evidence to settle that. Therefore, we take two main hypothesis: first, the entropy function is maximized in m A only, with the other parameters considered as prior information, that is, S N S N (m A |m ν , r ν ); second, a more restrict hypothesis: S N S N (m A , m ν , r ν ) where we will show that MEP can be more determinant in the sense that having information about one of the parameters: (i) axion mass, (ii) neutrino mass, and (iii) ratio of the coupling constants, the other two parameters can be uniquely determined by this principle.

The ratio r ν and the lightest neutrino mass m ν are free parameters which should be determined prior to the inference of axion mass. However, it is important to mention that the Shannon entropy in our formulation is not defined as a entropy per particle, a function of an energy per particle, as is usual in Thermodynamics. This does not represent a problem, however. First of all, our weights in Shannon entropy are not Boltzmann weights of some known problem of interacting particles in contact with a thermal reservatory exactly as is studied in Statistical Mechanics. Only in this situation, the Shannon entropy should be equivalent to Boltzmann entropy definition. Nevertheless, even without this equivalence it is important to mention that MEP is a more general and universal method than the Statistical Mechanics approach, and its basis are governed by probability theory. In this context, we can write the entropy of any probability distribution and maximize it in relation to their physical parameters.

We impose the most recent experimental constraints from the neutrino oscillation experiments summarized in ref. [20]

$$\begin{array}{@{}rcl@{}} m_{1}=m_{\nu }\;\;,\;\;m_{2}&=&\sqrt{m_{\nu }^{2}+7.45\times 10^{-5}} \\ m_{3}&=&\sqrt{m_{\nu }^{2}+2.55\times 10^{-3}}\;. \end{array} $$
(15)

The upper bound from the Planck Collaboration measurements of CMB anisotropies [21] for the sum of the neutrino masses, along with (15) for a massless lightest neutrino, translate to the following constraint that will be taken into account in our inference process

$$ 0.059\text{ eV}<\sum\limits_{i=1}^{3}m_{i}<0.23\;\text{eV,} $$
(16)

which implies the interval 0 < m ν < 0.0712 eV, for the lighest neutrino mass. In the next sections we will present our results.

3 Results I: Axions Decaying into the Three Neutrinos and Photons

First, we suppose that the axion is heavy enough to decay into a pair of photons and all the three neutrinos.

The entropy, given by (2), can be written as [22]

$$ S_{N}(m_{A}|m_{\nu },r_{\nu })=S(m_{A}|m_{\nu },r_{\nu })+\frac{3}{2}\ln (2\pi Ne)+O\left( \frac{1}{N}\right) $$
(17)

with

$$ S(m_{A}|m_{\nu },r_{\nu })=\frac{1}{2}\ln (BR_{0}BR_{1}BR_{2}BR_{3}) $$
(18)

where the branching ratios are related as \({\sum }_{i=0}^{3}BR_{i}=1\).

It can be shown that the global maximum of S is obtained when all the branching ratios are equal

$$ BR_{i}(m_{A},m_{\nu },r_{\nu })=\frac{1}{4}\;,\;i=0,1,2,3 $$
(19)

However, it is also possible that this type of solution cannot be attainable if further constraints arise from an UV complete theory in which the parameters do not allow that the (19) be satisfied. If further constraints are absent or if they are weak, then we should expect that nearly equal branching ratios constitute another prediction coming from MEP.

First of all, we investigate the entropy supposing that MEP fixes just the axion mass, i.e., S = S(m A |m ν , r ν ), with the other parameters either previously known or at least bounded by some data. As we have just discussed, this might be the case if the UV complete theory fixes somehow the r ν of the model and possibly other parameters. In our approach, we, therefore, allow m ν to vary according to the constraints of (15), (16). In this case, for each fixed m ν and r ν , we just seek for a solution (m A ) that maximizes S(m A |m ν , r ν ) given by (18). The light green shaded area of Fig. 1 shows the MEP inference for the axion mass for the range of the r ν taking into account the neutrino masses constraints from (16).

Fig. 1
figure 1

Inferred axion mass region from MEP, taking into account the constraints from (16). The gray curves correspond to the axion mass maximizing the entropy for the case in which the maximum lightest neutrino mass (upper curve) and the minimum lightest neutrino mass (lower curve) are assumed. The light green shaded region correspond to all the maximum entropy points, assuming that the lightest neutrino mass is in the interval allowed by 16. The blue curve assumes that the maximum of entropy fixes all the three parameters as discussed in text. The inset plot corresponds to a zoom of the blue line, where the dashed curve is just an example of curve in the allowed region corresponding to an intermediate value of lightest neutrino mass: \(m_{\protect \nu }=0.03\) eV between the bounds. This curve crosses the blue line in (r ν , m A ) ≡ (0.00174,0.12), which is the point of the highest entropy in this dashed line. Finally, the dashed red line shows the limit of acceptable values of r ν for the DFSZ-type model which we present in the Appendix (A), while the dashed dark green line shows the limit from astrophysics for the \(r_{\protect \nu }\) (see the text)

The gray curves in Fig. 1 correspond to axion mass maximizing the entropy for the case in which the maximum lightest neutrino mass (upper curve) and the minimum lightest neutrino mass (lower curve) were assumed. Such curves can be identified as iso-lightest-neutrino-mass curves in the diagram r ν × m A . The following plots (a) and (b), in Fig. 2 show the maximum entropy values assumed in each point of gray lines respectively for the minimum (m ν = 10−8 eV) and the maximum (m ν = 0.0721226 eV) lightest neutrino mass. It is important to mention that even when we used m ν = 10−7 eV, the same result was obtained and the gray line remains the same.

Fig. 2
figure 2

The following plots (a) and (b) show the maximum entropy values assumed in each point of the gray lines (see Fig. 1) respectively for the minimum (\(m_{\protect \nu }=10^{-8}\) eV) and the maximum (\(m_{\protect \nu }=0.0721226\) eV) lightest neutrino mass. Both gray lines cross the blue line (see again Fig. 1) exactly in the points that correspond to the peak presented in the plots (a) and (b)

This suggests that we can look to MEP in a more restrict optimization than just seeking for a solution m A , given m ν and r ν . A stronger inference can be made by optimizing SS(m A , m ν , r ν ), the result is represented by the blue line of Fig. 2, where given one of the parameters, is possible to obtain the other two ones. We give a more detailed explanation as follows. The boundary gray lines cross the blue line exactly in the points that correspond to the peak presented in the plots (a) and (b) in the same Fig. 2. The inset plot in Fig. 1 corresponds to a focused area of the blue line region. Just to give an example, for r ν = 0.00174, the point over the blue line, of the highest entropy in the green region, corresponds to m A = 0.12 eV and m ν = 0.03 eV. In this same inset plot, the dashed black curve in the allowed region corresponds exactly to the intermediate value of lightest neutrino mass: m ν = 0.03 eV. This iso-lightest-neutrino-mass curve is exactly between the limiting lines (gray lines) as expected. Therefore, the light green region can be understood as a family of iso-lightest-neutrino-mass curves of the diagram r ν × m A , which cross the blue line which is composed by the optimal values found in each iso-lightest-neutrino-masses curves. In other words, the blue line is composed by values for which MEP fixes all the three parameters.

The dashed red line in Fig. 1 shows the limit to allowed values of r ν for the DFSZ-type model with right-handed neutrinos which we present in Appendix (A). In the same Fig. 1, the dashed dark green line shows the limit from astrophysics on the coupling (see below).

The inference is stronger for small r ν up to ∼ 0.0017, for higher values, the upper boundary of the Fig. 1 increases toward higher axion masses while the lower boundary remains nearly constant with r ν . Note that, under this hypothesis, we need to know r ν and m ν in order to get m A . The DFSZ model with right-handed neutrinos restricts \(r_{\nu }<0.46\cos ^{2}\beta \), which determines the dashed red line (see Appendix (A)).

We observe that under the consideration of more restrict optimization (blue line in Fig. 1), MEP constrains the axion mass to lie within

$$ 0.1\;\text{eV}<m_{A}<0.2\;\text{eV}. $$
(20)

We observe that the interval in (20) is in the threshold of the projected sensitivity of the IAXO experiment [23]. This shows that our hypothesis can be tested experimentally in the near future.

We should mention that the interval of (20) for the axion mass derived from MEP is compatible with the limits from astrophysics (compilation of the actual astrophysical limits on the axion mass and coupling constants are given in ref. [24]). For example, studies concerning the evolution of stars on the horizontal branching [25] put the constraint m A < 0.5 eV (f A > 1.3 × 107 GeV) on the DFSZ model. There is an astrophysical limit that could potentially impact on the interval in (20), but it also depends on the axion-electron coupling. A bound from red giants in the Galactic globular cluster M5 provided \(m_{A}\,\cos ^{2}\beta <15.3\) meV at 95% CL [26], in the DFSZ model having the axion-electron coupling coefficient \(C_{Ae}=\cos ^{2}\beta /3\) Footnote 1 . Taking into account (20), it means that \(\cos ^{2}\beta <0.0752\) implying that r ν < 0.034, with C A ν = −C A e . This restriction on r ν is shown in Fig. 1 (dark green dashed line) which leads to 0.1 eV < m A < 6.3 eV for the DFSZ model with the right-handed neutrinos which we present in Appendix (A).

4 Results II: Axions Decaying into the Lightest Neutrinos and Photons

Let us suppose now that the axion has only two decay modes: one into a pair of photons, and the other into a pair of the lightest neutrinos. In this case, the two relevant branching ratios are

$$ BR_{0}\,=\,\left[1\,+\,32\pi^{2}\frac{r_{\nu }^{2}}{\alpha^{2}}\frac{m_{\nu }^{2} }{{m_{A}^{2}}}\left( 1\,-\,\frac{4m_{\nu }^{2}}{{m_{A}^{2}}}\right)^{1/2}\right]^{-1}\;\;\text{and}\;\;BR_{1}\,=\,1\,-\,BR_{\gamma \gamma } $$
(21)

The probability of N axions decaying in n 0 photon pairs and n 1 = Nn 0 neutrino pairs is

$$ \Pr (N;n_{0})=\frac{N!}{n_{0}!(N-n_{0})!}BR_{0}^{n_{0}}(1-BR_{0})^{N-n_{0}} $$
(22)

In the limit \(N\rightarrow \infty \), by the central limit theorem, \(\Pr (N;n_{0})\rightarrow \frac {\exp \left [ -\frac {(n_{0}-N\cdot BR_{0})^{2}}{ 2NBR_{0}(1-BR_{0})}\right ] }{\sqrt {2\pi N\cdot BR_{0}(1-BR_{0})}}\) and the entropy can be written as

$$\begin{array}{@{}rcl@{}} S_{N} &=&S_{_{0}}+\frac{1}{2}\ln (2\pi Ne)+O(\frac{1}{N}) \end{array} $$
(23)
$$\begin{array}{@{}rcl@{}} S_{_{0}}(m_{A},m_{\nu },r_{\nu }) &=&\frac{1}{2}\ln \left[ BR_{0}\cdot (1-BR_{0})\right] \;. \end{array} $$
(24)

In this case, the maximum of S N given by (23) occurs for

$$ BR_{0}=BR_{1}=1/2\;, $$
(25)

in close analogy to 19.

In the Appendix (B), we derive the algebraic solutions to this equation in details. An important difference to the previous case is that, as shown in the Appendix (B), the solutions of (25) can only be found in terms of the ratio zm ν /m A . This implies that one parameter remains necessarily free in the inference method. There are two interesting asymptotic regimes which we want to discuss. First z ≈ 1/2, and second z < < 1.

The first one is the threshold regime where m A ≈ 2m ν . This is a type of solution which we also found in the case where the Higgs boson has an additional decay channel to dark matter [3].

The second interesting regime occurs when m ν < < m A , in this case, it is possible to show that (see Appendix B)

$$ \frac{m_{\nu }}{m_{A}}\approx \frac{\sqrt{1-\sqrt{1-\frac{\alpha^{2}}{4\pi^{2}r_{\nu }^{2}}}}}{2} $$
(26)

Moreover, as this relation should be positive, there is a lower bound on r ν given by

$$ r_{\nu }=\frac{\sqrt{3\sqrt{3}}}{4\pi }\alpha =0.00132 $$
(27)

which is a solution of the fourth order equation in z explicit in the Appendix B. This is compatible with the inferred couplings in the case of three neutrinos studied in the previous section.

In the Fig. 3, we show the inferred masses as a function of the axion-neutrino coupling. The upper plot in Fig. 3 displays the z < < 1 regime. The axion and the neutrino masses can be very different depending on r ν which varies from the smallest possible value of 0.00132, for which z is positive, up to the maximum value of 0.46 allowed by the axion model under consideration. In the lower plot, we show the other interesting regime where m A ≈ 2m ν , again allowing the range 0.00132 < r ν < 0.46. In this former case, the mass inference is barely dependent of the coupling constant. In both plots, we also show the region of the axion mass parameter where the axion can constitute, at least, part of the cold dark matter of the Universe and be detected by a haloscope like [27], or other proposed experiments as [28,29,30].

Fig. 3
figure 3

Linear relations predicted by maximizing the Shannon entropy in the case where the axion is allowed to decay just to the lightest neutrino and photons. In the plot (a), we show the z ≈ 1/2 regime, and in the plot (b), the z < < 1 regime

Recently, the axion mass was calculated with lattice QCD methods [31, 32] to lie in the range 10−6 eV < m A < 1.5 × 10−3 eV which fits exactly in the bulk of the region shown inside the bars in Fig. 3. If the axion mass confirms the lattice QCD result, one can use the MEP prediction to bound the lightest neutrino mass and the coupling constant with the results presented in this work. Once confirmed, this would add a strong evidence in favor of MEP as a valuable inference tool in particle physics.

5 Conclusions

In this paper, we employed the maximum entropy principle to a model where axions couple to photons and neutrinos. By demanding that the Shannon entropy of an ensemble of axions decaying to photons and neutrinos be maximized, we made inferences about the masses of the axion taking into account its relationship with neutrinos masses and r ν , which is proportional to the ratio of axion-neutrino coupling constant and axion-photon coupling constant.

In the case where the axion decays into the three neutrino mass eigenstates, and taking the hypothesis that the entropy is assumed to be a function of the axion mass, the lightest neutrino mass, and the r ν which is the ratio of the axion coupling constants, MEP is able to make a sharp prediction: 0.1 eV < m A < 0.2 eV. On the other hand, if r ν and m ν are given as inputs, i.e., S = S(m A |r ν , m ν ), considering the DFSZ model with right-handed neutrinos, MEP jointly with astrophysical bounds, constrain the axion mass to be 0.1 eV < m A < 6.3 eV.

If the axion decays only into a pair of the lightest neutrinos and photons, the inference has two regime of solutions allowing the axion as a dark matter candidate. First, when the axion mass is much larger than the lightest neutrino mass, (z < < 1), the inference of the axion mass has a strong dependence on the ratio of the coupling constant (r ν ) as shown in Fig. 3a. On the other hand, if m A ≈ 2m ν (z ≈ 1/2), the axion mass has a very weak dependence on r ν as expected (see Fig. (3)b). For example: if m A = 10−4 eV, and z ≈ 1/2, the MEP fixes the lightest neutrino mass around 5 × 10−5 eV; However, if z < < 1, the MEP predicts lightest neutrino mass within 10−8eV < m ν < 10−5eV. In this case, for example, if r ν = α, then m ν = 7 × 10−6 eV.

Finally, we would like to stress that MEP can furnish a sharp prediction if the neutrino mass is determined and knowing r ν from some model as shown in our Fig. 1.