1 Introduction

Despite the glorious successes of the standard model of particle physics, the generations of fermion masses remain one of the most fundamental but unsolved problems therein. These masses are treated as free parameters in the standard model, which seem to be rather dispersed and unrelated and can only be determined experimentally. Therefore, it is reasonable to first seek some phenomenological relations of these masses in order to reduce the number of free parameters, and this will significantly help us for the future model buildings in and beyond the standard model.

Among the existing phenomenological mass relations, an empirical formula suggested by Koide three decades ago [13], is one of the most accurate,

$$\begin{aligned} k_l=\frac{m_e+m_\mu +m_\tau }{(\sqrt{m_e}+\sqrt{m_\mu }+\sqrt{m_\tau })^2}=\frac{2}{3}, \end{aligned}$$

where \(m_e\), \(m_\mu \), and \(m_\tau \) are the pole masses of three charged leptons: electron, muon, and tauon, respectively. From the current experimental data [best-fit (\(\pm 1\sigma \))] [4]:

$$\begin{aligned}&m_e=(0.510998928 \pm 0.000000011)~\mathrm{MeV},\\&m_\mu =(105.6583715 \pm 0.0000035)~\mathrm{MeV},\\&m_\tau =(1776.82 \pm 0.16)~\mathrm{MeV}, \end{aligned}$$

it is straightforward to see the amazing precision of this simple formula,

$$\begin{aligned} k_l=\frac{2}{3}\times \left[ 1\pm \mathcal {O}\left( 10^{-5}\right) \right] . \end{aligned}$$

Besides, it is very interesting to see that \(k_l\) lies exactly in the middle of the two extremes of 1/3 (exact democracy in lepton mass spectrum) and 1 (extreme hierarchy) [5]. Furthermore, although the lepton masses are hierarchical, the exclusion of the smallest mass \(m_e\) still makes \(k_l\) notably deviate from 2/3.

This remarkable precision has aroused in both theorists [525] and phenomenologists [2634] a longtime interest in the Koide formula, but unfortunately the underlying physics remains incomplete. The previous studies can be classified into three categories: (1) to explore the possible physical origin of the Koide formula (e.g. in a supersymmetric model [19] or in an effective field theory [2022]); (2) to generalize the Koide formula from charged leptons to neutrinos and quarks (see Refs. [33] and [34] for example); (3) to examine the energy scale dependence of the Koide formula [18] (i.e. to check the stability of the Koide formula against radiative corrections, with the pole masses replaced by the running masses of fermions). In the present paper, we focus on the latter two aspects for two extended Koide formulas suggested by Kartavtsev in Ref. [35]. Before doing so, we briefly review the relevant work on the extensions of the Koide formula from charged leptons to neutrinos and quarks.

In Ref. [33], the Koide formula was extended to the sectors of neutrinos, up-type, and down-type quarks,

$$\begin{aligned}&k_\nu =\frac{m_1+m_2+m_3}{(\sqrt{m_1}+\sqrt{m_2}+\sqrt{m_3})^2},\nonumber \\&k_\mathrm{up}=\frac{m_u+m_c+m_t}{(\sqrt{m_u}+\sqrt{m_c}+\sqrt{m_t})^2}, \nonumber \\&k_\mathrm{down}=\frac{m_d+m_s+m_b}{(\sqrt{m_d}+\sqrt{m_s}+\sqrt{m_b})^2}, \end{aligned}$$
(1)

where \(m_1\), \(m_2\), \(m_3\) are the masses of three neutrinos, and \(m_u\), \(\ldots \), \(m_b\) are the masses of six quarks. It was found that these naive extensions of the Koide formula failed to be valid, as \(k_{\nu }=0.33\sim 0.57\), \(k_\mathrm{up}=0.73\sim 0.93\), and \(k_\mathrm{down}=0.60\sim 0.80\). In Ref. [34], Rodejohann and Zhang generalized the Koide formula to the quark sector in another way, not according to their charges or isospins, but to their masses. For the light (u, d, s) and heavy (c, b, t) quarks, they separately introduced

$$\begin{aligned}&k_\mathrm{light}=\frac{m_u+m_d+m_s}{(\sqrt{m_u}+\sqrt{m_d}+\sqrt{m_s})^2}, \nonumber \\&k_\mathrm{heavy}=\frac{m_c+m_b+m_t}{(\sqrt{m_c}+\sqrt{m_b}+\sqrt{m_t})^2}, \end{aligned}$$
(2)

and found a better accordance with the Koide formula, \(k_\mathrm{light}=0.49\sim 0.65\) and \(k_\mathrm{heavy}=0.66\sim 0.68\).

Recently, Kartavtsev proposed a new extension [35], including the neutrino masses in the original Koide formula,

$$\begin{aligned} K_l&=\frac{\sum _l m_l}{(\sum _l\sqrt{m_l})^2}\nonumber \\&=\frac{m_1+m_2+m_3+m_e+m_\mu +m_\tau }{(\sqrt{m_1}+\sqrt{m_2}+\sqrt{m_3}+\sqrt{m_e}+\sqrt{m_\mu }+\sqrt{m_\tau })^2}, \end{aligned}$$
(3)

where the sum is democratically over all the six leptons. Since the neutrino masses are tiny compared to those of charged leptons, it is quite possible that the exact ratio 2/3 could be attained when they are taken into account. Another motivation for this extension is from the quark sector,

$$\begin{aligned} K_q&=\frac{\sum _q m_q}{(\sum _q\sqrt{m_q})^2}\nonumber \\&=\frac{m_u+m_d+m_s+m_c+m_b+m_t}{(\sqrt{m_u}+\sqrt{m_d}+\sqrt{m_s}+\sqrt{m_c}+\sqrt{m_b}+\sqrt{m_t})^2}. \end{aligned}$$
(4)

It will be shown in Sect. 3 that \(K_q\) is also very close to 2/3 and is almost stable in a large range of energy scales, although the mass of each quark varies with the energy scale obviously.

These preliminary successes enlighten us to investigate Kartavtsev’s extensions of the Koide formula in detail, since the naive estimates in Ref. [35] were still very simple. We should first of all explore phenomenologically the validity of Eqs. (3) and (4) for both leptons and quarks, and these explorations will be greatly helpful for the relevant model building for the theoretical explanation of the extended Koide formulas. This is the purpose of our present paper.

This paper is organized as follows. First, in Sect. 2, we show that the extended Koide formula for leptons is invalid under the constraints from current experimental data. However, if the uncertainty of the most inaccurate tauon mass is relaxed to about \(2\sigma \) confidence level, the extended Koide formula could be satisfied, and the neutrino masses can thus be extracted. Next, in Sect. 3, we first clarify some different definitions of quark masses and then examine the energy scale dependence of the extended Koide formula for the running quark masses. We find that the extended Koide formula holds fairly well in a large range of energy scales. Some relevant conclusions and discussions are shown in Sect. 4.

Table 1 The first 10 minima of \(|K_l-2/3|\) and the corresponding neutrino masses from the extended Koide formula in the normal mass hierarchy. The values of \(|K_l-2/3|\) are greatly larger than the precision of operation, meaning that the extended Koide formula for leptons is unsuccessful under the constraints from current experimental data.

2 Extended Koide formula for leptons

In this section, we examine the extended Koide formula for leptons. However, this can only be performed with the masses of all the six leptons known. Unfortunately, the lack of the absolute masses of neutrinos makes this examination impossible. But on the other hand, thanks to the more and more precise experiments of neutrino oscillations, we have already had two firm constraints of the mass-squared differences of the neutrino mass eigenstates. Therefore, if we assume the validity of the extended Koide formula for leptons (i.e., \(K_l=2/3\) exactly) and regard it as the third constraint, the three neutrino masses may thus be extracted.

Based on three-flavor neutrino mixing, a global analysis [4] of the current experimental data from the oscillations of the solar, atmospheric, reactor, and accelerator neutrinos indicates [best-fit (\(\pm 1\sigma \))]:

$$\begin{aligned}&m_2^2-m_1^2=(7.53\pm 0.18)\times 10^{-5}~\mathrm{eV}^2, \\&m_3^2-m_2^2=(2.44\pm 0.06)\times 10^{-3}~\mathrm{eV}^2 \quad (\mathrm{normal~mass~hierarchy}), \\&m_2^2-m_3^2=(2.52\pm 0.07)\times 10^{-3}~\mathrm{eV}^2 \quad (\mathrm{inverted~mass~hierarchy}). \end{aligned}$$

By normal and inverted mass hierarchies, we mean that neutrinos have the mass spectra \(m_1<m_2<m_3\) and \(m_3<m_1<m_2\), respectively.

Suppose the extended Koide formula is exact for leptons,

$$\begin{aligned}&K_l=\frac{m_1+m_2+m_3+m_e+m_\mu +m_\tau }{(\sqrt{m_1}+\sqrt{m_2}+\sqrt{m_3}+\sqrt{m_e}+\sqrt{m_\mu }+\sqrt{m_\tau })^2}=\frac{2}{3}. \end{aligned}$$
(5)

Solving Eq. (5) with the two constraints of the mass-squared differences, we may first determine one of the three neutrino masses (e.g. \(m_2\)) and then obtain the other two (\(m_1\) and \(m_3\)). But the direct solving of Eq. (5) is rather inconvenient, so we convert this problem to search the local minimum of \(|K_l-2/3|\), and this is a typical numerical nonlinear optimization problem. In the following, we synthetically utilize the Nelder–Mead method (with the shortest runtime) and the random search method (with the highest accuracy) to obtain the neutrino masses with the smallest computational cost.

We start from the normal mass hierarchy. In this circumstance, what we face now is to search the minimum of \(|K_l-2/3|\) as an objective function of six variables (\(m_1\), \(m_2\), \(m_3\), \(m_e\), \(m_\mu \), \(m_\tau \)) under five constraints:

$$\begin{aligned}&(7.53-0.18)\times 10^{-5}~\mathrm{eV}^2\nonumber \\&< m_2^2-m_1^2< (7.53+0.18)\times 10^{-5}~\mathrm{eV}^2, \end{aligned}$$
(6)
$$\begin{aligned}&(2.44-0.06)\times 10^{-3}~\mathrm{eV}^2\nonumber \\&< m_3^2-m_2^2< (2.44+0.06)\times 10^{-3}~\mathrm{eV}^2, \end{aligned}$$
(7)
$$\begin{aligned}&(0.510998928 - 0.000000011)~\mathrm{MeV}< m_e\nonumber \\&<(0.510998928 + 0.000000011)~\mathrm{MeV}, \end{aligned}$$
(8)
$$\begin{aligned}&(105.6583715 - 0.0000035)~\mathrm{MeV}< m_\mu \nonumber \\&<(105.6583715 + 0.0000035)~\mathrm{MeV}, \end{aligned}$$
(9)
$$\begin{aligned}&(1776.82 - 0.16)~\mathrm{MeV}< m_\tau <(1776.82 + 0.16)~\mathrm{MeV}. \end{aligned}$$
(10)

For searching the minimum of \(|K_l-2/3|\), we set the precision of operation to 30 significant figures. Meanwhile, due to the limitations of the Nelder–Mead method, a computing result has a great possibility to oscillate in the vicinity of a convergence point, so we set the maximal number of iterations to 2000 for a single operation. If the iterations are not successful, the point will be discarded, and the results from this point will be excluded by the checking program. From the operating and checking for 500 random initial points, we eventually obtain 49 successful convergence points, and the first 10 minima of \(|K_l-2/3|\) and the corresponding neutrino masses are sorted in Table 1.

From Table 1, we clearly observe that although the values of \(|K_l-2/3|\) are already very small \(\sim \mathcal {O}(10^{-6})\), they are still greatly beyond the computing accuracy that we adopt. Since we have set the precision of operation to 30 significant figures, \(|K_l-2/3|\) should be \(\sim \mathcal {O}(10^{-30})\), if the masses \(m_1\), \(m_2\), \(m_3\) make \(|K_l-2/3|\) converge to zero absolutely. As a result, we conclude that there is no solution for neutrino masses from Eq. (5) under the current experimental constraints in Eqs. (6)–(10). In other words, Kartavtsev’s extended Koide formula for leptons in Eq. (5) is unsuccessful. This situation is the same for the inverted mass hierarchy.

Although the extended Koide formula is invalid for leptons with the current experimental data, the deviations of \(K_l\) from 2 / 3 are still considerably small. This smallness naturally leads us to investigate weather \(K_l=2/3\) exactly, if the uncertainties of lepton masses are slightly larger than the current data. Whereas, we do not need to relax the uncertainty of each lepton mass to test this possibility, because the uncertainty of \(K_l\) is substantially attributed to the uncertainty of the tauon mass \(m_\tau \), since the uncertainties of \(m_1\), \(m_2\), \(m_3\), \(m_e\), and \(m_\mu \) are much smaller. In fact, the measurement of \(m_\tau \) is much more inaccurate than that shown in Eq. (10): \(m_\tau =(1776.82 \pm 0.16)\) MeV, which results from the weighted average of various experiments [4]. Actually, even the central value of \(m_\tau \) varies from 1775 to 1783 MeV, with the error bar of a few MeV [4]. Therefore, in the following, we keep the constraints in Eqs. (6)–(9) and also the central value of \(m_\tau \) as 1776.82 MeV, but only relax its uncertainty, in order to examine the validity of the extended Koide formula. We choose the neutrino mass \(m_2\) and study its dependence on \(m_\tau \), as \(m_2\) appears in both constraints in Eqs. (6) and (7). Our results are shown in Fig. 1.

Fig. 1
figure 1

The neutrino mass \(m_2\) and its dependence on the tauon mass \(m_\tau \). The left and right panels are for the normal and inverted mass hierarchies, respectively. Each point in the figures is a convergence point for \(K_l=2/3\). There is a lower bound 1777.11 MeV for \(m_\tau \) in the normal mass hierarchy and 1777.17 MeV in the inverted case. The neutrino mass \(m_2\) is not very sensitive to \(m_\tau \) in a relative large range, with the central value being \(8.68\times 10^{-3}\) eV in the normal mass hierarchy, and \(5.02\times 10^{-2}\) eV in the inverted case

From Fig. 1, we obviously find that there is a lower bound for the tauon mass \(m_\tau \), such that \(K_l=2/3\) exactly. For the normal mass hierarchy, from the left panel in Fig. 1, we observe

$$\begin{aligned}&m_\tau >1777.11~\mathrm{MeV}, \quad \mathrm{i.e.} \quad m_\tau >(1776.82+1.81\sigma )~\mathrm{MeV}, \end{aligned}$$

meaning that the extended Koide formula for leptons is satisfied, only if the uncertainty of \(m_\tau \) is relaxed to about \(2\sigma \) confidence level. Furthermore, the neutrino mass \(m_2\) is not very sensitive to \(m_\tau \) in a relative large range, and we thus estimate

$$\begin{aligned}&8.58\times 10^{-3}~\mathrm{eV}<m_2< 8.78\times 10^{-3}~\mathrm{eV}. \end{aligned}$$

Therefore, combining Eqs. (6) and (7), we have the masses of the other two neutrinos:

$$\begin{aligned}&0~\mathrm{eV} <m_1<6.64\times 10^{-4}~\mathrm{eV},\\&4.95\times 10^{-2}~\mathrm{eV}< m_3< 5.07\times 10^{-2}~\mathrm{eV}. \end{aligned}$$

The corresponding central values for these three neutrino masses read

$$\begin{aligned}&m_1=2.06\times 10^{-4}~\mathrm{eV}, \quad m_2=8.68\times 10^{-3}~\mathrm{eV}, \\&m_3=5.02\times 10^{-2}~\mathrm{eV}. \end{aligned}$$

Hence, we find a relatively mild hierarchy for the neutrino masses in this case.

Similarly, for the inverted mass hierarchy, from the right panel in Fig. 1, we have

$$\begin{aligned} m_\tau >1777.17~\mathrm{MeV}, \quad \mathrm{i.e.}\quad m_\tau >(1776.82+2.16\sigma )~\mathrm{MeV}, \end{aligned}$$

and

$$\begin{aligned}&0~\mathrm{eV}< m_3< 3.93\times 10^{-3}~\mathrm{eV}, \\&4.87\times 10^{-2}~\mathrm{eV} <m_1<5.02\times 10^{-2}~\mathrm{eV}, \\&4.95\times 10^{-2}~\mathrm{eV} <m_2<5.09\times 10^{-2}~\mathrm{eV}, \end{aligned}$$

with the central values being

$$\begin{aligned}&m_3=2.00\times 10^{-4}~\mathrm{eV}, \\&m_1=4.94\times 10^{-2}~\mathrm{eV}, \quad m_2=5.02\times 10^{-2}~\mathrm{eV}. \end{aligned}$$

In this case, we find the neutrino mass scheme as \(m_3\ll m_1\approx m_2\).

Table 2 The running quark masses and the running parameter \(K_q(\mu )\) in the extended Koide formula at some typical energy scales. The mass of the Higgs boson is taken as 125 GeV, and the cutoff scale for vacuum stability is \(4\times 10^{12}\) GeV (data from Table 1 of Ref. [38]). The running quark masses are found to decrease monotonically, but \(K_q(\mu )\) is almost stable in a sizable range of energy scales from GeV to \(10^{12}\) GeV. Moreover, \(K_q(\mu )>2/3\) at all energy scales, but the deviations are only about 5 %.

Last, we should stress that we only focus on the pole masses of leptons in this section, but not their running masses. Since the lepton mass ratios are rather insensitive to radiative corrections [18], this is not a severe problem.

3 Extended Koide formula for quarks

In this section, we move on to explore the extended Koide formula for quarks (i.e. if \(K_q=2/3\) or not). However, the situation in the quark sector is much more complicated, or even ambiguous, than that of lepton. This complication or ambiguity comes from what masses we mean for quarks. For instance, in Ref. [4], the six quark masses are recommended as

$$\begin{aligned} m_u&=2.3^{+0.7}_{-0.5}~\mathrm{MeV}, \quad m_d=4.8^{+0.5}_{-0.3}~\mathrm{MeV}, \quad m_s=95\pm 5~\mathrm{MeV}, \nonumber \\ m_c&=1.275\pm 0.025~\mathrm{GeV}, \quad m_b=4.18\pm 0.03~\mathrm{GeV}, \nonumber \\ m_t&=173.21\pm 0.51\pm 0.71~\mathrm{GeV}. \end{aligned}$$
(11)

From these data, it is easy to find \(K_q\approx 0.64\). But this trivial result actually makes no sense, as the masses of the light quarks (u, d, s) are estimated as the current quark masses, the masses of relatively heavy quarks (c, b) mean the running quark masses, and the mass of the heaviest t quark is measured as its pole mass. Therefore, it is meaningless to calculate \(K_q\) as a combination of all these six masses without distinction.

Different from leptons, quarks are confined inside hadrons and cannot be observed as physical particles in experiments, so their masses cannot be measured directly. Therefore, concerning quark masses, we should first make clear their quantitative definitions and meanings. For example, the masses of light quarks in chiral perturbation theory always mean the current quark masses. While, in a particular non-relativistic hadron model, we mean the quark masses by the constituent quark masses. Moreover, the quark masses computed directly from lattice quantum chromodynamics (QCD) are the bare quark masses. Whereas, in the Koide formula, the charged lepton masses are the pole (physical) masses, which correspond to the positions of divergence in their propagators in the on-shell renormalization scheme. However, the pole masses of quarks can only be defined in perturbation theory and are not reliable at low energies because of the non-perturbative infrared effects in QCD. Hence, the pole masses of quarks are not well defined, so the extension of the Koide formula for quarks should only refer to their running masses.

The mass parameters in the QCD Lagrangian depend not only on the renormalization scheme adopted to define the theory, but also on the energy scale at which an observation occurs. Therefore, the values of quark masses may alter significantly in different renormalization schemes and at different energy scales. Whereas, one can convert these values between different schemes in perturbation theory and run these values to the demanded energy scales by the renormalization group equations. At high energies, where non-perturbative QCD effects become small, the calculations of quark masses are most commonly performed in the dimensional regularization scheme with the modified minimal subtraction (\(\overline{\mathrm{MS}}\)), to obtain the running (or renormalized) quark masses \(m_q(\mu )\) at a given energy scale \(\mu \). As a result, we should trace the energy scale dependence of the extended Koide formula for the quark sector and clarify the mass parameters in Eq. (4) as the running quark masses,

$$\begin{aligned} K_q(\mu )=\frac{m_u(\mu )+m_d(\mu )+m_s(\mu )+m_c(\mu )+m_b(\mu )+m_t(\mu )}{\left[ \sqrt{m_u(\mu )}+\sqrt{m_d(\mu )}+\sqrt{m_s(\mu )}+\sqrt{m_c(\mu )}+\sqrt{m_b(\mu )}+\sqrt{m_t(\mu )}\right] ^2}. \end{aligned}$$

The running quark masses were systematically calculated in Refs. [36] and [37], and were especially recalculated with a much higher precision in Ref. [38] after the discovery of the Higgs boson. Hence, in this paper, we follow the data in Ref. [38].

In Table 2, we list the running quark masses taken from Table 1 of Ref. [38]. These masses were calculated in the standard model at a number of typical energy scales: for example, \(m_c\) evaluated at the scale equal to its mass, 2 GeV where light quark masses are often quoted in the \(\overline{\mathrm{MS}}\) scheme, the Higgs mass \(m_H\approx 125\) GeV, until the cutoff scale \(\Lambda _\mathrm{VS}\approx 4\times 10^{12}\) GeV, where the vacuum stability in the standard model is lost due to a relatively small Higgs mass. We clearly see that the running quark masses monotonously decrease at large energy scales. The running parameter \(K_q(\mu )\) in the extended Koide formula is also listed in the last column in Table 2.

We find from Table 2 that in a sizable range of energy scales from GeV to \(10^{12}\) GeV, \(K_q(\mu )\) is rather insensitive to the running effects of quark masses. This insensitivity should be substantially attributed to the large mass hierarchy in the quark sector. Moreover, \(K_q(\mu )\) is always slightly larger than 2/3 at all energy scales, indicating the impossibility to extend the Koide formula to the running quark masses. But the discrepancies between \(K_q(\mu )\) and 2/3 are only about 5 %, much smaller than the various attempts in Eqs. (1) or (2). In this sense, Kartavtsev’s extended Koide formula for quarks is a rather good choice. In addition, the uncertainties of \(K_q(\mu )\) at different energy scales are almost the same, but this is not unexpected, as the uncertainty of \(K_q(\mu )\) is mainly from the uncertainty of the heaviest t quark mass. Besides, we do not find that \(K_q(\mu )\) crosses 2/3 at some particular energy scale \(2~\mathrm{GeV}<\mu <m_Z\), as claimed by Kartavtsev in Ref. [35], since this cross was naively estimated from the data in Eq. (11). But as we have explained above, these values of quark masses in Eq. (11) have different definitions and cannot be consulted simultaneously. The stability of \(K_q(\mu )\) against the running effects is also illustrated in Fig. 2.

Fig. 2
figure 2

The running parameter \(K_q(\mu )\) in the extended Koide formula as the function of energy scales. Due to the huge differences between these energy scales, we arrange them equidistantly on the \(\mu \)-axis. \(K_q(\mu )\) is found to be almost stable in a huge range of energy scales from GeV to \(10^{12}\) GeV

4 Conclusions and discussions

The standard model of particle physics has achieved triumphant successes in the last five decades. However, one of the most crucial shortcomings therein is a large number of free parameters, including 12 fermion masses. Any reduction of this number will pave a way for our comprehension of the underlying flavor physics. The Koide formula is one of the appealing attempts in this direction. Unfortunately, this formula only associates the masses of three charged leptons, but not of all the 12 flavors of fermions. However, charged leptons should not be particular in fermions, so the idea to extend the original Koide formula, including all leptons and quarks on an equal footing, is thus very natural and desirable. Kartavtsev’s extensions [35] in Eqs. (3) and (4) treated six leptons and quarks in a totally democratic manner, with a maximal S(6) permutation symmetry, and a preliminary estimate indicated a certain plausibility of these extensions.

In the present paper, we explore Kartavtsev’s extended Koide formulas for both leptons and quarks at length. For the lepton sector, it proves that \(K_l\) cannot be equal to 2/3 exactly with the current experimental data of the charged lepton masses and the mass-squared differences of neutrinos within \(1\sigma \) confidence level. Then our strategy is to assume the rigorous validity of the extended Koide formula for leptons and relax the uncertainty of the most inaccurate tauon mass \(m_\tau \). By this means, the neutrino masses can be extracted from the extended Koide formula, if the uncertainty of \(m_\tau \) is relaxed to about \(2\sigma \) confidence level in both the normal and the inverted mass hierarchies. The central values for three neutrino masses read: \(m_1=2.06\times 10^{-4}\) eV, \(m_2=8.68\times 10^{-3}\) eV, \(m_3=5.02\times 10^{-2}\) eV (normal hierarchy), and \(m_3=2.00\times 10^{-4}\) eV, \(m_1=4.94\times 10^{-2}\) eV, \(m_2=5.02\times 10^{-2}\) eV (inverted hierarchy). These results are consistent with the most stringent upper bound on the sum of neutrino masses from the measurements of the cosmic microwave background temperature spectra from the WMAP and Planck satellite experiments: \(m_1+m_2+m_3<0.66\) eV (95 % confidence level) [39], and also from the data combined with the baryon acoustic oscillations: \(m_1+m_2+m_3<0.23\) eV (95 % confidence level) [39]. It is interesting to note that, even if the neutrino masses increase near this cosmological bound (e.g. \(m_1+m_2+m_3\approx 0.23\) eV), with the constraints in Eqs. (6) and (7), the discrepancies between \(K_l\) and 2 / 3 almost remain unchanged: \(2.0\times 10^{-5}<|K_l-2/3|<3.8\times 10^{-5}\) for the normal mass hierarchy and \(2.0\times 10^{-5}<|K_l-2/3|<3.9\times 10^{-5}\) for the inverted mass hierarchy. This is understandable, as the uncertainty of \(K_l\) mainly comes from the uncertainties of charged leptons.

For the quark sector, the various definitions of quark masses greatly complicate the situation. The pole masses in the Koide formula become ill-defined for the light quarks, due to the non-perturbative effects in QCD at low energies. Therefore, the exploration of the extended Koide formula should only be implemented for the running quark masses. We find that the running parameter \(K_q(\mu )\) is almost stable in a very large range of energy scales from GeV to \(10^{12}\) GeV, mainly as a result of the large mass hierarchy in the quark sector. However, \(K_q(\mu )\) is always slightly larger than 2 / 3, meaning the invalidity of the extended Koide formula for the running quark masses, but this deviation is merely about 5 %. We omit the discussion of the running behavior of the extended Koide formula in the lepton sector, as the running effects are negligibly tiny because the lepton mass ratios are rather insensitive to radiative corrections [18].

Below, we give some general discussions on the Koide formula. The mystery of the Koide formula is twofold. The first is its surprising simplicity and accuracy, but only for charged leptons. The inclusion of neutrinos is a reasonable balance between the charged and uncharged leptons, but this inclusion is meaningful only if the generation mechanism of neutrino masses is the same as that of charged lepton masses (i.e. neutrinos are of Dirac type). On the contrary, if the tiny neutrino masses are generated from the seesaw mechanism [4042], Kartavtsev’s extension of the Koide formula will be pointless due to the Majorana mass term. The second is that the Koide formula consists of the pole masses of fermions, which are the low energy quantities and are defined at different energy scales. This is extremely counterintuitive, since we always expect simple formulas at high energy scales, where some symmetries are restored. Hence, the renormalization effects will not allow the Koide-like formulas for both the pole and the running fermion masses simultaneously.

Finally, we should point out that Kartavtsev’s extension of the Koide formula [35] and our corresponding detailed explorations are still at the phenomenological level. A similar work (the most general extension of the Koide formula), taking all the 12 fermions into account, i.e. \((\sum _f m_f)/(\sum _f\sqrt{m_f})^2=2/3\), is also not quite successful. Therefore, a possible direction for further extensions of the Koide-like formulas is to seek the theoretical basis of these empirical relations, as Koide originally did in a composite model or an extended technicolor-like model [13]. We should incorporate in the extended Koide formulas the elements and the mixing and phase angles in the lepton and quark mixing matrices [4347], and maybe also the fermion charges. This will be the topic for our future research.

We are very grateful to Prof. Jean-Marc Gérard for his stimulating idea and also to Fengjiao Chen for fruitful discussions. This work is supported by the Fundamental Research Funds for the Central Universities of China (No. N140504008).