1 Introduction

The production and detection of bottom quarks play an important rôle in various contexts in LHC physics. Letting aside the very abundant direct production, that is exploited for flavour physics studies, bottom is used to identify top particles and to study their properties. Furthermore, it is the dominant decay mode of the Higgs boson, that can be used to study processes as the associate HV production [1, 2] and the large transverse momentum production [3]. In searches for physics beyond the Standard Model, bottom also appears often produced in association with new-physics objects.

Having a mass much larger than the typical hadronic scales, bottom quark production is calculable in perturbative QCD. In cases when the transverse momentum involved in the production is large compared to its mass, as, for example, in high-energy \(e^+e^-\) annihilation, or in production at large transverse momentum in hadronic collisions, bottom can behave as a light parton, and give rise to a hadronic jet. Techniques for dealing with these regimes have been developed in the past [4], and have been applied to the LHC case [5]. They allow for the computation of the transverse momentum spectrum of promptly produced b quarks at next-to-leading order in QCD, including the resummation of large logarithms of the ratio of the transverse momentum over the bottom mass up to next-to-leading-logarithmic accuracy. These large logarithms can arise both from initial state radiation, when, for instance, an incoming gluon splits into a \(b\bar{b}\) pair, with one of the b undergoing a large-momentum-transfer collision with a parton from the target, and from final state radiation. In the last case, an outgoing gluon can split into a \(b\bar{b}\) pair, or a directly produced b quark can emit a collinear gluon.

The large transverse momentum regime is treated consistently at the leading logarithmic level in parton shower generators. At the most basic level, heavy flavours are treated as light flavours, but with a shower cut-off scale of the order of the heavy quark mass. However, considerable work has been performed to better account for mass effects. In some generators, this is achieved by suitable modifications of the splitting kinematics and splitting kernels [6, 7]. The Sherpa dipole shower [8] makes use of the Catani-Seymour dipoles for massive quarks [9]. The Catani-Seymour formalism is also used in the DIRE shower [10]. In Ref. [11] a final state dipole-antenna shower for massive fermions is proposed, based upon the corresponding antenna subtraction formalism of Ref. [12].

In next-to-leading order (NLO) calculations matched to Shower generators (NLO+PS) for heavy flavour production [13, 14], one generally treats the heavy flavour as being very heavy. The heavy quark mass thus acts as a cut-off on collinear singularities, that are thus not resummed. This approach has in fact proven to be quite viable in heavy flavour production even at relatively large momentum transfer [5]. Consider, for example, heavy quark pair production in a POWHEG framework. By neglecting collinear singularities from heavy quarks, the only singular region that we have to consider has to do with initial state radiation involving only light partons. Since the POWHEG procedure guarantees that the matrix elements are given correctly for up to one hard radiation, gluon splitting, flavour excitation and radiation from the heavy flavour are included, so that the logarithmically enhanced terms are correctly reproduced at first order. Higher order leading logarithms, however, are not treated correctly. In particular, there are reasons to give an adequate treatment to final state radiation from a high transverse momentum bottom quark. In fact, this radiation process is intimately related to the physics of the bottom fragmentation function, and may have important effects in processes of considerable interest, like for example in top decay.

In the POWHEG-BOX framework, a facility for the treatment of collinear radiation from a heavy particle was set up in Ref. [15], in the framework of electroweak corrections to W production. In that context, the purpose was to deal with mass effects in the final state radiation of photon from the lepton in W decays. The same framework is also appropriate for describing radiation from heavy quarks. In particular, it was adopted in Ref. [16], where the ttb_NLO_dec generator was introduced, and in Ref. [17] for the b_bbar_4l generator, for dealing with radiation from bottom quarks in top decays.

The purpose of the present paper is twofold: we present a new algorithm for radiation from a heavy quark, that has proven superior to the old one; furthermore we perform a thorough investigation of the behaviour of this component of the POWHEG generator, also by comparing the two methods, both in the framework of bottom quarks generated in top decay, and in inclusive bottom quark pair production. In the last case, such a study was never carried out.

The paper is organized as follows: in Sect. 2 we describe the new algorithm, in Sect. 3 we illustrate our phenomenological studies, and in Sect. 4 we give our conclusions.

2 Description of the new algorithm

2.1 The POWHEG mapping for the massive emitter case

Let us assume for definiteness to deal with a scattering process involving n partons in the final state at lowest order in perturbation theory. The generic point in the corresponding Born phase space, referred also as Born configuration, will be denoted with barred momenta

$$\begin{aligned} \overline{\varPhi }_n = \{\overline{k}_1, \dots , \overline{k}_n\}. \end{aligned}$$
(1)

The corresponding phase space volume element is given by

$$\begin{aligned} \mathrm {d}\overline{\varPhi }_n= \prod _{i=1}^n\frac{\mathrm {d}^3\mathbf {\overline{k}}_i}{(2\pi )^32\overline{k}_i^0}(2\pi )^4\delta ^{(4)}\left( q-\sum _{i=1}^n\overline{k}_i\right) , \end{aligned}$$
(2)

where q is the total incoming 4-momentum.Footnote 1 At Next-to-Leading order (NLO), one must also include processes of emission of one more real massless extra parton, resulting in a \(n+1\)-body kinematics which we will denote as

$$\begin{aligned} \varPhi _n = \{k_1, \dots , k_{n+1}\}. \end{aligned}$$
(3)

The singular regions of the real phase space are separated by means of suitable projection operators; in each of them, the radiated parton phase space is parametrised in terms of the FKS variables [18] (the notations \(\mathbf {p}\) and \(\underline{p}\) for a generic momentum p denote the tri-impulse and its modulus respectively)

$$\begin{aligned} \xi =\frac{2\underline{k}_{n+1}}{q^0}, \quad y=\frac{\mathbf {k}_{n}\cdot \mathbf {k}_{n+1}}{\underline{k}_n\underline{k}_{n+1}}, \end{aligned}$$
(4)

as shown in Fig. 1, where we have assumed that the emitter and the FKS partons are respectively the n-th and the \(n+1\)-th parton. The rescaled energy \(\xi \) is related to the soft limit (\(\xi \rightarrow 0\)), and the variable y to the collinear one (\(y\rightarrow \pm 1\)). The definition of the azimuthal angle \(\phi \), in the POWHEG framework, departs from the standard FKS definition. It is taken as the polar angle of the splitting around the axis parallel to the momentum of the recoil system, in the rest frame where \(q=(q^0,\mathbf {0})\).

Fig. 1
figure 1

Kinematics for a real configuration: \(k_n\) is the massive emitter, \(k_{n+1}\) is the radiated parton. y denotes the cosine of the angle between the two tri-vectors

In what follows, we will construct a one-to-one map from a real configuration with radiation variables \((\xi ,y,\phi )\) into a Born one. This leads to a factorisation of the real phase space in term of Born and radiation variables.

The mapping can be reduced to the case of the map from a 3-body phase space into a 2-body one. Inserting into the \((n+1)\)-body phase space volume element the identities

$$\begin{aligned} 1=\int \mathrm {d}^4k_{\text {rec}}\delta ^{(4)}\left( k_{\text {rec}}-\sum _{i=1}^{n-1}k_i\right) \end{aligned}$$
(5)

and

$$\begin{aligned} 1=\int \mathrm {d}M^2_{\text {rec}}\delta (M^2_{\text {rec}}-k_{rec}^2), \end{aligned}$$
(6)

the phase space is decomposed into a chain of two consecutive processes. With reference to Fig. 1, they are: the decay of a particle with momentum q into the 3-body system formed by the emitter \(k_n\), the FKS-parton \(k_{n+1}\) and the “recoil” system, with momentum and invariant mass

$$\begin{aligned} k_{\text {rec}}= & {} \sum _{i=1}^{n-1}k_i= q-k_n-k_{n+1}, \nonumber \\ M_{\text {rec}}^2= & {} k_{\text {rec}}^2, \end{aligned}$$
(7)

followed by the decay of the latter into the other \(n-1\) particles. In formula, we have

$$\begin{aligned} \mathrm {d}\varPhi _{n+1}=\mathrm {d}\varPhi _3\mathrm {d}\varPhi _{\text {rec}}, \end{aligned}$$
(8)

where

$$\begin{aligned} \mathrm {d}\varPhi _3= & {} \frac{\mathrm {d}M^2_{\text {rec}}}{2\pi }\frac{\mathrm {d}^3 \mathbf {k}_n}{2k^0_n(2\pi )^3}\frac{\mathrm {d}^3 \mathbf {k}_{n+1}}{2k^0_{n+1}(2\pi )^3}\frac{\mathrm {d}^3 \mathbf {k}_{\text {rec}}}{2k^0_{\text {rec}}(2\pi )^3}\nonumber \\&\times (2\pi )^4\delta ^{(4)}(q-k_n-k_{n+1}-k_{\text {rec}}),\end{aligned}$$
(9)
$$\begin{aligned} \mathrm {d}\varPhi _{\text {rec}}= & {} \prod _{i=1}^{n-1}\frac{\mathrm {d}^3\mathbf {k}_{i}}{2k_i^0(2\pi )^3}(2\pi )^4\delta ^{(4)}\left( k_{\text {rec}}-\sum _{i=1}^{n-1}k_i\right) . \end{aligned}$$
(10)

We now focus on the 3-body process; under the action of the mapping, the \(k_n\) and \(k_{n+1}\) partons will be replaced by a single parton with mass m and momentum \(\overline{k}_n\). We define

$$\begin{aligned} k\equiv k_n+k_{n+1}, \end{aligned}$$
(11)

so that

$$\begin{aligned} k_{\text {rec}}=q-k \implies k^0_{\text {rec}}=q^0-k^0,\quad \mathbf {k}_{\text {rec}}=-\,\mathbf {k}. \end{aligned}$$
(12)

We fix the transformation by demanding \(\mathbf {\overline{k}}_n \parallel \mathbf {k}\). Care must be taken to ensure the conservation of energy-momentum also for the resulting Born configuration. This is accomplished by performing a boost \(\varLambda \) in the direction \(\mathbf {k}\) and defining

$$\begin{aligned} \overline{k}_n=q-\varLambda k_{\text {rec}}, \end{aligned}$$
(13)

We determine the velocity parameter \(\beta \) of the boost transformation from the mass-shell condition

$$\begin{aligned} \overline{k}_n^2 = (q-\varLambda k_{\text {rec}})^2 = m^2. \end{aligned}$$
(14)

We get

$$\begin{aligned} \beta= & {} \frac{-4\underline{k}_{\text {rec}}k^0_{\text {rec}}q^2}{(q^2-m^2+M_{\text {rec}}^2)^2+4\underline{k}_{\text {rec}}^2q^2}\nonumber \\&+\frac{(q^2-m^2+M_{\text {rec}}^2)\sqrt{(q^2-m^2+M_{\text {rec}}^2)^2-4M_{\text {rec}}^2q^2}}{(q^2-m^2+M_{\text {rec}}^2)^2+4\underline{k}_{\text {rec}}^2q^2}.\nonumber \\ \end{aligned}$$
(15)

We define the other barred variables as

$$\begin{aligned} \overline{k}_i= \varLambda k_i, \quad i=1,\dots ,n-1. \end{aligned}$$
(16)

Their mass relations are preserved by the boost transformation and, furthermore, we have

$$\begin{aligned} \sum _{i=1}^{n}\overline{k}_i= & {} \sum _{i=1}^{n-1}\overline{k}_i+\overline{k}_{n}\nonumber \\= & {} q+\sum _{i=1}^{n-1}\varLambda k_{i}-\varLambda k_{\text {rec}} \nonumber \\= & {} q+\varLambda \bigg (\sum _{i}^{n-1}k_i-k_{\text {rec}}\bigg )=q, \end{aligned}$$
(17)

which is the energy-momentum conservation for the Born configuration.

2.2 Inverse map

We now detail the construction of the inverse map, which is what is actually needed in the applications. Suppose that a Born event has been generated, i.e. the barred variables \(\overline{k}_i\) (\(i=1,\ldots ,n\)) are given. Then, \(M_{\text {rec}}^2\) is obtained inverting Eq. (13):

$$\begin{aligned} M_{\text {rec}}^2= & {} (\varLambda k_\text {rec})^2 = (q-\overline{k}_n)^2\nonumber \\= & {} q^2+m^2-2q^0\overline{k}_n^0. \end{aligned}$$
(18)

We want to attach to it a radiation described by the radiation variables \(\xi ,y\) and \(\phi \). For future convenience we introduce the largest allowed value for \(\xi \)

$$\begin{aligned} \xi _{\mathrm{max}}\equiv 1-\frac{(m+M_{\mathrm{rec}})^2}{q^2}. \end{aligned}$$
(19)

The energy of the radiated parton is

$$\begin{aligned} k^0_{n+1}= \underline{k}_{n+1}=\frac{q^0}{2}\xi . \end{aligned}$$
(20)

Energy conservation requires that

$$\begin{aligned} q^0 = k^0_{n+1}+ \sqrt{\underline{k}_n^2+m^2} + \sqrt{\underline{k}_{\mathrm{rec}}^2+M_{\text {rec}}^2}, \end{aligned}$$
(21)

where

$$\begin{aligned} \underline{k}_\mathrm{rec}^2=\underline{k}_n^2+\underline{k}_{n+1}^2+2\underline{k}_n\underline{k}_{n+1}y. \end{aligned}$$
(22)

We can solve Eq. (21) for \(\underline{k}_n\) in a standard way, by bringing in turn each single square root on one side of the equation and squaring both sides. By doing this we actually find the solutions of all of the following equations

$$\begin{aligned} q^0 = k^0_{n+1} \pm \sqrt{\underline{k}_n^2+m^2} \pm \sqrt{\underline{k}_{\mathrm{rec}}^2+M_{\text {rec}}^2}, \end{aligned}$$
(23)

for all possible combinations of the signs in front of the square root. The solutions are given by

$$\begin{aligned} \underline{k}_{n}^{(\pm )}= & {} \frac{-(2\overline{k}_n^0-q^0\xi )\xi y}{(2-\xi )^2-\xi ^2y^2}\nonumber \\&\pm \frac{(2-\xi )\sqrt{(2\overline{k}_n^0-q^0\xi )^2 -m^2\xi ^2(1-y^2)-4m^2(1-\xi )}}{(2-\xi )^2-\xi ^2y^2}.\nonumber \\ \end{aligned}$$
(24)

In order for them to exist, the argument of the square root must be positive. This leads to the bound

$$\begin{aligned} (q^2-m^2+m^2y^2) \xi ^2 - 4(q^0\overline{k}_n^0 -m^2) \xi + 4 \overline{\underline{k}}_n^2 >0, \end{aligned}$$
(25)

with \(\overline{\underline{k}}_n^2=(\overline{k}_n^0)^2-m^2\). Equation (25) is satisfied if either \(\xi >\xi ^{(+)}(y)\) or \(\xi <\xi ^{(-)}(y)\), with

$$\begin{aligned} \xi ^{(\pm )}(y)= & {} 2\frac{\overline{k}_n^0q^0 - m^2 \pm m \sqrt{(q^0-\overline{k}_n^0)^2-\overline{\underline{k}}_n^2y^2}}{q^2-m^2+m^2y^2}, \nonumber \\= & {} \frac{q^2 - m^2 -M_{\mathrm{rec}}^2 \pm 2 m \sqrt{M_\mathrm{rec}^2+\overline{\underline{k}}_n^2(1-y^2)}}{q^2-m^2+m^2y^2} \nonumber \\= & {} \frac{4\overline{\underline{k}}_n^2}{q^2 - m^2 -M_{\mathrm{rec}}^2 \mp 2 m \sqrt{M_{\mathrm{rec}}^2+\overline{\underline{k}}_n^2(1-y^2)}}.\nonumber \\ \end{aligned}$$
(26)

The last equality follows from the fact that

$$\begin{aligned} \xi ^{(+)}\xi ^{(-)}=\frac{4\overline{\underline{k}}_n^2}{q^2-m^2+m^2y^2}. \end{aligned}$$
(27)

We see that \(\xi ^{(+)}\) is a decreasing function of \(y^2\). Thus

$$\begin{aligned} \xi ^{(+)}(y)> \xi ^{(+)}(1)=1-\frac{(m-M_{\mathrm{rec}})^2}{q^2} > \xi _{\mathrm{max}}. \end{aligned}$$
(28)

that is larger than the maximum value allowed by energy conservation. Thus, the corresponding \(\underline{k}_{n}^{(\pm )}\) values should be the solutions of one among Eq. (23) where some minus signs appear. On the other hand, \(\xi ^{(-)}(y)\) is an increasing function of \(y^2\), so

$$\begin{aligned} \xi ^{(-)}(y) < \xi ^{(-)}(1) = 1-\frac{(m+M_{\mathrm{rec}})^2}{q^2}, \end{aligned}$$
(29)

that is perfectly acceptable. Furthermore, in the \(\xi <\xi ^{(-)}(y)\) case the value \(\xi =0\) is allowed, that lead to the solutions \(\underline{k}_{n}^{(\pm )} = \pm \overline{k}_n^0\) satisfying Eq. (21) with the correct signs of the square roots. Since the \(\underline{k}_{n}^{(\pm )}\) must always satisfy one of the Eq. (23), and since they are smooth function of both \(\xi \) and y in their allowed range (that includes the \(\xi =0\) point), we infer by continuity that they satisfy Eq. (23).

Fig. 2
figure 2

Kinematic reconstruction of the real emission kinematics with positive (left) and negative \(\underline{k}_n\) values. The angle \(\theta \) is fixed by \(y=\cos \theta \)

Up to now we have not imposed the positivity of \(\underline{k}_n\). On the other hand, negative \(\underline{k}_n\) values still have a physical interpretation, as illustrated in Fig. 2. Thus, provided we interpret negative values of \(\underline{k}_n\) according to the construction of Fig. 2, we have two solutions of Eq. (21). They are however related, since

$$\begin{aligned} \underline{k}_n^{(+)}(\xi ,y)=-\underline{k}_n^{(-)}(\xi ,-y). \end{aligned}$$
(30)

If we pick just one of them, we have a single-value map from the underlying Born configuration and the radiation variables \(\xi ,y\) and \(\phi \) to a real emission configuration. We pick the solution \(\underline{k}_n^{(+)}(\xi ,y)\), since for \(m=0\) it corresponds to the usual solution in the massless case. Unlike in the massless case, however, \(\underline{k}_n^{(+)}(\xi ,y)\) is not always positive: it is negative in the region

$$\begin{aligned} y> 0 \,,\quad \xi > \xi ^{(-)}(0) = 2\frac{\overline{k}^0_n -m}{q-m}= \frac{(q^0-m)^2-M_{rec}^2}{q^0\,(q^0-m)}.\nonumber \\ \end{aligned}$$
(31)

For continuity, \(\underline{k}_n^{(+)}(\xi ,y)\) vanishes on the boundary line \(y>0,\;\xi =\xi ^{(-)}(0)\) separating the positive and negative regions. The points lying on this curve are degenerate and correspond to the same real configuration with the emitter at rest in the partonic centre-of-mass frame. Apart from them, that constitute a set of zero measure, the map is well defined and bijective. The inverse map is well defined also on the boundary line \(y>0,\, \xi =\xi ^{(-)}(0)\). This means that the corresponding jacobian vanishes on that curve. Then, the inverse map can be safely used both for the integration of the real differential cross section and for the generation of radiation.

Fig. 3
figure 3

Plot of the physical region in the \(\xi y\) plane. The shaded orange region is where \(\underline{k}_n^{(+)}(\xi ,y)\) is negative. It is physically equivalent to the (positive) \(\underline{k}_n^{(-)}(\xi ,-y)\) solution in the dark blue region. If we insisted upon considering only positive \(\underline{k}_n\) solutions, the blue region would be doubly covered, and the dark blue one would not be there

In Fig. 3 we display the \(\xi ,y\) kinematic region. We remark that the negative \(\underline{k}_n^{(+)}(\xi ,y)\) region includes neither soft nor collinear singularities, since \(\xi \) is large, and since the angular separation of the quark and the radiated gluon is larger than \(\pi /2\). From now on we will drop the suffix \((-)\) and will use \(\xi (y)\) and \(\xi (0)\) instead of \(\xi ^{(-)}(y)\) and \(\xi ^{(-)}(0)\).

Fig. 4
figure 4

Dalitz plot for the three-body phase space of the system comprising the heavy flavour, the radiated gluon and the recoiling system

In Fig. 4 we show the partition o the kinematic region represented in the more familiar Dalitz plane. Notice that in the massless limit the physical region in the Dalitz plot develops an acute angle in the lower right, corner corresponding to the gluon being anticollinear with the b quark. Thus, the problematic region \(\xi >\xi (0)\) is not a singular one.

2.3 Full kinematic reconstruction of the real emission

So far, we have got the length of the tri-vectors \(\mathbf {k}_n\) and \(\mathbf {k}_{n+1}\). It is a standard kinematical problem to determine their directions in such a way that their sum \(\mathbf {k}\) is parallel to \(\mathbf {\overline{k}}_n\). We do not enter in further details about it.

The last step is to calculate the \(\beta \) parameter of the boost transformation \(\varLambda \), Eq. (15), and to boost “back” the other barred momenta in the real event

$$\begin{aligned} k_i=\varLambda ^{-1}\overline{k}_i, \quad i= 1,\ldots , n-1. \end{aligned}$$
(32)

The above mapping allows us to write the \((n+1)\)-body phase space element in the factorized form

$$\begin{aligned} \mathrm {d}\varPhi _{n+1}= \mathrm {d}\varPhi _{\text {rad}}\mathrm {d}\overline{\varPhi }_n= J(\xi ,y,\phi )\mathrm {d}\xi \mathrm {d}y \mathrm {d}\phi \mathrm {d}\overline{\varPhi }_n, \end{aligned}$$
(33)

where we have expressed the radiation phase space in terms of the FKS variables with the jacobian function \(J(\xi ,y,\phi )\) taking into account the change of variables involved in the transformation. In order to extract the jacobian, we have to manipulate and compare the l.h.s and the r.h.s of Eq. (33). Recalling Eq. (8), we perform the change of variables

$$\begin{aligned} \mathbf {k}_n \rightarrow \mathbf {k}-\mathbf {k}_{n+1} \end{aligned}$$
(34)

in the three-body phase space, Eq. (9),

$$\begin{aligned} \mathrm {d}\varPhi _3= & {} \frac{\mathrm {d}M^2_{\text {rec}}}{2\pi }\frac{\mathrm {d}^3\mathbf {k}}{2k^0_n(2\pi )^3}\frac{\mathrm {d}^3\mathbf {k}_{n+1}}{2k^0_{n+1}(2\pi )^3}\frac{\mathrm {d}^3\mathbf {k}_{\text {rec}}}{2k^0_{\text {rec}}(2\pi )^3}\nonumber \\&\times (2\pi )^4\delta ^{(4)}(q-k-k_{\text {rec}})\,. \end{aligned}$$
(35)

In polar coordinates, we have

$$\begin{aligned} \mathrm {d}^3\mathbf {k}= \underline{k}^2\mathrm {d}\underline{k}\mathrm {d}\varOmega \end{aligned}$$
(36)

and, using as reference direction that of \(\mathbf {k}\),

$$\begin{aligned} \frac{\mathrm {d}^3\mathbf {k}_{n+1}}{2k^0_{n+1}(2\pi )^3} = \frac{q^2}{(4\pi )^3}\xi \mathrm {d}\xi \mathrm {d}\cos {\alpha }\mathrm {d}\phi \,, \end{aligned}$$
(37)

where \(\alpha \) is the angle between \(\mathbf {k}_{n+1}\) and \(\mathbf {k}\) and \(\phi \) is the azimuthal angle taking \(\mathbf {k}\) as the reference direction. Hence

$$\begin{aligned} \mathrm {d}\varPhi _{n+1}= & {} \frac{q^2}{(4\pi )^3}\xi \mathrm {d}\xi \mathrm {d}\cos {\alpha }\mathrm {d}\phi \frac{\underline{k}^2\mathrm {d}\underline{k}\mathrm {d}\varOmega }{2k^0_n(2\pi )^3}\frac{\mathrm {d}M^2_{\text {rec}}}{2\pi }\nonumber \\&\times \frac{\mathrm {d}^3k_{\text {rec}}}{2k^0_{\text {rec}}(2\pi )^3}(2\pi )^4\delta ^{(4)}(q-k-k_{\text {rec}})\mathrm {d}\varPhi _{\text {rec}}. \end{aligned}$$
(38)

On the other hand, following the same arguments that led to Eq. (8), we can split the barred Born phase space into a two-body phase space and the phase space of the system recoiling against the emitting parton

$$\begin{aligned} \mathrm {d}\overline{\varPhi }_n= & {} \frac{\mathrm {d}\overline{M}^2_{\text {rec}}}{2\pi }\frac{\mathrm {d}^3\overline{\mathbf {k}}_n}{2\overline{k}^0_n(2\pi )^3}\frac{\mathrm {d}^3\overline{\mathbf {k}}_{\text {rec}}}{2\overline{k}^0_{\text {rec}}(2\pi )^3}\nonumber \\&\times (2\pi )^4\delta ^{(4)}(q-\overline{k}_n-\overline{k}_{\text {rec}})\mathrm {d}\overline{\varPhi }_{rec}. \end{aligned}$$
(39)

Since \({\overline{k}}_n=q-\varLambda k_{\text {rec}}\), the delta function in Eq. (39) constrains the value of \(\overline{k}_\text {rec}\) to be

$$\begin{aligned} \overline{k}_{\text {rec}}= \varLambda k_{\text {rec}}. \end{aligned}$$
(40)

Then, exploiting the Lorentz invariance of the phase space element, we have

$$\begin{aligned}&\frac{\mathrm {d}M^2_{\text {rec}}}{2\pi }\frac{\mathrm {d}^3\mathbf {k}_{\text {rec}}}{2k^0_{\text {rec}}(2\pi )^3}(2\pi )^4\delta ^{(4)}(q-k-k_{\text {rec}})\mathrm {d}\varPhi _{\text {rec}}\nonumber \\&\quad =\frac{\mathrm {d}\overline{M}^2_{\text {rec}}}{2\pi }\frac{\mathrm {d}^3\overline{\mathbf {k}}_{\text {rec}}}{2\overline{k}^0_{\text {rec}}(2\pi )^3}(2\pi )^4\delta ^{(4)}(q-\overline{k}_n-\overline{k}_{\text {rec}})\mathrm {d}\overline{\varPhi }_\mathrm{rec},\nonumber \\ \end{aligned}$$
(41)

where the r.h.s and the l.h.s are related by the boost transformation \(\varLambda \). In particular, we observe that

$$\begin{aligned} \varLambda (q-k)= \varLambda k_{\text {rec}} = q-\overline{k}_n, \end{aligned}$$
(42)

so that the boost maps the argument of the delta function in the r.h.s into that of the delta function in the l.h.s. Inserting Eq. (38) and Eq. (39) into Eq. (33) and using Eq. (41), we get

$$\begin{aligned}&\frac{q^2}{(4\pi )^3}\xi \mathrm {d}\xi \, \mathrm {d}\cos {\alpha }\,\mathrm {d}\phi \frac{\underline{k}^2\mathrm {d}\underline{k}\mathrm {d}\varOmega }{2k^0_n(2\pi )^3}\nonumber \\&\quad = J(\xi ,y,\phi )\mathrm {d}\xi \, \mathrm {d}y \,\mathrm {d}\phi \frac{\mathrm {d}^3\overline{k}_n}{2\overline{k}^0_n(2\pi )^3}. \end{aligned}$$
(43)

By virtue of the mapping, the vectors \(\mathbf {k}\) and \(\mathbf {\overline{k}}_{n}\) are parallel so that in polar coordinates their angular elements are equal, \(d\varOmega =d\overline{\varOmega }_{n}\). Then, from Eq. (43) we have

$$\begin{aligned} \frac{q^2}{(4\pi )^3}\xi \frac{\underline{k}^2}{k_n^0}\mathrm {d}\cos {\alpha }\,\mathrm {d}\underline{k}= J(\xi ,y,\phi ) \frac{\overline{\underline{k}}_n^2}{\overline{k}_n^0}\,\mathrm {d}y \,\mathrm {d}\overline{\underline{k}}_n. \end{aligned}$$
(44)

and we are left with the computation of the jacobian of the two-variable-transformation

$$\begin{aligned} J^{(2)} = \begin{vmatrix} \dfrac{\partial \overline{\underline{k}}_n}{\partial \underline{k}}&\dfrac{\partial y}{\partial \underline{k}}\\[1.0em] \dfrac{\partial \overline{\underline{k}}_n}{\partial \cos \alpha }&\dfrac{\partial y}{\partial \cos \alpha } \end{vmatrix} . \end{aligned}$$
(45)

This transformation is implicitly defined by the relations

$$\begin{aligned} \begin{aligned}&\underline{k}_n=\sqrt{\underline{k}^2+\underline{k}^2_{n+1}-2\underline{k}\,\underline{k}_{n+1}\cos \alpha },\\&y=\frac{\underline{k}^2-\underline{k}_n^2-\underline{k}_{n+1}^2}{2\underline{k}_n\,\underline{k}_{n+1}},\\&M^2_{\text {rec}}=(q^0-k_n^0-\underline{k}_{n+1})^2-\underline{k}^2,\\&\overline{\underline{k}}_n=\frac{\lambda ^{{1}/{2}}(q^2,M_{\text {rec}}^2,m^2)}{2q^0} , \end{aligned} \end{aligned}$$
(46)

where \(\lambda \) is the kinematical Kallen function:

$$\begin{aligned} \lambda (x,y,z)= x^2+y^2+z^2-2xy-2xz-2yz. \end{aligned}$$
(47)

Applying the chain-rule for the derivative, it is straightforward to compute the jacobian. We get

$$\begin{aligned} J^{(2)}= & {} \frac{1}{\underline{k}_n^3}\frac{\underline{k}^2}{\overline{\underline{k}}_n} \frac{\overline{k}_n^0}{k_n^0}\big [k_n^0(\overline{k}_n^0-\underline{k}_{n+1})\nonumber \\&-\,m^2(1-{\underline{k}_{n+1}}/{q^0})\big ] \end{aligned}$$
(48)

The final expression for the full jacobian J is thus

$$\begin{aligned} J(\xi ,y,\phi )= & {} \frac{q^2}{(4\pi )^3}\xi \frac{\underline{k}_n^3}{\overline{\underline{k}}_n}\frac{1}{k_n^0(\overline{k}_n^0-\underline{k}_{n+1})-m^2(1-{\underline{k}_{n+1}}/{q^0})}\nonumber \\= & {} \frac{q^2}{(4\pi )^3}\xi \frac{\underline{k}_n^3}{\overline{\underline{k}}_n}\frac{2}{k_n^0(2\overline{k}_n^0-q^0\xi )-m^2(2-\xi )} \end{aligned}$$
(49)

Note that the denominator of J vanishes in two regions:

  • when approaching the curve \(\xi =\xi (0)\) for \(y>0\), behaving as \(\xi (0)-\xi \)

  • when approaching the curve \(\xi =\xi (y)\), as \(\sqrt{\xi (y)-\xi }\).

In the first case, the \(k_n^3\) term in the numerator vanishes simultaneously as \((\xi (0)-\xi )^3\). It follows that the jacobian vanishes as \(J\sim (\xi (0)-\xi )^2\) for \(\xi \rightarrow \xi (0)\) at fixed \(y>0\). This result is coherent with what has been argued above regarding the degenerate points corresponding to the configuration with the emitter parton at rest in the partonic centre-of-mass frame. In the second region, the jacobian develops an integrable singularity, that can be dealt with by importance sampling techniques in Monte Carlo integration.

2.4 Generation of radiation

The POWHEG master formula for the generation of radiation [19, 20] is

$$\begin{aligned} d\sigma _\text {NLO}= & {} \overline{B}(\varPhi _n)d\varPhi _n\bigg [\varDelta _\text {NLO}(\varPhi _n,t_{\mathrm{min}}) \nonumber \\&+\sum _\alpha \frac{\left[ d\varPhi _\text {rad}\varDelta _\text {NLO}(\varPhi _n,K_\perp (\varPhi _{n+1}))R(\varPhi _{n+1})\right] _{\alpha }^{\overline{\varPhi }^\alpha _n=\varPhi _n}}{B(\varPhi _n)}\bigg ],\nonumber \\ \end{aligned}$$
(50)

where \(t_{\mathrm{min}}\) is an infrared cutoff, and the NLO Sudakov form factor is given by

$$\begin{aligned}&\varDelta _\text {NLO}(\varPhi _n,p_T) =\theta (p_T-t_{\mathrm{min}}) \nonumber \\&\quad \times \exp { \left[ -\sum _{\alpha }\int \frac{\left[ d\varPhi _\text {rad}R(\varPhi _{n+1}) \varTheta (K_\perp (\varPhi _{n+1})-p_T)\right] _{\alpha }^{\overline{\varPhi }^\alpha _n= \varPhi _n}}{B(\varPhi _n)}\right] }.\nonumber \\ \end{aligned}$$
(51)

In the case of a massless emitter, \(K_\perp \) is a smooth function of the radiation variables, which is required to reduce to the transverse momentum in approaching the soft and collinear limits. For the massive case, in Ref. [15] the following definition was proposed

$$\begin{aligned} K_\perp ^2=2\frac{k^0}{p^0}p\cdot k = \frac{q^2}{2}\xi ^2(1-\beta y_\text {phy}). \end{aligned}$$
(52)

\(y_\text {phy}\) denotes the cosine of the physical angle between the emitter and the emitted parton.Footnote 2 Eq. (52) has the remarkable property of reducing continuously to the transverse momentum in the massless limit. We assume it as our default scale choice.

According to the standard veto method, we look for a suitable upper bound function U of the integrand in the NLO Sudakov form factor, namely

$$\begin{aligned} U(\xi ,y)d\xi dy \ge \frac{R}{B} J(\xi ,y) d\xi dy. \end{aligned}$$
(53)

For the sake of simplicity, we have omitted the integration on the azimuthal angle \(d\phi \), which results in a constant \(2\pi \) factor.

We model the upper bound function on the asymptotic singular behavior of the real matrix element near the soft and collinear singularities. We recall that the jacobian of the mapping has a divergent behaviour near the curve \(\xi =\xi (y)\). The upper bound function should have a behaviour not weaker than the Jacobian near the singular regions, and furthermore, it should be simple enough to allow us to perform an analytical integration in the constrained radiation phase space given by the cut \(K_T^2>t\).

It is convenient to perform a change of integration variables from \(\xi ,\,y\) to \(\xi ,\,K_T^2\). Indeed, it turns out that \(K_T^2\) is a monotonic decreasing function of y at fixed \(\xi \), i.e. \({\partial K_T^2}/{\partial y} < 0\).

The inversion of this mapping is too complex to be performed analytically,Footnote 3 but easy to perform numerically. We find that the associated jacobian \({\partial K_T^2}/{\partial y}\) has a behaviour similar to that of the jacobian of the mapping J:

$$\begin{aligned}\sim & {} \frac{1}{\sqrt{\xi (y)-\xi }} \;\; \text{ when }\;\; \xi \rightarrow \xi (y); \end{aligned}$$
(54)
$$\begin{aligned}\sim & {} (\xi (0)-\xi )^2 \;\; \text{ when }\;\; \xi \rightarrow \xi (0)\;\;\text{ for }\;\; y\ge 0. \end{aligned}$$
(55)

We now write

$$\begin{aligned} U = \frac{\partial K_T^2}{\partial y} U', \end{aligned}$$
(56)

so that in the new integration variables the integrand becomes \(U'\)

$$\begin{aligned} \int d\xi \, dy\, \varTheta (K_T^2 -t) U = \int d\xi dK_T^2 \varTheta (K_T^2 -t) U'. \end{aligned}$$
(57)

\(U'\) must have a simple form, and must have the appropriate behaviour to act as an upper bound for the soft and collinear singularities of the real matrix element.

2.4.1 Upper bound function

The singular behaviour of the real matrix element squared is universal and can be extracted in a straightforward manner by means of the eikonal approximation. In terms of the radiation variables, we get

$$\begin{aligned} \frac{R}{B} \sim \frac{N}{\xi ^2(1-\beta y_\text {phy})} = \frac{N}{K_T^2}, \end{aligned}$$
(58)

with N a suitable normalization constant. On the other hand, in the soft limit, the jacobian of the mapping behaves as

$$\begin{aligned} J(\xi ,y) \sim N'\xi . \end{aligned}$$
(59)

We must also take into account the behaviour in the soft limit of the jacobian term factorized in U:

$$\begin{aligned} \frac{\partial K_T^2}{\partial y} \sim N''\xi ^2. \end{aligned}$$
(60)

Putting all the three contributions together, we obtain the following expression of the upper bound function \(U'\)

$$\begin{aligned} U'(\xi ,K_T^2) = \frac{1}{K_T^2}\times \xi \times \frac{1}{\xi ^2}=\frac{1}{\xi K_T^2}. \end{aligned}$$
(61)

A more complete analysis shows that mapping J is enhanced (although not divergent) at large \(\xi \) for \(y\rightarrow -1\). In order to get a more efficient upper bound, we add the factor \(\frac{1}{1-{K_T^2}/{q^2}}\) to the previous expression. Hence, our final choice for the upper bound function \(U'\) is

$$\begin{aligned} U'(\xi ,K_T^2) = \frac{1}{\xi K_T^2 (1-{K_T^2}/{q^2})}. \end{aligned}$$
(62)

2.4.2 Integral of the upper bound function

In order to integrate the upper bound function analytically, its domain of integration has to be suitably enlarged. This can be done by interpreting the R / B expression as being defined in the larger domain, but as vanishing outside of the physical domain. Since the veto procedure prescribes that a point generated according to the upper bound function should be accepted with a probability proportional to the value of the radiation function divided by the upper bound function, points generated outside the physical domain should always be vetoed according to the above interpretation. From Eq. (52), we find the upper bound

$$\begin{aligned} K_T^2 < K_\text {max}^2 \equiv \frac{q^2}{2}\xi _\mathrm{max}^2(1+\beta _0), \end{aligned}$$
(63)

where \(\beta _0\) is the velocity of the emitter in the underlying Born configuration (this follows from the fact that we always have \(\beta \le \beta _0\)), and we also find

$$\begin{aligned} \frac{2K_T^2}{(1+\beta _0)}< \xi ^2 < \frac{2K_T^2}{(1-\beta _0)}, \end{aligned}$$
(64)

We thus take as our domain of integration the region in \(K_T\) and \(\xi \) such that Eqs. (63) and (64) are satisfied. We notice that in this way \(\xi \) can even become larger than 1. In practice, however, adding also the \(\xi <1\) or \(\xi <\xi _{\mathrm{max}}\) limit would render the integration more difficult, so we prefer to deal with it by vetoing. Defining

$$\begin{aligned} \xi _{\,{\mathop {m}\limits ^{M}}{}}(K_t^2) \equiv \sqrt{\frac{2K_T^2}{q^2(1 \mp \beta _0)}}, \end{aligned}$$
(65)

the integral of the upper bound function is then

$$\begin{aligned} I(t)= & {} \int _t^{K_{\mathrm{max}}^2} \frac{dK_T^2}{K_T^2(1-{K_T^2}/{q^2}) }\int _{\xi _m(K_t^2)}^{\xi _M(K_t^2)} \frac{d\xi }{\xi }\nonumber \\= & {} \ln \left[ \frac{K_{\mathrm{max}}^2}{q^2-K_\mathrm{max}^2}\frac{q^2-t}{t}\right] \,y_0, \end{aligned}$$
(66)

where \(y_0 \equiv (1/2)\ln [(1+\beta _0)/(1-\beta _0)]\) is the rapidity of the emitter in the underlying Born configuration. Given a number \(0<r<1\), the t value generated by solving the equation \(r=\exp [-2\pi N I(t)]\) is

$$\begin{aligned} t = \frac{A}{1+A}q^2, \quad A=\frac{K_{\mathrm{max}}^2}{q^2-K_\mathrm{max}^2}\exp \left[ \frac{\log {r}}{2\pi N y_0}\right] . \end{aligned}$$
(67)

2.4.3 Generation of radiation kinematics

The algorithm for generating the radiation variables proceeds as follows:

  1. 1.

    We set the initial scale \(t_0=K^2_{\mathrm{max}}\).

  2. 2.

    We generate a uniform random number

    $$\begin{aligned} 0<r<\exp [-2\pi N I(t_0)], \end{aligned}$$

    and get t from Eq. (67). If t is below \(t_{\mathrm{min}}\), no radiation is generated, and the event is emitted as is.

  3. 3.

    We pick a new uniform random number \(0<r'<1\) and we generate a value for \(\xi \) as

    $$\begin{aligned} \xi = \xi _\text {m}(t)\exp (y_0r'). \end{aligned}$$
    (68)

    This is consistent with the distribution of \(\xi \) at fixed \(K_T^2\) according to Eq. (66).

  4. 4.

    If \(\xi >\xi _\text {max}\), we set \(t_0=t\), and go back to the step 2.

  5. 5.

    If the veto condition is passed, given t and \(\xi \), we solve numerically for y the implicit equation

    $$\begin{aligned} K_T^2(\xi ,y) = t. \end{aligned}$$
    (69)

    If a solution does not exist, we set \(t=t_0\) and go back to step 2.

  6. 6.

    Now that \(\xi \) and y are available, we generate a random \(\phi \), and compute the ratio \(R=[R/B J(\xi ,y)]/U(\xi ,y)]\), with U given in terms of \(U'\) in Eq. (56), and generate a new random number \(0<r'''<1\). If \(r'''>R\) we set \(t_0=t\) and go back to the step 2. Otherwise, the event is accepted.

3 Phenomenology

3.1 Comparison in the bb4l case

We have compared results obtained with the new method presented here, with those obtained with the default POWHEG settings for the bb4l generator of Ref. [17]. We found remarkable agreement between the two results for all the distributions that we have examined. Here we show only two of them, to convey the idea of the quality of the agreement. These results were obtained for the 8TeV LHC collider, using the MSTW2008 PDF [21] set for reference only (other sets could be used as well [22, 23]). In our simulations we make the B hadrons stable. Jets are reconstructed using the Fastjet [24] implementation of the anti-\(k_{\mathrm{\mathchoice{\displaystyle }{\scriptstyle }{\scriptscriptstyle }{\scriptscriptstyle } T}}\) algorithm [25] with \(R=0.5\). We denote as B (\({{\bar{B}}}\)) the hardest (i.e. largest \(p_\mathrm{T}\)) b (\(\bar{b}\)) flavoured hadron. The B (\(\bar{B}\)) jet \({j_\mathrm{\mathchoice{\displaystyle }{\scriptstyle }{\scriptscriptstyle }{\scriptscriptstyle } B}} {}\) (\(j_{\bar{B}}\)) is defined to be the jet that contains the hardest B (\(\bar{B}\)). We discard events where the \({j_\mathrm{\mathchoice{\displaystyle }{\scriptstyle }{\scriptscriptstyle }{\scriptscriptstyle } B}} {}\) and \(j_{\bar{B}}\) coincide. The hardest \(e^+\) (\(\mu ^-\)) and the hardest \(\nu _e\) (\(\bar{\nu }_{\mu }\)) are paired to reconstruct the \(W^+\) (\(W^-\)). The reconstructed top (antitop) quark is identified with the corresponding \(W^+j_B\) (\(W^-j_{\bar{B}}\)) pair. We show the invariant mass of the \(W-b\)-jet system (Fig. 5) and the B fragmentation function in top decay (Fig. 6), as defined in Ref. [17], i.e. the the B energy in the reconstructed top rest frame normalized to the maximum value that it can attain at the given top virtuality. In the curves, the alt (for “alternative”) label stands for our new implementation, while def (for “default”) is the current POWHEG default. As one can see, the agreement is very good. This also shows that details in the implementation of radiation from the b quark in top decays do not seem to have important impact on physical observables.

Fig. 5
figure 5

Invariant mass distribution of the reconstructed top quark mass, defined as the mass of the \(W^+j_B\) or \(W^-j_{\bar{B}}\) system, produced with the bb4l generator, at the 8 TeV LHC. The two distributions are obtained with the default implementation of radiation from b quarks (def), and with the new implementation presented here (alt)

Fig. 6
figure 6

B fragmentation function in top quark decay as defined in Ref. [17], produced with the bb4l generator for the 8 TeV LHC. The default and alternative implementation of radiation from b quarks are compared

We found that the efficiency and the generation rate of the new implementation are comparable with those of the POWHEG default.

3.2 b production in hadronic collisions

In this section we study the available POWHEG implementations of radiation from massive quarks for the hvq generator [14], i.e. the default POWHEG implementation and our new one. In spite of the fact that the default formalism has been available for quite some time [15], no such study has been performed so far. We thus discuss it in this work, where we can also compare with our new implementation.

Fig. 7
figure 7

Example diagrams for the three mechanism that give rise to log-enhanced contributions in heavy flavour production: a final state radiation from a quark; b gluon splitting; c flavour excitation

The hvq generator has been available for quite some time as a tool to generate top, bottom and charm pairs in hadronic collisions. It is designed to simulate correctly the production of a heavy flavour pair when the logarithm of the ratio of the transverse momentum of the heavy quark divided by its mass is not too large. This limitation arises because there are three mechanisms, depicted in Fig. 7, involving radiation from the final state quark, production of a heavy quark-antiquark pair via final state gluon splitting and the splitting of an initial state gluon into a heavy quark-antiquark pair (where one of the two quarks is scattered at large transverse momentum), that can generate large logarithms involving the mass of the heavy quark. In the inclusive cross section for the production of a heavy quark with a given \(p_T\), for example, they generate logarithms of \(p_T/m\) (see Ref. [26], Eq. (5.1)). The last two mechanisms are commonly referred to as gluon splitting and flavour excitation. In spite of this, the hvq generator has also been used to model relatively large transverse momentum production of heavy flavours, as in Ref. [5]. There, the transverse momentum distribution of the heavy flavoured hadron in hvq was compared with the more accurate (but less exclusive) FONLL prediction [4]. It was found to be in rather good agreement. However, the large uncertainties related to the non-perturbative fragmentation of the heavy quark leads to the suspect that such agreement is at least in part accidental.

Fig. 8
figure 8

Comparison of alt and def for the transverse momentum distribution of the B hadron at the 8TeV LHC

Fig. 9
figure 9

Comparison of alt and def for the transverse momentum distribution of the b-jet at the 8TeV LHC

Fig. 10
figure 10

Comparison of alt and def for the b-jet mass

We will now compare the results obtained with the default hvq generator, that we will label nol (for “no light”, meaning that the heavy quark is treated as very heavy), that treats as singular regions only the radiation from massless partons (i.e. initial state radiation); hvq with the inclusion of the radiation from the heavy quark as a singular region will be labeled asl (for “as light”, meaning that the heavy quark is treated as a light parton). Furthermore, the default treatment of the heavy quark radiation region will be denoted as def, while the new implementation presented here will be called alt. In Figs. 8, 9 and 10 we show a comparison of def and alt. We can immediately see that we do not find important differences between the two methods, consistently with what was found in the bb4l case. The settings are similar to the bb4l case: we make the B hadrons stable, and define the b (\({{\bar{b}}}\)) jets as the jets containing the hardest b (\({{\bar{b}}}\)) flavoured hadron, with the jets defined as in the bb4l case. However, we do not exclude the case when both hardest b-flavoured hadrons are in the same jet. We perform the calculation for the LHC at 8 TeV, using NNPDF30_nlo_as_0118 pdf set [23]. As one can see, the two implementations are in excellent agreement. Observe the jump at 10 GeV in the \({j_\mathrm{\mathchoice{\displaystyle }{\scriptstyle }{\scriptscriptstyle }{\scriptscriptstyle } B}} \) mass. It is due to the case in which the b and \({{\bar{b}}}\) flavoured hadrons are both in the jet cone. From Fig. 10 we also see that for jet masses above 10 GeV the gluon splitting configuration dominates.

We found that the new implementation has a generation efficiency, which is estimated from the numbers of vetoes in FSR generation, three times greater than the default one. This leads to a generation rate of 1316 events per minute, against the 298 events per minute of the POWHEG default, which corresponds to a gain more than a factor of 4.

We now show in the left panels of Figs. 11, 12 and 13 the comparison among the alt and nol. Here we see considerable differences, especially in the large-momentum tail of the B and \({j_\mathrm{\mathchoice{\displaystyle }{\scriptstyle }{\scriptscriptstyle }{\scriptscriptstyle } B}} \) transverse momentum distribution, the alt ones being much harder. The mass of the b jet is also remarkably different. The large difference above 10 GeV hints to the fact that heavy quark pair production via the splitting of a large transverse momentum gluon is treated in a very different way in the two cases, and that this difference may be the cause of the large discrepancy in the transverse momentum distribution of the b hadron.

Fig. 11
figure 11

Left panel: comparison of alt and nol for the transverse momentum distribution of the B hadron 8TeV LHC. Right panel: same comparison with the treatment of the enhanced regions using remnants, as discussed in the text

Fig. 12
figure 12

Same as in Fig. 11 for the \(p_T\) of the b-jet

Fig. 13
figure 13

Same as in Fig. 11 for the b-jet mass

The difference between the alt and nol cases should not come as a surprise. The generation of radiation is performed in the nol case according to the formula

$$\begin{aligned} \mathrm {d}\sigma= & {} \mathrm {d}\varPhi _B \tilde{B}(\varPhi _B) \exp \left[ \int \frac{R(\varPhi _B,\varPhi '_{\mathrm{rad}})}{B(\varPhi _B)} \theta (k_t'-k_t) \mathrm {d}\varPhi '_{\mathrm{rad}}\right] \nonumber \\&\times \frac{R(\varPhi _B,\varPhi _{\mathrm{rad}})}{B(\varPhi _B)} \mathrm {d}\varPhi _{\mathrm{rad}}\,, \end{aligned}$$
(70)

where \(k_t\) is the transverse momentum of the emitted gluon with respect to the beam axis, since the only singular regions that are considered there are the initial-state radiation (ISR) ones. The strong coupling constant and the parton densities are evaluated by default at a scale equal to the transverse mass of the heavy quark at the level of the underlying Born kinematics

$$\begin{aligned} \mu _f=\mu _r=\sqrt{k_{t,q}^2+m_q^2} \end{aligned}$$
(71)

in the \(\tilde{B}\) function, while they are evaluated at a scale \(k_t\) (or \(k'_t\)) in the R / B ratios appearing in formula (71). Since \(\tilde{B}\) and B are of order \(\alpha _{\mathchoice{\displaystyle }{\scriptstyle }{\scriptscriptstyle }{\scriptscriptstyle } \mathrm S}^2\), while R is of order \(\alpha _{\mathchoice{\displaystyle }{\scriptstyle }{\scriptscriptstyle }{\scriptscriptstyle } \mathrm S}^3\), this means that in practice two powers of the strong coupling are evaluated at the scale of Eq. (71), while one power is evaluated at a scale \(k_t\). The mismatch in the scale used in \(\tilde{B}\) and in the B appearing in the ratios, combined with the exponential, leads as usual to the correct Sudakov form factor for initial state emission.

3.2.1 Problematic regions

In case the transverse momentum of the gluon is small, the scale assignments and the Sudakov form factor describe the process appropriately. It can happen however, that the real emission kinematics is near the gluon splitting, flavour excitation or quark radiation regimes. In these cases the gluon transverse momentum is not small. Furthermore, the numerator R in the integrand may be enhanced with respect to the denominator, thus yielding a damping of the real cross section that is not justified. Also the scale choices are not appropriate. For example, in the case of production of a high transverse momentum heavy quark pair according to the gluon splitting mechanism, the appropriate scale should correspond to two powers of \(\alpha _{\mathchoice{\displaystyle }{\scriptstyle }{\scriptscriptstyle }{\scriptscriptstyle } \mathrm S}\) evaluated at the gluon transverse momentum, and one power of \(\alpha _{\mathchoice{\displaystyle }{\scriptstyle }{\scriptscriptstyle }{\scriptscriptstyle } \mathrm S}\) evaluated at the scale of the order of the invariant mass of the heavy quark pair.

The adoption of the methods illustrated in Ref. [15] and in the present work for dealing with radiation from a heavy quark leads to the correct treatment of the radiation from the heavy, quark provided all remaining regions are treated correctly. This is in fact what happens in the case of the bb4l generator, where there is only one enhanced region, but it is not the case for the asl generator, that does not treat in a proper way the two regions of gluon splitting and flavour excitation. Thus, the nol and the asl generators will end up treating the enhanced regions in different (and in both cases incorrect) ways. In fact, while in the nol case the enhanced regions will all be treated as if they were ISR processes, in the asl case they will be split, and treated in part as ISR processes, and in part as radiation from the heavy quarks. In order to test this hypothesis, and in order to explore possible strategies to deal with this problem, we proceed as follows. It is possible in POWHEG to further separate out the real cross section into two terms, such that only one term has singular behaviour, while the remaining term, being finite, can be integrated independently. In the hvq case, this means

$$\begin{aligned} R=R^{(s)}+R^{(r)}\,. \end{aligned}$$
(72)

Equation (70) is then replaced by

$$\begin{aligned} \mathrm {d}\sigma= & {} \mathrm {d}\varPhi _B \tilde{B}^{(s)}(\varPhi _B) \exp \nonumber \\&\times \left[ \int \frac{R^{(s)}(\varPhi _B,\varPhi '_{\mathrm{rad}})}{B(\varPhi _B)} \theta (k_t'-k_t) \mathrm {d}\varPhi '_{\mathrm{rad}}\right] \nonumber \\&\times \frac{R^{(s)}(\varPhi _B,\varPhi _{\mathrm{rad}})}{B(\varPhi _B)} \mathrm {d}\varPhi _{\mathrm{rad}} \nonumber \\&+\int \mathrm {d}\varPhi _B \mathrm {d}\varPhi _{\mathrm{rad}} R^{(r)}(\varPhi _B,\varPhi _{\mathrm{rad}})\,. \end{aligned}$$
(73)

We can exploit this mechanism in order to separate out the enhanced regions, in such a way that we can treat them in a more uniform way with our generators. In particular, we separate out the gluon splitting and flavour excitation processes in all cases. In the nol case we also separate out the regions of radiation from the heavy quarks, in such a way that they are treated in a more transparent way. Observe that in performing this separation we rely upon the fact that the three enhanced region are not really singular, since the quark mass cuts off the collinear singularities, and thus the remnant term is actually finite.

We define the distance of a real configuration from a given enhanced region as follows

$$\begin{aligned}&d_{\mathrm{isr}} = k_t^2, \quad d_{\mathrm{glsp}} = 2 k_q\cdot k_{{\bar{q}}}\frac{k_q^0 k_{{\bar{q}}}^0}{(k_q^0 + k_{{\bar{q}}}^0)^2}, \nonumber \\&d_q=2 k_q\cdot k\frac{k^0}{k_q^0}+m_q^2, \quad d_{{\bar{q}}}=2 k_{{\bar{q}}}\cdot k\frac{k^0}{k_{{\bar{q}}}^0}+m_q^2, \nonumber \\&d_{q,\mathrm{flex}} = k_{{\bar{q}},\perp }^2 + m_q^2, \quad d_{{\bar{q}},\mathrm{flex}} = {k_{q,\perp }^2} + m_q^2, \end{aligned}$$
(74)

where in the first line the distances for ISR and gluon splitting are given, in the second line those for radiation from the heavy quarks, and in the last line the ones for flavour excitation. We then define, for the nol generator

$$\begin{aligned}&D = \frac{d_{\mathrm{isr}}^{-1}}{d_{\mathrm{isr}}^{-1}+d_{\mathrm{glsp}}^{-1}+ d_q^{-1}+d_{{\bar{q}}}^{-1} +d_{q,\mathrm{flex}}^{-1}+d_{{{\bar{q}}},\mathrm{flex}}^{-1} }, \nonumber \\&R^{(s)}=R D,\qquad R^{(r)}=R (1-D)\,. \end{aligned}$$
(75)

For the alt and def generators, we define

$$\begin{aligned}&D = \frac{d_{\mathrm{isr}}^{-1}+ d_q^{-1}+d_{{\bar{q}}}^{-1}}{d_\mathrm{isr}^{-1}+d_{\mathrm{glsp}}^{-1}+ d_q^{-1}+d_{{\bar{q}}}^{-1} +d_{q,\mathrm{flex}}^{-1}+d_{{{\bar{q}}},\mathrm{flex}}^{-1} } \end{aligned}$$
(76)
$$\begin{aligned}&R_i^{(s)}=R_i D,\qquad R_i^{(r)}=R_i (1-D)\,, \end{aligned}$$
(77)

where the index i labels the three singular regions that POWHEG is handling. In this case, the cross section is damped if the kinematics is near a singular region that is nether ISR nor FSR, i.e. only gluon splitting and flavour excitation kinematics are separated into the (r) component.

There is one more issue that needs to be considered when using a damping factor in POWHEG. By default, when evaluating the \(R^{(r)}\) component (called “real remnant”), the scale choice is the same as for \(\tilde{B}\), i.e. it is Eq. (71) applied to the underlying Born kinematics, that depends upon the considered singular region. This would lead to a different scale choice for the remnants in nol and asl. In order to avoid that, we should set the scale on the basis of the real kinematics. This can be done in POWHEG by setting appropriate flags and by modifying the code that computes the scales for the process. Our scale choice is

$$\begin{aligned} \mu _f=\mu _r=\frac{1}{2}\left[ \sqrt{k_{t,q}^2+m_q^2}+\sqrt{k_{t,{\bar{q}}}^2+m_q^2}+k_t\right] \,, \end{aligned}$$
(78)

that has the correct limit to the underlying Born scale both in the ISR and in the FSR case.

The result of this procedure is shown in the right panels of Figs. 11, 12 and 13. We notice a remarkable improvement in the agreement, although some important differences do remain. This is not unexpected, since in the two cases radiation from the heavy quark is treated in a very different way. It is interesting to notice that the B and the \({j_\mathrm{\mathchoice{\displaystyle }{\scriptstyle }{\scriptscriptstyle }{\scriptscriptstyle } B}} \) spectra computed with the nol without remnants (which is the default in the standard hvq generator), is in fair agreement with the alt one when the enhanced regions are separated using the remnants. Since the default hvq program gives a description of the transverse momentum distribution of B hadrons that is in fair agreement with the FONLL calculation, we infer that also the alt prediction will display a similar agreement, provided the gluon splitting and flavour excitation region are treated separately as remnants.

The alt (or equivalently the def generator), with the remnant separation discussed above, seems to be at this point the generator that may give the best description of b production data at hadron collider. We should not forget, however, that some flexibility still remains in the treatment of the remnant (in this work we have made a definite scale choice for the remnants in order to have a clearer comparison with the nol generator). We also notice from Figs. 11 and 12 that after the remnants are introduced, the B-hadron and b-jet \(p_T\) spectra become softer. This seems to be in contrast with the discussion at the beginning of Sect. 3.2.1. On the other hand, this result may be due to the particular scale choice that we have performed for the real graphs, and that POWHEG applies automatically also to the remnants. This scale turns out to be higher than the typical scale involved in the region discussed at the beginning of Sect. 3.2.1. A better approach would be to introduce the possibility of alternative scale choices in the remnants, including the possibility of performing a different scale choice depending upon which enhanced region one is considering. We refrain here from carrying out such study, since we believe that comparison with data on single inclusive b-hadron and b-jet production (see Refs. [27,28,29] and references therein) and on correlations of \(b\bar{b}\) pairs [30, 31], would be needed in order to make progress in this direction, and this is beyond the scope of the present work. Such study would however be very valuable, and not only for the purpose of testing QCD in bottom production. The production of top pairs is one of the most important background process at the LHC, including also its future high luminosity and eventually high energy developments. An accurate simulation of the \(t\bar{t}\) background would be very valuable for the LHC experimental collaborations, and it is quite clear that a model of b production yielding a good description of the data from low to high transverse momenta should be also well suited to describe top production in all the needed phase space. Furthermore, these studies could lead to an improved description of top production at large transverse momentum, that, as reported currently by CMS and ATLAS (see [32, 33] and references therein) seems to be not well described by theoretical models.

4 Conclusions

In this paper we have presented a method for implementing radiation from a heavy quark in the POWHEG framework. This method is considerably simpler and more transparent than the one presented in Ref. [15], and it has a much better numerical performance. The present method overcomes a problem related to the fact that the most natural map from an underlying Born configuration and a set of FKS-like radiation variables for radiation from a massive quark does not have a unique inverse in the whole kinematic region. The POWHEG inverse map has thus two solutions in some region of phase space, and in the present work it is shown how, by giving up the physical connection of the y variable to the gluon emission angle in a very limited region of phase space one can pick one of the two solutions in such a way that we still have a single valued inverse mapping. We have examined the output of the new method in the framework of the generators of Refs. [17] and [14]. We found that the new method yields results that are very consistent with the previous one, that is at this moment the POWHEG default. This is reassuring, since it shows that details of the implementation do not impact in a visible way the physics result, and also it shows that the previous implementation, in spite of being quite contrived, is in fact correct.

In this work we have also examined for the first time the impact of the inclusion of the singular region associated with radiation from the heavy quark in the case of the hvq generator. We have shown that, unless one separates the enhanced gluon splitting and flavour excitation contribution from the real contribution that are dealt with by the POWHEG radiation formula, and treats them as remnants, one finds results that are in considerable disagreement with the traditional hvq implementation. On the other hand, it seems that performing this separation is the appropriate thing to do for a consistent modeling of the process. We notice that such modeling would be of great interest also for its potential application to top pair production, that is a very important background to many LHC physics studies.