Infrared Problem vs Gauge Choice: Scattering of Classical Dirac Field


We consider the Dirac equation for the classical spinor field placed in an external, time-dependent electromagnetic field of the form typical for scattering settings: \(F=F^\mathrm{ret}+F^\mathrm{in}=F^\mathrm{adv}+F^\mathrm{out}\), where the current producing \(F^{\mathrm{ret}/\mathrm{adv}}\) has past and future asymptotes homogeneous of degree \(-3\), and the free fields \(F^{\mathrm{in}/\mathrm{out}}\) are radiation fields produced by currents with similar asymptotic behavior. We show the existence of the electromagnetic gauges in which the particle has ‘in’ and ‘out’ asymptotic states approaching free field states, with no long-time corrections of the free dynamics. Using a special Cauchy foliation of the spacetime, we show in this context the existence and asymptotic completeness of the wave operators. Moreover, we define a special ‘evolution picture’ in which the free evolution operator has well-defined limits for \(t\rightarrow \pm \infty \); thus the scattering wave operators do not need the free evolution counteraction.


This article is a further step in a program of investigation of the infrared problems in electrodynamics. Among them, the long-time asymptotic behavior of the charged matter fields is one of the key issues. Such questions as understanding what is a charged particle, or how to define scattering operator in quantum electrodynamics, are correlated to this problem.

It is well known that the long-time asymptotics poses problems in theories with long-range interactions. Standard way to deal with that is to modify the asymptotic dynamics of charged particles or fields, by Dollard or similar methods, augmented by some ‘dressing’ of charged particles. The cost is the loss of a clear interpretation of the asymptotic dynamics. Moreover, in physically realistic quantum field theory models this procedure has not resulted, up to now, in a non-perturbational understanding of the issue.Footnote 1

In a series of articles I have put forward the idea, that the long-time asymptotics problem may be relieved by an appropriate choice of gauge of the electromagnetic potential. In a recent article [11] a Schrödinger (nonrelativistic) particle was considered in a time-dependent electromagnetic field, of the form typical for scattering situations. It was found that an appropriate choice of gauge allows the existence of the asymptotic dynamics, with no need for dressing.Footnote 2 We also refer the reader to this article for more extensive description of the context and our motivation.

Here we implement the idea in the case of the classical Dirac field evolving in the external electromagnetic field. The system is not fully self-interacting, but the form of the external electromagnetic field mimics the expected properties of this field in fully interacting system. We consider the evolution of the Dirac field as a unitary evolution in a Hilbert space. However, similarly as in the previously considered nonrelativistic case, the evolution does not take place in standard time, over flat pure space sheets. It turns out, that for our purpose it is convenient to consider a Cauchy foliation supplied by constant \(\tau \) surfaces, where \((\tau ,\mathbf {z})\) form the coordinate systemFootnote 3 defined by:

$$\begin{aligned} x^0=\tau (|\mathbf {z}|^2+1)^\frac{1}{2}\,,\qquad \mathbf {x}=(\tau ^2+1)^\frac{1}{2}\mathbf {z}\,. \end{aligned}$$

This poses technical complication even at the free Dirac equation level, as the unitary evolution is not a unitary one-parameter group (Hamiltonian depends on \(\tau \)).Footnote 4 This is to be contrasted with the nonrelativistic case considered earlier, where the Niederer transformation introducing suitable coordinates leads from the free particle Hamiltonian to that of the harmonic oscillator (a kind of beautiful miracle). Nevertheless, the Schrödinger operator case becomes steeply difficult when the potential with space part is introduced, as then the term \(\mathbf {A}\cdot \mathbf {p}\) prevents the application of the Dyson series method. To deal with that case, we have applied the Kato theorem. Here, for the free Dirac equation we proceed differently, and our method may be of independent interest. The addition of the interaction with the electromagnetic field may be then treated, in contrast to the nonrelativistic Schrödinger case, by a variation of the Dyson method combined with the relativistic causality. For this system we show that an appropriate choice of gauge removes the asymptotic problem. We show the existence and asymptotic completeness of the wave operators, with no need for any modification of the asymptotic free Dirac evolution.Footnote 5

This result will show that the choice of a gauge \(\mathcal {A}(x)\) in this classical field setting has a decisive importance for the asymptotic identification of the incoming/outgoing charged fields. The main qualitative feature of gauges in this appropriate class is that the product \(x\cdot \mathcal {A}(x)\) vanishes in timelike directions, see remarks in Sect. 8 (a property already identified in [8]). We would like to stress that the theory is formulated from the outset in such chosen gauge. Whether such formulation may be carried over to the quantum field theory is a subject for future research.Footnote 6 This prospective investigation should also find contact with the asymptotic algebra of fields in quantum electrodynamics postulated by the author [9] (see also [10]).

Here let us only mention that the use of a gauge in the class anticipated above could lead to some broadening of the scope of external classical, time-dependent electromagnetic fields for which the scattering operator for the classical Dirac field may be lifted to the case of the respective quantum field. As is well known, the mixing of electrons/positrons leads to rather severe restrictions in this respectFootnote 7 (see, e.g., the monograph [17], Sections 2.4 and 2.5). Precisely what gain would be possible is an open question.

The outline of the article is as follows. In Sect. 2 we formulate the Dirac evolution as a unitary evolution, with time-dependent self-adjoint generators, over a rather general family of Cauchy surfaces. Section 3 gives the formulation of the external field problem with the Dyson series method, but with the important use of the relativistic causality. In both free and interacting case the self-adjointness is achieved with the use of the commutator theorem (see Appendix B) and the harmonic oscillator Hamiltonian. In Sect. 4 we specify our choice of coordinates to those mentioned above and transform the evolution to a new ‘picture.’ In this picture the free Dirac evolution on our foliation has well-defined limits as unitary operators. Section 5 specifies the general external problem to the electromagnetic case and discusses the gauge transformation. Section 6 contains our central results formulated for a wide class of electromagnetic fields: the existence and asymptotic completeness of the wave operators. Section 7 gives a theorem showing that the electromagnetic fields typical for scattering contexts described above admit potentials in gauges satisfying the demands of the main theorems of Sect. 6. Section 8 offers some remarks on the implications of our results and on further physically motivated restriction in the class of the obtained electromagnetic gauges.

Large parts of the material are shifted to Appendix. In Appendix A we discuss a spinor transformation needed in Sect. 2. Appendix B describes our method to deal with a class of time-dependent Hamiltonians. A lemma needed for the application of this method to the system considered here is discussed in Appendix C. In Appendix D we gather geometrical facts and relations in our special coordinate system. Appendices E and F recapitulate some properties of the solutions of the free Dirac and wave equations, respectively. Appendix G contains some estimates of the decay of the advanced and retarded solutions of the inhomogeneous wave equation and their differences (radiation fields). The results of Appendices E–G are applied next in Appendix H to the case of the electromagnetic fields typical for scattering contexts. Finally, the necessary decay properties of the special gauge introduced in Sect. 7 are obtained in Appendix I.

Free Dirac Evolution

Throughout the article we set \(\hbar =1\), \(c=1\). We choose a reference point, and then the flat spacetime is identified with the Minkowski vector space. Let m be the mass parameter in the free Dirac equation. To simplify notation, we rescale Minkowski vectors by multiplying them by m, and denote the resulting space by M, and its dimensionless vectors by \(x^a\). The flat (covariant) derivative in the rescaled Minkowski space is denoted by \(\nabla _a\). Also, the electromagnetic interaction to appear later will be introduced by the interaction term \(\mathcal {A}_a\overline{\psi }\gamma ^a\psi \), so to recover the physical units and quantities one should replace \(\mathcal {A}\rightarrow (e/m)\mathcal {A}\), with e the elementary charge.

Let \(\tau :M\mapsto {\mathbb {R}}\) be a smooth surjective function such that the hypersurfaces \(\Sigma _\tau \) of constant \(\tau \) form a Cauchy foliation of the Minkowski spacetime, with \(\tau \) increasing into the future. Consider the Dirac equation, which we write in the form

$$\begin{aligned} (\tfrac{1}{2}[\gamma ^a,i\nabla _a]_+-1)\psi =0 \end{aligned}$$

in our dimensionless coordinates, where \(\gamma ^a\) are the Dirac matrices and \([.,.]_+\) symbolizes the anticommutator. As is well known, the Cauchy problem for this equation with the initial data \(\psi |_{\Sigma _{\tau _0}}=f\) is explicitly solved by the formula

$$\begin{aligned} \psi (x)=i^{-1}\int _{\Sigma _{\tau _0}} S(x-y)\gamma ^a f(y)\, \hbox {d}\sigma _a(y)\,, \end{aligned}$$

where \(\hbox {d}\sigma _a\) is the dual integration element on \(\Sigma _{\tau _0}\) and S(x) is the standard Green function of the free Dirac field, in the dimensionless coordinates

$$\begin{aligned} S(x)=(i\gamma \cdot \nabla +1)D_1(x)\,,\quad D_1(x)=\frac{i}{(2\pi )^3} \int {{\,\mathrm{sgn}\,}}(v^0)\delta (v^2-1)e^{-iv\cdot x}\hbox {d}v\,. \end{aligned}$$

If f is a smooth bi-spinor function on \(\Sigma _{\tau _0}\), with a compact support, then \(\psi (x)\) is a smooth function in M, with compact support on each Cauchy surface \(\Sigma _{\tau }\). Therefore, we obtain a bijective evolution mapping between the spaces of smooth, compactly supported bi-spinor functions on our family of Cauchy surfaces. Moreover, if on each \(\Sigma _\tau \) one defines the scalar product

$$\begin{aligned} (\psi _1,\psi _2)_\tau =\int _{\Sigma _\tau } \overline{\psi _1}\gamma ^a\psi _2\,\hbox {d}\sigma _a\,, \end{aligned}$$

then the evolution is isometric.

Let now \((\zeta ^\mu )=(\zeta ^0,\zeta ^i)=(\tau ,z^i)\equiv (\tau ,\mathbf {z})\in {\mathbb {R}}^4\) (\(i=1,2,3\)), with \(\tau \) described above, be a smooth curvilinear coordinate system, mapping M diffeomorphically onto \({\mathbb {R}}^4\). Denote by \(\eta _{ab}\) and \(C^{a\ldots }{}_{b\ldots }\) the Minkowski spacetime metric tensor, and any other tensor, respectively. Then the geometrical components in the coordinate system \((\xi ^\mu )\) will be denoted by

$$\begin{aligned} \begin{aligned}&\partial _\mu =\frac{\partial }{\partial \zeta ^\mu }=\frac{\partial x^a}{\partial \zeta ^\mu }\nabla _a\,,\quad \partial _\tau =\frac{\partial }{\partial \tau }\,,\quad \partial _i=\frac{\partial }{\partial z^i}\,,\\&\hat{\gamma }^\mu =(\nabla _a\zeta ^\mu )\gamma ^a\,,\quad \hat{C}^{\mu \ldots }{}_{\nu \ldots } =(\nabla _a\zeta ^\mu )\ldots C^{a\ldots }{}_{b\ldots }\frac{\partial x^b}{\partial \zeta ^\nu }\ldots \,,\\&g_{\mu \nu }=\frac{\partial x^a}{\partial \zeta ^\mu }\frac{\partial x^b}{\partial \zeta ^\nu }\eta _{ab}\,,\quad g=\det (g_{\mu \nu })\,,\quad g_{(z)}=\det [(g_{ij})_{i,j\le 3}]\,, \end{aligned} \end{aligned}$$

Coordinates restricted to the indices 1, 2, 3 will be written as \(\hat{\gamma }^i\), \(g_{ij}\) and \(\hat{C}^{i\ldots }{}_{j\ldots }\), and the zeroth coordinate will be indicated by \(\tau \); thus, for instance: \(\mathcal {A}_a\gamma ^a=\hat{\mathcal {A}}_\mu \hat{\gamma }^\mu =\hat{\mathcal {A}}_\tau \hat{\gamma }^\tau +\hat{\mathcal {A}}_i\hat{\gamma }^i\).

In these coordinates, the Dirac equation (2) and the product (3) take the form

$$\begin{aligned}&\Big (\tfrac{i}{2}\Big [|g|^{\frac{1}{2}}\hat{\gamma }^\mu ,|g|^{-\frac{1}{2}} \partial _\mu \Big ]_+ -1\Big )\psi =0\,, \end{aligned}$$
$$\begin{aligned}&(\psi _1,\psi _2)_\tau =\int \overline{\psi _1}\hat{\gamma }^\tau \psi _2 \, |g|^{\frac{1}{2}}d^3z\,. \end{aligned}$$

We now choose a Minkowski reference system \((e_0,\ldots ,e_3)\). Let us denote

$$\begin{aligned} n=[g^{\tau \tau }]^{-\frac{1}{2}}\nabla \tau \end{aligned}$$

and let K be the Lorentz rotation of Dirac spinors in the hyperplane spanned by the pair of timelike unit vectors \((e_0,n)\), such that

$$\begin{aligned} K^{-1}\gamma ^a n_a\,K=\gamma ^0\,,\qquad K^\dagger =\gamma ^0K^{-1}\gamma ^0\,, \end{aligned}$$

where dagger \(\dagger \) denotes the Hermitian conjugation of matrices. The form and further properties of K are discussed in Appendix A. We define the following transformation:

$$\begin{aligned} \psi =T_\tau \chi \,,\qquad T_\tau =(|g|g^{\tau \tau })^{-\frac{1}{4}}K=|g_{(z)}|^{-\frac{1}{4}}K\,. \end{aligned}$$

The product (6) then takes the standard \({\mathbb {C}}^4\otimes L^2({\mathbb {R}}^3)\) form

$$\begin{aligned} (\psi _1,\psi _2)_\tau =\int \chi _1^\dagger \chi _2 d^3z\,. \end{aligned}$$

Setting \(\psi \) into (5) and applying from the left the transformation \((g^{\tau \tau })^{-\frac{1}{2}}T_\tau ^{-1}\) we obtain an equivalent form of the Dirac equation

$$\begin{aligned}&\Big (\tfrac{i}{2}\Big [(g^{\tau \tau })^{-\frac{1}{2}}\hat{\gamma }_K^\mu ,\partial _\mu \Big ]_+ -(g^{\tau \tau })^{-\frac{1}{2}}\Big )\chi \nonumber \\&\quad +\tfrac{i}{2}(g^{\tau \tau })^{-\frac{1}{2}} \Big [\hat{\gamma }_K^\mu ,(K^{-1}\partial _\mu K)\Big ]+\chi =0\,, \end{aligned}$$

where we have introduced notation

$$\begin{aligned} \gamma _K=K^{-1}\gamma K\,. \end{aligned}$$

We now make an additional assumption: \(\Sigma _\tau \) is rotationally symmetric in the chosen Minkowski system, that is \(\tau =\tau (x^0,|\mathbf {x}|)\). We show in Appendix A that in this case the anticommutator in the second line of (9) vanishes. We also note that \((g^{\tau \tau })^{-\frac{1}{2}}\hat{\gamma }_K^\tau =\gamma ^0 \equiv \beta \). This allows us to write the Dirac equation in the form

$$\begin{aligned} i\partial _\tau \chi =\Big (-\tfrac{i}{2}\big [(g^{\tau \tau })^{-\frac{1}{2}}\beta \hat{\gamma }_K^i,\partial _i\big ]_+ +(g^{\tau \tau })^{-\frac{1}{2}}\beta \Big )\chi \,. \end{aligned}$$

We summarize the above discussion.

Theorem 1

(i) Let the smooth function on the Minkowski space \(\tau =\tau (x^0,|\mathbf {x}|)\) determine its foliation by Cauchy surfaces \(\Sigma _\tau \), with \(\tau \) increasing into the future. Denote by \(C_0^\infty (\Sigma _\tau , {\mathbb {C}}^4)\) the space of smooth, compactly supported functions on \(\Sigma _\tau \) with values in \({\mathbb {C}}^4\).

Then the free Dirac evolution of initial data in \(C_0^\infty (\Sigma _\sigma , {\mathbb {C}}^4)\) is determined by a family of bijective linear evolution operators

$$\begin{aligned} U^\Sigma _0(\tau ,\sigma ): C_0^\infty (\Sigma _\sigma , {\mathbb {C}}^4) \mapsto C_0^\infty (\Sigma _\tau , {\mathbb {C}}^4) \end{aligned}$$

isometric with respect to the products (3). The size of the support in \(\Sigma _\tau \) is restricted by the size of the support in \(\Sigma _\sigma \) and the relativistic causality. By continuity, \(U_0^\Sigma (\tau ,\sigma )\) extend to unitary operators

$$\begin{aligned} U^\Sigma _0(\tau ,\sigma ): {\mathcal {H}}(\Sigma _\sigma )\mapsto {\mathcal {H}}(\Sigma _\tau )\,, \end{aligned}$$

where \({\mathcal {H}}(\Sigma _\tau )\) is the Hilbert space of spinor functions on \(\Sigma _\tau \) with the product (3).

(ii) Let \((\tau ,z^i):M\mapsto {\mathbb {R}}^4\) be a smooth diffeomorphism, with notation (4), such that \(\tau \) satisfies the assumptions of (i). Denote

$$\begin{aligned} {\mathcal {H}}={\mathbb {C}}^4&\otimes L^2({\mathbb {R}}^3,d^3z)\,,\nonumber \\ H_0(\tau )=\tfrac{1}{2}[\lambda ^i(\tau ),p_i]_+&+\mu (\tau )\,,\quad h_0=\tfrac{1}{2}(\mathbf {p}^2+\mathbf {z}^2)\,, \nonumber \\ p_i=-i\partial /\partial z^i\,,\qquad \mu (\tau )&=(g^{\tau \tau })^{-\frac{1}{2}}\beta \,, \qquad \lambda ^i(\tau )=\mu (\tau )\hat{\gamma }_K^i\,. \end{aligned}$$

Then the transformation

$$\begin{aligned} T_\tau :{\mathcal {H}}\mapsto {\mathcal {H}}(\Sigma _\tau )\,,\qquad T_\tau =|g_{(z)}|^{-1/4}K\,, \end{aligned}$$

with operator K discussed in Appendix A, is a unitary operator and the family

$$\begin{aligned} U_0(\tau ,\sigma )=T_\tau ^{-1}U_0^\Sigma (\tau ,\sigma )T_\sigma \,, \qquad U_0(\tau ,\sigma ):{\mathcal {H}}\mapsto {\mathcal {H}}\end{aligned}$$

forms a unitary, strongly continuous evolution system, such that the following holds:

  1. (A)

    \(U_0(\tau ,\sigma )C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)=C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\) and the relativistic causality is respected.

  2. (B)

    For \(\varphi \in C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\) the maps \((\tau ,\sigma )\mapsto H_0(\tau )U_0(\tau ,\sigma )\varphi \) and \((\tau ,\sigma )\mapsto h_0U_0(\tau ,\sigma )\varphi \) are strongly continuous, the map \((\tau ,\sigma )\mapsto U_0(\tau ,\sigma )\varphi \) is in the class \(C^1\) in the strong sense, and the following equations are satisfied

    $$\begin{aligned} i\partial _\tau U_0(\tau ,\sigma )\varphi&=H_0(\tau )U_0(\tau ,\sigma )\varphi \,,\\ i\partial _\sigma U_0(\tau ,\sigma )\varphi&=-U_0(\tau ,\sigma )H_0(\sigma )\varphi \,. \end{aligned}$$
  3. (C)

    The operators \(H_0(\tau )\) are symmetric on \(C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\).


The symmetry of \(H_0(\tau )\) follows from the symmetry of the operators \(\lambda ^i(\tau )\), \(\mu (\tau )\) and \(p_i\), and invariance of \(C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\) with respect to them. The other statements of the theorem follow easily from the discussion preceding the theorem. \(\square \)

We now add assumptions which will be satisfied in the context of our application, and which allow us to significantly extend the domains of validity of the results in the last theorem. We use the abbreviation

$$\begin{aligned} \langle z\rangle =(|\mathbf {z}|^2+1)^\frac{1}{2}\,. \end{aligned}$$

Also, the usual multi-index notation will be used.

Theorem 2

Let all the assumptions of Theorem 1 be satisfied. Suppose in addition that the following bounds of matrix norms hold:

$$\begin{aligned} \begin{aligned}&|\lambda ^i(\tau ,\mathbf {z})|\le C(\tau )\langle z\rangle \,,\quad |\partial _z^\alpha \lambda ^i(\tau ,\mathbf {z})|\le C(\tau )\,,\quad 1\le |\alpha |\le 3\\&|\partial _z^\beta \mu (\tau ,\mathbf {z})|\le C(\tau )\langle z\rangle ^{2-|\beta |}\,,\quad |\beta |\le 2\,, \end{aligned} \end{aligned}$$

where \(C(\tau )\) is a continuous function. Denote \(h_0=\tfrac{1}{2}(\mathbf {p}^2+\mathbf {z}^2\)), the self-adjoint harmonic oscillator Hamiltonian.

Then the following holds:

  1. (A)

    \(H_0(\tau )\) are essentially self-adjoint on \(C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\) and on each core of \(h_0\), and \({\mathcal {D}}(h_0)\subseteq {\mathcal {D}}(H_0(\tau ))\). Moreover, \(h_0^{-1}H_0(\tau )\) and \(H(\tau )h_0^{-1}\) extend to bounded operators, and as functions of \(\tau \) are strongly continuous.

  2. (B)

    The operator \(h_0U_0(\tau ,\sigma )h_0^{-1}\) is bounded, with the norm

    $$\begin{aligned} \Vert h_0U_0(\tau ,\sigma )h_0^{-1}\Vert \le \exp \left[ \mathrm {const}\int _\sigma ^\tau C(\rho )\hbox {d}\rho \right] \,, \end{aligned}$$

    so in particular

    $$\begin{aligned} U_0(\tau ,\sigma ){\mathcal {D}}(h_0)={\mathcal {D}}(h_0)\,, \end{aligned}$$

    and the map \((\tau ,\sigma )\mapsto h_0U(\tau ,\sigma )h_0^{-1}\) is strongly continuous.

  3. (C)

    For \(\varphi \in {\mathcal {D}}(h_0)\) the vector functions \(h_0U_0(\tau ,\sigma )\varphi \), \(H_0(\tau )U_0(\tau ,\sigma )\varphi \) and \(U_0(\tau ,\sigma )H_0(\sigma )\varphi \) are strongly continuous with respect to parameters, the vector function \(U_0(\tau ,\sigma )\varphi \) is of the class \(C^1\) with respect to parameters in the strong sense and the following equations hold

    $$\begin{aligned} i\partial _\tau U_0(\tau ,\sigma )\varphi&=H_0(\tau ) U_0(\tau ,\sigma )\varphi \,, \\ i\partial _\sigma U_0(\tau ,\sigma )\varphi&=-U_0(\tau ,\sigma )H_0(\sigma )\varphi \,. \end{aligned}$$
  4. (D)

    Relativistic causality is respected by \(U_0(\tau ,\sigma )\). Therefore, if we denote

    $$\begin{aligned}&{\mathcal {H}}\supset {\mathcal {H}}_c\ \text {--the subspace of functions with compact essential support}\,,\\&{\mathcal {D}}_c(\mathbf {p}^2)={\mathcal {H}}_c\cap {\mathcal {D}}(\mathbf {p}^2)\,, \end{aligned}$$


    $$\begin{aligned} U_0(\tau ,\sigma ){\mathcal {H}}_c={\mathcal {H}}_c\,,\quad U_0(\tau ,\sigma ){\mathcal {D}}_c(\mathbf {p}^2)={\mathcal {D}}_c(\mathbf {p}^2)\,. \end{aligned}$$


It follows from the assumptions on \(\lambda ^i\) and \(\mu \) that \(H_0(\tau )\) fulfill all the conditions imposed on the operator h in Lemma 14 in Appendix C. In Theorems 12 and 13 in Appendix B, we now put

$$\begin{aligned} {\mathcal {D}}=C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\,,\quad h(\tau )=H_0(\tau )\,,\quad u(\tau ,\sigma )=U_0(\tau ,\sigma ) \end{aligned}$$

and \(h_0\) as defined in the present assumptions. Then, by the results of Theorem 1, all the assumptions of these theorems are satisfied and the present thesis follows. \(\square \)

Evolution in External Field

We now consider the Dirac equation in an external field, of the form

$$\begin{aligned} (\tfrac{1}{2}[\gamma ^a,i\nabla _a]_+-1-V)\psi =0\,, \end{aligned}$$

where V(x) is a matrix function satisfying the condition

$$\begin{aligned} V(x)^\dagger =\gamma ^0V(x)\gamma ^0\,, \end{aligned}$$

which guarantees the reality of the interaction term \(-\overline{\psi }V\psi \) in the Lagrangian and the conservation of the current \(\overline{\psi }\gamma ^a\psi \). The addition of this interaction term leads to the modification of the free curvilinear version to the equation of the form

$$\begin{aligned} i\partial _\tau \varphi =(\tfrac{1}{2}[\lambda ^i,p_i]_++\mu +W)\varphi \,, \end{aligned}$$


$$\begin{aligned} W(\tau ,\mathbf {z})=\mu K^{-1}V(x)K=W(\tau ,\mathbf {z})^\dagger \,, \end{aligned}$$

Hermicity being equivalent to the property (14) [see (8)].

Using the interaction picture technique, we shall obtain the evolution operators for which Eq. (15) is satisfied. However, thanks to the relativistic causality we can extend the applicability of the technique to the following setting, in which the operators \(W(\tau )\) do not need to be bounded.

Theorem 3

Let all the assumptions of Theorems 1 and 2 be satisfied. Suppose in addition that \(W(\tau ,\mathbf {z})\) is a Hermitian matrix function such that the mappings

$$\begin{aligned} \big \{{\mathbb {R}}\ni \tau \mapsto \partial _z^\alpha W(\tau ,\mathbf {z})\big \} \in C^0({\mathbb {R}}, L_\mathrm {loc}^\infty ({\mathbb {R}}^3))\,,\qquad |\alpha |\le 2\,, \end{aligned}$$


$$\begin{aligned} \Vert \langle z\rangle ^{-2+|\alpha |}\partial _z^\alpha W(\tau ,\mathbf {z})\Vert _\infty \le C(\tau )\,,\qquad |\alpha |\le 1\,, \end{aligned}$$

where \(C(\tau )\) is a continuous function. Denote

$$\begin{aligned} H(\tau )=H_0(\tau )+W(\tau )\,, \end{aligned}$$

with the initial domain \(C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\), and define the formal series

$$\begin{aligned} \begin{aligned} U(\tau ,\sigma )&=\sum _{n=0}^\infty U^{(n)}(\tau ,\sigma )\,, \qquad U^{(0)}(\tau ,\sigma )=\mathbf {1}\,,\\ U^{(n)}(\tau ,\sigma )&=(-i)^n \int \limits _{\tau \ge \tau _{n} \ge \ldots \ge \tau _1\ge \sigma } U_0(\tau ,\tau _n)W(\tau _n)U_0(\tau _n,\tau _{n-1})\ldots \\&\quad \ldots W(\tau _1)U_0(\tau _1,\sigma )\mathrm{d}\tau _n\ldots \mathrm{d}\tau _1\\&=-i\int \limits _\sigma ^\tau U_0(\tau ,\rho )W(\rho )U^{(n-1)}(\rho ,\sigma )\mathrm{d}\rho \\&=-i\int \limits _\sigma ^\tau U^{(n-1)}(\tau ,\rho )W(\rho )U_0(\rho ,\sigma )\mathrm{d}\rho \,, \qquad n\ge 1\,. \end{aligned} \end{aligned}$$

Then the following is true:

  1. (A)

    \(H(\tau )\) are essentially self-adjoint on \(C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\) and on each core of \(h_0\), and \({\mathcal {D}}(h_0)\subseteq {\mathcal {D}}(H(\tau ))\).

  2. (B)

    Series \(U(\tau ,\sigma )\) and its conjugation converge strongly on \({\mathcal {H}}_c\), and the limit operator extends to a unitary propagator on \({\mathcal {H}}\), strongly continuous in its parameters.

  3. (C)

    \(U(\tau ,\sigma )\) respects relativistic causality and

    $$\begin{aligned} U(\tau ,\sigma ){\mathcal {D}}_c(\mathbf {p}^2)={\mathcal {D}}_c(\mathbf {p}^2)\,. \end{aligned}$$
  4. (D)

    For \(\varphi \in {\mathcal {D}}_c(\mathbf {p}^2)\) the vector functions \(h_0U(\tau ,\sigma )\varphi \), \(H(\tau )U(\tau ,\sigma )\varphi \) and \(U(\tau ,\sigma )H(\sigma )\varphi \) are strongly continuous with respect to parameters, the function \(U(\tau ,\sigma )\varphi \) is of the class \(C^1\) with respect to parameters in the strong sense and the following equations hold

    $$\begin{aligned} \begin{aligned} i\partial _\tau U(\tau ,\sigma )\varphi&=H(\tau ) U(\tau ,\sigma )\varphi \,,\\ i\partial _\sigma U(\tau ,\sigma )\varphi&=-U(\tau ,\sigma )H(\sigma )\varphi \,. \end{aligned} \end{aligned}$$
  5. (E)

    Unitary operators satisfying (D), with \(U(\sigma ,\sigma )=\mathbf {1}\), are unique.


The proof of (A) is similar as the proof in the free case, statement (A) in Theorem 2. One has to note that the assumption (17) ensures the validity of items (i) and (ii) in Lemma 14; the result of item (iii) is not needed.

Consider the series (18). As each of the operators \(U_0(\tau ,\tau _n)\), \(U_0(\tau _k,\tau _{k-1})\) and \(U_0(\tau _1,\sigma )\) in the definition of \(U^{(n)}(\tau ,\sigma )\) respects causality, and multiplication by \(W(\tau _k,\mathbf {z})\) does not enlarge the support of the function, each of the operators \(U^{(n)}(\tau ,\sigma )\) respects causality. Let, for chosen T and r,

$$\begin{aligned} \tau ,\sigma \in [-T,T]\,,\quad \varphi \in {\mathcal {H}}\,,\quad \mathop {\mathrm{ess}\,\mathrm{supp}}\limits \varphi \subseteq \{\mathbf {z}\mid |\mathbf {z}|\le r\}\,, \end{aligned}$$

which is assumed for the rest of this proof. Then there exists R(Tr) such that the essential support of all functions \(U^{(n)}(\tau ,\sigma )\varphi \) is contained in \(|\mathbf {z}|\le R(T,r)\) (uniformly with respect to \(\tau \), \(\sigma \) and \(\varphi \) in the assumed range). Let \(J(\mathbf {z})\) be a smooth function such that \(J(\mathbf {z})=1\) for \(|\mathbf {z}|\le R\) and \(J(\mathbf {z})=0\) for \(|\mathbf {z}|\ge R+1\). Denote by \(U_J(\tau ,\sigma )\) and \(U_J^{(n)}(\tau ,\sigma )\) the series (18) and its terms in which \(W(\rho ,\mathbf {z})\) have been replaced by \(W_J(\rho ,\mathbf {z})=J(\mathbf {z})W(\rho ,\mathbf {z})\); note that by assumption (16) \(W_J(\rho )\) is a bounded, strongly continuous operator, with \(\Vert W_J(\rho )\Vert \le d_1\) for some constant \(d_1\), uniformly on \(\rho \in [-T,T]\). Using the first of the recursive relations in (18), it is now easy to see that in the given setting

$$\begin{aligned} U^{(n)}(\tau ,\sigma )\varphi =U_J^{(n)}(\tau ,\sigma )\varphi \,. \end{aligned}$$

Arguing as in the proof of the Dyson expansion (see Thm.  X.69 in [15]), one shows that \(U_J(\tau ,\sigma )\) converges uniformly to a strongly continuous unitary propagator. As the space \({\mathcal {H}}_c\) is dense in \({\mathcal {H}}\), the statement (B) is proved. Also, the causality is respected.

To prove (C) (the invariance of the subspace) and (D), we write

$$\begin{aligned} h_0W_J(\rho ) h_0^{-1}=[h_0,W_J(\rho )]h_0^{-1}+W_J(\rho ) \end{aligned}$$

and note that by Theorem 12 applied to \(h(\tau )=W_J(\tau )\) (with Lemma 14) the first term on the rhs is a bounded operator, strongly continuous by the assumption (16). Therefore, the operator \(h_0W_J(\rho )h_0^{-1}\) is also bounded and strongly continuous, and \(\Vert h_0W_J(\rho )h_0^{-1}\Vert \le d_2\) for some constant \(d_2\), uniformly on \(\rho \in [-T,T]\). Taking also into account statement (B) in Theorem 2, we conclude that \(h_0U_J^{(n)}(\tau ,\sigma )h_0^{-1}\) is bounded, strongly continuous, with the norm estimated by

$$\begin{aligned} \Vert h_0 U_J^{(n)}(\tau ,\sigma )h_0^{-1}\Vert \le (n!)^{-1}(d_1+d_2)^n \exp \left[ \mathrm {const}\int _\sigma ^\tau C(\rho )\hbox {d}\rho \right] \,, \end{aligned}$$

This implies that the series \(\sum _nh_0U_J^{(n)}(\tau ,\sigma )h_0^{-1}\) converges uniformly to the strongly continuous function \(h_0U_J(\tau ,\sigma )h_0^{-1}\). Let us now impose on \(\varphi \) a stronger assumption that \(\varphi \in {\mathcal {D}}_c(\mathbf {p}^2)\). Then

$$\begin{aligned} h_0U(\tau ,\sigma )\varphi =h_0U_J(\tau ,\sigma )\varphi =h_0U_J(\tau ,\sigma )h_0^{-1}h_0\varphi \,. \end{aligned}$$

This proves (C), and also the strong continuity of \(h_0U(\tau ,\sigma )\varphi \) in (D). The strong continuity of \(H(\tau )U(\tau ,\sigma )\varphi \) and \(U(\tau ,\sigma )H(\sigma )\varphi \) follows now from the strong continuity of \(H(\tau )h_0^{-1}\), similarly as in Theorem 2, basing on Theorem 12 and assumptions (16) and (17).

Next, we note that

$$\begin{aligned} U_0(0,\tau )U^{(n)}(\tau ,\sigma )\varphi =-i\int _\sigma ^\tau U_0(0,\rho )W(\rho )U^{(n-1)}(\rho ,\sigma )\varphi \, \hbox {d}\rho \,, \end{aligned}$$

so this vector function is continuously differentiable in \(\tau \) in the strong sense and

$$\begin{aligned} i\partial _\tau [U_0(0,\tau )U^{(n)}(\tau ,\sigma )]\varphi = U_0(0,\tau )W(\tau )U^{(n-1)}(\tau ,\sigma )\varphi \end{aligned}$$

(remember that on the rhs W may be replaced by \(W_J\)). Again, using the uniform convergence of \(U_J(\tau ,\sigma )\) we obtain

$$\begin{aligned} i\partial _\tau [U_0(0,\tau )U(\tau ,\sigma )]\varphi =U_0(0,\tau )W(\tau )U(\tau ,\sigma )\varphi \,, \end{aligned}$$

and the rhs is strongly continuous in \((\tau ,\sigma )\). Finally, differentiating

$$\begin{aligned} U(\tau ,\sigma )\varphi =U_0(\tau ,0)[U_0(0,\tau )U(\tau ,\sigma )]\varphi \end{aligned}$$

by the Leibnitz rule and using (C) from Theorem 2 we arrive at the first equation in (19). The uniqueness (E) follows easily from this equation. The proof of the second equation in (19) is similar, with the use of the second of the recursive relations in (18); we omit the details. \(\square \)

The most general matrix field V(x) satisfying condition (14) may be concisely represented in the form

$$\begin{aligned} V=\sum _{k=0}^4 i^{\frac{1}{2}k(k-1)}C^k_{a_1\ldots a_k}\gamma ^{a_1}\ldots \gamma ^{a_k}\,, \end{aligned}$$

where the tensor fields \(C^k_{a_1\ldots a_k}\) are real and antisymmetric. The corresponding form of the Hermitian matrix function W is

$$\begin{aligned} W=\mu \sum _{k=0}^4 i^{\frac{1}{2}k(k-1)} \hat{C}^k_{\mu _1\ldots \mu _k}\hat{\gamma }^{\mu _1}_K\ldots \hat{\gamma }^{\mu _k}_K\,. \end{aligned}$$

This form encompasses the scalar (\(k=0\)) and pseudoscalar (\(k=4\)) potentials, the electromagnetic vector potential (\(k=1\)) and the pseudovector potential (\(k=3\)), the interactions characteristic for anomalous magnetic and electric moments, as well as the linearized gravitation (\(k=2\)).

Special Foliation, New Picture and Free Asymptotics

We now choose the foliation \(\tau \) and the variables \(\mathbf {z}\) by

$$\begin{aligned} x^0=\tau \langle z\rangle \equiv \tau (|\mathbf {z}|^2+1)^{\frac{1}{2}}\,,\qquad \mathbf {x}=\langle \tau \rangle \mathbf {z}\equiv (\tau ^2+1)^{\frac{1}{2}}\mathbf {z}\,, \end{aligned}$$

and we also denote

$$\begin{aligned} \langle \tau |z\rangle =(\tau ^2+|\mathbf {z}|^2+1)^{\frac{1}{2}}\,. \end{aligned}$$

The idea behind this choice is that for \(\tau \) tending to \(\pm \infty \) the Cauchy surfaces should tend, for timelike directions in the spacetime, to the hyperboloids \(x^2=\tau ^2\). All geometrical facts on these curvilinear coordinates needed for our purposes are gathered in Appendix D.

From now on, we adopt the coordinate system (20). Using the properties of this system, it is then easy to see that all the conditions of Theorems 1 and 2 are satisfied. Also, a convenient property of these coordinates is that \(g_{\tau i}=0\). Therefore, we find

$$\begin{aligned} {[}\hat{\gamma }_K^i,\beta ]_+=(g^{\tau \tau })^{-\frac{1}{2}}K^{-1}[\hat{\gamma }^i,\hat{\gamma }^\tau ]_+ K =2(g^{\tau \tau })^{-\frac{1}{2}}g^{i\tau }=0\,, \end{aligned}$$

and then

$$\begin{aligned}&{[}\lambda ^i,\beta ]_+=0=[\lambda ^i,\mu ]_+\,, \end{aligned}$$
$$\begin{aligned}&\tfrac{1}{2}[\lambda ^i,\lambda ^j]_+=-g_{\tau \tau }g^{ij}\mathbf {1}\equiv \rho ^{ij}\mathbf {1}\,; \end{aligned}$$

the symmetric form \(\rho \) is positive definite.

The free evolution \(U_0(\tau ,\sigma )\) may be easily expressed in the coordinate system (20). For this purpose, it is sufficient to consider \(U_0(\tau ,0)\). Using the representation of the Dirac solution (103) in Appendix E and the definition (12), for \(\varphi \in {\mathcal {S}}({\mathbb {R}}^3,{\mathbb {C}}^4)\) we obtain

$$\begin{aligned} {[}U_0(\tau ,0)\varphi ](\mathbf {z})&=\left( \frac{\langle \tau |z\rangle }{\langle \tau \rangle \langle z\rangle }\right) ^{\frac{1}{2}}K(\tau ,\mathbf {z})^{-1}\nonumber \\&\quad \times \left( \frac{\langle \tau \rangle }{2\pi }\right) ^{\frac{3}{2}}\int \left[ e^{-ix(\tau ,\mathbf {z})\cdot v}P_+(v) {-}e^{ix(\tau ,\mathbf {z})\cdot v}P_-(v)\right] [{\mathcal {F}}^{-1}\varphi ](v)\hbox {d}\mu (v)\,, \end{aligned}$$

where \(x(\tau ,\mathbf {z})=(\tau \langle z\rangle ,\langle \tau \rangle \mathbf {z})\). We consider the asymptotic form of this evolution for \(\tau \rightarrow \pm \infty \). Let \(\varphi \in {\mathcal {F}}C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\) and restrict \(\mathbf {z}\) to a compact set. Then both \(\mathbf {v}\) (the space part of v) and \(\mathbf {z}\) are restricted to bounded sets and by the stationary phase method (see, e.g., [19]) the leading asymptotic behavior of the second line in (23) is given by

$$\begin{aligned} \mp i\big [e^{-i(\tau \pm \frac{\pi }{4})}P_+(z^0,\pm \mathbf {z}) +e^{i(\tau \pm \frac{\pi }{4})}P_-(z^0,\pm \mathbf {z})\big ] {[}{\mathcal {F}}^{-1}\varphi ](z^0,\pm \mathbf {z}) +O(\langle \tau \rangle ^{-1})\,, \end{aligned}$$

where \(z^0=\langle z\rangle \), the upper/lower signs \(\mp \) and \(\pm \) correspond to the limits \(\tau {\rightarrow } {\pm } {\infty }\), respectively, and the rest is bounded uniformly for \(\mathbf {z}\) in the given set. The limit of the term on the rhs of (23) in the first line is equal to \(\langle z\rangle ^{-\frac{1}{2}}K(\pm \infty ,\mathbf {z})^{-1}\), with

$$\begin{aligned} K(\pm \infty ,\mathbf {z})=2^{-\frac{1}{2}}\big [(1+\langle z\rangle )^{\frac{1}{2}} \pm (1+\langle z\rangle )^{-\frac{1}{2}}\gamma ^0\mathbf {z}\cdot \varvec{\gamma }\big ]=K(\infty ,\pm \mathbf {z})\,, \end{aligned}$$

and again the rest is bounded by \(\mathrm {const}\langle \tau \rangle ^{-1}\), uniformly in the given set. We now note two facts:

$$\begin{aligned}&K(\infty ,\mathbf {v})^{-1} v\cdot \gamma \, K(\infty ,\mathbf {v})=\beta \,,\\&e^{-i\sigma }\tfrac{1}{2}(\mathbf {1}+\beta ) +e^{i\sigma }\tfrac{1}{2}(\mathbf {1}-\beta ) =e^{-i\sigma \beta }\,. \end{aligned}$$

The first identity is the limit form of the first of relations in (8), but it may also be checked directly. The second identity is most easily evaluated on the two complementary eigenspaces of \(\beta \) (with eigenvalues \(\pm 1\)).

Setting the asymptotic forms (24) and (25) into formula (23), and using the above identities (the second one with \(\sigma =\tau \pm \frac{\pi }{4}\)), we obtain

$$\begin{aligned} {[}U_0(\tau ,0)\varphi ](\mathbf {z})&= \mp ie^{-i(\tau \pm \frac{\pi }{4})\beta } \langle z\rangle ^{-\frac{1}{2}}K(\infty ,\pm \mathbf {z})^{-1} [{\mathcal {F}}^{-1}\varphi ](z^0,\pm \mathbf {z})\nonumber \\&\quad +O(\langle \tau \rangle ^{-1})\,. \end{aligned}$$

The last formula suggests the definition of the following unitary transformation:

$$\begin{aligned} \Phi (\tau )=\exp [-i\tau \beta ]\,, \end{aligned}$$

and the associated change of the evolution ‘picture’ (also in the interacting case):

$$\begin{aligned} \begin{aligned}&U_{0\Phi }(\tau ,\sigma )=\Phi (\tau )^*U_0(\tau ,\sigma )\Phi (\sigma )\,, \quad U_\Phi (\tau ,\sigma )=\Phi (\tau )^*U(\tau ,\sigma )\Phi (\sigma )\,,\\&H_{0\Phi }(\tau )=\Phi (\tau )^*H_0(\tau )\Phi (\tau )-\beta \,,\\\&H_{\Phi }(\tau )=H_{0\phi }(\tau )+W_\Phi (\tau )\,, \quad W_\Phi (\tau )=\Phi (\tau )^* W(\tau )\Phi (\tau )\,. \end{aligned} \end{aligned}$$

Remark 4

Under the conditions of Theorem 3, all statements of its thesis remain valid with the replacements defined by Eqs. (27).

This follows quite trivially, as \(\Phi (\tau )\) acts only on the factor \({\mathbb {C}}^4\) in the Hilbert space \({\mathcal {H}}={\mathbb {C}}^4\otimes L^2({\mathbb {R}}^3)\).

After that, we go back to the asymptotics of the free evolution, where we shall need the parity operator

$$\begin{aligned} {[}{\mathcal {P}}\varphi ](\mathbf {z})=\varphi (-\mathbf {z})\,. \end{aligned}$$

Moreover, we observe that the map

$$\begin{aligned} {[}{\mathcal {K}}\varphi ](\mathbf {z})=\langle z\rangle ^{\frac{1}{2}}K(\infty ,\mathbf {z})\varphi (\mathbf {z}) \end{aligned}$$

is a unitary operator \({\mathcal {H}}\mapsto L^2_\gamma (H)\) (with the latter space defined in Appendix E).

Theorem 5

The following strong limits exist as unitary operators in \({\mathcal {H}}\):

$$\begin{aligned}&U_{0\Phi }(+\infty ,0)=\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow +\infty }U_{0\Phi }(\tau ,0) =-ie^{-i\frac{\pi }{4}\beta }{\mathcal {K}}^{-1}{\mathcal {F}}^{-1}\,,\\&U_{0\Phi }(-\infty ,0)=\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow -\infty }U_{0\Phi }(\tau ,0) =ie^{+i\frac{\pi }{4}\beta }{\mathcal {P}} {\mathcal {K}}^{-1}{\mathcal {F}}^{-1}\,,\\&U_{0\Phi }(0,+\infty )=\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow +\infty }U_{0\Phi }(0,\tau ) =i{\mathcal {F}}{\mathcal {K}}e^{+i\frac{\pi }{4}\beta }\,,\\&U_{0\Phi }(0,-\infty )=\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow -\infty }U_{0\Phi }(0,\tau ) =-i{\mathcal {F}}{\mathcal {K}}{\mathcal {P}}e^{-i\frac{\pi }{4}\beta }\,,\\&U_{0\Phi }(0,\pm \infty )=U_{0\Phi }(\pm \infty ,0)^*\,. \end{aligned}$$

Therefore,Footnote 8

$$\begin{aligned} U_{0\Phi }(+\infty ,-\infty )=i\beta {\mathcal {P}}\,. \end{aligned}$$


For \(\psi \in C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\) and \(\varphi \in {\mathcal {F}}C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\) formula (26) gives

$$\begin{aligned}&\lim _{\tau \rightarrow +\infty }(\psi ,U_{0\Phi }(\tau ,0)\varphi ) =(\psi ,(-i)e^{-i\frac{\pi }{4}\beta }{\mathcal {K}}^{-1}{\mathcal {F}}^{-1}\varphi )\,,\\&\lim _{\tau \rightarrow -\infty }(\psi ,U_{0\Phi }(\tau ,0)\varphi ) =(\psi ,ie^{+i\frac{\pi }{4}\beta }{\mathcal {P}} {\mathcal {K}}^{-1}{\mathcal {F}}^{-1}\varphi )\,. \end{aligned}$$

Both subspaces are dense in \({\mathcal {H}}\), so the weak operator limits result. But the limit operators are evidently unitary, so the weak limits imply the strong limits of \(U_{0\Phi }(\tau ,0)\), as well as its conjugate. \(\square \)

Electromagnetic Interaction and Gauge Transformation

In the rest of this article we are interested in the standard, minimal coupling electromagnetic interaction. For the electromagnetic field \(F_{ab}\), we reserve notation \(A_a\) for the Lorenz gauge potential (fully specified in what follows). We write \(\mathcal {A}_a\) for the potential in a general gauge to be used in the Dirac equation. Also, we recall our conventions defined in formulas (4) and the following remarks, so \(\hat{F}_{\mu \nu }\), \(\hat{A}_\mu \) and \(\hat{\mathcal {A}}_\mu \) are components of these fields in our coordinate system, and for \(\mu ,\nu =i,j\), etc. the range is restricted to values 1, 2, 3.

Therefore, in the electromagnetic case the field V and its transformed version W are

$$\begin{aligned} V(x)=\mathcal {A}_a(x)\gamma ^a\,,\qquad W(\tau ,\mathbf {z}) =\hat{\mathcal {A}}_\tau (\tau ,\mathbf {z})+\hat{\mathcal {A}}_i(\tau ,\mathbf {z})\lambda ^i(\tau ,\mathbf {z})\,, \end{aligned}$$

and we write the Hamiltonian as

$$\begin{aligned} H=\tfrac{1}{2}[\lambda ^i,\pi _i]_++\mu +\hat{\mathcal {A}}_\tau \,,\qquad \pi _i=p_i+\hat{\mathcal {A}}_i\,. \end{aligned}$$

Theorem 6

The coordinate system \((\tau ,\mathbf {z})\) given by (20) is assumed.

(i) Let the electromagnetic potential \(\mathcal {A}(x)\) have components \(\hat{\mathcal {A}}_\mu (\tau ,\mathbf {z})\) such that for all indices \(\mu =\tau ,1,2,3\) the mappings

$$\begin{aligned} \big \{{\mathbb {R}}\ni \tau \mapsto \partial _z^\alpha \hat{\mathcal {A}}_\mu (\tau ,\mathbf {z})\big \} \in C^0({\mathbb {R}},L^\infty _\mathrm {loc}({\mathbb {R}}^3))\,,\quad |\alpha |\le 2\,, \end{aligned}$$


$$\begin{aligned} \left. \begin{array}{l} \Vert \langle z\rangle ^{-2+|\alpha |}\partial _z^\alpha \hat{\mathcal {A}}_\tau (\tau ,\mathbf {z})\Vert _\infty \\ \Vert \langle z\rangle ^{-1+|\alpha |}\partial _z^\alpha \hat{\mathcal {A}}_i(\tau ,\mathbf {z})\Vert _\infty \end{array}\right\} \le C(\tau )\,,\quad |\alpha |\le 1\,, \end{aligned}$$

where \(C(\tau )\) is a continuous function. Then the conditions of Theorem 3 are fulfilled and the unitary propagator \(U(\tau ,\sigma )\) with the listed properties is obtained.

(ii) Let the potential \(\mathcal {A}(x)\) and the corresponding propagator \(U(\tau ,\sigma )\) be as in (i). Define a new gauge

$$\begin{aligned} \mathcal {A}^{\mathcal {G}}=\mathcal {A}-\nabla {\mathcal {G}}\,, \end{aligned}$$

where \({\mathcal {G}}(x)\) is a gauge function such that for all indices \(\mu =\tau ,1,2,3\) the mappings

$$\begin{aligned} \big \{{\mathbb {R}}\ni \tau \mapsto \partial _z^\alpha \partial _\mu {\mathcal {G}}(\tau ,\mathbf {z})\big \} \in C^0({\mathbb {R}},L^\infty _\mathrm {loc}({\mathbb {R}}^3))\,,\quad |\alpha |\le 1\,, \end{aligned}$$


$$\begin{aligned} \left. \begin{array}{lll} \Vert \langle z\rangle ^{-2+|\alpha |}\partial _z^\alpha \partial _\tau {\mathcal {G}}(\tau ,\mathbf {z})\Vert _\infty \\ \Vert \langle z\rangle ^{-1+|\alpha |}\partial _z^\alpha \partial _i{\mathcal {G}}(\tau ,\mathbf {z})\Vert _\infty \end{array} \right\} \le C(\tau )\,,\quad |\alpha |\le 1\,, \end{aligned}$$

where \(C(\tau )\) is a continuous function. Denote by \(H^{\mathcal {G}}(\tau )\) the interacting Hamiltonian with \(\hat{\mathcal {A}}\) replaced by \(\hat{\mathcal {A}}^{\mathcal {G}}\), and

$$\begin{aligned} U^{\mathcal {G}}(\tau ,\sigma )= e^{i {\mathcal {G}}(\tau )}U(\tau ,\sigma )e^{-i {\mathcal {G}}(\sigma )}\,. \end{aligned}$$

Then for such modified operators the statements (A) and (C)–(E) of Theorem 3 are satisfied.


  1. (i)

    The bounds (13) are satisfied in our coordinate system. Together with the assumed properties of \(\hat{\mathcal {A}}_\mu \), this ensures that the interaction term (28) satisfies the assumptions of Theorem 3, so the thesis follows.

  2. (ii)

    For (A) we note that the assumption (32) implies that the interaction term \(W^{\mathcal {G}}=\hat{\mathcal {A}}^{\mathcal {G}}_\tau +\hat{\mathcal {A}}^{\mathcal {G}}_i\lambda ^i\) satisfies assumptions imposed on M in Lemma 14 (i) and (ii) (not necessarily (iii)). This is sufficient for the conclusion on self-adjointness of \(H^{\mathcal {G}}\) obtained as in Theorem 3. Moreover, it follows from (31) that \(e^{i{\mathcal {G}}(\tau )}{\mathcal {D}}_c(\mathbf {p}^2)={\mathcal {D}}_c(\mathbf {p}^2)\), so (C) is satisfied. Finally, for \(\varphi \in {\mathcal {D}}_c(\mathbf {p}^2)\) we have

    $$\begin{aligned}&i\partial _\tau e^{i{\mathcal {G}}(\tau )}\varphi =-e^{i{\mathcal {G}}(\tau )}\partial _\tau {\mathcal {G}}(\tau )\varphi \,,\\&e^{i{\mathcal {G}}(\tau )}H(\tau )e^{-i{\mathcal {G}}(\tau )}\varphi =[H(\tau )-\lambda ^i(\tau )\partial _i{\mathcal {G}}(\tau )]\varphi \,, \end{aligned}$$

    so the remaining statements easily follow with the use of (31).

\(\square \)


We come here to our main objective in this article, scattering of the Dirac field in an external time-dependent electromagnetic field. With the assumptions of Theorem 6, augmented by some decay conditions formulated in the two assumptions below, we shall obtain the complete description of scattering in terms of the Cauchy surfaces of constant \(\tau \), as to be found in Theorem 7. The existence of the wave operators needs only a rather simple additional Assumption I, but their unitarity and completeness are more demanding, and they follow from Assumption II. It is with regard to this latter question that we need to discuss some further notation and properties.

It is easy to see that the operators

$$\begin{aligned} \tilde{H}=\tfrac{1}{2}[\lambda ^i,\pi _i]_+=H-\mu -\hat{\mathcal {A}}_\tau \,, \quad \tilde{H}_0=\tfrac{1}{2}[\lambda ^i,p_i]_+=H_0-\mu \end{aligned}$$

have similar properties as H and \(H_0\), in particular they are essentially self-adjoint on \(C_{0}^{\infty }({\mathbb {R}}^3, {\mathbb {C}}^4)\), so \({\mathcal {D}}_c(\mathbf {p})={\mathcal {H}}_c\cap {\mathcal {D}}(\mathbf {p})\) is contained in their domains. Therefore, \({\mathcal {D}}_c(\mathbf {p}^2)\) is contained in the domains of \(\tilde{H}^2\) and \(\tilde{H}_0^2\) [as well as \(H^2\) and \(H_0^2\)]. Moreover, we note for later use that

$$\begin{aligned} {[}\tilde{H},\beta ]_+=[\tilde{H}_0,\beta ]_+=0\,, \end{aligned}$$

which is a consequence of (21).

We shall need a more explicit form of \(\tilde{H}^2\) below. We calculate (with \(\rho \) defined in (22))

$$\begin{aligned} \tilde{H}^2&=(\pi _i\lambda ^i+\tfrac{i}{2}\partial \cdot \lambda )(\lambda ^j\pi _j-\tfrac{i}{2}{}\partial \cdot \lambda )\nonumber \\&=\pi _i\rho ^{ij}\pi _j+\tfrac{1}{2}\pi _i[\lambda ^i,\lambda ^j]\pi _j +\tfrac{i}{2}(\partial \cdot \lambda )\lambda ^j\pi _j-\tfrac{i}{2}\pi _j\lambda ^j(\partial \cdot \lambda ) +\tfrac{1}{4}(\partial \cdot \lambda )^2\,. \end{aligned}$$

The second term on the rhs above may be written in two alternative ways:

$$\begin{aligned} \begin{aligned} \tfrac{1}{2}\pi _i[\lambda ^i,\lambda ^j]\pi _j=-\tfrac{i}{2}(\partial _i[\lambda ^i,\lambda ^j])\pi _j +\tfrac{1}{2}[\lambda ^i,\lambda ^j]\pi _i\pi _j\\ =\tfrac{i}{2}\pi _i(\partial _j[\lambda ^i,\lambda ^j]) +\tfrac{1}{2}\pi _i\pi _j[\lambda ^i,\lambda ^j]\,. \end{aligned} \end{aligned}$$

Taking into account that \([\lambda ^i,\lambda ^j]\) is antisymmetric in the indices, one can replace the product \(\pi _i\pi _j\) by \(\frac{1}{2}[\pi _i,\pi _j]=-\frac{i}{2}\hat{F}_{ij}\), and then replace \([\lambda ^i,\lambda ^j]\) multiplying this expression by \(2\lambda ^i\lambda ^j\). Taking now one half of the sum of the two expressions in (36), we find

$$\begin{aligned} \tfrac{1}{2}\pi _i[\lambda ^i,\lambda ^j]\pi _j=\tfrac{i}{4}\big [\pi _j, \partial _i[\lambda ^j,\lambda ^i]\big ]_+ -\tfrac{i}{2}\lambda ^i\lambda ^j\hat{F}_{ij}\,. \end{aligned}$$

For the next two terms on the rhs of (35) we note

$$\begin{aligned} \tfrac{i}{2}(\partial \cdot \lambda )\lambda ^j\pi _j-\tfrac{i}{2}\pi _j\lambda ^j(\partial \cdot \lambda ) =-\tfrac{i}{4}[\pi _j,[\lambda ^j,\partial \cdot \lambda ]]_+-\tfrac{1}{4}\partial _j[\partial \cdot \lambda ,\lambda ^j]_+\,, \end{aligned}$$

which is shown in a somewhat similar way as the former identity. The sum of the first terms on the rhs of (37) and (38) gives \(\tfrac{i}{4}[\pi _j,[\partial _i\lambda ^j,\lambda ^i]]_+\). In this way we obtain

$$\begin{aligned} \tilde{H}^2&=\pi _i\rho ^{ij}\pi _j+\tfrac{i}{4}[\partial _i\lambda ^j,\lambda ^i]\pi _j +\tfrac{i}{4}\pi _j[\partial _i\lambda ^j,\lambda ^i]\\&\quad -\tfrac{1}{4}\partial _j([\lambda ^j,\partial \cdot \lambda ]_+)+\tfrac{1}{4}(\partial \cdot \lambda )^2 -\tfrac{i}{2}\lambda ^i\lambda ^j\hat{F}_{ij}\,. \end{aligned}$$

With the use of further notation

$$\begin{aligned}&\Lambda _j=\tfrac{i}{4}(\rho ^{-1})_{jk}[\partial _i\lambda ^k,\lambda ^i]\,, \quad \pi _{\Lambda i}=\pi _i+\Lambda _i\,, \end{aligned}$$
$$\begin{aligned}&Q=\tfrac{1}{4}\partial _j([\lambda ^j,\partial \cdot \lambda ]_+)-\tfrac{1}{4}(\partial \cdot \lambda )^2\,, \quad N=Q+\Lambda _i\rho ^{ij}\Lambda _j\,,\\&{\mathcal {B}}=\tfrac{i}{2}\lambda ^i\lambda ^j\hat{F}_{ij}\,,\nonumber \end{aligned}$$

we can write

$$\begin{aligned} \tilde{H}^2=\pi _{\Lambda }\rho \pi _{\Lambda }-N-{\mathcal {B}}\,, \end{aligned}$$

where in the first term on the rhs a symbolic notation for summation over indices is used. A straightforward calculation shows that both Q and N are positive numerical functions (times the unit matrix; see Appendix D, formula (96) for Q, and then for N this is obvious). Therefore, if we further denote

$$\begin{aligned} s=\sqrt{\rho }\,, \end{aligned}$$

then for \(\varphi \in {\mathcal {D}}_c(\mathbf {p}^2)\) we have

$$\begin{aligned} \Vert \tilde{H}\varphi \Vert \le \big (\Vert s\pi _\Lambda \varphi \Vert ^2-(\varphi ,{\mathcal {B}}\varphi )\big )^{\frac{1}{2}} \le \Vert s\pi _\Lambda \varphi \Vert +|(\varphi ,{\mathcal {B}}\varphi )|^{\frac{1}{2}} \end{aligned}$$

Next denote

$$\begin{aligned} X=\tfrac{1}{2}\mu ^{-1} \hat{\mathcal {A}}_i\lambda ^i=\tfrac{1}{2}\beta a_i\lambda ^i\,, \qquad a_i=(g^{\tau \tau })^{\frac{1}{2}}\hat{\mathcal {A}}_i\,. \end{aligned}$$

Below we shall need the following identity valid, with our assumptions on \(\hat{\mathcal {A}}_i\), on \({\mathcal {D}}_c(\mathbf {p})\):

$$\begin{aligned} \beta [X,\tilde{H}]=a_i\rho ^{ij}\pi _{\Lambda j}-\tfrac{i}{2}\lambda ^i\lambda ^j\partial _i a_j -\tfrac{i}{2}(\partial _i\rho ^{ij})a_j\,. \end{aligned}$$

To show this, we note that \([X,\tilde{H}]=\tfrac{1}{2} \beta [\lambda ^ia_i,\tilde{H}]_+\), write \(\tilde{H}=\lambda ^j\pi _j -\tfrac{i}{2}\partial \cdot \lambda \), and then the lhs of (45) takes the form

$$\begin{aligned}&\tfrac{1}{2}a_i[\lambda ^i,\lambda ^j]_+\pi _j-\tfrac{i}{2}\lambda ^j\partial _j(\lambda ^i a_i) -\tfrac{i}{4}a_i [\lambda ^i,\partial _j\lambda ^j]_{+}\\&\quad =a_i\rho ^{ij}\pi _{\Lambda j}-\tfrac{i}{2}\lambda ^j\lambda ^i\partial _j a_i -\tfrac{i}{2}a_i\big (\lambda ^j\partial _j\lambda ^i+\tfrac{1}{2}[\lambda ^i,\partial _j\lambda ^j]_+ +\tfrac{1}{2}[\partial _j\lambda ^i,\lambda ^j]\big )\,, \end{aligned}$$

where after the equality sign we have added and subtracted the term \(a_i\rho ^{ij}\Lambda _j\). It is now easy to show that the terms in parentheses multiplying \(a_i\) sum up to \(\tfrac{1}{2}\partial _j[\lambda ^j,\lambda ^i]_+\), which ends the proof of (45).

The spreading of the past and future is characterized in our coordinate system by Lemma 15 in Appendix D. Denote

$$\begin{aligned} {\mathcal {D}}(r,\mathbf {p}^2)=\{\psi \in {\mathcal {D}}(\mathbf {p}^2)\mid \psi (\mathbf {z})=0\ \text {for}\ |\mathbf {z}|\ge r\}\,, \end{aligned}$$

so that

$$\begin{aligned} {\mathcal {D}}_c(\mathbf {p}^2)=\bigcup _{r>0}{\mathcal {D}}(r,\mathbf {p}^2)\,. \end{aligned}$$

We set \(\tau _0=0\) in this lemma, and replace \(r_0\) and r by r and \(r(\tau )\), respectively, so that

$$\begin{aligned} r(\tau )=\langle r \rangle |\tau |+r\langle \tau \rangle \,. \end{aligned}$$

Then, according to this lemma, and statement (C) of Theorem 3, we have

$$\begin{aligned} \psi \in {\mathcal {D}}(r,\mathbf {p}^2)\quad \Longrightarrow \quad U(\tau ,0)\psi \in {\mathcal {D}}(r(\tau ),\mathbf {p}^2)\,. \end{aligned}$$

For a measurable function \(f(\tau ,\mathbf {z})\) we define a semi-norm function

$$\begin{aligned} \tau \mapsto \Vert f\Vert _{r,\tau }=\mathop {\mathrm{ess}\,\mathrm{sup}}\limits _{|\mathbf {z}|\le r(\tau )}|f(\tau ,\mathbf {z})|\,, \end{aligned}$$

and then for \(\psi \) as in (46) we find

$$\begin{aligned} \Vert f(\tau ,.)U(\tau ,0)\psi \Vert \le \Vert f\Vert _{r,\tau }\Vert \psi \Vert \,. \end{aligned}$$

Note that according to Lemma 15 we have

$$\begin{aligned} |\mathbf {z}|\le r(\tau )\ \Longrightarrow \ \langle z\rangle \le \langle r \rangle \langle \tau \rangle +r|\tau |\,, \end{aligned}$$

so in that case

$$\begin{aligned} |\mathbf {z}|\le \langle z\rangle \le 2\langle r \rangle \langle \tau \rangle \,. \end{aligned}$$

For any measurable function \(k(\mathbf {z})\) and \(r>0\) we shall denote

$$\begin{aligned} \Vert k\Vert _r=\mathop {\mathrm{ess}\,\mathrm{sup}}\limits _{|\mathbf {z}|\le r}|k(\mathbf {z})|\,. \end{aligned}$$

For a function \(f(\tau ,\mathbf {z})\) we obviously have \(\Vert f(\tau ,.)\Vert _r\le \Vert f(\tau ,.)\Vert _{r,\tau }\).

Our scattering theorem will apply to potentials satisfying the following conditions of increasing restrictiveness.

Assumption I. Potential \(\hat{\mathcal {A}}_\mu (\tau ,\mathbf {z})\) satisfies the assumptions of Theorem 6, and in addition the mapping \(\tau \mapsto \hat{\mathcal {A}}_i(\tau ,\mathbf {z}) \) is in \(C^1({\mathbb {R}},L^\infty _\mathrm {loc}({\mathbb {R}}^3))\). For each \(r>0\) the following expressions are integrable on \({\mathbb {R}}\) with respect to \(\tau \):

$$\begin{aligned} \Vert \hat{\mathcal {A}}_\tau \Vert _r\,,\quad \langle \tau \rangle ^{-2}\Vert \hat{\mathcal {A}}_i\Vert _r^2\,,\quad \langle \tau \rangle ^{-2}\Vert \partial _i\hat{\mathcal {A}}_j\Vert _r\,,\quad \langle \tau \rangle ^{-1}\Vert \partial _\tau \hat{\mathcal {A}}_i\Vert _r\,. \end{aligned}$$

Assumption II. Potential \(\hat{\mathcal {A}}_\mu (\tau ,\mathbf {z})\) satisfies the assumptions of Theorem 6, and in addition the mapping \( \tau \mapsto \hat{\mathcal {A}}_i(\tau ,\mathbf {z})\) is in \(C^1({\mathbb {R}},L^\infty _\mathrm {loc}({\mathbb {R}}^3))\). For each \(r>0\) the following expressions are integrable on \({\mathbb {R}}\) with respect to \(\tau \):

$$\begin{aligned}&\Vert \hat{\mathcal {A}}_\tau \Vert _{r,\tau }\,,\\&\langle \tau \rangle ^{-2}\big (\Vert \hat{\mathcal {A}}_i\Vert _{r,\tau }^2+\Vert z^i\hat{\mathcal {A}}_i\Vert _{r,\tau }^2\big )\,,\\&\langle \tau \rangle ^{-2}\big (\Vert \partial _i\hat{\mathcal {A}}_j\Vert _{r,\tau }+\Vert z^i\partial _i\hat{\mathcal {A}}_j\Vert _{r,\tau } +\Vert z^j\partial _i\hat{\mathcal {A}}_j\Vert _{r,\tau }+\Vert z^iz^j\partial _i\hat{\mathcal {A}}_j\Vert _{r,\tau }\big )\,,\\&\langle \tau \rangle ^{-1}\big (\Vert \partial _\tau \hat{\mathcal {A}}_i\Vert _{r,\tau }+\Vert z^i\partial _\tau \hat{\mathcal {A}}_i\Vert _{r,\tau }\big )\,. \end{aligned}$$

Moreover, let \(\xi :[1,\infty )\mapsto {\mathbb {R}}\) be an appropriately chosen smooth, positive, nondecreasing function, such that \(\xi (1)=1\) and

$$\begin{aligned} \xi '(u)\le \frac{\kappa }{u}\xi (u)\,,\ \kappa \in (0,\tfrac{1}{2})\,, \qquad |\xi ''(u)|\le \frac{\mathrm {const}}{u}\xi (u)\,, \end{aligned}$$

where in the second bound the constant is arbitrary. The following bounds are satisfied

$$\begin{aligned} \Vert \hat{F}_{ij}\Vert _{r,\tau }+\Vert z^i\hat{F}_{ij}\Vert _{r,\tau }&\le \mathrm {const}(r)\frac{\langle \tau \rangle }{\xi (\langle \tau \rangle )}\,,\\ \Vert \hat{F}_{i\tau }\Vert _{r,\tau }+\Vert z^i\hat{F}_{i\tau }\Vert _{r,\tau }&\le \frac{\mathrm {const}(r)}{\xi (\langle \tau \rangle )}\,, \end{aligned}$$

and for each \(r>0\) the following expressions are integrable on \({\mathbb {R}}\)

$$\begin{aligned} \frac{\Vert \xi (\langle z\rangle )\hat{\mathcal {A}}_i\Vert _{r,\tau }}{\langle \tau \rangle \xi (\langle \tau \rangle )}\,, \qquad \frac{\Vert \xi (\langle z\rangle )z^i\hat{\mathcal {A}}_i\Vert _{r,\tau }}{\langle \tau \rangle \xi (\langle \tau \rangle )}\,. \end{aligned}$$

Before stating the theorem, we make a comment on the function \(\xi \) and consider some additional consequences of Assumption II.

If one sets \(\xi (u)=u^\kappa \xi _0(u)\), then the first bound in (48) is equivalent to \(\xi _0'(u)\le 0\). It follows that

$$\begin{aligned} \xi (u)\le u^\kappa \,. \end{aligned}$$

Therefore, \(\xi \) is a slowly increasing, non-oscillating function. Examples include:

$$\begin{aligned}&\xi _1(u)=u^\kappa \,,\\&\xi _2(u)=\Big [1+\frac{\kappa }{m}\log (u)\Big ]^m\,,\quad m>0\,, \end{aligned}$$

where \(\kappa \) is as assumed in (48). A particular choice of \(\xi \) must guarantee the validity of the assumptions (if this is possible).

By the Schwarz inequality, also the following integral is finite:

$$\begin{aligned} \int _{{\mathbb {R}}}\frac{\Vert \hat{\mathcal {A}}_i\Vert _{r,\tau }}{\langle \tau \rangle ^2}\hbox {d}\tau \le \sqrt{\pi }\left( \int _{{\mathbb {R}}}\frac{\Vert \hat{\mathcal {A}}_i\Vert _{r,\tau }^2}{\langle \tau \rangle ^2}d \tau \right) ^{\frac{1}{2}}<\infty \end{aligned}$$

and similarly for \(z^i\hat{\mathcal {A}}_i\). Moreover, for \(\tau _2\ge \tau _1\) we have

$$\begin{aligned} \frac{\hat{\mathcal {A}}_i(\tau _2,\mathbf {z})}{\langle \tau _2\rangle } -\frac{\hat{\mathcal {A}}_i(\tau _1,\mathbf {z})}{\langle \tau _1\rangle }= \int _{\tau _1}^{\tau _2}\left( \frac{\partial _\sigma \hat{\mathcal {A}}_i(\sigma ,\mathbf {z})}{\langle \sigma \rangle } -\frac{\sigma \hat{\mathcal {A}}_i(\sigma ,\mathbf {z})}{\langle \sigma \rangle ^3}\right) \hbox {d}\sigma \,, \end{aligned}$$

and similarly for \(z^i\hat{\mathcal {A}}_i\). Therefore, by Assumption II the functions

$$\begin{aligned} \langle \tau \rangle ^{-1}\Vert \hat{\mathcal {A}}_i\Vert _{r,\tau }\quad \text {and} \quad \langle \tau \rangle ^{-1}\Vert z^i\hat{\mathcal {A}}_i\Vert _{r,\tau } \end{aligned}$$

have limits for \(\tau \rightarrow \pm \infty \). Limits different from zero would contradict other assumptions, so

$$\begin{aligned} \lim _{\tau \rightarrow \pm \infty }\frac{\Vert \hat{\mathcal {A}}_i\Vert _{r,\tau }}{\langle \tau \rangle } =\lim _{\tau \rightarrow \pm \infty }\frac{\Vert z^i\hat{\mathcal {A}}_i\Vert _{r,\tau }}{\langle \tau \rangle }=0\,. \end{aligned}$$

For the sake of the proof of the coming theorem, we note the following estimates easily obtained with the use of formula (89):

$$\begin{aligned}&|\lambda ^i\hat{\mathcal {A}}_i|\le \frac{\mathrm {const}}{\langle \tau \rangle } \bigg (\frac{\langle \tau |z\rangle }{\langle \tau \rangle }|\hat{\mathcal {A}}_i|+|z^i\hat{\mathcal {A}}_i|\bigg )\,, \end{aligned}$$
$$\begin{aligned}&|\lambda ^i\lambda ^j\partial _i\hat{\mathcal {A}}_j| \le \frac{\mathrm {const}}{\langle \tau \rangle ^2}\Big [\frac{\langle \tau |z\rangle ^2}{\langle \tau \rangle ^2}|\partial _i\hat{\mathcal {A}}_j| +\frac{\langle \tau |z\rangle }{\langle \tau \rangle }\big (|z^i\partial _i\hat{\mathcal {A}}_j|+|z^j\partial _i\hat{\mathcal {A}}_j|\big ) +|z^iz^j\partial _i\hat{\mathcal {A}}_j|\Big ]\,. \end{aligned}$$

Theorem 7

  1. (i)

    Let \(\hat{\mathcal {A}}_\mu \) satisfy Assumption I. Then the following strong limits exist:

    $$\begin{aligned}&\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow \pm \infty }U(0,\tau )U_0(\tau ,0)=\Omega _\mp \,, \end{aligned}$$
    $$\begin{aligned}&\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow \pm \infty }U_\Phi (0,\tau )=U_\Phi (0,\pm \infty ) =\Omega _{\mp }U_{0\Phi }(0,\pm \infty )\,. \end{aligned}$$
  2. (ii)

    Let \(\hat{\mathcal {A}}_\mu \) satisfy Assumption II. Then the operators \(\Omega _\mp \) are unitary and also the following strong limits exist:

    $$\begin{aligned}&\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow \pm \infty }U_0(0,\tau )U(\tau ,0)=\Omega _\mp ^*\,, \end{aligned}$$
    $$\begin{aligned}&\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow \pm \infty }U_\Phi (\tau ,0)=U_\Phi (\pm \infty ,0) =U_{0\Phi }(\pm \infty ,0)\Omega ^*_{\mp }\,. \end{aligned}$$


For the sake of the whole proof we assume that \(\psi \in {\mathcal {D}}(r,\mathbf {p}^2)\) and \(\Vert \psi \Vert =1\).

(i) For \(\tau \rightarrow \infty \), we prove the existence of the limit (53), from which the limit (52) follows with the use of Theorem 5. The case \(\tau \rightarrow -\infty \) is analogous.

We note the identity

$$\begin{aligned} \big (1-\tfrac{1}{2}\beta \tilde{H}\big )\beta -H\big (1-\tfrac{1}{2}\beta \tilde{H}\big ) =(\mu -\beta +\hat{\mathcal {A}}_\tau )\big (\tfrac{1}{2}\beta \tilde{H}-1\big )-\tfrac{1}{2}\beta \tilde{H}^2\,, \end{aligned}$$

where \(\tilde{H}\) is as defined in (33) (to show this one eliminates H with the use of (33) and takes into account the anticommutation relation (34)). Using it, we obtain the evolution equation

$$\begin{aligned}&i\partial _\tau \big [U(0,\tau )\big (1-\tfrac{1}{2}\beta \tilde{H}\big )\Phi (\tau )\big ]\psi \\&\quad =U(0,\tau )\big [-\tfrac{i}{2}\beta (\partial _\tau \tilde{H}) +(\mu -\beta +\hat{\mathcal {A}}_\tau )\big (\tfrac{1}{2}\beta \tilde{H}-1\big ) -\tfrac{1}{2}\beta \tilde{H}^2\big ]\Phi (\tau )\psi \,. \end{aligned}$$

Taking into account the anticommutation relation (34), we can write this in the form

$$\begin{aligned}&i\partial _\tau \big [U_\Phi (0,\tau )-\tfrac{1}{2}U(0,\tau )\Phi (\tau )^*\beta \tilde{H}\big ]\psi \nonumber \\&\quad =-U(0,\tau )\Phi (\tau )\big [\mu -\beta +\hat{\mathcal {A}}_\tau +\tfrac{1}{2}\beta \tilde{H}^2\big ]\psi \nonumber \\&\qquad +\tfrac{1}{2} U(0,\tau )\Phi (\tau )^*\beta \big [-i\dot{\tilde{H}}+(\mu -\beta +\hat{\mathcal {A}}_\tau )\tilde{H}\big ]\psi \,. \end{aligned}$$

We estimate the terms in this equation, starting with the second term in brackets on the lhs, and then going to the successive terms on the rhs. As \(|\mathbf {z}|\le r\) on the support of \(\psi \), each \(\lambda ^i\), \(\partial _i\lambda ^j\) and \(\partial _i\partial _j\lambda ^k\) give a bounding factor \(\mathrm {const}(r)\langle \tau \rangle ^{-1}\), and each \(\partial _\tau \lambda ^i\) a factor \(\mathrm {const}(r)\langle \tau \rangle ^{-2}\), which leads to an easy straightforward estimation:

$$\begin{aligned}&\Vert \tilde{H}\psi \Vert \le \frac{\mathrm {const}(r)}{\langle \tau \rangle }\big (\Vert p_i\psi \Vert +\Vert \hat{\mathcal {A}}_i\Vert _r+1\big )\,,\\&\Vert (\mu -\beta )\psi \Vert \le \frac{r^2}{2\langle \tau \rangle ^2}\,,\qquad \Vert \hat{\mathcal {A}}_\tau \psi \Vert \le \Vert \hat{\mathcal {A}}_\tau \Vert _r\,,\\&\Vert \tilde{H}^2\psi \Vert \le \frac{\mathrm {const}(r)}{\langle \tau \rangle ^2}\Big [\Vert p_ip_j\psi \Vert +\big (\Vert \hat{\mathcal {A}}_i\Vert _r+1\big )\big (\Vert \hat{\mathcal {A}}_i\Vert _r+1+\Vert p_i\psi \Vert \big )\Big ]\,,\\&\Vert \dot{\tilde{H}}\psi \Vert \le \frac{\mathrm {const}(r)}{\langle \tau \rangle ^2}\big (\Vert p_i\psi \Vert +\Vert \hat{\mathcal {A}}_i\Vert _r+1\big ) +\frac{\mathrm {const}(r)}{\langle \tau \rangle }\Vert \partial _\tau \hat{\mathcal {A}}_i\Vert _r\,,\\&\Vert (\mu -\beta +\hat{\mathcal {A}}_\tau )\tilde{H}\psi \Vert \le \frac{\mathrm {const}(r)}{\langle \tau \rangle }\Big (\frac{r^2}{\langle \tau \rangle ^2} +\Vert \hat{\mathcal {A}}_\tau \Vert _r\Big )\big (\Vert p_i\psi \Vert +\Vert \hat{\mathcal {A}}_i\Vert _r+1\big )\,. \end{aligned}$$

Therefore, with the conditions of Assumption I all the terms on the rhs of (56) are integrable on \({\mathbb {R}}\), so the strong limit of \(U_\Phi (0,\tau )\psi -\tfrac{1}{2}U(0,\tau ) \Phi (\tau )^*\beta \tilde{H}\psi \) exists. But the second term vanishes in the limit, so the thesis follows for \(\psi \in {\mathcal {D}}_c(\mathbf {p}^2)\), and then by isometry for all \(\psi \in {\mathcal {H}}\). (For the integrability of \(\langle \tau \rangle ^{-2}\Vert \hat{\mathcal {A}}_i\Vert _r\) and for vanishing of \(\langle \tau \rangle ^{-1}\Vert \hat{\mathcal {A}}_i\Vert _r\) one argues similarly as in the remarks following Assumption II.)

(ii) We prove the existence of the limit (54) for \(\tau \rightarrow \infty \), from which the limit (55) follows. Combined with the existence of the limit (52), this also leads to unitarity and the conjugation relation. The case \(\tau \rightarrow -\infty \) is similar.

We note the identity

$$\begin{aligned} (1+X)H-H_0(1+X)=\hat{\mathcal {A}}_\tau +WX +[X,\tilde{H}]\,, \end{aligned}$$

where W and X are defined in (28) and (44), respectively, and we used the fact that

$$\begin{aligned} {[}X,H-\tilde{H}]=[X,\mu ]=-2\mu X=-\hat{\mathcal {A}}_i\lambda ^i\,. \end{aligned}$$


$$\begin{aligned}&i\partial _\tau \big [U_0(0,\tau )(1+X)U(\tau ,0)\big ]\psi \nonumber \\&\quad =U_0(0,\tau )\Big (i\partial _\tau X+\hat{\mathcal {A}}_\tau +WX +[X,\tilde{H}]\Big )U(\tau ,0)\psi \,. \end{aligned}$$

If we can show that the norm of the rhs of (57) is integrable over \([0,+\infty )\), then the strong limit of \(U_0(0,\tau )(1+X)U(\tau ,0)\psi \) for \(\tau \rightarrow \infty \) exists. But with the use of formula (47), and taking into account (50) and the value of \(g^{\tau \tau }\) to be found in Appendix D, we obtain

$$\begin{aligned} \Vert X(\tau )U(\tau ,0)\psi \Vert \le \Vert X\Vert _{r,\tau } \le \frac{\mathrm {const}}{\langle \tau \rangle }\Big (\Vert \hat{\mathcal {A}}_i\Vert _{r,\tau }+\Vert z^i\hat{\mathcal {A}}_i\Vert _{r,\tau }\Big )\,, \end{aligned}$$

so this norm vanishes in the limit, which then implies the desired result.

We estimate the norms of the successive terms on the rhs of (57), again with the use of (47). Differentiating formula (90) with respect to \(\tau \), we find

$$\begin{aligned} \Vert (\partial _\tau X)U(\tau ,0)\psi \Vert&\le \Vert \partial _\tau X\Vert _{r,\tau } \le \frac{\mathrm {const}}{\langle \tau \rangle ^2}\Big (\Vert \hat{\mathcal {A}}_i\Vert _{r,\tau }+\Vert z^i\hat{\mathcal {A}}_i\Vert _{r,\tau }\Big )\\&\quad +\frac{\mathrm {const}}{\langle \tau \rangle }\Big (\Vert \partial _\tau \hat{\mathcal {A}}_i\Vert _{r,\tau }+\Vert z^i\partial _\tau \hat{\mathcal {A}}_i\Vert _{r,\tau }\Big )\,, \end{aligned}$$

which is integrable. The norm of the second term is bounded by \(\Vert \hat{\mathcal {A}}_\tau (\tau )\Vert _{r,\tau }\), which is integrable by assumption. Next we note that

$$\begin{aligned} \beta WX=\hat{\mathcal {A}}_\tau \beta X-\tfrac{1}{2} (g^{\tau \tau })^{\frac{1}{2}}\hat{\mathcal {A}}_i\rho ^{ij}\hat{\mathcal {A}}_j\,, \end{aligned}$$

so using the explicit form of \((g^{\tau \tau })^{\frac{1}{2}}\rho ^{ij}\), see (91), we estimate the norm of the third term on the rhs of (57) by

$$\begin{aligned} \Vert WX\Vert _{r,\tau }\le \mathrm {const}\Vert \hat{\mathcal {A}}_\tau \Vert _{r,\tau }\Vert X\Vert _{r,\tau } +\frac{\mathrm {const}\langle r \rangle }{\langle \tau \rangle ^2}\Big (\Vert \hat{\mathcal {A}}_i\Vert ^2_{r,\tau }+\Vert z^i\hat{\mathcal {A}}_i\Vert ^2_{r,\tau }\Big )\,, \end{aligned}$$

which ensures integrability.

To estimate the norm of the fourth term, we use (45), (51) and (93) to find

$$\begin{aligned}&\Vert \lambda ^i\lambda ^j\partial _i a_j+(\partial _i\rho ^{ij})a_j\Vert _{r,\tau } \le \mathrm {const}\frac{\langle r \rangle }{\langle \tau \rangle ^2}\Vert \partial _i\hat{\mathcal {A}}_j\Vert _{r,\tau }\\&\quad +\frac{\mathrm {const}}{\langle \tau \rangle ^2}\Big (\Vert \hat{\mathcal {A}}_i\Vert _{r,\tau }+\Vert z^i\hat{\mathcal {A}}_i\Vert _{r,\tau } +\Vert z^i\partial _i\hat{\mathcal {A}}_j\Vert _{r,\tau }+\Vert z^j\partial _i\hat{\mathcal {A}}_j\Vert _{r,\tau } +\Vert z^iz^j\partial _i\hat{\mathcal {A}}_j\Vert _{r,\tau }\Big )\,, \end{aligned}$$

which again is integrable.

We are now left with the single term \(a\rho \pi _\Lambda U(\tau ,0)\psi \). As it turns out, in this case the methods applied up to now are insufficient and it is here that we make use of the function \(\xi \). We write \(\psi _\tau = U(\tau ,0)\psi \) and note that

$$\begin{aligned} \Vert a\rho \pi _{\Lambda }\psi _\tau \Vert&\le \Vert sa\xi (\langle z\rangle )\Vert _{r,\tau } \Vert \xi (\langle z\rangle )^{-1}s\pi _\Lambda \psi _\tau \Vert \\&\le \langle \tau \rangle ^{-1}\Big (\Vert \xi (\langle z\rangle )\hat{\mathcal {A}}_i\Vert _{r,\tau } +\Vert \xi (\langle z\rangle ) z^i\hat{\mathcal {A}}_i\Vert _{r,\tau }\Big )\Vert \xi (\langle z\rangle )^{-1}s\pi _\Lambda \psi _\tau \Vert \,. \end{aligned}$$

Now, the estimation of \(\Vert \xi (\langle z\rangle )^{-1}s\pi _\Lambda \psi _\tau \Vert \) is the most difficult part of the proof and we shift it to the lemma below. Substituting its result in the above estimate and using Assumption II, one completes the proof of the existence of the limit \(\tau \rightarrow +\infty \). \(\square \)

Lemma 8

Under the conditions of Assumption II, for \(\psi \in {\mathcal {D}}_c(\mathbf {p}^2)\) the following estimate holds

$$\begin{aligned} \Vert \xi (\langle z\rangle )^{-1}s\pi _\Lambda U(\tau ,0)\psi \Vert \le \mathrm {const}(\psi )\xi (\langle \tau \rangle )^{-1}\,. \end{aligned}$$


We assume again that \(\psi \in {\mathcal {D}}(r,\mathbf {p}^2)\) and \(\Vert \psi \Vert =1\). We observe that the lhs of the inequality may be equivalently replaced by \(\Vert s\pi _\Lambda \xi (\langle z\rangle )^{-1}U(\tau ,0)\psi \Vert \). Indeed, we have

$$\begin{aligned} \big \Vert \big [s\pi _\Lambda ,\xi (\langle z\rangle )^{-1}\big ]\big \Vert _\infty = \Big \Vert \frac{\mathbf {z}\xi '(\langle z\rangle )}{\langle \tau \rangle \xi (\langle z\rangle )^2}\Big \Vert _\infty \le \Big \Vert \frac{\kappa \,\mathbf {z}}{\langle \tau \rangle \langle z\rangle \xi (\langle z\rangle )}\Big \Vert _\infty \le \frac{\kappa }{\langle \tau \rangle }\,. \end{aligned}$$

Now we shall estimate the norm squared

$$\begin{aligned} \Vert s\pi _\Lambda \xi (\langle z\rangle )^{-1}\psi _\tau \Vert ^2 =(\pi _\Lambda \xi (\langle z\rangle )^{-1}\psi _\tau ,\rho \pi _\Lambda \xi (\langle z\rangle )^{-1}\psi _\tau ) \end{aligned}$$

by first finding a differential inequality, and then integrating. Preparing for that, we denote

$$\begin{aligned} \nu =i\xi (\langle z\rangle )^{-1}[H,\xi (\langle z\rangle )]=\frac{\lambda ^i\partial _i\xi (\langle z\rangle )}{\xi (\langle z\rangle )} =\frac{\xi '(\langle z\rangle )}{\langle \tau \rangle \xi (\langle z\rangle )}z^i\alpha ^i\,, \end{aligned}$$

with the standard notation \(\alpha ^i=\beta \gamma ^i\), and observe that

$$\begin{aligned} \xi (\langle z\rangle )^{-1} H=\tilde{H}\xi (\langle z\rangle )^{-1}+(\hat{\mathcal {A}}_\tau +\mu -i\nu )\xi (\langle z\rangle )^{-1}\,. \end{aligned}$$

To shorten notation, we shall write \(\psi _\tau ^\xi =\xi (\langle z\rangle )^{-1}\psi _\tau \). Looking at the explicit form of \(\Lambda _i\) (94), we note that

$$\begin{aligned} {[}\Lambda _i,\mu ]=0\,,\qquad [\Lambda _i,\nu ]_+=0\,. \end{aligned}$$

Now calculate

$$\begin{aligned} \partial _\tau [\pi _\Lambda \psi _\tau ^\xi ]&=-i\pi _\Lambda \xi (\langle z\rangle )^{-1} H\psi _\tau +(\partial _\tau \Lambda +\partial _\tau \hat{\mathcal {A}})\psi _\tau ^\xi \\&=-i\pi _\Lambda \tilde{H}\psi _\tau ^\xi +(\dot{\Lambda }+\hat{F}_{\tau .}+\partial (i\nu -\mu )+2\nu \Lambda )\psi _\tau ^\xi -(\nu +i\hat{\mathcal {A}}_\tau +i\mu )\pi _\Lambda \psi _\tau ^\xi \,, \end{aligned}$$


$$\begin{aligned}&\partial _\tau (\pi _\Lambda \psi _\tau ^\xi ,\rho \pi _\Lambda \psi _\tau ^\xi ) -(\pi _\Lambda \psi _\tau ^\xi ,\dot{\rho }\pi _\Lambda \psi _\tau ^\xi ) =-2(\pi _\Lambda \psi _\tau ^\xi ,\rho \nu \pi _\Lambda \psi _\tau ^\xi )\nonumber \\&\quad -i(\pi _\Lambda \rho \pi _\Lambda \psi _\tau ^\xi ,\tilde{H}\psi _\tau ^\xi ) +i(\tilde{H}\psi _\tau ^\xi ,\pi _\Lambda \rho \pi _\Lambda \psi _\tau ^\xi )\nonumber \\&\quad +2{{\,\mathrm{Re}\,}}(\pi _\Lambda \psi _\tau ^\xi ,\rho [\dot{\Lambda } +\hat{F}_{\tau .}+\partial (i\nu -\mu )+2\nu \Lambda ]\psi _\tau ^\xi )\,, \end{aligned}$$

where in both identities \(\hat{F}_{\tau .}\) denotes \(\hat{F}_{\tau i}\) with the index i suppressed (in the second identity summation over this index is implied). In the first identity we have used (60) and commuted \(\pi _\Lambda \) with the term \((\hat{\mathcal {A}}_\tau +\mu -i\nu )\).

From now on we continue the proof for \(\tau \ge 0\); for \(\tau \le 0\) the proof is analogous, but equation (61) has to be multiplied by \(-1\) before continuing. The first term on the rhs of (61) is bounded in absolute value by

$$\begin{aligned} 2\Vert \nu \Vert _\infty (\pi _\Lambda \psi _\tau ^\xi ,\rho \pi _\Lambda \psi _\tau ^\xi ) \le 2\kappa \langle \tau \rangle ^{-1}\Vert s\pi _\Lambda \psi _\tau ^\xi \Vert ^2\,, \end{aligned}$$

where we used the estimate given in (99). With the use of formula (41), the second line of equation (61) takes the form \(2{{\,\mathrm{Re}\,}}i(\tilde{H}\psi _\tau ^\xi ,(N+{\mathcal {B}})\psi _\tau ^\xi )\), and thanks to the estimate (43) is bounded in absolute value by

$$\begin{aligned} 2\big (\Vert s\pi _\Lambda \psi _\tau ^\xi \Vert +|(\psi _\tau ^\xi , {\mathcal {B}}\psi _\tau ^\xi )|^{\frac{1}{2}}\big )\Vert (N+{\mathcal {B}})\psi _\tau ^\xi \Vert \,. \end{aligned}$$

The third line in (61) is bounded by

$$\begin{aligned} 2\Vert s\pi _\Lambda \psi _\tau ^\xi \Vert \Vert s(\dot{\Lambda }+\hat{F}_{\tau \,.}+\partial (i\nu -\mu )+2\nu \Lambda )\psi _\tau ^\xi \Vert \,. \end{aligned}$$

Finally, we observe that for \(\tau >0\) we have [see (92)]

$$\begin{aligned} \dot{\rho }\le -\frac{2\tau }{\langle \tau \rangle ^2}\rho \,, \end{aligned}$$

which allows us to use (61) for the following estimate:

$$\begin{aligned} \partial _\tau \Vert s\pi _\Lambda \psi _\tau ^\xi \Vert ^2\le -b\Vert s\pi _\Lambda \psi _\tau ^\xi \Vert ^2 +2c\Vert s\pi _\Lambda \psi _\tau ^\xi \Vert +d\,, \end{aligned}$$


$$\begin{aligned}&b=\frac{2\tau }{\langle \tau \rangle ^2}-\frac{2\kappa }{\langle \tau \rangle }\,,\nonumber \\&c=\Vert (N+{\mathcal {B}})\psi _\tau ^\xi \Vert +\Vert s(\dot{\Lambda }+\hat{F}_{\tau \,.}+\partial (i\nu -\mu ) +2\nu \Lambda )\psi _\tau ^\xi \Vert \,,\\&d=2|(\psi _\tau ^\xi ,{\mathcal {B}}\psi _\tau ^\xi )|^{\frac{1}{2}}\Vert (N+{\mathcal {B}})\psi _\tau ^\xi \Vert \,.\nonumber \end{aligned}$$

The second term on the rhs of (62) may be estimated as follows

$$\begin{aligned} 2c\Vert s\pi _\Lambda \psi _\tau ^\xi \Vert&=2\Big [\frac{1-2\kappa }{\langle \tau \rangle }\Big ]^{\frac{1}{2}} \Vert s\pi _\Lambda \psi _\tau ^\xi \Vert \ \Big [\frac{\langle \tau \rangle }{1-2\kappa }\Big ]^{\frac{1}{2}}c\\&\le \frac{1-2\kappa }{\langle \tau \rangle }\Vert s\pi _\Lambda \psi _\tau ^\xi \Vert ^2+\frac{\langle \tau \rangle c^2}{1-2\kappa }\,, \end{aligned}$$

which results in the inequality

$$\begin{aligned} \partial _\tau \Vert s\pi _\Lambda \psi _\tau ^\xi \Vert ^2\le -b_0\Vert s\pi _\Lambda \psi _\tau ^\xi \Vert ^2+d_0\,, \end{aligned}$$


$$\begin{aligned} b_0=\frac{2\tau }{\langle \tau \rangle ^2}-\frac{1}{\langle \tau \rangle }\,,\quad d_0=\frac{\langle \tau \rangle c^2}{1-2\kappa }+d\,. \end{aligned}$$

We set

$$\begin{aligned} \Vert s\pi _\Lambda \psi _\tau ^\xi \Vert ^2=\exp \Big (-\int _0^\tau b_0(\sigma )\hbox {d}\sigma \Big )f(\tau ) =\frac{\tau +\langle \tau \rangle }{\langle \tau \rangle ^2}f(\tau ) \end{aligned}$$

and then (64) takes the form

$$\begin{aligned} \partial _\tau f\le \frac{\langle \tau \rangle ^2}{\tau +\langle \tau \rangle }d_0(\tau )\le \langle \tau \rangle d_0(\tau )\,. \end{aligned}$$

We note that \(f(0)=\Vert [s\pi _\Lambda \psi _\tau ^\xi ]_{\tau =0}\Vert ^2\) and find

$$\begin{aligned} \Vert s\pi _\Lambda \psi _\tau ^\xi \Vert ^2 \le \frac{2}{\langle \tau \rangle }\bigg [\Vert [s\pi _\Lambda \psi _\tau ^\xi ]_{\tau =0}\Vert ^2 +\int _0^\tau \langle \sigma \rangle d_0(\sigma )\hbox {d}\sigma \bigg ]. \end{aligned}$$

We have to estimate \(d_0\) defined in (65). We start by estimating c. For the terms depending on the electromagnetic field, we have (we use the form of s and estimates of \(\lambda ^i\) given in Appendix D)

$$\begin{aligned}&\Vert {\mathcal {B}}\Vert _{r,\tau }\le \frac{\mathrm {const}\langle r \rangle ^2}{\langle \tau \rangle ^2} \big (\Vert \hat{F}_{ij}\Vert _{r,\tau }+\Vert z^i\hat{F}_{ij}\Vert _{r,\tau }\big ) \le \frac{\mathrm {const}(r)}{\langle \tau \rangle \xi (\langle \tau \rangle )}\,,\\&\Vert s\hat{F}_{\tau \,.}\Vert _{r,\tau } \le \frac{\mathrm {const}\langle r \rangle }{\langle \tau \rangle } \big (\Vert \hat{F}_{\tau i}\Vert _{r,\tau }+\Vert z^i\hat{F}_{\tau i}\Vert _{r,\tau }\big ) \le \frac{\mathrm {const}(r)}{\langle \tau \rangle \xi (\langle \tau \rangle )}\,.\nonumber \end{aligned}$$

For the term \(s^{ij}\partial _j(i\nu -\mu )\), using the estimates (98) and (100) in Appendix D we findFootnote 9

$$\begin{aligned} \Vert s\partial (i\nu -\mu )\psi _\tau ^\xi \Vert \le \Big \Vert \frac{s\partial (i\nu -\mu )}{\xi (\langle z\rangle )}\Big \Vert _{r,\tau } \le \frac{\mathrm {const}}{\langle \tau \rangle ^2}\Big \Vert \frac{\langle z\rangle }{\xi (\langle z\rangle )}\Big \Vert _{r,\tau } \le \frac{\mathrm {const}(r)}{\langle \tau \rangle \xi (\langle \tau \rangle )}\,, \end{aligned}$$

where we used the fact, that both \(u/\xi (u)\) as well as \(\xi (u)\) are increasing, so

$$\begin{aligned} \frac{\langle z\rangle }{\xi (\langle z\rangle )}\le \frac{2\langle r \rangle \langle \tau \rangle }{\xi (2\langle r \rangle \langle \tau \rangle )} \le 2\langle r \rangle \frac{\langle \tau \rangle }{\xi (\langle \tau \rangle )}\,. \end{aligned}$$

The estimation of the other terms in (63) uses the bounds (95), (97) and (99) and gives

$$\begin{aligned} \Vert N\psi _\tau ^\xi \Vert +\Vert s\dot{\Lambda }\psi _\tau ^\xi \Vert +\Vert 2\nu \Lambda \psi _\tau ^\xi \Vert \le \frac{\mathrm {const}}{\langle \tau \rangle ^2}\,, \end{aligned}$$

so summing up, we have

$$\begin{aligned} c\le \frac{\mathrm {const}(r)}{\langle \tau \rangle \xi (\langle \tau \rangle )}\,,\qquad \langle \tau \rangle \, c^2\le \frac{\mathrm {const}(r)}{\langle \tau \rangle \xi (\langle \tau \rangle )^2}\,. \end{aligned}$$

The use of (67) and (69) shows that also

$$\begin{aligned} d(\tau )\le \frac{\mathrm {const}(r)}{[\langle \tau \rangle \xi (\langle \tau \rangle )]^{\frac{3}{2}}} \le \frac{\mathrm {const}(r)}{\langle \tau \rangle \xi (\langle \tau \rangle )^2}\,, \end{aligned}$$

where we used the bound \(u^\frac{1}{2}\ge \xi (u)\ge 1\), see (49). Summing up, we obtain

$$\begin{aligned} d_0(\tau )\le \frac{\mathrm {const}(r)}{\langle \tau \rangle \xi (\langle \tau \rangle )^2}\,. \end{aligned}$$

Now, it follows from (48) that \(u^\kappa /\xi (u)\) is an increasing function. Therefore,

$$\begin{aligned} \int _0^\tau \langle \sigma \rangle d_0(\sigma )\hbox {d}\sigma&\le \mathrm {const}(r)\int _0^\tau \frac{\hbox {d}\sigma }{\xi (\langle \sigma \rangle )^2} \le \mathrm {const}(r) \frac{\langle \tau \rangle ^{2\kappa }}{\xi (\langle \tau \rangle )^2}\int _0^\tau \frac{\hbox {d}\sigma }{\langle \sigma \rangle ^{2\kappa }}\\&\le \mathrm {const}(r)\frac{\langle \tau \rangle }{\xi (\langle \tau \rangle )^2}\,. \end{aligned}$$

This, when used in (66), gives

$$\begin{aligned} \Vert s\pi _\Lambda \psi _\tau ^\xi \Vert ^2 \le \frac{2\Vert [s\pi _\Lambda \psi _\tau ^\xi ]_{\tau =0}\Vert ^2}{\langle \tau \rangle } +\frac{\mathrm {const}(r)}{\xi (\langle \tau \rangle )^2} \le \frac{\mathrm {const}(\psi )}{\xi (\langle \tau \rangle )^2}\,, \end{aligned}$$

where for the second inequality we used (49). \(\square \)

Typical Electromagnetic Field and Its Special Gauges

Assumption II, on which our main theorem on scattering 7 is based, is rather technical and not easy for interpretation. Here we formulate a rather typical situation met in scattering processes.Footnote 10 We show that the electromagnetic field thus identified admits a gauge in which Assumption II is satisfied. Moreover, there is a class of gauges which need not satisfy this assumption, but still assure a similar asymptotic structure.

The retarded and advanced potentials are defined in terms of the source current J in standard way (as \(\varphi _{\mathrm{ret}/\mathrm{adv}}\) is defined in terms of \(\rho \) in (118) in Appendix G). Also, the radiated field of the current J is defined in standard way, \(A_\mathrm{rad}=A_\mathrm{ret}-A_\mathrm{adv}\). The Heaviside step function is denoted by \(\theta \).

Assumption III. The Lorenz potential \(A_a\) of the electromagnetic field \(F_{ab}\) is given by

$$\begin{aligned} A=A_\mathrm{ret}+A_\mathrm{in}=A_\mathrm{adv}+A_\mathrm{out}\,, \end{aligned}$$

where \(A_\mathrm{ret}\) is the retarded potential of a current J satisfying the assumptions listed below, and \(A_\mathrm{in}\) is the radiated potential of another current \(J_\mathrm{in}\) with similar properties as J. Then also \(A_\mathrm{adv}\) is the advanced potential of the current J, and \(A_\mathrm{out}\) is the radiated potential of the current \(J_\mathrm{out}=J+J_\mathrm{in}\), which has similar properties as J and \(J_\mathrm{in}\).

The conserved current J(x) is of class \(C^3\) and for some \(0< \varepsilon < \frac{1}{2}\) satisfies the following estimates:

$$\begin{aligned}&|\nabla ^\alpha J(x)|\le \frac{\mathrm {const}}{(|x|+1)^{3+|\alpha |}}\bigg [\theta (x^2)+\frac{1}{(|x|+1)^\varepsilon } \bigg ]\,,\qquad \text {for}\quad |\alpha |\le 3, \end{aligned}$$
$$\begin{aligned}&|\nabla ^\alpha (x\cdot \nabla +3) J(x)|\le \frac{\mathrm {const}}{(|x|+1)^{3+|\alpha |+\varepsilon }}\,, \qquad \qquad \quad \,\, \text {for}\quad |\alpha |\le 2\,. \end{aligned}$$

The same is assumed for \(J_\mathrm{in}\), and then the same follows for \(J_\mathrm{out}\). The potential A is then of class \(C^3\) on the Minkowski spacetime.

Note that A is a linear combination of retarded and advanced potentials of currents satisfying (70). Therefore, the last statement of Assumption III is a consequence of Lemma 20 (i) in Appendix G.

Theorem 9

Let the electromagnetic field F and its Lorenz potential A satisfy Assumption III. Define a new gauge by

$$\begin{aligned} \mathcal {A}(x)=A(x)-\nabla S(x)\,. \end{aligned}$$

Then the following holds.

(i) For the choice

$$\begin{aligned} S(x)=S_\ell (x)\equiv \log (\langle \tau \rangle \langle z\rangle )\,x\cdot A(x)\,, \end{aligned}$$

the potential (72) fulfills Assumption II, so Theorems 6 and 7 are satisfied.

(ii) Let S(x) be another gauge function, such that the difference

$$\begin{aligned} {\mathcal {G}}(x)=S(x)-S_\ell (x) \end{aligned}$$

satisfies the assumptions of Theorem 6 (ii), so that the thesis of this theorem is true. Suppose, in addition, that there exist point-wise limits

$$\begin{aligned} \lim _{\tau \rightarrow \pm \infty }{\mathcal {G}}(\tau ,\mathbf {z})\equiv {\mathcal {G}}_\pm (\mathbf {z})\,. \end{aligned}$$

Then the potential (72) satisfies the thesis of Theorem 7 (but not necessarily the assumption of this theorem).

(iii) In particular, the gauges defined by:

$$\begin{aligned}&\text {(a)}\qquad S_{\log }(x)=\log \langle \tau \rangle \, x\cdot A(x)\,, \\&\text {(b)}\qquad S_\mathrm{tr}(x)=\int _0^\tau \partial _\sigma x(\sigma ,\mathbf {z})\cdot A(\sigma ,\mathbf {z})\mathrm{d}\sigma \,, \end{aligned}$$

are in the class defined in (ii). In case (b) one has \(\hat{\mathcal {A}}_\tau =0\) and

$$\begin{aligned} \hat{\mathcal {A}}_i(\tau ,\mathbf {z})=\hat{A}_i(0,\mathbf {z})-\int _0^\tau \hat{F}_{i\sigma }(\sigma ,\mathbf {z})\mathrm{d}\sigma \,. \end{aligned}$$

Remark 10

Assumption III, and consequently the validity of Theorem 9, is independent of the choice of the time axis for the definition of the foliation (20).

Proof of Theorem 9

(i) This potential obeys Theorem 23 in Appendix I. As \(\mathcal {A}\) is of class \(C^2\) on the Minkowski spacetime, so the assumption (29) of Theorem 6 and the assumption on continuous differentiability of the mapping \(\tau \mapsto \hat{\mathcal {A}}_i(\tau ,\mathbf {z})\) in Assumption II are clearly satisfied. Denote

$$\begin{aligned} \xi (u)=u^\kappa \,,\qquad \kappa<\varepsilon <\tfrac{1}{2}\,. \end{aligned}$$

The estimates of Theorem 23 imply then the following norm bounds:

$$\begin{aligned}&\displaystyle \Vert \hat{F}_{i\tau }\Vert _\infty \,,\ \Vert z^i\hat{F}_{i\tau }\Vert _\infty \le \frac{\mathrm {const}}{\langle \tau \rangle }\,,\\&\displaystyle \Vert \hat{F}_{ij}\Vert _\infty \,,\ \Vert z^i\hat{F}_{ij}\Vert _\infty \le \mathrm {const}\,,\\&\displaystyle \Vert \hat{\mathcal {A}}_\tau \Vert _{r,\tau }\le \mathrm {const}\Big (\frac{1+\log \langle \tau \rangle }{\langle \tau \rangle ^{1+\varepsilon }} +\frac{\log \langle r \rangle }{\langle \tau \rangle ^3}\Big )\,,\\&\displaystyle \Vert (1+\log \langle z\rangle )^{-1}\hat{\mathcal {A}}_\tau \Vert _\infty +\Vert \partial _i\hat{\mathcal {A}}_\tau \Vert _\infty +\Vert z^i\partial _i\hat{\mathcal {A}}_\tau \Vert _\infty \le \mathrm {const}\frac{1+\log \langle \tau \rangle }{\langle \tau \rangle ^{1+\varepsilon }}\,,\\&\displaystyle \Vert \partial _i\partial _j\hat{\mathcal {A}}_\tau \Vert _\infty \le \mathrm {const}(1+\log \langle \tau \rangle )\,,\\&\displaystyle \Vert \hat{\mathcal {A}}_i\Vert _\infty +\Vert z^i\hat{\mathcal {A}}_i\Vert _\infty \le \Vert \xi (\langle z\rangle )\hat{\mathcal {A}}_i\Vert _\infty +\Vert \xi (\langle z\rangle )z^i\hat{\mathcal {A}}_i\Vert _\infty \le \mathrm {const}(1+\log \langle \tau \rangle )\,,\\&\displaystyle \Vert \partial _\tau \hat{\mathcal {A}}_i\Vert _\infty +\Vert z^i\partial _\tau \hat{\mathcal {A}}_i\Vert _\infty \le \frac{\mathrm {const}}{\langle \tau \rangle }\,,\\&\displaystyle \Vert \partial _i\hat{\mathcal {A}}_j\Vert _\infty +\Vert z^i\partial _i\hat{\mathcal {A}}_j\Vert _\infty +\Vert z^j\partial _i\hat{\mathcal {A}}_j\Vert _\infty +\Vert z^iz^j\partial _i\hat{\mathcal {A}}_j\Vert _\infty \le \mathrm {const}(1+\log \langle \tau \rangle )\,. \end{aligned}$$

It is now easily checked that the potential satisfies assumption (30) in Theorem 6 and all the remaining bounds in Assumption II, from which the thesis follows. (Note that in this case all expressions are bounded in \(L^\infty \)-norm, except for \(\hat{\mathcal {A}}_\tau \).)

(ii) Let the evolution operator \(U(\tau ,\sigma )\) refer to the potential defined in (i), and denote the new potential now considered by \(\hat{\mathcal {A}}^{\mathcal {G}}\). Following further notation used in Theorem 6, we have

$$\begin{aligned}&\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow \pm \infty }U^{\mathcal {G}}_\Phi (\tau ,0) =\mathop {\mathrm{s{-}lim}}\limits _{\tau \pm \infty }\Phi ^*(\tau )e^{i {\mathcal {G}}(\tau )}U(\tau ,0)e^{-i{\mathcal {G}}(0)}\\&\quad =\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow \pm \infty }e^{i{\mathcal {G}}(\tau )}U_\Phi (\tau ,0)e^{-i{\mathcal {G}}(0)} =e^{i{\mathcal {G}}_\pm }U_\Phi (\pm \infty ,0)e^{-i{\mathcal {G}}(0)}\,, \end{aligned}$$


$$\begin{aligned}&\mathop {\mathrm{s{-}lim}}\limits _{\tau \rightarrow \pm \infty }U_0(0,\tau )U^{\mathcal {G}}(\tau ,0) =U_{0\Phi }(0,\pm \infty )U^{\mathcal {G}}_\Phi (\pm \infty ,0)\\&\quad =U_{0\Phi }(0,\pm \infty )e^{i{\mathcal {G}}_\pm }U_\Phi (\pm \infty ,0) \Omega _\mp e^{-i{\mathcal {G}}(0)} \equiv \Omega ^{\mathcal {G}}_\mp \,. \end{aligned}$$

Similarly for the limits of the conjugated operators.

(iii) Both gauges are easily seen to satisfy condition (31) of Theorem 6; we turn to the estimates (32). In the case (a), with \(C(x)=x\cdot A(x)\), we have

$$\begin{aligned} {\mathcal {G}}(\tau ,\mathbf {z})=-\log \langle z\rangle \,C(\tau ,\mathbf {z})\,, \end{aligned}$$

and the estimates are easily checked with the use of the results of the proof of Theorem 23. Also, it follows from the estimate of \(|\partial _\tau C|\) given there that \(C(\tau ,\mathbf {z})\) has limits for \(\tau \rightarrow \pm \infty \).

In the case (b), it is now sufficient to investigate the difference of the gauge function \(S_\mathrm{tr}(x)\) as compared to the gauge function of case (a):

$$\begin{aligned} {\mathcal {G}}'(x)=S_\mathrm{tr}(x)-S_{\log }(x)\,. \end{aligned}$$

Differentiating and using the form of \(\partial _\tau x\) given in the proof of Theorem 23, we find

$$\begin{aligned}&\partial _\tau {\mathcal {G}}'(\tau ,\mathbf {z})=\langle \tau \rangle ^{-2}\langle z\rangle A_0-\log \langle \tau \rangle \partial _\tau C\,,\\&\partial _i\partial _\tau {\mathcal {G}}'(\tau ,\mathbf {z})=\frac{1}{\langle \tau \rangle ^2}\Big [\frac{z^i}{\langle z\rangle }A_0+\langle z\rangle \partial _i A_0\Big ] -\log \langle \tau \rangle \partial _i\partial _\tau C\,, \end{aligned}$$

Now noting that \(\partial _i{\mathcal {G}}(0,\mathbf {z})=0\), integrating (the last term by parts) and applying the derivative \(\partial _j\), we obtain (fields in the integrand depend on \((\sigma ,\mathbf {z})\))

$$\begin{aligned} \partial _i\partial _j{\mathcal {G}}'(\tau ,\mathbf {z})&=\int _0^\tau \left[ \frac{d^{ij}}{\langle z\rangle }A_0 +\frac{1}{\langle z\rangle }(z^i\partial _j A_0+z^j\partial _i A_0) +\langle z\rangle \partial _i\partial _jA_0+\sigma \partial _i\partial _j C\right] \frac{\hbox {d}\sigma }{\langle \sigma \rangle ^2}\\&\quad -\log \langle \tau \rangle \partial _i\partial _j C\,, \end{aligned}$$

with \(d^{ij}\) defined in (133). With the use of the estimates listed in the proof of Theorem 23, one finds

$$\begin{aligned}&|\partial _\tau {\mathcal {G}}'|\le \mathrm {const}\frac{1+\log \langle \tau \rangle }{\langle \tau \rangle ^{1+\varepsilon }}\,, \qquad |\partial _i\partial _\tau {\mathcal {G}}'|\le \mathrm {const}\frac{1+\log \langle \tau \rangle }{\langle \tau \rangle ^{1+\varepsilon }}\,,\\&|\partial _i\partial _j{\mathcal {G}}'|\le \mathrm {const}(1+\log \langle \tau \rangle )\,,\qquad |\partial _i{\mathcal {G}}'|\le \mathrm {const}\,, \end{aligned}$$

the fourth estimate by the integration of the second one. Thus, the estimates (32) are satisfied. Finally, \(\partial _\tau {\mathcal {G}}'\) is integrable on \({\mathbb {R}}\), so the thesis follows. \(\square \)


There are three questions we want to address in this section:

  1. (i)

    How far is the present analysis from a complete treatment of the Maxwell-Dirac system?

  2. (ii)

    Is there a further physical selection criterion to choose a gauge from the class of gauges obtained in Theorem 9?

  3. (iii)

    Open problems.

With regard to the first of these questions, we note that the form of the charged currents producing electromagnetic fields in Assumption III mimics what one should expect in fully interacting theory. A possible shortage of this assumption rests in the estimates of derivatives of the currents, which would not be satisfied for oscillating terms in asymptotic behavior. The Dirac field current does have such asymptotic terms, but it is quite plausible to predict that oscillations dump the asymptotic behavior of fields produced by them. What would be needed is an appropriately fast vanishing of the leading oscillating asymptotic terms in the neighborhood of the lightcone (which is a quite reasonable prediction). This would presumably lead to similar behavior of electromagnetic potentials as that following from Assumption III. Moreover, we note that our Assumption II, on which our analysis is based, leaves much more room for the types of potentials than Assumption III.

Let us once more at this point recall the work by Flato et al. [7]. These authors do have a theorem on the evolution of the complete system, but in a rather restricted setting and with not much control over the range of validity. The theorem states that in the space of smooth (i.e. \(C^\infty \)) initial data there exists a neighborhood of zero, which gives rise to the Cauchy evolution and completeness. Our analysis is not fully developed to the full interacting case, but gives complete results for the Dirac part of the system, with the electromagnetic fields plausibly guessed.

Our main motivation for the present work was the expectation that the choice of gauge does matter for asymptotic behavior and its interpretation. We have shown that for a class of gauges the asymptotic behavior of the Dirac field approaches that of a free field, without the need for any dynamical corrections.

This brings us to the second question mentioned at the beginning. Theorem 9 identifies a class of gauges for which the situation described above takes place. The concrete gauge defined by (73) was used for technical reasons: this gauge satisfies Assumption II, and it is the only gauge with that property among the gauges explicitly mentioned in Theorem 9. However, we want to argue now, in less precise terms then those of the preceding sections, that a different choice of gauges in the given class has a definite physical interpretation when an extension, as mentioned above, to fully interacting system is considered. Such extension, partly based on conjectures, was considered in [8], and we have to briefly recall a few results of this analysis. It was found that if one uses inside the lightcone the gauge related to the Lorenz gauge A (of similar properties as those of Assumption III) by

$$\begin{aligned} \mathcal {A}_\mathrm {cone}(x)=A(x)-\nabla S_\mathrm{cone}(x)\,, \qquad S_\mathrm{cone}(x)=\log \sqrt{x^2}\,x\cdot A(x)\,, \end{aligned}$$

or another with similar timelike asymptotic behavior, then the asymptotic total four-momentum and angular momentum taken away into timelike infinity have the same functional form as those for the free Dirac field. Moreover, the energy–momentum radiated into null infinity is fully due to the free outgoing electromagnetic field. However, the angular momentum going out into the null infinity, in addition to the free radiated contribution, has mixed adv-out electromagnetic terms. These latter terms may be incorporated into the free Dirac field by a change of the asymptotic limit addition to the gauge function. As a result, both energy–momentum as well as angular momentum are clearly separated into electromagnetic and Dirac parts.Footnote 11 Now we would like to identify in our class of global gauges those for which this separation may be expected. It is not difficult to show that for points inside the future lightcone, represented by \(\lambda v\), with \(\lambda >0\) and v on the future unit hyperboloid, we have

$$\begin{aligned} \lim _{\lambda \rightarrow \infty }[S_{\log }(\lambda v)-S_\mathrm{cone}(\lambda v)]=0 \end{aligned}$$

for the gauge function \(S_{\log }\) defined in Theorem 9 (iii). Therefore, we put forward the selection criterion for the gauge functions S to be used in (72) for the definition of \(\mathcal {A}\):

$$\begin{aligned} S(\tau ,\mathbf {z})=S_{\log }(\tau ,\mathbf {z})+\Delta S(\tau ,\mathbf {z})\,, \end{aligned}$$

where \(\Delta S\) has the limit

$$\begin{aligned} \Delta S(\infty ,\mathbf {z})=\lim _{\tau \rightarrow \infty }\Delta S(\tau ,\mathbf {z}) \end{aligned}$$

which satisfies the condition of the separation of angular momentum as described above (it is easy to see that for \(\Delta S\) sufficiently regular the rhs is equal to \(\lim _{\lambda \rightarrow \infty }\Delta S(\lambda (\langle z\rangle ,\mathbf {z}))\)). Similar conditions should be applied for past infinity.

Gauges in the class thus selected have the property announced in the introduction: \(x\cdot \mathcal {A}(x)\) vanishes asymptotically in timelike directions. This may be checked easily for \(S_{\log }\), and if \(\Delta S(\tau ,\mathbf {z})\) is not oscillating in \(\tau \), the same is true for this part.

The existence and completeness of the wave operators, as indicated in Remark 10, do not depend on the choice of the time axis for the definition of our spacetime foliation. On the other hand, and this is the first of open problems, the precise transformation law from one inertial observer to another needs further investigation. More generally, one can ask what is a general class of Cauchy foliations for which the results could be repeated, and how the results would depend on the choice in the class.

Let us end this discussion by expressing the belief that our scheme could be applied to analogous problems on at least some of the smooth curved spacetime backgrounds. Whether, as inquired by one of the Referees, further extension to black hole type spacetimes would be possible, is a more speculative question.


  1. 1.

    For (generalized) Dollard methods, see the monograph [2]. Recent examples of the use of ‘dressing’ in construction of simplified quantum field models include [5, 13]. A recent attempt at a precise implementation of the Dollard idea in general quantum field theory (in the form proposed much earlier in rather imprecise terms by Kulish and Faddeev) may be found in [4].

  2. 2.

    What remains an open question for this system is the asymptotic completeness. In the article, we expressed the view that the clash between the symmetry groups of the two parts of the system: Lorentz for the Maxwell and Galilean for the Schrödinger equations, might make the issue more problematic. Our present discussion seems to confirm this.

  3. 3.

    I have first proposed the use of these coordinates in the present context in 1999, as a natural extension of the hyperbolic foliation of the inside of the lightcone, to a Cauchy foliation of the whole spacetime. The evolution of the Dirac field over the hyperbolic foliation, and its large hyperbolic time asymptotics, was analyzed in [8] in terms of a Fourier-like transformation, which transformed this evolution to a unitary evolution in the Hilbert space of spinors on the hyperboloid of four velocities. The existence of the wave operators for the Dirac field in external electromagnetic field was established (in appropriate gauge, see below), but no results on asymptotic completeness were obtained. The idea now was to formulate the evolution over the foliation (1) in similar Fourier terms. Piotr Marecki, my student at that time, carried out in his MSc Thesis [12] calculations of this program for the free Dirac field (scattering was only briefly mentioned in this thesis). This method has its assets, but it is inconvenient for the analysis of many further questions like self-adjointness of the generators, the analysis of the domain of validity of the Dirac equation in the differential form, or asymptotic completeness of the interacting case, the questions that have not been considered. Here we use another method, which could be developed much further, and enabled the solution of all these questions.

  4. 4.

    One should also mention that the use of more general, than the flat equal time hypersurfaces, places our problem as a special case of the problem of hyperbolic equations on Lorentzian manifolds, a question intensively studied in the past, see a recent monograph [1]. However, our more specific system allows us to apply more specific Hilbert space methods, and obtain stronger results.

  5. 5.

    The Cauchy problem and the asymptotic completeness of the interacting Maxwell–Dirac system were considered by Flato et al. in [7]. However, their analysis needs strong smoothness and smallness assumptions, the latter not under well-determined control, and uses methods rather not well suited for an application in quantum case (‘nonlinear representation of the Poincaré group’). Also, the authors modify the asymptotic dynamics by a variation of the Dollard method. Our aims are different, as explained above.

  6. 6.

    We postpone to such prospective publication a comparison with the existing discussions of the relevance of gauge choice in QED. Here counts the idea, originated by Dirac [3], of an a posteriori transformation to a ‘gauge-independent gauge,’ developed by many authors, most exhaustively by Steinmann, see his monograph [18], Chapter 12. Also, the expectation (not shared by everyone) that formulations of QED in differing gauges need not be equivalent received a recent support from a mathematical analysis of a simplified model [6] (where literature account may also be found).

  7. 7.

    This is a standard example of the fact that classical external interaction problems for quantum fields create problems of their own, which are not expected to propagate into full closed quantum theory. An even more restrictive question in the case of the quantum Dirac field in an external classical field is whether a unitary evolution operator exists for this system. As shown in [16] (for standard evolution over the flat foliation of the spacetime), this may be possible only if the magnetic field vanishes. This brings to sharp light a rather restricted physical relevance of such questions and models (note that this condition is not even inertial observer-independent; anyway, this problem is not related to infrared questions).

  8. 8.

    Note an analogy with the nonrelativistic case, see Section 2 in [11].

  9. 9.

    The problem of estimation of the term \(s^{ij}\partial _j\mu \) is the ultimate reason for our introduction of the function \(\xi \). Without it, the bound in (68) would have the form \(\mathrm {const}(r)\langle \tau \rangle ^{-1}\), which would be insufficient for our application.

  10. 10.

    Possible oscillating terms in the asymptotic behavior of charged currents are not taken into account. One can expect that fields produced by such terms decay more rapidly than those considered here; see also Discussion.

  11. 11.

    For explicit expressions and more extensive discussion we refer the reader to [8], Section V. Here we would only like to reassure the reader that the problems met for angular momentum in electrodynamics are accounted for in that discussion.

  12. 12.

    Unlike in [11], here we make no use of the Kato theorem.

  13. 13.

    See, e.g., Proposition B.3.3 in [2].

  14. 14.

    It may be remarked that the proof admits some weakening of the assumptions. For instance, in (i) for the derivatives of \(L^i\) it would be sufficient that \(\langle z\rangle ^{-2}\partial _iL^i\) be in \(L^\infty \); in (ii) for the second derivatives of \(L^i\) it suffices that \(\langle z\rangle \Delta L^i\) is in \(L^\infty \). We shall not need such more general results.


  1. 1.

    Bär, C., Ginoux, N., Pfäffle, F.: Wave Equations on Lorentzian Manifolds and Quantization, ESI Lectures in Mathematics and Physics. EMS, Zurich (2007). arxiv:0806.1036 [math.DG]

  2. 2.

    Dereziński, J., Gérard, Ch.: Scattering Theory of Classical and Quantum N-Particle Systems. Springer, Berlin (1997)

    Google Scholar 

  3. 3.

    Dirac, P.A.M.: Gauge-invariant formulation of quantum electrodynamics. Can. J. Phys. 33, 650–660 (1955)

    ADS  MathSciNet  Article  Google Scholar 

  4. 4.

    Duch, P.: Infrared problem in perturbative quantum field theory. arXiv:1906.00940. [math-ph]

  5. 5.

    Dybalski, W.: From Faddeev–Kulish to LSZ. Towards a non-perturbative description of colliding electrons. Nucl. Phys. B 925, 455–469 (2017)

    ADS  MathSciNet  Article  Google Scholar 

  6. 6.

    Dybalski, W., Wegener, B.: Asymptotic charges, large gauge transformations and inequivalence of different gauges in external current QED. JHEP 11, 126 (2019). 12 pp

    ADS  MathSciNet  Article  Google Scholar 

  7. 7.

    Flato, M., Simon, J., Taflin, E.: Asymptotic completeness, global existence and the infrared problem for the Maxwell–Dirac equations. arXiv:hep-th/9502061v1 (1995). Asymptotic Completeness, Global Existence and the Infrared Problem for the Maxwell-Dirac Equations, Memoirs of the American Mathematical Society (Book 606), AMS, 1997

  8. 8.

    Herdegen, A.: Long-range effects in asymptotic fields and angular momentum of classical field electrodynamics. J. Math. Phys. 36, 4044–4086 (1995)

    ADS  MathSciNet  Article  Google Scholar 

  9. 9.

    Herdegen, A.: Semidirect product of CCR and CAR algebras and asymptotic states in quantum electrodynamics. J. Math. Phys. 39, 1788–1817 (1998)

    ADS  MathSciNet  Article  Google Scholar 

  10. 10.

    Herdegen, A.: Asymptotic structure of electrodynamics revisited. Lett. Math. Phys. 107, 1439–1470 (2017)

    ADS  MathSciNet  Article  Google Scholar 

  11. 11.

    Herdegen, A.: Electromagnetic gauge choice for scattering of Schrödinger particle. Ann. H. Poincaré 20, 3633–3650 (2019)

    Article  Google Scholar 

  12. 12.

    Marecki, P.: Electron scattering in long-range electromagnetic field, MSc Thesis, Jagiellonian University, Kraków (2000). (unpublished)

  13. 13.

    Morchio, G. and Strocchi, F.: The infrared problem in QED: A lesson from a model with Coulomb interaction and realistic photon emission, Ann. H. Poincaré 17, 2699-2739 (2016). arXiv: 1410.7289v3 [hep-th]

  14. 14.

    Reed, M., Simon, B.: Methods of Modern Mathematical Physics, vol. I. Academic Press, San Diego (1975)

    Google Scholar 

  15. 15.

    Reed, M., Simon, B.: Methods of Modern Mathematical Physics, vol. II. Academic Press, San Diego (1979)

    Google Scholar 

  16. 16.

    Ruijsenaars, S.N.M.: Charged particles in external fields. I. Classical theory. J. Math. Phys. 18, 720–737 (1977)

    ADS  MathSciNet  Article  Google Scholar 

  17. 17.

    Scharf, G.: Finite Quantum Electrodynamics. The Causal Approach, 2nd edn. Springer, Berlin (1995)

    Google Scholar 

  18. 18.

    Steinmann, O.: Perturbative Quantum Electrodynamics and Axiomatic Field Theory. Springer, Berlin (2000)

    Google Scholar 

  19. 19.

    Vainberg, B.R.: Asymptotic Methods in Equations of Mathematical Physics. Gordon and Breach, New York (1989)

    Google Scholar 

Download references

Author information



Corresponding author

Correspondence to Andrzej Herdegen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Communicated by Jan Derezinski.


Appendix A. Transformation K

The operator of the Lorentz rotation of Dirac spinors in the hyperplane spanned by the timelike, future-pointing unit vectors t and n is given by

$$\begin{aligned} K(\kappa )=\exp (\tfrac{\kappa }{2}[\gamma \cdot n,\gamma \cdot t]) =\cosh (\kappa \sinh \zeta )+\frac{\sinh (\kappa \sinh \zeta )}{2\sinh \zeta }[\gamma \cdot n, \gamma \cdot t]\,, \end{aligned}$$

where \(\kappa \) is the parameter of the rotation, \(\zeta >0\) is such that \(n\cdot t=\cosh \zeta \), and the rhs is obtained by a simple calculation with the use of the relation \(([\gamma \cdot n,\gamma \cdot t])^2=4(\sinh \zeta )^2\). Rotation of \(\gamma \cdot n\) gives

$$\begin{aligned} K(-\kappa )\gamma \cdot n K(\kappa ) =\frac{\sinh (\zeta -2\kappa \sinh \zeta ) \gamma \cdot n +\sinh (2\kappa \sinh \zeta )\gamma \cdot t}{\sinh \zeta }\,. \end{aligned}$$

Demanding that the coefficient at \(\gamma \cdot n\) vanishes, we find \(\kappa _0=\zeta /(2\sinh \zeta )\) and then for \(K=K(\kappa _0)\) we have

$$\begin{aligned}&K=\frac{1+\gamma \cdot n \gamma \cdot t}{\sqrt{2(1+n\cdot t)}} =\gamma \cdot w\,\beta =K^\dagger \,,\qquad w=\frac{t+n}{\sqrt{(t+n)^2}}\,,\\&K^{-1}\gamma \cdot n K=\gamma \cdot t\,.\nonumber \end{aligned}$$

We now put \(t=e_0\), and n as given in (7). With notation introduced in Sect. 2, we now have:

Proposition 11

If \(\Sigma _\tau \) are rotationally symmetric, that is \(\tau =\tau (x^0,|\mathbf {x}|)\), then

$$\begin{aligned} {[}\hat{\gamma }_K^\mu ,(K^{-1}\partial _\mu K)]_+=0\,. \end{aligned}$$


The lhs of (75) is coordinate independent, so we can use Minkowski coordinates. If we use K in the form (74), then we find

$$\begin{aligned} \beta [\gamma _K^a,(K^{-1}\nabla _a K)]_+\beta =(\gamma ^b\gamma ^a\gamma ^c-\gamma ^c\gamma ^a\gamma ^b)w_b\nabla _a w_c =2\gamma ^b\gamma ^a\gamma ^c w_{[b}\nabla _a w_{c]}\,, \end{aligned}$$

where the second equality follows from the easily seen complete antisymmetry of the expression in gamma matrices following the first equality sign. Using the form of w given in (74), we find

$$\begin{aligned} w_{[b}\nabla _a w_{c]}=[(t+n)^2]^{-1}(t_{[b}+n_{[b})\nabla _a n_{c]}\,. \end{aligned}$$

If the assumption of the proposition holds, then \(n_a=(k_1,k_2x^i)\), where \(k_1\), \(k_2\) are functions of \(x^0\) and \(|\mathbf {x}|\). In consequence, the rhs of (76) vanishes and the thesis follows. \(\square \)

Appendix B. An Open System

We discuss here a class of open systems for which the (general) Schrödinger equation may be defined. The main tool for the discussion of this section is the commutator theorem, as formulated in Thm. X.37 in [15].Footnote 12

B.1: Hamiltonians

Theorem 12

Let \(h_0\ge 1\) be a self-adjoint operator, for which the dense subspace \({\mathcal {D}}\subseteq {\mathcal {D}}(h_0)\) is a core. Suppose that for \(\tau \in {\mathbb {R}}\) the following holds:

  1. (a)

    \(h(\tau )\) is a family of symmetric operators on \({\mathcal {D}}\),

  2. (b)

    the commutator \([h(\tau ),h_0]\), defined originally as a quadratic form on \({\mathcal {D}}\), extends to an operator on \({\mathcal {D}}\),

  3. (c)

    for \(\psi \in {\mathcal {D}}\) the maps \(\tau \mapsto h(\tau )\psi \), \(\tau \mapsto [h(\tau ),h_0]\psi \) are strongly continuous and the following bounds are satisfied:

    $$\begin{aligned} \Vert h(\tau )\psi \Vert&\le c_1(\tau )\Vert h_0\psi \Vert \,, \end{aligned}$$
    $$\begin{aligned} |(\psi ,[h(\tau ),h_0]\psi )|&\le c_2(\tau ) (\psi ,h_0\psi )\,, \end{aligned}$$
    $$\begin{aligned} \Vert [h(\tau ),h_0]\psi \Vert&\le c_3(\tau )\Vert h_0\psi \Vert \,, \end{aligned}$$

    where \(c_i(\tau )\) are continuous functions.

Denote by the same symbols the closures of the operators \(h(\tau )\) and \([h(\tau ),h_0]\).

Then the following holds:

  1. (A)

    \({\mathcal {D}}(h_0)\subseteq {\mathcal {D}}(h(\tau ))\), \({\mathcal {D}}(h_0)\subseteq {\mathcal {D}}([h(\tau ),h_0])\), and \(h(\tau )\) are essentially self-adjoint on \({\mathcal {D}}\) and on each core of \(h_0\),

  2. (B)

    operators \(h(\tau )h_0^{-1}\), \(h_0^{-1}h(\tau )\), \([h(\tau ),h_0]h_0^{-1}\) and \(h_0^{-1}[h(\tau ),h_0]\) extend to bounded operators, strongly continuous with respect to \(\tau \), and

    $$\begin{aligned} \begin{aligned}&\Vert h(\tau )h_0^{-1}\Vert =\Vert h_0^{-1}h(\tau )\Vert \le c_1(\tau )\,,\\&\Vert [h(\tau ),h_0]h_0^{-1}\Vert =\Vert h_0^{-1}[h(\tau ),h_0]\Vert \le c_3(\tau )\,, \end{aligned} \end{aligned}$$
  3. (C)

    statement (c) remains true for all \(\psi \in {\mathcal {D}}(h_0)\).


As \({\mathcal {D}}\) is a core for \(h_0\), the domains inclusions in (A) and the extension of inequalities (77)–(79) to \(\psi \in {\mathcal {D}}(h_0)\) follow immediately, if we denote the closures of the operators \(h(\tau )\) and \([h(\tau ),h_0]\) by the same symbols. In addition, the essential self-adjointness of \(h(\tau )\) follows from (77) and (78) by the commutator theorem, Thm. X.37 in [15].

The bounds (80) for the operators \(h(\tau )h_0^{-1}\) and \([h(\tau ),h_0]h_0^{-1}\) follow now immediately. The operators \(h_0^{-1}h(\tau )\) and \(h_0^{-1}i[h(\tau ),h_0]\), initially densely defined on \({\mathcal {D}}(h_0)\), extend to conjugates of \(h(\tau )h_0^{-1}\) and \(i[h(\tau ),h_0]h_0^{-1}\), respectively, so all the bounds (80) are satisfied.

Strong continuity in \(\tau \) of the operators \(h_0^{-1}h(\tau )\) and \(h_0^{-1}[h(\tau ),h_0]\) follows immediately from the bounds (80) and the assumption (c). For the continuity of \(h(\tau )h_0^{-1}\), we note what follows. For \(\psi \in {\mathcal {H}}\) the vector \(h_0^{-1}\psi \) is in \({\mathcal {D}}(h_0)\), so for each \(\epsilon >0\) there exist \(\varphi \in {\mathcal {D}}\) (a core for \(h_0\)) such that \(\Vert \psi -h_0\varphi \Vert \le \epsilon \). We estimate

$$\begin{aligned} \Vert h(\tau )h_0^{-1}\psi -h(\sigma )h_0^{-1}\psi \Vert \le \epsilon [c_1(\tau )+c_1(\sigma )]+\Vert h(\tau )\varphi -h(\sigma )\varphi \Vert \,, \end{aligned}$$

so the strong continuity follows. The case of \([h(\tau ),h_0]h_0^{-1}\) is analogous. This completes the proof of (B), and its consequence is the statement of strong continuity of maps in (C). \(\square \)

B.2: Evolution

Theorem 13

Let all the assumptions of Theorem 12 be satisfied. Moreover, let \((\tau ,\sigma )\mapsto u(\tau ,\sigma )\) be a strongly continuous unitary evolution system in \({\mathcal {H}}\):

$$\begin{aligned} u(\tau ,\rho )u(\rho ,\sigma )=u(\tau ,\sigma )\,,\quad u(\tau ,\tau )=1\,, \end{aligned}$$

such that \(u(\tau ,\sigma ){\mathcal {D}}={\mathcal {D}}\). Suppose that for \(\psi \in {\mathcal {D}}\):

  1. (a)

    the maps \((\tau ,\sigma )\mapsto h(\tau )u(\tau ,\sigma )\psi \) and \((\tau ,\sigma )\mapsto h_0u(\tau ,\sigma )\psi \) are strongly continuous,

  2. (b)

    the map \((\tau ,\sigma )\mapsto u(\tau ,\sigma )\psi \) is of class \(C^1\) in the strong sense and the following equations hold:

    $$\begin{aligned} i\partial _\tau u(\tau ,\sigma )\psi =h(\tau )u(\tau ,\sigma )\psi \,, \quad i\partial _\sigma u(\tau ,\sigma )\psi =-u(\tau ,\sigma )h(\sigma )\psi \,, \end{aligned}$$

Then, the following holds:

  1. (A)

    the operator \(h_0u(\tau ,\sigma )h_0^{-1}\) is bounded, with the norm

    $$\begin{aligned} \Vert h_0u(\tau ,\sigma )h_0^{-1}\Vert \le \exp \left[ \mathrm {const}\int _\sigma ^\tau c_3(\rho )\mathrm{d}\rho \right] \,, \end{aligned}$$

    so in particular

    $$\begin{aligned} u(\tau ,\sigma ){\mathcal {D}}(h_0)={\mathcal {D}}(h_0)\,; \end{aligned}$$
  2. (B)

    the map \((\tau ,\sigma )\mapsto h_0u(\tau ,\sigma )h_0^{-1}\) is strongly continuous;

  3. (C)

    all statements of assumptions (a) and (b) remain true for all \(\psi \in {\mathcal {D}}(h_0)\).


To show (A), we use a standard trick.Footnote 13 It is easy to see that

$$\begin{aligned} \Big [\frac{h_0^2}{1+\epsilon h_0^2},h(\tau )\Big ] =\frac{h_0}{1+\epsilon h_0^2}\big [[h(\tau ),h_0], h_0^{-1}\big ]_+\frac{h_0}{1+\epsilon h_0^2}\,, \end{aligned}$$

so using the second bound in (80), we have for \(\psi \in {\mathcal {D}}\)

$$\begin{aligned} \partial _\tau \Big (u(\tau ,\sigma )\psi , \frac{h_0^2}{1+\epsilon h_0^2}u(\tau ,\sigma )\psi \Big ) \le 2 c_3(\tau )\Big (u(\tau ,\sigma )\psi , \frac{h_0^2}{1+\epsilon h_0^2}u(\tau ,\sigma )\psi \Big )\,, \end{aligned}$$

By Gronwall’s inequality, we obtain

$$\begin{aligned} \Big \Vert \frac{h_0}{(1+\epsilon h_0^2)^\frac{1}{2}}u(\tau ,\sigma )\psi \Big \Vert \le e^{\int _\sigma ^\tau c_3(\rho )\mathrm{d}\rho }\Big \Vert \frac{h_0}{(1+\epsilon h_0^2)^\frac{1}{2}}\psi \Big \Vert \,. \end{aligned}$$

As \(h_0\) is closed, letting \(\epsilon \searrow 0\) we show that \(u(\tau ,\sigma )\psi \in {\mathcal {D}}(h_0)\) for \(\psi \in {\mathcal {D}}\), and

$$\begin{aligned} \Vert h_0u(\tau ,\sigma )\psi \Vert \le e^{\int _\sigma ^\tau c_3(\rho )\hbox {d}\rho }\Vert h_0\psi \Vert \,. \end{aligned}$$

But \({\mathcal {D}}\) is a core for \(h_0\), so this inequality extends to \({\mathcal {D}}(h_0)\). Setting \(\psi =h_0^{-1}\chi \), with arbitrary \(\chi \in {\mathcal {H}}\), we obtain (A).

For statement (B) we note that for \(\psi \in {\mathcal {H}}\) and \(\varphi \in {\mathcal {D}}\) such that \(\Vert \psi -h_0\varphi \Vert \le \epsilon \) (as in the proof of Theorem 12) we have

$$\begin{aligned}&\Vert h_0[u(\tau ,\sigma )-u(\tau ',\sigma ')]h_0^{-1}\psi \Vert \\&\quad \le \Vert h_0[u(\tau ,\sigma )-u(\tau ',\sigma ')]h_0^{-1}\Vert \epsilon +\Vert h_0u(\tau ,\sigma )\varphi -h_0u(\tau ',\sigma ')\varphi \Vert \,, \end{aligned}$$

so the map \((\tau ,\sigma )\mapsto h_0u(\tau ,\sigma )h_0^{-1}\) is strongly continuous.

The extension of (a) to all \(\psi \in {\mathcal {D}}(h_0)\) now follows from the present statement (B), and the statement (B) of Theorem 12. For the extension of (b), we write the integral forms of equations (81) for \(\psi \in {\mathcal {D}}\)

$$\begin{aligned} u(\tau ,\sigma )\psi -\psi =-i\int _\sigma ^\tau h(\rho )u(\rho ,\sigma )\psi \,\hbox {d}\rho =-i\int _\sigma ^\tau u(\rho ,\sigma )h(\rho )\psi \,\hbox {d}\rho \,. \end{aligned}$$

But we already know that both integrands extend to continuous functions of \(\rho \) for \(\psi \in {\mathcal {D}}(h_0)\), so this extension followed by differentiation leads to the extended equations (81). \(\square \)

Appendix C. A Lemma

Here we state a result to be applied in Sects. 2 and 3. The lemma below enables the application of Theorem 12 to our system.

Lemma 14

Let \(L^i(\mathbf {z})\), \(i=1,2,3\), and \(M(\mathbf {z})\) be four \(n\times n\) Hermitian matrix functions on \({\mathbb {R}}^3\), and denote

$$\begin{aligned} h=\tfrac{1}{2}(L^ip_i+p_iL^i)+ M\,, \quad h_0=\tfrac{1}{2}(\mathbf {p}^2+\mathbf {z}^2)\,. \end{aligned}$$

Then for some constants \(C_i\) (\(i=1,2,3\)) and all \(\varphi \in C_0^\infty ({\mathbb {R}}^3, {\mathbb {C}}^n)\) the following inequalities are satisfied:

  1. (i)

    if the functions \(\langle z\rangle ^{-1}L^i(\mathbf {z})\), \(\partial _iL^j(\mathbf {z})\) and \(\langle z\rangle ^{-2}M(\mathbf {z})\) are in \(L^\infty ({\mathbb {R}}^3)\), then

    $$\begin{aligned} \Vert h\varphi \Vert \le C_1\Vert h_0\varphi \Vert \,; \end{aligned}$$
  2. (ii)

    if in addition \(\partial _i\partial _jL^k(\mathbf {z})\) and \(\langle z\rangle ^{-1}\partial _i M(\mathbf {z})\) are in \(L^\infty ({\mathbb {R}}^3)\), then

    $$\begin{aligned} |(h\psi ,h_0\varphi )-(h_0\psi ,h\varphi )| \le C_2\Vert h_0^{\frac{1}{2}}\psi \Vert \Vert h_0^{\frac{1}{2}}\varphi \Vert \,; \end{aligned}$$
  3. (iii)

    if in addition to (i) and (ii) also \(\partial _i\partial _j\partial _k L^l(\mathbf {z})\) and \(\partial _i\partial _j M(\mathbf {z})\) are in \(L^\infty ({\mathbb {R}}^3)\), then

    $$\begin{aligned} \Vert [h,h_0]\varphi \Vert \le C_3\Vert h_0\varphi \Vert \,. \end{aligned}$$


Footnote 14 Before starting the proof of (i)–(iii), we note a few simple facts. For \(\varphi \in C_0^\infty ({\mathbb {R}}^3, {\mathbb {C}}^n)\) we note the identity

$$\begin{aligned} \varphi ^\dagger \mathbf {p}^2\varphi +(\mathbf {p}^2\varphi )^\dag \varphi +\Delta |\varphi |^2=2|\mathbf {p}\varphi |^2\,, \end{aligned}$$

which multiplied by \(\mathbf {z}^2\) and integrated (transfer \(\Delta \) by parts) gives

$$\begin{aligned} {{\,\mathrm{Re}\,}}(\mathbf {p}^2\varphi ,\mathbf {z}^2\varphi )+3\Vert \varphi \Vert ^2=\Vert |\mathbf {z}|\mathbf {p}\varphi \Vert ^2\,. \end{aligned}$$

Also, it follows from \(\mathbf {p}^2+\mathbf {z}^2\ge 3\cdot \mathbf {1}\) that

$$\begin{aligned} \Vert \varphi \Vert \le \tfrac{1}{3}\Vert (\mathbf {p}^2+\mathbf {z}^2)\varphi \Vert \,. \end{aligned}$$

Using (85) and (86), one finds

$$\begin{aligned}&\Vert (\mathbf {p}^2+\mathbf {z}^2)\varphi \Vert ^2 - \Vert \mathbf {p}^2\varphi \Vert ^2-\Vert \mathbf {z}^2\varphi \Vert ^2 =2{{\,\mathrm{Re}\,}}(\mathbf {p}^2\varphi ,\mathbf {z}^2\varphi )\\&\quad \ge -6\Vert \varphi \Vert ^2 \ge -\tfrac{2}{3}\Vert (\mathbf {p}^2+\mathbf {z}^2)\varphi \Vert ^2\,, \end{aligned}$$

which gives the bound:

$$\begin{aligned} \Vert \mathbf {p}^2\varphi \Vert ^2+\Vert \mathbf {z}^2\varphi \Vert ^2\le \tfrac{5}{3}\,\Vert (\mathbf {p}^2+\mathbf {z}^2)\varphi \Vert ^2\,. \end{aligned}$$

We note that the L- and M-parts of the thesis may be proved separately. Consider the L-part first, and assume that \(L^i\) satisfy the assumptions in (i). Then \(N_{\alpha ,i\beta }(\mathbf {z})=\langle z\rangle ^{-1}L^i_{\alpha \beta }(\mathbf {z})\) may be treated as a bounded operator \({\mathbb {C}}^{3n}\otimes L^2({\mathbb {R}}^3)\mapsto {\mathbb {C}}^n\otimes L^2({\mathbb {R}}^3)\), whose norm we denote \(d_1\). Then,

$$\begin{aligned} \Vert L^ip_i\varphi \Vert ^2&\le d_1^2\sum _i\Vert \langle z\rangle p_i\varphi \Vert ^2=d_1^2 \big (\Vert |\mathbf {z}|\mathbf {p}\varphi \Vert ^2+(\varphi ,\mathbf {p}^2\varphi )\big )\\&\le d_1^2 \big (\Vert \mathbf {p}^2\varphi \Vert \Vert \mathbf {z}^2\varphi \Vert +3\Vert \varphi \Vert ^2+\Vert \varphi \Vert \Vert \mathbf {p}^2\varphi \Vert \big )\\&\le \tfrac{1}{2}d_1^2\big (2\Vert \mathbf {p}^2\varphi \Vert ^2+\Vert \mathbf {z}^2\varphi \Vert ^2+7\Vert \varphi \Vert ^2\big )\,, \end{aligned}$$

where in the third step we used identity (85) and the Schwarz inequality. Taking into account that \(\mathbf {L}\cdot \mathbf {p}+\mathbf {p}\cdot \mathbf {L}=2\mathbf {L}\cdot \mathbf {p}-i(\varvec{\partial }\cdot \mathbf {L})\) and \(\Vert \varvec{\partial }\cdot \mathbf {L}\Vert _\infty \le \mathrm {const}\), with the use of (86) and (87) one easily arrives at the estimate (82) for the L-part.

Let now \(L^i\) satisfy the assumptions in (ii) and let \(\varphi \in C_0^\infty ({\mathbb {R}}^3, {\mathbb {C}}^n)\). Then, using the usual rules for calculation of commutators, one finds

$$\begin{aligned} \big [[L^i,p_i]_+,h_0\big ]\varphi =-2iz^iL^i\varphi +\tfrac{i}{2}[p_i,p_j(\partial _jL^i)+(\partial _jL^i)p_j]_+\varphi \,, \end{aligned}$$

which makes sense as a distributional identity (\(p_i\), \(p_j\) treated distributionally). As \(L^i\) are twice differentiable, we can transform the second term on the rhs:

$$\begin{aligned} \tfrac{i}{2}[p_i,p_j(\partial _jL^i)+(\partial _jL^i)p_j]_+\varphi =ip_i\big ((\partial _jL^i)+(\partial _iL^j)\big )p_j\varphi +\tfrac{i}{2}[p_i,\Delta L^i]\varphi \,. \end{aligned}$$

Setting this into (88) and taking the product with \(\psi \in C_0^\infty ({\mathbb {R}}^3, {\mathbb {C}}^n)\), we obtain

$$\begin{aligned}&([L^i,p_i]_+\psi , h_0\varphi ) -(h_0\psi ,[L^i,p_i]_+\varphi ) =i(p_i\psi ,(\partial _iL^j+\partial _jL^i)p_j\varphi )\\&\quad -\tfrac{1}{2}(\psi ,(\Delta L^i)p_i\varphi ) +\tfrac{1}{2}((\Delta L^i)p_i\psi ,\varphi ) -2i(z^i\psi ,L^i\varphi )\,. \end{aligned}$$

We treat \(P_{i\alpha ,j\beta }(\mathbf {z})=\partial _iL^j_{\alpha \beta }(\mathbf {z})+\partial _j L^i_{\alpha \beta }(\mathbf {z})\) as a bounded operator acting in \({\mathbb {C}}^{3n}\otimes L^2({\mathbb {R}}^3)\), whose norm we denote \(d_2\). Then

$$\begin{aligned} |(p_i\psi ,(\partial _iL^j+\partial _jL^i)p_j\varphi )| \le d_2\Big (\sum _i\Vert p_i\psi \Vert ^2\sum _j\Vert p_j\varphi \Vert ^2\Big )^{\frac{1}{2}} \le 2d_2\Vert h_0^{\frac{1}{2}}\psi \Vert \Vert h_0^{\frac{1}{2}}\varphi \Vert \,. \end{aligned}$$

Similarly, \(Q_{\alpha ,i\beta }(\mathbf {z})=\Delta L^i_{\alpha \beta }(\mathbf {z})\) is a bounded operator from \({\mathbb {C}}^{3n}\otimes L^2({\mathbb {R}}^3)\) into \({\mathbb {C}}^n\otimes L^2({\mathbb {R}}^3)\) with the norm \(d_3\), and \(R_{i\alpha ,\beta }(\mathbf {z})=\langle z\rangle ^{-1}L^i_{\alpha \beta }(\mathbf {z})\) a bounded operator from \({\mathbb {C}}^n\otimes L^2({\mathbb {R}}^3)\) into \({\mathbb {C}}^{3n}\otimes L^2({\mathbb {R}}^3)\) with the norm \(d_4\). Therefore

$$\begin{aligned}&|(\psi ,(\Delta L^i)p_i\varphi )| \le d_3\Vert \psi \Vert \Big (\sum _i\Vert p_i\varphi \Vert ^2\Big )^\frac{1}{2} \le \mathrm {const}\Vert h_0^{\frac{1}{2}}\psi \Vert \Vert h_0^{\frac{1}{2}}\varphi \Vert \,,\\&|(z^i\psi ,L^i\varphi )|\le d_4\Vert \langle z\rangle \varphi \Vert \Big (\sum _i\Vert z^i\psi \Vert ^2\Big )^{\frac{1}{2}} \le \mathrm {const}\Vert h_0^{\frac{1}{2}}\psi \Vert \Vert h_0^{\frac{1}{2}}\varphi \Vert \,, \end{aligned}$$

which ends the proof of (83) for the L-part.

If in addition the assumptions in (iii) are satisfied, then on the rhs of (88) all operators \(p_i\) may be commuted to the right, and the commutator is a differential operator with functional coefficients:

$$\begin{aligned}{}[[L^i,p_i]_+,h_0]=2i(\partial _i L^j)p_ip_j +(\Delta L^j+\partial _j\varvec{\partial }\cdot \mathbf {L})p_j -\tfrac{i}{2}(\Delta \varvec{\partial }\cdot \mathbf {L})-2i\mathbf {z}\cdot \mathbf {L}\,. \end{aligned}$$

Using similar techniques as above, we find

$$\begin{aligned}&\Vert (\partial _iL^j)p_ip_j\varphi \Vert \le d_5\Big (\sum _{i,j}\Vert p_ip_j \varphi \Vert ^2\Big )^{\frac{1}{2}} =d_5\Vert \mathbf {p}^2\varphi \Vert \,,\\&\Vert (\Delta L^j+\partial _j\varvec{\partial }\cdot \mathbf {L})p_j\varphi \Vert \le d_6\Big (\sum _j\Vert p_j\varphi \Vert ^2\Big )^{\frac{1}{2}} \le \mathrm {const}\Vert h_0^{\frac{1}{2}}\varphi \Vert \le \mathrm {const}\Vert h_0\varphi \Vert \,,\\&\Vert (\Delta \varvec{\partial }\cdot \mathbf {L})\varphi \Vert \le \mathrm {const}\Vert \varphi \Vert \,,\qquad \Vert \mathbf {z}\cdot \mathbf {L}\varphi \Vert \le \mathrm {const}\Vert \langle z\rangle ^2\varphi \Vert \,. \end{aligned}$$

Again using (86) and (87), one obtains (84) for the L-part.

For the M-part, note that

$$\begin{aligned} \Vert M\varphi \Vert \le \mathrm {const}\Vert \langle z\rangle ^2\varphi \Vert \le \mathrm {const}\Vert (\mathbf {p}^2+\mathbf {z}^2)\varphi \Vert \,, \end{aligned}$$

which proves (82). Consider the commutator

$$\begin{aligned} {[} M,h_0]=\tfrac{i}{2}[(\varvec{\partial }M)\cdot \mathbf {p}+\mathbf {p}\cdot (\varvec{\partial }M)]\,. \end{aligned}$$

If the assumptions in (ii) are satisfied, then

$$\begin{aligned}&2|(M\psi ,h_0\varphi )-(h_0\psi ,M\varphi )|\le |((\partial _iM)\psi ,p_i\varphi )|+|(p_i\psi ,(\partial _iM)\varphi )|\\&\quad \le \mathrm {const}\Vert h_0^{\frac{1}{2}}\psi \Vert \Vert h_0^{\frac{1}{2}}\varphi \Vert \,, \end{aligned}$$

which proves (83). Finally, with the assumption in (iii), (84) for this part is obtained similarly as (82) for the L-part. \(\square \)

Appendix D. Special Variables

Variables \((\tau ,\mathbf {z})\) are defined by:

$$\begin{aligned} x^0=\tau \langle z\rangle \,,\qquad \mathbf {x}=\langle \tau \rangle \mathbf {z}\,, \end{aligned}$$

see (20). Further notation:

$$\begin{aligned} \langle \tau |z\rangle =(\tau ^2+\mathbf {z}^2+1)^\frac{1}{2}\,,\quad \langle \tau \Vert z\rangle =\langle \tau \rangle \langle z\rangle +\langle \tau |z\rangle \,. \end{aligned}$$

At various points of the calculations in this section, the following identities are used

$$\begin{aligned} \tau ^2|\mathbf {z}|^2=\langle \tau \rangle ^2\langle z\rangle ^2-\langle \tau |z\rangle ^2= \langle \tau \Vert z\rangle (\langle \tau \rangle \langle z\rangle -\langle \tau |z\rangle )\,. \end{aligned}$$

The derivative transformation between these coordinates and the Minkowski system is given by

$$\begin{aligned} \frac{\partial (x^0,\mathbf {x})}{\partial (\tau ,\mathbf {z})} =\left( \begin{array}{cc} \langle z\rangle &{}\quad \dfrac{\tau {\mathbf {z}}^{\intercal }}{\langle z\rangle }\\ \dfrac{\tau \mathbf {z}}{\langle \tau \rangle } &{}\quad \langle \tau \rangle \mathbf {1} \end{array}\right) , \quad \frac{\partial (\tau ,\mathbf {z})}{\partial (x^0,\mathbf {x})} =\left( \begin{array}{cc} \dfrac{\langle \tau \rangle ^2\langle z\rangle }{\langle \tau |z\rangle ^2} &{}\quad -\dfrac{\tau \langle \tau \rangle \mathbf {z}^\intercal }{\langle \tau |z\rangle ^2} \\ -\dfrac{\tau \langle z\rangle \mathbf {z}}{\langle \tau |z\rangle ^2} &{}\quad \dfrac{1}{\langle \tau \rangle }\bigg [\mathbf {1}+\dfrac{\tau ^2}{\langle \tau |z\rangle ^2}\mathbf {z}\mathbf {z}^\intercal \bigg ] \end{array}\right) \,. \end{aligned}$$

Using these transformations, one finds the new metric tensor matrices

$$\begin{aligned}&(g_{\mu \nu }) =\left( \begin{array}{cc} \dfrac{\langle \tau |z\rangle ^2}{\langle \tau \rangle ^2} &{}\quad 0\\ 0 &{}\quad -\langle \tau \rangle ^2\mathbf {1}+\dfrac{\tau ^2}{\langle z\rangle ^2}\mathbf {z}\mathbf {z}^\intercal \end{array}\right) \,, \quad g_{(z)}=-\frac{\langle \tau \rangle ^4\langle \tau |z\rangle ^2}{\langle z\rangle ^2}\,,\\&(g^{\mu \nu }) =\left( \begin{array}{cc} \dfrac{\langle \tau \rangle ^2}{\langle \tau |z\rangle ^2} &{} \quad 0 \\ 0 &{}\quad -\dfrac{1}{\langle \tau \rangle ^2}\bigg [\mathbf {1}+\dfrac{\tau ^2}{\langle \tau |z\rangle ^2}\mathbf {z}\mathbf {z}^\intercal \bigg ] \end{array}\right) \end{aligned}$$

and the new vector components of the gamma matrices

$$\begin{aligned} \hat{\gamma }^\tau&=\frac{\langle \tau \rangle }{\langle \tau |z\rangle ^2}\big (\langle \tau \rangle \langle z\rangle \gamma ^0-\tau \mathbf {z}\cdot \varvec{\gamma }\big )\,,\\ \hat{\gamma }^i&=\frac{1}{\langle \tau \rangle }\gamma ^i-\frac{\tau }{\langle \tau \rangle \langle \tau |z\rangle ^2}\big (\langle \tau \rangle \langle z\rangle \gamma ^0-\tau \mathbf {z}\cdot \varvec{\gamma }\big )z^i\,. \end{aligned}$$

The operator K defined by (8) and discussed in Appendix A takes the form

$$\begin{aligned}&K=\frac{1}{\sqrt{2\langle \tau |z\rangle }} \Big [\sqrt{\langle \tau \Vert z\rangle }+\frac{\tau }{\sqrt{\langle \tau \Vert z\rangle }}\gamma ^0\mathbf {z}\cdot \varvec{\gamma }\Big ]=K^\dagger \,,\\&K^{-1}=\gamma ^0K\gamma ^0\,, \end{aligned}$$

which leads to the following form of the transformed gamma matrices (10) and \(\lambda \) matrices introduced by (11):

$$\begin{aligned} \hat{\gamma }_K^i&=\frac{1}{\langle \tau \rangle }\gamma _\bot ^i+\frac{\langle z\rangle }{\langle \tau |z\rangle }\gamma _z\hat{z}^i\,,\nonumber \\ \lambda ^i&=\frac{\langle \tau |z\rangle }{\langle \tau \rangle ^2}\alpha _{\bot }^i+\frac{\langle z\rangle }{\langle \tau \rangle }\alpha _z \hat{z}^i\nonumber \\&=\frac{\langle \tau |z\rangle }{\langle \tau \rangle ^2}\alpha _{\bot }^i+\frac{1}{(\langle z\rangle +|\mathbf {z}|)\langle \tau \rangle }\alpha _z \hat{z}^i +\frac{1}{\langle \tau \rangle }\alpha _z z^i\,, \end{aligned}$$


$$\begin{aligned}&\hat{z}^i=\frac{z^i}{|\mathbf {z}|}\,,\qquad \gamma _z=\mathbf {\hat{z}}\cdot \varvec{\gamma }\,,\qquad \gamma _\bot ^i=\gamma ^i-\gamma _z\hat{z}^i\,,\\&\alpha ^i=\gamma ^0\gamma ^i\,,\qquad \alpha _z=\mathbf {\hat{z}}\cdot \varvec{\alpha }\,,\qquad \alpha _\bot ^i=\alpha ^i-\alpha _z\hat{z}^i\,. \end{aligned}$$

For the estimation of X in (58) it is convenient to have

$$\begin{aligned} (g^{\tau \tau })^{\frac{1}{2}}\lambda ^i =\frac{1}{\langle \tau \rangle }\alpha _{\bot }^i+\frac{1}{(\langle z\rangle +|\mathbf {z}|)\langle \tau |z\rangle }\alpha _z \hat{z}^i +\frac{1}{\langle \tau |z\rangle }\alpha _z z^i\,. \end{aligned}$$

The derivatives of \(\lambda \) matrices are

$$\begin{aligned}&\partial _j\lambda ^i=\frac{|\mathbf {z}|}{\langle \tau \rangle ^2}\bigg [\frac{\hat{z}^j\alpha _{\bot }^i}{\langle \tau |z\rangle } +\frac{\tau ^2\alpha _{\bot }^j\hat{z}^i}{\langle \tau \Vert z\rangle } +\Big (\frac{\tau ^2\delta _{\bot }^{ji}}{\langle \tau \Vert z\rangle } +\frac{\langle \tau \rangle \hat{z}^j\hat{z}^i}{\langle z\rangle }\Big )\alpha _z\bigg ]\,,\\&\partial _i\lambda ^i=\frac{1}{\langle \tau \rangle ^2} \bigg [\frac{2\tau ^2}{\langle \tau \Vert z\rangle }+\frac{\langle \tau \rangle }{\langle z\rangle }\bigg ]\mathbf {z}\cdot \varvec{\alpha }\,, \end{aligned}$$


$$\begin{aligned} \delta ^{ij}_\bot =\delta ^{ij}-\hat{z}^i\hat{z}^j\,, \end{aligned}$$

and their estimates are easily obtained:

$$\begin{aligned} |\lambda ^i|\le \mathrm {const}\frac{\langle z\rangle }{\langle \tau \rangle }\,,\quad |\partial _j\lambda ^i|\le \mathrm {const}\frac{|\mathbf {z}|}{\langle \tau \rangle \langle z\rangle }\,,\quad |\partial _j\partial _i\lambda ^k|\le \frac{\mathrm {const}}{\langle \tau \rangle \langle z\rangle }\,. \end{aligned}$$

The \(\mu \) matrix introduced in (11), and its derivative, take the form

$$\begin{aligned} \mu =\frac{\langle \tau |z\rangle }{\langle \tau \rangle }\beta \,,\quad \partial _i\mu =\frac{z^i}{\langle \tau \rangle \langle \tau |z\rangle }\beta \,. \end{aligned}$$

The commutator, and the anti-commutator of matrices \(\lambda ^i\), the latter defining the form \(\rho ^{ij}\) introduced in (22), yield

$$\begin{aligned} {[}\lambda ^i,\lambda ^j]&=\frac{\langle \tau |z\rangle ^2}{\langle \tau \rangle ^4}[\alpha _\bot ^i,\alpha _\bot ^j] +2\frac{\langle \tau |z\rangle \langle z\rangle }{\langle \tau \rangle ^3}\alpha _z(\hat{z}^i\alpha _\bot ^j-\hat{z}^j\alpha _\bot ^i)\,,\nonumber \\ \rho ^{ij}&=\tfrac{1}{2}[\lambda ^i,\lambda ^j]_+=-g_{\tau \tau }g^{ij} =\frac{\langle \tau |z\rangle ^2}{\langle \tau \rangle ^4}\delta _{\bot }^{ij}+\frac{\langle z\rangle ^2}{\langle \tau \rangle ^2}\hat{z}^i\hat{z}^j\nonumber \\&=\frac{1}{\langle \tau \rangle ^2}\Big [\Big (1+\frac{|\mathbf {z}|^2}{\langle \tau \rangle ^2}\Big )\delta _{\bot }^{ij} +\langle z\rangle ^2\hat{z}^i\hat{z}^j\Big ]\,. \end{aligned}$$

Using the latter form, we find for the \(\tau \)-derivative of \(\rho \)

$$\begin{aligned} \dot{\rho }^{ij} =-\frac{2\tau }{\langle \tau \rangle ^2}\rho ^{ij}-\frac{2\tau |\mathbf {z}|^2}{\langle \tau \rangle ^6}\delta _{\bot }^{ij}\,, \end{aligned}$$

from which, in the sense of quadratic forms,

$$\begin{aligned} {{\,\mathrm{sgn}\,}}(\tau )\dot{\rho }^{ij} \le -\frac{2|\tau |}{\langle \tau \rangle ^2}\rho ^{ij}\,, \end{aligned}$$

an inequality needed in Lemma 8. The expression first appearing in (45) and needed in the proof of Theorem 7 takes the form

$$\begin{aligned} \partial _i\rho ^{ij}=2\frac{\tau ^2+\langle \tau \rangle ^2}{\langle \tau \rangle ^4}z^j\,. \end{aligned}$$

The square root of the form \(\rho \), introduced in (42), is explicitly

$$\begin{aligned} s^{ij}=\frac{\langle \tau |z\rangle }{\langle \tau \rangle ^2}\delta _{\bot }^{ij}+\frac{\langle z\rangle }{\langle \tau \rangle }\hat{z}^i\hat{z}^j\,. \end{aligned}$$

The quantity introduced in (39), and its properties, are:

$$\begin{aligned} \Lambda _j&=\frac{|\mathbf {z}|}{2\langle \tau |z\rangle }\Big (\frac{\tau ^2}{\langle \tau \Vert z\rangle } -\frac{\langle \tau \rangle \langle z\rangle }{\langle \tau |z\rangle ^2}\Big )i\alpha _z\alpha _\bot ^j\,, \end{aligned}$$
$$\begin{aligned} s^{ij}\Lambda _j&=\frac{|\mathbf {z}|}{2\langle \tau \rangle ^2}\left( \frac{\tau ^2}{\langle \tau \Vert z\rangle } -\frac{\langle \tau \rangle \langle z\rangle }{\langle \tau |z\rangle ^2}\right) i\alpha _z\alpha _\bot ^i\,,\nonumber \\ |\Lambda _j|&\le \frac{\langle \tau \rangle }{\langle \tau |z\rangle }\,,\quad |s^{ij}\Lambda _j|\le \frac{1}{\langle \tau \rangle }\,, \quad |\dot{\Lambda }_j|\le \frac{\mathrm {const}}{\langle \tau |z\rangle }\,,\quad |s^{ij}\dot{\Lambda }_j|\le \frac{\mathrm {const}}{\langle \tau \rangle ^2}\,. \end{aligned}$$

The calculation of Q defined in (40) is more tedious, and we write down two of the intermediate steps:

$$\begin{aligned} \tfrac{1}{4}\partial _j\big ([\lambda ^j,\partial \cdot \lambda ]_+\big )&=\frac{\tau ^2}{\langle \tau \rangle ^3\langle \tau \Vert z\rangle }\Big (2\langle z\rangle +\frac{|\mathbf {z}|^2}{\langle z\rangle }\Big ) +\frac{\tau ^2}{\langle \tau \rangle ^2\langle \tau |z\rangle \langle \tau \Vert z\rangle }+\frac{3}{2\langle \tau \rangle ^2}\,,\\ \tfrac{1}{4}(\partial \cdot \lambda )^2&=\frac{\tau ^2\langle z\rangle }{\langle \tau \rangle ^3\langle \tau \Vert z\rangle }-\frac{\tau ^2\langle \tau |z\rangle }{\langle \tau \rangle ^4\langle \tau \Vert z\rangle } +\frac{\tau ^2|\mathbf {z}|^2}{\langle \tau \rangle ^3\langle z\rangle \langle \tau \Vert z\rangle }+\frac{|\mathbf {z}|^2}{4\langle \tau \rangle ^2\langle z\rangle ^2}\,, \end{aligned}$$

which substituted in (40) gives

$$\begin{aligned} 0<Q&=\langle \tau \rangle ^{-2}\Big (\frac{\tau ^2}{\langle \tau \rangle ^2}+\frac{\tau ^2}{\langle \tau |z\rangle \langle \tau \Vert z\rangle } +\frac{1}{4\langle z\rangle ^2}+\frac{5}{4}\Big )\le \frac{\mathrm {const}}{\langle \tau \rangle ^2}\,, \end{aligned}$$
$$\begin{aligned} 0<N&\le \frac{\mathrm {const}}{\langle \tau \rangle ^2}\,. \end{aligned}$$

The following estimates are needed in the proof of Lemma 8

$$\begin{aligned} s^{ij}\partial _j\mu =\frac{z^i\langle z\rangle }{\langle \tau \rangle ^2\langle \tau |z\rangle }\beta \,,\qquad |s^{ij}\partial _j\mu |\le \frac{|\mathbf {z}|}{\langle \tau \rangle ^2}\,. \end{aligned}$$

The matrix \(\nu \) defined by (59), with the use of (48) is shown to satisfy the estimates

$$\begin{aligned} |\nu |\le \frac{\kappa }{\langle \tau \rangle }\,,\quad |s^{ij}\Lambda _j\nu |\le \frac{\kappa }{\langle \tau \rangle ^2}\,, \end{aligned}$$

which are needed in the proof of Lemma 8. To simplify notation of its derivative, let us denote \(\chi (u)=\xi (u)^{-1}\xi '(u)\). Then

$$\begin{aligned} \partial _i\nu&=\frac{1}{\langle \tau \rangle }\Big [\chi (\langle z\rangle )\alpha _\bot ^i +\Big (\chi (\langle z\rangle )+\frac{|\mathbf {z}|^2}{\langle z\rangle }\chi '(\langle z\rangle )\Big )\alpha _z\hat{z}^i\Big ]\,,\\ s^{ij}\partial _j\nu&=\frac{\langle \tau |z\rangle }{\langle \tau \rangle ^3}\chi (\langle z\rangle )\alpha _\bot ^i +\frac{1}{\langle \tau \rangle ^2}\Big (\langle z\rangle \chi (\langle z\rangle )+|\mathbf {z}|^2\chi '(\langle z\rangle )\Big )\alpha _z\hat{z}^i\,. \end{aligned}$$

It follows from (48) that \(\chi (u)\le \kappa /u\) and \(|\chi '(u)|\le \mathrm {const}/u\), so

$$\begin{aligned} |s^{ij}\partial _j\nu |\le \mathrm {const}\frac{|\mathbf {z}|}{\langle \tau \rangle ^2}\,. \end{aligned}$$

The spreading of the past and the future in Minkowski space, in the language of the coordinates \((\tau ,\mathbf {z})\), is given by the following lemma. This result is needed for the description of scattering in Sect. 6.

Lemma 15

The past (resp. future) of the set \(\tau =\tau _0\), \(|\mathbf {z}|\le r_0\) consists of the points \((\tau ,\mathbf {z})\) such that \(\tau <\tau _0\) (resp. \(\tau >\tau _0\)) and \(|\mathbf {z}|\le r\), where

$$\begin{aligned} \begin{aligned} r= & {} r_0[\langle \tau _0\rangle \langle \tau \rangle -\tau _0\tau ]+\langle r_0\rangle |\tau \langle \tau _0\rangle -\langle \tau \rangle \tau _0| \,,\\ \langle r \rangle= & {} \langle r_0\rangle [\langle \tau _0\rangle \langle \tau \rangle -\tau _0\tau ]+r_0|\tau \langle \tau _0\rangle -\langle \tau \rangle \tau _0| \,. \end{aligned} \end{aligned}$$

It follows that

$$\begin{aligned} \langle r \rangle \le \langle r_0\rangle \left\{ \begin{array}{ll} (\langle \tau _0\rangle -\tau _0)(\langle \tau \rangle +\tau )\,, &{} \quad \tau \ge \tau _0\,,\\ (\langle \tau _0\rangle +\tau _0)(\langle \tau \rangle -\tau )\,, &{} \quad \tau \le \tau _0\,. \end{array}\right. \end{aligned}$$

In particular, for \(\tau _0=0\) and \(\tau \in {\mathbb {R}}\) we have

$$\begin{aligned} \langle r \rangle \le 2\langle r_0\rangle \langle \tau \rangle \,. \end{aligned}$$


Simple geometrical analysis shows that the radius r is given by the conditions that the vector \((\tau \langle r \rangle -\tau _0\langle r_0\rangle ,0,0,\langle \tau \rangle r-\langle \tau _0\rangle r_0)\) is lightlike and \(\langle \tau \rangle r-\langle \tau _0\rangle r_0>0\), that is

$$\begin{aligned} \epsilon (\tau \langle r \rangle -\tau _0\langle r_0\rangle )=\langle \tau \rangle r-\langle \tau _0\rangle r_0>0\,, \end{aligned}$$

where \(\epsilon =-1\) (resp. \(\epsilon =+1\)) for past (resp. future). We set \(\tau _0=\sinh (T_0)\), \(r_0=\sinh (R_0)\), \(\tau =\sinh (T)\), \(r=\sinh (R)\), where \(R_0>0\), \(R>0\). Then the equality above is easily transformed to \(\sinh (R-\epsilon T)=\sinh (R_0-\epsilon T_0)\), so \(R-R_0=\epsilon (T-T_0)\). Treating \(R_0\), T and \((T-T_0)\) as independent, we put \(R=R_0+\epsilon (T-T_0)\) and \(T_0=T-(T-T_0)\) in the inequality, which is then transformed to \(\epsilon \sinh (T-T_0)\cosh (R_0+\epsilon T)>0\). Thus \(\epsilon ={{\,\mathrm{sgn}\,}}(T-T_0)\) and \(R=R_0+|T-T_0|\). Substituting this in \(r=\sinh (R)\) and \(\langle r \rangle =\cosh (R)\), one obtains (101). Moreover,

$$\begin{aligned} \langle r \rangle \le \cosh (R_0)\exp |T-T_0| ={\left\{ \begin{array}{ll} \langle r_0\rangle \exp (T)\exp (-T_0)\,, &{} T\ge T_0\,,\\ \langle r_0\rangle \exp (T_0)\exp (-T)\,, &{} T\le T_0\,, \end{array}\right. } \end{aligned}$$

which gives (102). \(\square \)

Finally, we state the following inequalities needed for the estimation of electromagnetic fields, discussed in Appendix H.

Lemma 16

For \(x=(\tau \langle z\rangle ,\langle \tau \rangle \mathbf {z})\) the following holds:

$$\begin{aligned} |x^0|+|\mathbf {x}|+1&\ge \langle \tau \rangle \langle z\rangle \,,\\ \big ||x^0|-|\mathbf {x}|\big |+1&\ge \frac{\langle \tau \rangle ^2+\langle z\rangle ^2}{2\langle \tau \rangle \langle z\rangle }\,. \end{aligned}$$


The first inequality follows from

$$\begin{aligned} (|\tau |\langle z\rangle +\langle \tau \rangle |\mathbf {z}|+1)^2\ge |\tau |^2\langle z\rangle ^2+\langle \tau \rangle ^2|\mathbf {z}|^2+1\ge \langle \tau \rangle ^2\langle z\rangle ^2\,, \end{aligned}$$

and the second from

$$\begin{aligned} \big ||\tau |\langle z\rangle -\langle \tau \rangle |\mathbf {z}|\big |+1&=\frac{|\tau ^2\langle z\rangle ^2-\langle \tau \rangle ^2|\mathbf {z}|^2|}{|\tau |\langle z\rangle +\langle \tau \rangle |\mathbf {z}|}+1 \ge \frac{|\langle \tau \rangle ^2-\langle z\rangle ^2|+2\langle \tau \rangle \langle z\rangle }{2\langle \tau \rangle \langle z\rangle }\\&\ge \frac{[(\langle \tau \rangle ^2-\langle z\rangle ^2)^2+4\langle \tau \rangle ^2\langle z\rangle ^2]^{\frac{1}{2}}}{2\langle \tau \rangle \langle z\rangle } =\frac{\langle \tau \rangle ^2+\langle z\rangle ^2}{2\langle \tau \rangle \langle z\rangle }\,. \end{aligned}$$

\(\square \)

Appendix E. Free Dirac Equation: The Fourier Representation

Here, mainly to fix notation, we write down the integral Fourier form of the solution of the free Dirac equation for a given initial condition.

Let H denote the unit hyperboloid \(v^2=1\), \(v^0>0\), with the Lorentz-invariant measure \(\hbox {d}\mu (v)=d^3v/v^0\). Moreover, introduce the projection operators in \({\mathbb {C}}^4\) defined by \(P_\pm (v) =\tfrac{1}{2}(\mathbf {1}\pm v\cdot \gamma )\). For \(f\in {\mathcal {S}}(H,{\mathbb {C}}^4)\) the function

$$\begin{aligned} \psi (x)=(2\pi )^{-\frac{3}{2}}\int \big [e^{-ix\cdot v}P_+(v) -e^{ix\cdot v}P_-(v)\big ]f(v) \hbox {d}\mu (v) \end{aligned}$$

is a smooth solution of the Dirac equation with the initial condition

$$\begin{aligned} \psi (0,\mathbf {x})=(2\pi )^{-\frac{3}{2}}\int \big [e^{i\mathbf {x}\cdot \mathbf {v}}P_+(v) -e^{-i\mathbf {x}\cdot \mathbf {v}}P_-(v)\big ]f(v) \hbox {d}\mu (v) \end{aligned}$$

in the space \({\mathcal {S}}({\mathbb {R}}^3,{\mathbb {C}}^4)\) (where \(\mathbf {x}\cdot \mathbf {v}\) is the Euclidean product of the space parts of x and v). The transformation

$$\begin{aligned} {\mathcal {S}}(H,{\mathbb {C}}^4)\ni f\mapsto {\mathcal {F}}f=\psi (0,.)\in {\mathcal {S}}({\mathbb {R}}^3,{\mathbb {C}}^4) \end{aligned}$$

extends to an isometric transformation from the Hilbert space \(L^2_\gamma (H)\) of four-spinor functions with the scalar product \((f_1,f_2)=\int \overline{f_1(v)}\gamma \cdot v f_2(v) \hbox {d}\mu (v)\) into \({\mathcal {H}}={\mathbb {C}}^4\otimes L^2({\mathbb {R}}^3)\). For \(\psi (0,\mathbf {x})\in {\mathcal {S}}({\mathbb {R}}^3,{\mathbb {C}}^4)\) the inverse transformation is given by

$$\begin{aligned} f(v)=(2\pi )^{-\frac{3}{2}}\int \big [e^{-i\mathbf {v}\cdot \mathbf {x}}P_+(v) +e^{i\mathbf {v}\cdot \mathbf {x}}P_-(v)\big ]\gamma ^0\psi (0,\mathbf {x})d^3x\,, \end{aligned}$$

which again extends to an isometric transformation on \({\mathbb {C}}^4\otimes L^2({\mathbb {R}}^3)\). Therefore,

$$\begin{aligned} {\mathcal {F}}: L^2_\gamma (H)\mapsto {\mathcal {H}}\end{aligned}$$

is a unitary transformation, and again \({\mathcal {F}}^{-1}:{\mathcal {S}}({\mathbb {R}}^3,{\mathbb {C}}^4)\mapsto {\mathcal {S}}(H,{\mathbb {C}}^4)\). Thus, for \(\varphi \in {\mathcal {S}}({\mathbb {R}}^3,{\mathbb {C}}^4)\) the solution of the free Dirac equation with the initial condition \(\psi (0,\mathbf {x})=\varphi (\mathbf {x})\) is represented by

$$\begin{aligned} \psi (x)=(2\pi )^{-\frac{3}{2}}\int \big [e^{-ix\cdot v}P_+(v) -e^{ix\cdot v}P_-(v)\big ][{\mathcal {F}}^{-1}\varphi ](v)\hbox {d}\mu (v)\,. \end{aligned}$$

Appendix F. Solutions of the Wave Equation

In this and the following two sections we shall use the language more extensively explained in [10], where also bibliographic sources are indicated. Moreover, we shall repeatedly make use of the following estimates, which we write down here for the convenience of the reader, and whose proof may be found in [8], Appendix B.

Lemma 17

Let \(a>0\), \(b\ge 0\), \(c>0\) and \(\alpha >0\). Then

$$\begin{aligned} \int _0^c(a+bu)^{-\alpha }du<\left\{ \begin{array}{llll} \frac{\alpha }{\alpha -1}\frac{c}{a^{\alpha -1}(a+bc)}\,,&{} \quad \alpha >1\,,\\ \frac{1}{1-\alpha }\frac{c}{(a+bc)^\alpha }\,,&{} \quad \alpha <1\,. \end{array}\right. \end{aligned}$$

A large class of the solutions of the homogeneous wave equation may be represented in the following form

$$\begin{aligned} \varphi (x)=-\frac{1}{2\pi }\int \dot{F}(x\cdot l,l)d^2l\,, \end{aligned}$$

where \(\dot{F}(s,l)=\partial _sF(s,l)\), with \(F(s,l)\) a homogeneous of degree \(-1\) function of a real variable s and null (light-like), future-pointing vectors l. Consequently, \(\dot{F}(s,l)\) is homogeneous of degree \(-2\), and at this stage this function is all one needs for the above representation, but later the function \(F\) will play a role. For \(\dot{F}\) in the class \(C^0\) the field \(\varphi \) is in \(C^0\) and satisfies the wave equation in the distributional sense. For \(F\) in the class \(C^3\), the field is in the class \(C^2\) and the wave equation is satisfied in the direct sense. The measure \(d^2l\) is the Lorentz invariant measure on the set of null directions, applicable to functions of l, homogeneous of degree \(-2\). If f(l) is such function, and \((e_0,\ldots ,e_3)\) is any Minkowski basis, then

$$\begin{aligned} \int f(l) d^2l= \int f(k)\hbox {d}\Omega (\mathbf {k})\,, \end{aligned}$$

where \(k=l/l^0\), \(\hbox {d}\Omega (\mathbf {k})\) is the spherical angle measure in this coordinate system, and the value of this integral does not depend on the choice of basis. The operators of intrinsic differentiation in the cone are given by \(L_{ab}=l_a(\partial /\partial l^b)-l_b(\partial /\partial l^a)\), and satisfy

$$\begin{aligned} \int L_{ab}f(l)d^2l=0. \end{aligned}$$

Let \(Z^a(l)\) be a vector function, such that \(l\cdot Z(l)=0\). If one extends Z(l) to the neighborhood of the cone with the preservation of this property, then

$$\begin{aligned} L_{0a}[Z^a(l)/l^0]=\partial \cdot Z(l)\,, \end{aligned}$$

independently of any particular extension, and where after differentiation the rhs is restricted to the cone. Therefore, the rhs is well defined and independent of the choice of basis. Moreover, let in addition Z(l) be homogeneous of degree \(-1\), then

$$\begin{aligned} \int \partial \cdot Z(l)\, d^2l=0\,. \end{aligned}$$

We shall use the coordinates \((u,\phi )\) defined by

$$\begin{aligned} l=l^0k\,,\quad k=e_0+e_3u+\sqrt{1-u^2}R(\phi )\,, \quad R(\phi )=e_1\cos \phi +e_2\sin \phi \,. \end{aligned}$$

We note for later use that

$$\begin{aligned} \ddot{R}(\phi )\equiv \partial _\phi ^2 R(\phi )=-R(\phi )\,. \end{aligned}$$

We choose \(e_3=\mathbf {x}/|\mathbf {x}|\), and then

$$\begin{aligned} \varphi (x)=-\frac{1}{2\pi }\int F(x^0-|\mathbf {x}|u,k(u,\phi ))\hbox {d}u\,\hbox {d}\phi \,. \end{aligned}$$

The following theorem gives the properties of decay in spacetime of the solutions (105). In the estimates (110), (112), (113) below, and other of similar type to appear later, one should understand that vectors l are scaled to \(l^0=1\). The form of such estimates does not depend on the choice of a basis, only the bounding constants change. Recall that \(\theta \) is the Heaviside step function.

Theorem 18

Let \(\varphi (x)\) be represented by formula (105), with \(\dot{F}(s,l)\) and \(F(s,l)\) as described there, and let \(\varepsilon \in (0,1)\).

  1. (i)

    If \(\dot{F}(s,l)\) is of class \(C^0\), and for some \(n\in \{0,1,\ldots \}\) is bounded by

    $$\begin{aligned} |\dot{F}(s,l)|\le \frac{\mathrm {const}}{(|s|+1)^{n+\varepsilon }}\,, \end{aligned}$$

    then \(\varphi (x)\) is a \(C^0\) field and the following estimates are satisfied

    $$\begin{aligned} |\varphi (x)|\le \mathrm {const}{\left\{ \begin{array}{ll} \displaystyle \frac{1}{(|x|+1)^\varepsilon }\,,&{} n=0\,,\\ \displaystyle \frac{1}{|x|+1}\bigg [\frac{\theta (x^2)}{(|x^0|-|\mathbf {x}|+1)^{n-1+\varepsilon }} +\theta (-x^2)\bigg ]\,,&{} n\ge 1\,. \end{array}\right. } \end{aligned}$$
  2. (ii)

    If \(F(s,l)\) is of class \(C^1\), and for some \(n\in \{1,2,\ldots \}\) is bounded by

    $$\begin{aligned} |L^\alpha F(s,l)|\le \frac{\mathrm {const}}{(|s|+1)^{n-1+\varepsilon }}\,,\quad |\alpha |\le 1\,, \end{aligned}$$

    then \(\varphi (x)\) is a \(C^0\) field and the following stronger estimates hold

    $$\begin{aligned}&|\varphi (x)|\le \frac{\mathrm {const}}{(|x|+1)(||x^0|-|\mathbf {x}||+1)^\varepsilon }\,, \qquad \varepsilon <\tfrac{1}{2}\,,\quad n=1\,,\\&|\varphi (x)|\le \frac{\mathrm {const}}{|x|+1}\bigg [\frac{1}{(||x^0|-|\mathbf {x}||+1)^{n-1+\varepsilon }} +\frac{\theta (-x^2)}{(|x|+1)^\frac{1}{2}(|\mathbf {x}|-|x^0|+1)^\frac{1}{2}}\bigg ]\,,\\&\qquad n\ge 2\,. \end{aligned}$$
  3. (iii)

    If \(F(s,l)\) is of class \(C^2\), and for some \(n\in \{1,2,\ldots \}\) is bounded by

    $$\begin{aligned} |L^\alpha F(s,l)|\le \frac{\mathrm {const}}{(|s|+1)^{n-1+\varepsilon }}\,,\quad |\alpha |\le 2\,, \end{aligned}$$

    then \(\varphi (x)\) is a \(C^1\) field and the following stronger estimates hold

    $$\begin{aligned} |\varphi (x)|\le {\left\{ \begin{array}{ll} \displaystyle \frac{\mathrm {const}}{(|x|+1)(||x^0|-|\mathbf {x}||+1)^\varepsilon }\,,&{} n=1\,,\\ \displaystyle \frac{\mathrm {const}}{|x|+1}\bigg [\frac{1}{(||x^0|-|\mathbf {x}||+1)^{n-1+\varepsilon }} +\frac{\theta (-x^2)}{|x|+1}\bigg ]\,,&{} n\ge 2\,. \end{array}\right. } \end{aligned}$$

Note that due to the properties of homogeneity of \(\dot{F}\) and \(F\), the bound (110) is satisfied in all three cases (with \(n\ge 2\) in (ii) and \(n\ge 1\) in (iii)). Case (ii) for \(n=1\) could also be treated, but the result is more involved, and we do not need it anyway.


(i) Using the bound (110) and the integral (109), after the change of variables \(\sigma ={{\,\mathrm{sgn}\,}}(x^0)(x^0-|\mathbf {x}|u)\), we obtain

$$\begin{aligned} |\varphi (x)|&\le \frac{\mathrm {const}}{|\mathbf {x}|}\int \limits _{|x^0|-|\mathbf {x}|}^{|x^0|+|\mathbf {x}|} \frac{\hbox {d}\sigma }{(|\sigma |+1)^{n+\varepsilon }}\\&\le \frac{\mathrm {const}}{|\mathbf {x}|} {\left\{ \begin{array}{ll} \displaystyle \int _0^{2|\mathbf {x}|}\frac{\hbox {d}\xi }{(\xi +|x^0|-|\mathbf {x}|+1)^{n+\varepsilon }}\,,&{} x^2\ge 0\,,\\ \displaystyle \int _0^{|x^0|+|\mathbf {x}|}\frac{\hbox {d}\sigma }{(\sigma +1)^{n+\varepsilon }}\,,&{}x^2\le 0\,. \end{array}\right. } \end{aligned}$$

Lemma 17 gives now the bounds (111).

(ii) and (iii) These cases give more restrictive bounds only in the region \(x^2\le 0\), \(|\mathbf {x}|\ge 1\), in which \(|\mathbf {x}|\ge \mathrm {const}(|x|+1)\). We use the equalities (114) and (115), respectively, of Lemma 19 below. To shorten notation, we denote the exponent in (112) and (113) by \(q=n-1+\varepsilon \). For \(x^2\le 0\) we then have

$$\begin{aligned} |\varphi (x)|\le \frac{\mathrm {const}}{|\mathbf {x}|}\bigg [\frac{1}{(|\mathbf {x}|-|x^0|+1)^q} +\int \limits _{-1}^1\left\{ \begin{matrix} (1-u^2)^{-\frac{1}{2}}\\ 1\end{matrix}\right\} \frac{du}{(||x^0|+|\mathbf {x}|u|+1)^q}\bigg ]\,, \end{aligned}$$

where \((1-u^2)^{-\frac{1}{2}}\) applies in case (ii) and 1 in case (iii). Integration in case (iii) is done as in case (i) and the thesis of this case follows. The integral in case (ii), after the substitution \(1-u=t\), is bounded by 2 times the integral

$$\begin{aligned}&\int _0^1 \frac{dt}{f(a,b,t)}\,,\quad f(a,b,t)=t^\frac{1}{2}(|bt-a|+1)^q\,,\\&0\le a=|\mathbf {x}|-|x^0|\le |\mathbf {x}|=b\ge \mathrm {const}(|x|+1)\,. \end{aligned}$$

For \(t\in [0,a/(2b)]\) we have \(f(a,b,t)\ge \mathrm {const}\, t^\frac{1}{2}(a+1)^q\), so

$$\begin{aligned} \int _0^\frac{a}{2b}\frac{dt}{f(a,b,t)} \le \mathrm {const}\,b^{-\frac{1}{2}}(a+1)^{\frac{1}{2}-q} \end{aligned}$$

For \(t\in [a/(2b),2a/b]\) we have \(f(a,b,t)\ge \mathrm {const}\, a^\frac{1}{2}b^{-\frac{1}{2}}(|bt-a|+1)^q\). The integral is now split into intervals [a/(2b), a/b] and [a/b, 2a/b] and evaluated with the use of Lemma 17. This gives

$$\begin{aligned} \int _\frac{a}{2b}^\frac{2a}{b}\frac{dt}{f(a,b,t)} \le \mathrm {const}\,b^{-\frac{1}{2}}\times {\left\{ \begin{array}{ll} \displaystyle (a+1)^{\frac{1}{2}-q}\,, &{} q<1\,\\ \displaystyle (a+1)^{-\frac{1}{2}}\,, &{} q>1\,. \end{array}\right. } \end{aligned}$$

Finally, if \(2a/b\le 1\), then for \(t\in [2a/b,1]\) we have \(f(a,b,2a/b+s)\ge \mathrm {const}\,s^\frac{1}{2}(a+1+bs)^q\), and by changing further the integration variable to \(r=b(a+1)^{-1}s\) one finds

$$\begin{aligned} \int _\frac{2a}{b}^1\frac{dt}{f(a,b,t)} \le \mathrm {const}\times {\left\{ \begin{array}{ll} \displaystyle b^{-q}\,, &{} q<\frac{1}{2}\,,\\ \displaystyle b^{-\frac{1}{2}}(a+1)^{\frac{1}{2}-q}\,, &{} q>\frac{1}{2}\,. \end{array}\right. } \end{aligned}$$

Substituting the values of a and b and noting that \(b^{-\frac{1}{2}}(a+1)^{\frac{1}{2}-q}\le \mathrm {const}\, b^{-q}\) for \(q<\frac{1}{2}\), \(b\ge 1\), one obtains the thesis. \(\square \)

Lemma 19

Let \(F(s,l)\) be homogeneous of degree \(-1\) and choose \(e_3\) in (107) as \(e_3=\mathbf {x}/|\mathbf {x}|\). Then the following identities hold

$$\begin{aligned}&\frac{1}{2\pi }\int \dot{F}(x\cdot l,l)d^2l +\frac{1}{|\mathbf {x}|}\big [F(x^0-|\mathbf {x}|,(1,\mathbf {x}/|\mathbf {x}|))-F(x^0+|\mathbf {x}|,(1,-\mathbf {x}/|\mathbf {x}|))\big ]\nonumber \\&\quad =\frac{1}{2\pi |\mathbf {x}|}\int (l^0)^{-1}\big [L'_{03} -\frac{u}{\sqrt{1-u^2}}R^iL'_{0i}\big ]F(x\cdot l,l)d^2l \end{aligned}$$
$$\begin{aligned}&\quad =\frac{1}{2\pi |\mathbf {x}|}\int (l^0)^{-1}\big [L'_{03}-u\dot{R}^i\dot{R}^j L'_{0i}L'_{0j}\big ]F(x\cdot l,l)d^2l\,, \end{aligned}$$

where prime at \(L'_{ab}\) indicates that these operators act only on the second argument in \(F(x\cdot l,l)\). The equalities (114) and (115) hold for \(F\) of class \(C^1\) and \(C^2\), respectively.


Using the form (107), we find

$$\begin{aligned} \partial _u=\frac{1}{l^0}\frac{\partial l^a}{\partial u}L_{0a}=L_{03}-\frac{u}{\sqrt{1-u^2}}R^iL_{0i}\,. \end{aligned}$$

We apply this identity to \(F(s,l)\), then set \(s=x\cdot l\) and divide by \(l^0\). As \(l^0\) does not depend on u, the result may be written as

$$\begin{aligned} \partial _u\big [(l^0)^{-1}F(x\cdot l,l)\big ] +|\mathbf {x}|\dot{F}(x\cdot l,l) =(l^0)^{-1}\Big [L'_{03}-\frac{u}{\sqrt{1-u^2}}R^iL'_{0i}\Big ]F(x\cdot l,l)\,, \end{aligned}$$

(primes at \(L'\) as in the thesis). We integrate this identity over l’s. The first term on the lhs gives

$$\begin{aligned}&\int \partial _u F\big (x^0-|\mathbf {x}|u,k(u,\phi )\big )\hbox {d}u\,\hbox {d}\phi \\&\quad =2\pi \big [ F\big (x^0-|\mathbf {x}|,(1,\mathbf {x}/|\mathbf {x}|)\big ) -F\big (x^0+|\mathbf {x}|,(1,-\mathbf {x}/|\mathbf {x}|)\big )\big ]\,. \end{aligned}$$

Dividing by \(2\pi |\mathbf {x}|\) and rearranging the terms, we obtain equality (114). Next, using (108) we transform the second term of the integrand as

$$\begin{aligned}&-\frac{u}{l^0\sqrt{1-u^2}}R^iL'_{0i}F(x\cdot l,l) =\partial _\phi \Big [\frac{u}{l^0\sqrt{1-u^2}}\dot{R}^iL'_{0i}F(x\cdot l,l)\Big ]\\&\quad -\frac{u}{l^0\sqrt{1-u^2}}\dot{R}^i\partial '_\phi \big [L'_{0i}F(x\cdot l,l)\big ]\,, \end{aligned}$$

where we have taken into account that \(l^0\) and \(x\cdot l\) do not depend on \(\phi \). The first term on the rhs vanishes when integrated, while in the second term we use another differential identity:

$$\begin{aligned} \partial _\phi =\frac{1}{l^0}\frac{\partial l^a}{\partial \phi }L_{0a}=\sqrt{1-u^2}\dot{R}^aL_{0a}\,. \end{aligned}$$

Substituting the result into the integral, we obtain (115). \(\square \)

Appendix G. Sources

Let \(\rho (x)\) be a continuous field on Minkowski space. We formulate two decay conditions, with some fixed numbers \(\omega >0\) and \(\varepsilon \in (0,1)\):

$$\begin{aligned}&\text {(A)}\qquad |\rho (x)|\le \frac{\mathrm {const}}{(|x|+1)^\omega }\,, \end{aligned}$$
$$\begin{aligned}&\text {(B)}\qquad |\rho (x)|\le \frac{\mathrm {const}}{(|x|+1)^3}\bigg [\theta (x^2) +\frac{1}{(|x|+1)^\varepsilon }\bigg ]\,. \end{aligned}$$

We want to consider the existence and decay properties of the advanced and retarded solutions

$$\begin{aligned} \varphi _{\mathrm{ret}/\mathrm{adv}}(x)=4\pi \int D_{\mathrm{ret}/\mathrm{adv}}(x-y)\rho (y)dy \end{aligned}$$

of the equation

$$\begin{aligned} \Box \,\varphi =4\pi \rho \,, \end{aligned}$$

and of their difference, the radiation field

$$\begin{aligned} \varphi _\mathrm{rad}(x)=\varphi _\mathrm{ret}(x)-\varphi _\mathrm{adv}(x)=4\pi \int D(x-y)\rho (y) \hbox {d}y =-\frac{1}{2\pi }\int \hbox {d}V(x\cdot l,l)d^2l\,, \end{aligned}$$

where (see [10])

$$\begin{aligned} V(s,l)=\int \delta (s-x\cdot l)\rho (x)\hbox {d}x\,. \end{aligned}$$

We use the standard notation

$$\begin{aligned}&D_{\mathrm{ret}/\mathrm{adv}}(x)=\frac{1}{2\pi }\theta (\pm x^0)\delta (x^2)\,,\\&D(x)=D_\mathrm{ret}(x)-D_\mathrm{adv}(x)=\frac{1}{2\pi }{{\,\mathrm{sgn}\,}}(x^0)\delta (x^2)\,. \end{aligned}$$

Lemma 20

  1. (i)

    If \(\rho (x)\) satisfies condition (A) for \(\omega >2\), then \(\varphi _{\mathrm{adv}/\mathrm{ret}}(x)\) are well defined and continuous, and the following bound is satisfied:

    $$\begin{aligned} |\varphi _\mathrm{adv}(x)|\le \frac{\mathrm {const}}{(|x|+1)^{\omega -2}}\qquad \text {for}\quad x^0\ge 0\,; \end{aligned}$$

    similarly for the retarded solution in the half-space \(x^0\le 0\).

  2. (ii)

    If in addition to (i) \(\rho (x)\) is of class \(C^1\), and \(\nabla \rho (x)\) and \(x\cdot \nabla \rho (x)\) satisfy condition (A) for \(\omega >2\), then \(\varphi _{\mathrm{adv}/\mathrm{ret}}(x)\) are of class \(C^1\) and

    $$\begin{aligned} x\cdot \nabla \varphi _{\mathrm{adv}/\mathrm{ret}}(x)=\int D_{\mathrm{adv}/\mathrm{ret}}(x-y)(y\cdot \nabla +2)\rho (y)\,\hbox {d}y\,. \end{aligned}$$
  3. (iii)

    If \(\rho (x)\) satisfies condition (A) for \(\omega >3\), or condition (B), then V(sl) is well defined and continuous and satisfies the bound

    $$\begin{aligned} |V(s,l)|\le \frac{\mathrm {const}}{(|s|+1)^{\omega -3}}\,, \end{aligned}$$

    where \(\omega =3\) in case of condition (B).

  4. (iv)

    If in addition to (iii) \(\rho (x)\) is of class \(C^1\), and \(\nabla \rho (x)\) and \(x\cdot \nabla \rho (x)\) satisfy condition (A) for \(\omega >3\) or condition (B), then V(sl) is of class \(C^1\) and

    $$\begin{aligned} s\partial _s\int \delta (s-x\cdot l)\rho (x)\,\hbox {d}x =\int \delta (s-x\cdot l)(x\cdot \nabla +3)\rho (x)\,\hbox {d}x\,. \end{aligned}$$


(i) The use of the integral formula (118) and the bound (116) gives

$$\begin{aligned} |\varphi _\mathrm{adv}(x)|=\bigg |\int \frac{\rho (x^0+|\mathbf {z}|,\mathbf {x}+\mathbf {z})}{|\mathbf {z}|}d^3z\bigg | \le \mathrm {const}\int _0^\infty I(x^0,|\mathbf {x}|,r)\hbox {d}r\,, \end{aligned}$$

where we have denoted \(r=|\mathbf {z}|\), and

$$\begin{aligned} I(x^0,|\mathbf {x}|,r)=\int _0^2\frac{r\,d u}{[1+(r+x^0)^2+(r-|\mathbf {x}|)^2+2|\mathbf {x}|ru]^{\frac{\omega }{2}}} \end{aligned}$$

with \(u=\cos [\measuredangle (\mathbf {z},\mathbf {x})]+1\). Using the estimate (104), we find

$$\begin{aligned} I(x^0,|\mathbf {x}|,r) \le \frac{\mathrm {const}\,r}{[1+(r+x^0)^2+(r+|\mathbf {x}|)^2][1+(r+x^0)^2 +(r-|\mathbf {x}|)^2]^{\frac{\omega }{2}-1}}\,, \end{aligned}$$

so the integral on the rhs of (122) is well defined, and the continuity of \(\varphi _\mathrm{adv}(x)\) easily follows. Moreover, for \(x^0\ge 0\) it follows

$$\begin{aligned} I(x^0,|\mathbf {x}|,r)\le \frac{\mathrm {const}\,r}{[1+x^0+|\mathbf {x}|+r]^2[1+r+x^0+|r-|\mathbf {x}||]^{\omega -2}}\,, \end{aligned}$$

and then

$$\begin{aligned} \int _0^{|\mathbf {x}|}I(x^0,|\mathbf {x}|,r)\hbox {d}r\le \frac{\mathrm {const}}{[1+x^0+|\mathbf {x}|]^\omega }\int _0^{|\mathbf {x}|}r\hbox {d}r \le \frac{\mathrm {const}}{(|x|+1)^{\omega -2}}\,, \end{aligned}$$

and (with \(\sigma =r-|\mathbf {x}|\))

$$\begin{aligned} \int _{|\mathbf {x}|}^\infty I(x^0,|\mathbf {x}|,r)\hbox {d}r \le \mathrm {const}\int _0^\infty \frac{\hbox {d}\sigma }{(\sigma +x^0+| \mathbf {x}|+1)^{\omega -1}}\le \frac{\mathrm {const}}{(|x|+1)^{\omega -2}}\,, \end{aligned}$$

which ends the proof of (i).

(ii) We integrate the identity

$$\begin{aligned} \frac{x\cdot \nabla _x\rho (x^0+|\mathbf {z}|,\mathbf {x}+\mathbf {z})}{|\mathbf {z}|}&=\frac{(y\cdot \nabla _y+2)\rho (y)_{|y=(x^0+|\mathbf {z}|,\mathbf {x}+\mathbf {z})}}{|\mathbf {z}|}\\&\quad -\frac{\partial }{\partial z^i}\Big (\frac{z^i\rho (x^0+|\mathbf {z}|,\mathbf {x}+\mathbf {z})}{|\mathbf {z}|}\Big ) \end{aligned}$$

over \(|\mathbf {z}|\le R\) and transform the integral of the second term on the rhs to a surface integral. This surface contribution is bounded by \(RI(x^0,|\mathbf {x}|,R)\), so it vanishes in the limit \(R\rightarrow \infty \). Formula (119) follows.

(iii) We choose the space basis in which \(e_3=\mathbf {l}\), introduce new variables \(\alpha =x^0-x^3\), \(\beta =x^0+x^3\), and denote \(\mathbf {x}_\bot =x^1e_1+x^2e_2\). Conditions (116) and (117) may be then written as

$$\begin{aligned}&\text {(A)}\qquad |\rho (x)|\le \frac{\mathrm {const}}{(1+\alpha ^2+\beta ^2+|\mathbf {x}_\bot |^2)^\frac{\omega }{2}}\,,\\&\text {(B)} \qquad |\rho (x)|\le \frac{\mathrm {const}\,\theta (\alpha \beta -|\mathbf {x}_\bot |^2)}{(1+\alpha ^2+\beta ^2 +|\mathbf {x}_\bot |^2)^\frac{3}{2}} +\frac{\mathrm {const}}{(1+\alpha ^2+\beta ^2+|\mathbf {x}_\bot |^2)^\frac{3+\varepsilon }{2}}\,, \end{aligned}$$

and V takes the form \(V(s,l)=\frac{1}{2}\int \rho (\alpha = s,\beta , \mathbf {x}_\bot )\hbox {d}\beta d^2x_\bot \). For the case (A) with \(\omega >3\) we then have

$$\begin{aligned} |V(s,l)|&\le \mathrm {const}\int \frac{\hbox {d}\beta d^2x_\bot }{(1+s^2+\beta ^2+|\mathbf {x}_\bot |^2)^\frac{\omega }{2}}\\&\le \mathrm {const}\int _0^\infty \frac{r^2\hbox {d}r}{(1+|s|+r)^\omega }=\frac{\mathrm {const}}{(1+|s|)^{\omega -3}}\,. \end{aligned}$$

For the case (B) it is now sufficient to restrict integral to the region \(x^2\ge 0\), which leads to the estimation of the integral \(\int K(\beta )\hbox {d}\beta \), with

$$\begin{aligned} K(\beta )&=\int \frac{\theta (s\beta -|\mathbf {x}_\bot |^2)d^2x_\bot }{(1+s^2+\beta ^2+|\mathbf {x}_\bot |^2)^\frac{3}{2}} =\mathrm {const}\,\theta (s\beta )\int _0^{s\beta }\frac{\hbox {d}\xi }{(1+s^2+\beta ^2+\xi )^\frac{3}{2}}\\&\le \mathrm {const}\frac{\theta (s\beta )s\beta }{(1+s^2+\beta ^2)^\frac{3}{2}} \le \mathrm {const}\frac{\theta (s\beta )|s|}{(1+|s|+|\beta |)^2}\,. \end{aligned}$$

This leads to

$$\begin{aligned} \int K(\beta )\hbox {d}\beta \le \mathrm {const}\,|s|\int _0^\infty \frac{\hbox {d}\sigma }{(1+|s|+\sigma )^2}\le \mathrm {const}\,. \end{aligned}$$

(iv) We integrate the identity

$$\begin{aligned} s\partial _s\rho (s,\beta ,\mathbf {x}_\bot )&=(x\cdot \nabla +3)\rho (x)_{|\alpha =s}\\&\quad -\partial _\beta \big (\beta \rho (s,\beta ,\mathbf {x}_\bot )\big ) -\sum _{i=1}^2\partial _{x^i}\big (x^i\rho (s,\beta ,\mathbf {x}_\bot )\big ) \end{aligned}$$

over the region \(|\mathbf {x}_\bot |\le R\), \(|\beta |\le B\). The contribution of the surface terms is bounded by

$$\begin{aligned} \sum _{\epsilon =\pm }B\int _{{\mathbb {R}}^2}|\rho (s,\pm B,\mathbf {x}_\bot )|d^2x_\bot +R^2\int _{{\mathbb {R}}\times S^1}|\rho (s,\beta ,R\mathbf {n})|\hbox {d}\beta \hbox {d}\Omega (\mathbf {n})\,, \end{aligned}$$

where \(\hbox {d}\Omega (\mathbf {n})\) is the angle measure on the unit circle \(S^1\). Condition (A) with \(\omega >3\) is stronger than condition (B), so it is sufficient to consider the latter, in the form given in the proof of (iii) above. Then, the first contribution in (123) is bounded by

$$\begin{aligned}&\mathrm {const}\bigg [\int _0^{|s|B}\frac{B\hbox {d}\zeta }{(1+s^2+B^2+\zeta )^\frac{3}{2}} +\int _0^\infty \frac{B\hbox {d}\zeta }{(1+s^2+B^2+\zeta )^\frac{3+\epsilon }{2}}\bigg ]\\&\quad \le \frac{\mathrm {const}\, |s|B^2}{(1+|s|+B)^3}+\frac{\mathrm {const}\, B}{(1+|s|+B)^{1+\epsilon }}\,, \end{aligned}$$

where we have substituted \(|\mathbf {x}_\bot |^2=\zeta \). The second term in (123) is bounded by

$$\begin{aligned}&\mathrm {const}\bigg [\int _{R^2/|s|}^\infty \frac{R^2 \hbox {d}\beta }{(1+|s|+R+\beta )^3} +\int _0^\infty \frac{R^2 \hbox {d}\beta }{(1+|s|+R+\beta )^{3+\epsilon }}\bigg ]\\&\quad \le \frac{\mathrm {const}\,R^2}{(1+|s|+R+R^2/|s|)^2}+ \frac{\mathrm {const}\,R^2}{(1+|s|+R)^{2+\epsilon }}\,. \end{aligned}$$

All these bounds vanish in the limit \(B,R\rightarrow \infty \), which leads to the identity (121). \(\square \)

Appendix H. Electromagnetic Fields and Lorenz Potentials

We apply now the results of the discussion of the last two sections to the case of electromagnetic fields. We start with the Lorenz potentials of free fields, which we assume to have been produced as radiation fields by some conserved currents.

Theorem 21

Let J(x) satisfy Assumption III in Sect. 7. Then, the Lorenz potential \(A_\mathrm{rad}(x)\) of the radiated field of this current and the scalar field \(C_\mathrm{rad}(x)=x\cdot A_\mathrm{rad}(x)\) satisfy the following decay estimates:

$$\begin{aligned}&\left. \begin{array}{l} |A_\mathrm{rad}(x)|\\ |\nabla C_\mathrm{rad}(x)| \end{array}\ \right\} \le \frac{\mathrm {const}}{|x|+1}\Big [\frac{1}{(||x^0|-|\mathbf {x}||+1)^\varepsilon } +\theta (-x^2)\Big ]\,, \end{aligned}$$
$$\begin{aligned}&|\nabla A_\mathrm{rad}(x)|\le \frac{\mathrm {const}}{|x|+1}\Big [\frac{1}{(||x^0|-|\mathbf {x}||+1)^{1+\varepsilon }} +\frac{\theta (-x^2)}{|x|+1}\Big ]\,, \end{aligned}$$
$$\begin{aligned}&|\nabla \nabla C_\mathrm{rad}(x)|\le \frac{\mathrm {const}}{|x|+1}\bigg [\frac{1}{(||x^0|-|\mathbf {x}||+1)^{1+\varepsilon }} +\frac{\theta (-x^2)}{(|x|+1)^\frac{1}{2}(|\mathbf {x}|-|x^0|+1)^\frac{1}{2}}\bigg ]\,, \end{aligned}$$
$$\begin{aligned}&|\nabla \nabla A_\mathrm{rad}(x)| \le \frac{\mathrm {const}}{|x|+1}\,, \end{aligned}$$
$$\begin{aligned}&\left. \begin{array}{l} |(x\cdot \nabla +1) A_\mathrm{rad}(x)|\\ |\nabla (x\cdot \nabla C_\mathrm{rad})(x)| \end{array} \right\} \le \frac{\mathrm {const}}{(|x|+1)(||x^0|-|\mathbf {x}||+1)^\varepsilon }\,, \end{aligned}$$
$$\begin{aligned}&|(x\cdot \nabla )^k C_\mathrm{rad}(x)| \le \frac{\mathrm {const}}{(|x|+1)^\varepsilon }\,,\quad k=1,2\,. \end{aligned}$$

The estimate (127) could be refined, but this is not needed for our purposes.


It follows from the assumption (70) and Lemma 20 (iii) that the function

$$\begin{aligned} V(s,l)=\int \delta (s-x\cdot l)J(x)dx=\int J \Big (\frac{s+\mathbf {x}\cdot \mathbf {l}}{l^0},\mathbf {x}\Big )\frac{d^3x}{l^0} \end{aligned}$$

is of class \(C^3\). The use of estimate (71), relation (121) and the estimates of the form (120) shows that

$$\begin{aligned} |\partial _s^k L^\alpha \dot{V}(s,l)|\le \frac{\mathrm {const}}{(|s|+1)^{k+1+\varepsilon }}\,, \quad k+|\alpha |\le 2\,. \end{aligned}$$

From the representation

$$\begin{aligned} A_\mathrm{rad}(x)=-\frac{1}{2\pi }\int \dot{V}(x\cdot l,l)d^2l\,, \end{aligned}$$

by the use of the calculation

$$\begin{aligned} \partial \cdot V(x\cdot l,l)=x\cdot \dot{V}(x\cdot l,l)+\partial '\cdot V(x\cdot l,l) \end{aligned}$$

and the integral identity (106), we also find

$$\begin{aligned} C_\mathrm{rad}(x)&=\frac{1}{2\pi }\int \partial '\cdot V(x\cdot l,l)d^2l\,,\\ \nabla _a C_\mathrm{rad}(x)&=\frac{1}{2\pi }\int l_a\partial '\cdot \dot{V}(x\cdot l,l)d^2l\,. \end{aligned}$$

(Similarly as \(L'_{ab}\) in Lemma 19 in Appendix F, \(\partial '\) acts only on the second argument.) All estimates now follow from the appropriate statements of Theorem 18: estimates (124) from (i) with \(n=1\), estimate (125) from (iii) for \(n=2\), estimate (126) from (ii) for \(n=2\), estimate (127) from (i) with \(n=1\). Next, with \(W(s,l)=s\dot{V}(s,l)\), we have

$$\begin{aligned} (x\cdot \nabla +1)A_\mathrm{rad}(x)&=-\frac{1}{2\pi }\int \dot{W}(x\cdot l,l)d^2l\,,\\ \nabla _a(x\cdot \nabla C_\mathrm{rad}(x))&=\frac{1}{2\pi }\int l_a\partial '\cdot \dot{W}(x\cdot l,l)d^2l\,, \end{aligned}$$

so the estimates (128) follow from Theorem 18 (ii) with \(n=1\). Finally, with \(U(s,l)=s\dot{W}(s,l)\), we have

$$\begin{aligned} x\cdot \nabla C_\mathrm{rad}(x)&=\frac{1}{2\pi }\int \partial '\cdot W(x\cdot l,l)d^2l\,,\\ (x\cdot \nabla )^2C_\mathrm{rad}(x)&=\frac{1}{2\pi }\int \partial '\cdot U(x\cdot l,l)d^2l\,, \end{aligned}$$

so the estimates (129) follow from Theorem 18 (i) for \(n=0\). \(\square \)

Free in/out fields of the type discussed above are now augmented by ret/adv fields to produce the total electromagnetic field.

Theorem 22

Let the electromagnetic potential A satisfy Assumption III in Sect. 7. Then, the following estimates are satisfied:

$$\begin{aligned} \left. \begin{array}{l} |\nabla ^\alpha A(x)|\\ |\nabla C(x)| \end{array}\ \right\}&\le \frac{\mathrm {const}}{|x|+1}\le \frac{\mathrm {const}}{\langle \tau \rangle \langle z\rangle }\,, \quad |\alpha |=0,2\,,\nonumber \\ |\nabla A(x)|&\le \frac{\mathrm {const}}{|x|+1}\Big [\frac{1}{(||x^0|-|\mathbf {x}||+1)^{1+\varepsilon }} +\frac{1}{|x|+1}\Big ]\nonumber \\&\le \frac{\mathrm {const}}{\langle \tau \rangle \langle z\rangle }\bigg [\frac{1}{\langle \tau \rangle \langle z\rangle } +\Big (\frac{\langle \tau \rangle \langle z\rangle }{\langle \tau \rangle ^2+\langle z\rangle ^2}\Big )^{1+\varepsilon }\bigg ]\,, \end{aligned}$$
$$\begin{aligned} |\nabla \nabla C(x)|&\le \frac{\mathrm {const}}{|x|+1}\Big [\frac{1}{(||x^0|-|\mathbf {x}||+1)^{1+\varepsilon }} +\frac{1}{(|x|+1)^\frac{1}{2}(||x^0|-|\mathbf {x}||+1)^\frac{1}{2}}\Big ]\nonumber \\&\le \frac{\mathrm {const}}{\langle \tau \rangle \langle z\rangle }\bigg [\frac{1}{\langle \tau \rangle ^\frac{1}{2}\langle z\rangle ^\frac{1}{2}} \Big (\frac{\langle \tau \rangle \langle z\rangle }{\langle \tau \rangle ^2+\langle z\rangle ^2}\Big )^\frac{1}{2} +\Big (\frac{\langle \tau \rangle \langle z\rangle }{\langle \tau \rangle ^2+\langle z\rangle ^2}\Big )^{1+\varepsilon }\bigg ]\,, \end{aligned}$$
$$\begin{aligned} \left. \begin{array}{l} |(x\cdot \nabla +1) A(x)|\\ |\nabla (x\cdot \nabla C(x))| \end{array}\ \right\}&\le \frac{\mathrm {const}}{(|x|+1)(||x^0|-|\mathbf {x}||+1)^\varepsilon }\nonumber \\&\le \frac{\mathrm {const}}{\langle \tau \rangle \langle z\rangle }\Big (\frac{\langle \tau \rangle \langle z\rangle }{\langle \tau \rangle ^2+\langle z\rangle ^2}\Big )^\varepsilon \,,\\ |(x\cdot \nabla )^k C(x)|&\le \frac{\mathrm {const}}{(|x|+1)^\varepsilon } \le \frac{\mathrm {const}}{\langle \tau \rangle ^\varepsilon \langle z\rangle ^\varepsilon }\,,\quad k=1,2\,,\nonumber \\ |x^aF_{ab}(x)|&\le \frac{\mathrm {const}}{|x|+1}\le \frac{\mathrm {const}}{\langle \tau \rangle \langle z\rangle }\,.\nonumber \end{aligned}$$


To prove the estimates on A and C in the half-space \(x^0\ge 0\) it is sufficient to add the estimates of \(A_\mathrm{out}\) and \(C_\mathrm{out}\), as given by Theorem 21, to the estimates of \(A_\mathrm{adv}\) and \(C_\mathrm{adv}\) following from the assumptions on J and Lemma 20 (i) and (ii). For \(x^0\ge 0\) one has

$$\begin{aligned} |\nabla ^\alpha A_\mathrm{adv}(x)|&\le \frac{\mathrm {const}}{(|x|+1)^{|\alpha |+1}}\,,\quad |\alpha |\le 3\,,\\ |\nabla ^\alpha (x\cdot \nabla +1)A_\mathrm{adv}(x)|&=\bigg |\int D_\mathrm{adv}(x-y)(y\cdot \nabla +3+|\alpha |) \nabla ^\alpha J(y)dy\,\bigg |\\&\le \frac{\mathrm {const}}{(|x|+1)^{1+|\alpha |+\varepsilon }}\,,\quad |\alpha |\le 1\,. \end{aligned}$$


$$\begin{aligned}&x\cdot \nabla C_\mathrm{adv}(x)=x^c(x\cdot \nabla +1)A_{\mathrm{adv}\,c}(x)\,,\\&(x\cdot \nabla )^2 C_\mathrm{adv}(x)=x\cdot \nabla C_\mathrm{adv}(x) +x^cx^d\nabla _d(x\cdot \nabla +1)A_{\mathrm{adv}\,c}(x)\,,\\&x^aF_{ab}(x)=(x\cdot \nabla +1)A_b(x)-\nabla _bC(x)\,. \end{aligned}$$

and all the estimates in the x-form now easily follow. Similarly for \(x^0\le 0\) with the use of \(A_\mathrm{in}\) and \(A_\mathrm{ret}\). The \((\tau ,\mathbf {z})\)-form of the estimates is the consequence of Lemma 16. \(\square \)

Appendix I. Special Gauge in Special Variables

We consider now our field in the special gauge (73), which we write in the form

$$\begin{aligned} \mathcal {A}=A-\nabla S_\ell \,,\quad S_\ell =\log (\langle \tau \rangle \langle z\rangle )C\,,\quad C(x)=x\cdot A\,. \end{aligned}$$

We need the components of this potential, and various expressions involving them, in our special coordinates system. For the sake of this section we introduce the abbreviation

$$\begin{aligned} \ell _{\tau z}=\log (\langle \tau \rangle \langle z\rangle )\,. \end{aligned}$$

Theorem 23

The special gauge potential \(\hat{\mathcal {A}}_\mu (\tau ,\mathbf {z})\) is a \(C^2\) field, and the following estimates are satisfied:

$$\begin{aligned}&|\hat{F}_{i\tau }|\le \frac{\mathrm {const}}{\langle \tau \rangle \langle z\rangle }\,,\quad |z^i\hat{F}_{i\tau }|\le \frac{\mathrm {const}}{\langle \tau \rangle }\,, \quad |\hat{F}_{ij}|\le \mathrm {const}\,,\quad |z^i\hat{F}_{ij}|\le \frac{\mathrm {const}}{\langle z\rangle }\,,\\&|\hat{\mathcal {A}}_\tau |\le \mathrm {const}\bigg [\frac{1+\log \langle \tau \rangle }{\langle \tau \rangle ^{1+\varepsilon }} +\frac{\log \langle z\rangle }{\langle \tau \rangle ^3}\bigg ]\,,\\&|\hat{\mathcal {A}}_i|\le \mathrm {const}\frac{1+\log (\langle \tau \rangle \langle z\rangle )}{\langle z\rangle }\,, \quad |z^i\hat{\mathcal {A}}_i|\le \mathrm {const}\frac{1+\log (\langle \tau \rangle \langle z\rangle )}{\langle z\rangle ^\varepsilon }\,,\\&|\partial _i\hat{\mathcal {A}}_\tau |+|z^i\partial _i\hat{\mathcal {A}}_\tau |\le \mathrm {const}\frac{1+\log \langle \tau \rangle }{\langle \tau \rangle ^{1+\varepsilon }}\,,\\&|\partial _\tau \hat{\mathcal {A}}_i|+|z^i\partial _\tau \hat{\mathcal {A}}_i|\le \frac{\mathrm {const}}{\langle \tau \rangle }\,,\\&|\partial _i\hat{\mathcal {A}}_j|\le \mathrm {const}(1+\log \langle \tau \rangle )\,,\\&|z^i\partial _i\hat{\mathcal {A}}_j|+|z^i\partial _j\hat{\mathcal {A}}_i|\le \mathrm {const}\frac{1+\log (\langle \tau \rangle \langle z\rangle )}{\langle z\rangle }\,,\\&|z^iz^j\partial _i\hat{\mathcal {A}}_j|\le \mathrm {const}\frac{1+\log (\langle \tau \rangle \langle z\rangle )}{\langle z\rangle ^\varepsilon }\,. \end{aligned}$$

The proof below allows formulation of more restrictive, but at the same time more sophisticated and less transparent estimates. These given above are sufficient for our purposes.


According to Assumption III the potential A is of class \(C^3\), so \(\mathcal {A}\) is of class \(C^2\). For the calculation of the components and derivatives of A and C, we use the following form of the coordinates change:

$$\begin{aligned} \partial _\tau x=\frac{\tau }{\langle \tau \rangle ^2}x+\frac{\langle z\rangle }{\langle \tau \rangle ^2}(1,0)\,,\qquad \partial _i x=\Big (\tau \frac{z^i}{\langle z\rangle },\langle \tau \rangle \delta ^{i,.}\Big )\,, \end{aligned}$$

which leads further to additional identities

$$\begin{aligned}&z^i\partial _ix=x-\frac{\tau }{\langle z\rangle }(1,0)\,,\\&\partial _i\partial _\tau x=\frac{\tau }{\langle \tau \rangle ^2}\partial _i x+\frac{z^i}{\langle \tau \rangle ^2\langle z\rangle }(1,0)\,, \qquad \partial _i\partial _j x=\frac{\tau }{\langle z\rangle }d^{ij}(1,0)\,, \end{aligned}$$

where \(d^{ij}\) is given by

$$\begin{aligned} d^{ij}=\delta ^{ij}-\frac{z^iz^j}{\langle z\rangle ^2}\,, \end{aligned}$$

and some of its properties are

$$\begin{aligned}&\partial _i\Big (\frac{z^j}{\langle z\rangle }\Big )=\frac{d^{ij}}{\langle z\rangle }\,, \qquad z^i d^{ij}=\frac{z^j}{\langle z\rangle ^2}\,,\\&\partial _kd^{ij}=\frac{-1}{\langle z\rangle ^2}(z^id^{jk}+z^jd^{ik})\,, \qquad z^i\partial _i d^{jk}=-2\frac{z^jz^k}{\langle z\rangle ^4}\,. \end{aligned}$$

A straightforward calculation now gives

$$\begin{aligned} \hat{F}_{i\tau }&=(\partial _i x^a)(\partial _\tau x^b)F_{ab}=(\partial _ix^a)\Big (\frac{\tau }{\langle \tau \rangle ^2}F_{ab}x^b +\frac{\langle z\rangle }{\langle \tau \rangle ^2}F_{a0}\Big )\,,\\ z^i\hat{F}_{i\tau }&=\Big (\frac{\langle z\rangle }{\langle \tau \rangle ^2}+\frac{\tau ^2}{\langle \tau \rangle ^2\langle z\rangle }\Big )x^aF_{a0}\,,\\ \hat{F}_{ij}&=(\partial _ix^a)(\partial _jx^b)F_{ab}\,,\\ z^i\hat{F}_{ij}&=\Big (x^aF_{ab}-\frac{\tau }{\langle z\rangle }F_{0b}\Big )(\partial _jx^b)\,.\\ \hat{\mathcal {A}}_\tau&=\frac{\langle z\rangle }{\langle \tau \rangle ^2}A_0-\ell _{\tau z}\partial _\tau C\,,\\ \hat{\mathcal {A}}_i&=\langle \tau \rangle \, d^{ij}A_j-\ell _{\tau z}\partial _iC\,,\\ z^i\hat{\mathcal {A}}_i&=\frac{\langle \tau \rangle }{\langle z\rangle ^2}z^iA_i-\ell _{\tau z}z^i\partial _iC\\ \partial _i\hat{\mathcal {A}}_\tau&= \frac{z^i}{\langle \tau \rangle ^2\langle z\rangle }A_0+\frac{\langle z\rangle }{\langle \tau \rangle ^2}\partial _i A_0-\frac{z^i}{\langle z\rangle ^2} \partial _\tau C -\ell _{\tau z}\partial _i\partial _\tau C\,, \\ z^i\partial _i\hat{\mathcal {A}}_\tau&=\frac{\mathbf {z}^2}{\langle \tau \rangle ^2\langle z\rangle }A_0+\frac{\langle z\rangle }{\langle \tau \rangle ^2}z^i\partial _i A_0 -\frac{|\mathbf {z}|^2}{\langle z\rangle ^2}\partial _\tau C-\ell _{\tau z}z^i\partial _i\partial _\tau C\,,\\ \partial _i\hat{\mathcal {A}}_j&=\langle \tau \rangle \big [(\partial _id^{jk})A_k+d^{jk}\partial _i A_k\big ] -\frac{z^i}{\langle z\rangle ^2}\partial _j C -\ell _{\tau z}\partial _i\partial _j C\,,\\ z^i\partial _i\hat{\mathcal {A}}_j&=\langle \tau \rangle \Big [-2\frac{z^jz^k}{\langle z\rangle ^4}A_k +d^{jk}z^i\partial _i A_k\Big ] -\frac{|\mathbf {z}|^2}{\langle z\rangle ^2}\partial _jC -\ell _{\tau z}z^i\partial _i\partial _jC\,,\\ z^iz^j\partial _i\hat{\mathcal {A}}_j&=\frac{\langle \tau \rangle }{\langle z\rangle ^2}\Big [-2\frac{|\mathbf {z}|^2}{\langle z\rangle ^2}z^kA_k +z^iz^k\partial _i A_k\Big ] -\frac{|\mathbf {z}|^2}{\langle z\rangle ^2}z^j\partial _jC -\ell _{\tau z}z^iz^j\partial _i\partial _jC\,,\\ \end{aligned}$$

For the estimation of all these field and potential expressions, we use Theorem 22. We note that

$$\begin{aligned} |\partial _i x|\le \mathrm {const}\langle \tau \rangle \,,\qquad |\partial _\tau x|\le \mathrm {const}\langle z\rangle \,,\qquad |\partial _z^\alpha d^{ij}|\le \frac{\mathrm {const}}{\langle z\rangle ^{|\alpha |}}\,,\ |\alpha |\le 2\,. \end{aligned}$$

Moreover, when using the estimates (130), (131) and (132) we shall make use of the obvious inequalities

$$\begin{aligned} \frac{\langle \tau \rangle \langle z\rangle }{\langle \tau \rangle ^2+\langle z\rangle ^2} \le \min \bigg \{\frac{1}{2},\frac{\langle \tau \rangle }{\langle z\rangle },\frac{\langle z\rangle }{\langle \tau \rangle }\bigg \} \le \left\{ \begin{array}{l} \langle \tau \rangle \langle z\rangle ^{-1}\,,\\ \langle \tau \rangle ^{-1}\langle z\rangle \,,\\ 2^{-1} \end{array}\right. \end{aligned}$$

(the choice of one of them will be dictated by a particular need). In this way we find

$$\begin{aligned} |\hat{F}_{i\tau }|&\le \mathrm {const}\Big (|F_{ab}x^b|+\frac{\langle z\rangle }{\langle \tau \rangle }|F_{a0}|\Big ) \le \frac{\mathrm {const}}{\langle \tau \rangle \langle z\rangle }\,,\\ |z^i\hat{F}_{i\tau }|&\le \mathrm {const}\Big (\frac{\langle z\rangle }{\langle \tau \rangle ^2}+\frac{1}{\langle z\rangle }\Big )|x^aF_{a0}| \le \frac{\mathrm {const}}{\langle \tau \rangle }\bigg (\frac{1}{\langle \tau \rangle ^2}+\frac{1}{\langle z\rangle ^2}\bigg )\,,\\ |\hat{F}_{ij}|&\le \mathrm {const}\langle \tau \rangle ^2|F_{ab}|\le \mathrm {const}\,,\\ |z^i\hat{F}_{ij}|&\le \mathrm {const}\Big (\langle \tau \rangle |x^aF_{ab}|+\frac{\langle \tau \rangle ^2}{\langle z\rangle }|F_{0b}|\Big ) \le \frac{\mathrm {const}}{\langle z\rangle }\,, \end{aligned}$$

which exhausts the claims on the field F.

The estimation of the potential expressions is more tedious, and we consider it in steps. In the first step we estimate the \(\tau \)- and \(\mathbf {z}\)-dependent coefficients, which gives

$$\begin{aligned}&|\hat{\mathcal {A}}_\tau |\le \frac{\langle z\rangle }{\langle \tau \rangle ^2}|A_0|+\ell _{\tau z}|\partial _\tau C|\,,\\&|\hat{\mathcal {A}}_i|\le \mathrm {const}\langle \tau \rangle |A_j|+\ell _{\tau z}|\partial _iC|\,,\quad |z^i\hat{\mathcal {A}}_i|\le \mathrm {const}\frac{\langle \tau \rangle }{\langle z\rangle }|A_i|+\ell _{\tau z}|z^i\partial _iC|\\&|\partial _i\hat{\mathcal {A}}_\tau |\le \frac{1}{\langle \tau \rangle ^2}\big (|A_0|+\langle z\rangle |\partial _i A_0|\big ) +\frac{1}{\langle z\rangle }|\partial _\tau C| +\ell _{\tau z}|\partial _i\partial _\tau C|\,,\\&|z^i\partial _i\hat{\mathcal {A}}_\tau |\le \frac{\langle z\rangle }{\langle \tau \rangle ^2}\big (|A_0|+|z^i\partial _i A_0|\big ) +|\partial _\tau C|+\ell _{\tau z}|z^i\partial _i\partial _\tau C|\,,\\&|\partial _i\hat{\mathcal {A}}_j|\le \mathrm {const}\langle \tau \rangle \Big (\frac{1}{\langle z\rangle }|A_k|+|\partial _i A_k|\Big ) +\frac{1}{\langle z\rangle }|\partial _j C|+\ell _{\tau z}|\partial _i\partial _j C|\,,\\&|z^i\partial _i\hat{\mathcal {A}}_j|\le \mathrm {const}\langle \tau \rangle \Big (\frac{1}{\langle z\rangle ^2}|A_k|+|z^i\partial _i A_k|\Big ) +|\partial _jC|+\ell _{\tau z}|z^i\partial _i\partial _jC|\,,\\&|z^iz^j\partial _i\hat{\mathcal {A}}_j|\le \mathrm {const}\frac{\langle \tau \rangle }{\langle z\rangle }(|A_k|+|z^i\partial _i A_k|) +|z^j\partial _jC|+\ell _{\tau z}|z^iz^j\partial _i\partial _jC|\,, \end{aligned}$$

Next, we evaluate the A- and C-terms:

$$\begin{aligned}&\partial _i A_b=(\partial _ix^a)\nabla _aA_b\,,\qquad z^i\partial _i A_b=x\cdot \nabla A_b-\frac{\tau }{\langle z\rangle }\nabla _0 A_b\,,\\&\partial _i\partial _j A_c=(\partial _i x^a)(\partial _j x^b)\nabla _a\nabla _b A_c+\frac{\tau }{\langle z\rangle }d^{ij}\nabla _0 A_c \end{aligned}$$

(the latter term is not needed for the present proof, but its estimate given below is used in the proof of Theorem 9 (iii) (b)),

$$\begin{aligned}&\partial _\tau C=\frac{\tau }{\langle \tau \rangle ^2}x\cdot \nabla C+\frac{\langle z\rangle }{\langle \tau \rangle ^2}\nabla _0 C\,,\\&\partial _iC= (\partial _ix^a)\nabla _a C\,,\qquad z^i\partial _iC= x\cdot \nabla C-\frac{\tau }{\langle z\rangle }\nabla _0C\,,\\&\partial _i\partial _\tau C=(\partial _i x^a)\Big [\frac{\tau }{\langle \tau \rangle ^2}\nabla _a(x\cdot \nabla C) +\frac{\langle z\rangle }{\langle \tau \rangle ^2}\nabla _a\nabla _0 C\Big ]+\frac{z^i}{\langle \tau \rangle ^2\langle z\rangle }\nabla _0 C\,,\\&z^i\partial _i\partial _\tau C=\frac{\tau }{\langle \tau \rangle ^2}\Big [(x\cdot \nabla )^2C-\nabla _0^2 C\Big ] +\Big [\frac{\langle z\rangle }{\langle \tau \rangle ^2}-\frac{\tau ^2}{\langle \tau \rangle ^2\langle z\rangle }\Big ]\nabla _0(x\cdot \nabla C)\\&\quad -\frac{1}{\langle \tau \rangle ^2\langle z\rangle }\nabla _0 C\,,\\&\partial _i\partial _jC=(\partial _ix^a)(\partial _jx^b)\nabla _a\nabla _b C+\frac{\tau }{\langle z\rangle }d^{ij}\nabla _0C\,,\\&z^i\partial _i\partial _jC=(\partial _jx^a)\Big [\nabla _a(x\cdot \nabla -1)C-\frac{\tau }{\langle z\rangle }\nabla _a\nabla _0C\Big ] +\frac{\tau z^j}{\langle z\rangle ^3}\nabla _0C\,,\\&z^iz^j\partial _i\partial _jC=(x\cdot \nabla -1)x\cdot \nabla C-2\frac{\tau }{\langle z\rangle }\nabla _0(x\cdot \nabla C) +\frac{\tau ^2}{\langle z\rangle ^2}\nabla _0^2C\\&\quad +\frac{\tau }{\langle z\rangle }\Big (2+\frac{|\mathbf {z}|^2}{\langle z\rangle ^2}\Big )\nabla _0C\,. \end{aligned}$$

Now we can estimate the A- and C-terms:

$$\begin{aligned}&|A_a|\le \frac{\mathrm {const}}{\langle \tau \rangle \langle z\rangle }\,,\qquad |\partial _i A_b|\le \mathrm {const}\,\min \bigg \{\frac{1}{\langle z\rangle }, \frac{1}{\langle \tau \rangle }\bigg \}\,,\\&|z^i\partial _iA_b|\le \frac{\mathrm {const}}{\langle \tau \rangle \langle z\rangle }\,,\qquad |\partial _i\partial _j A_b|\le \mathrm {const}\frac{\langle \tau \rangle }{\langle z\rangle }\,,\\&|\partial _\tau C|\le \mathrm {const}\Big (\frac{1}{\langle \tau \rangle ^{1+\varepsilon }\langle z\rangle ^\varepsilon } +\frac{1}{\langle \tau \rangle ^3}\Big )\,,\\&|\partial _iC|\le \frac{\mathrm {const}}{\langle z\rangle }\,,\qquad |z^i\partial _iC|\le \frac{\mathrm {const}}{\langle z\rangle ^\varepsilon }\,,\\&|\partial _i\partial _\tau C|\le \frac{\mathrm {const}}{\langle \tau \rangle ^{1+\varepsilon }\langle z\rangle ^{1-\varepsilon }}\,, \qquad |z^i\partial _i\partial _\tau C|\le \frac{\mathrm {const}}{\langle \tau \rangle ^{1+\varepsilon }\langle z\rangle ^\varepsilon }\,,\\&|\partial _i\partial _j C|\le \mathrm {const}\,\min \bigg \{1, \frac{\langle \tau \rangle }{\langle z\rangle }\bigg \}\,,\\&|z^i\partial _i\partial _j C|\le \frac{\mathrm {const}}{\langle z\rangle }\,,\qquad |z^iz^j \partial _i\partial _j C|\le \frac{\mathrm {const}}{\langle z\rangle ^\varepsilon }\,. \end{aligned}$$

Substituting these estimates in the bounds on the potential expressions obtained earlier in the proof, one arrives at the estimates given for them in the thesis. The only less obvious estimation on the way is that of the term

$$\begin{aligned} \log (\langle \tau \rangle \langle z\rangle )\min \bigg \{1, \frac{\langle \tau \rangle }{\langle z\rangle }\bigg \} \end{aligned}$$

appearing in \(|\partial _i\hat{\mathcal {A}}_j|\). For \(\langle z\rangle \le \langle \tau \rangle \) this is bounded by \(2\log \langle \tau \rangle \), while for \(\langle z\rangle \ge \langle \tau \rangle \) we write \(\log (\langle \tau \rangle \langle z\rangle )=2\log \langle \tau \rangle +\log (\langle z\rangle /\langle \tau \rangle )\) and estimate the term by

$$\begin{aligned} 2\log \langle \tau \rangle +\Big (\frac{\langle z\rangle }{\langle \tau \rangle }\Big )^{-1}\log \Big (\frac{\langle z\rangle }{\langle \tau \rangle }\Big ) \le \mathrm {const}(1+\log \langle \tau \rangle )\,. \end{aligned}$$

Finally, we write

$$\begin{aligned}&|\partial _\tau \hat{\mathcal {A}}_i|\le |\partial _i\hat{\mathcal {A}}_\tau |+|\hat{F}_{i\tau }|\,,\qquad |z^i\partial _\tau \hat{\mathcal {A}}_i|\le |z^i\partial _i\hat{\mathcal {A}}_\tau |+|z^i\hat{F}_{i\tau }|\\&|z^j\partial _i\hat{\mathcal {A}}_j|\le |z^j\partial _j\hat{\mathcal {A}}_i|+|z^j\hat{F}_{ij}| \end{aligned}$$

to obtain the last missing estimates. \(\square \)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Herdegen, A. Infrared Problem vs Gauge Choice: Scattering of Classical Dirac Field. Ann. Henri Poincaré (2021).

Download citation

Mathematics Subject Classification

  • Primary 81U99
  • Secondary 81V10