The original version of Chapter 8 was inadvertently published with errors. Those errors have been corrected as follows:

Page 324. “8.3.3 Raising and Lowering: How Did Dirac Do It?” should read as follows.

8.3.3 Raising and Lowering: How Did Dirac Do It?

The “operator method” of harmonic oscillator analysis is fundamental to all advanced QM. It foreshadows a very similar method used for angular momentum, and is the basis of a major part of QFT (including quantum EM radiation), which starts with the fields as an infinite set of quantized harmonic oscillators. The method is far from obvious, so the big question on everybody's mind is “How did Dirac do it?” How did he know how to create the creation and annihilation operators? Most references give them as “Lo! And behold!” (much like they present the Schrödinger equation out of thin air). Here is one way to do it logically, and (who knows?) perhaps is similar to how Dirac figured it out. Our goal here is not to provide the simplest derivation, but to show how such an idea might come about in the first place. In addition, this section further ties together the meaning of operators and Dirac notation, which makes this topic worth understanding. This section assumes you understand how the operator method works, but not how to motivate its development.

First, recall the stationary states of the 1D harmonic oscillator ([8], 7.18, p. 144):

$$\displaylines{ |n\rangle \equiv {u_n}(x) = {1 \over {\sqrt {{2^n}n!} }}{\left( {{{m\omega } \over {\hbar \pi }}} \right)^{1/4}}\,{H_n}\left( {\sqrt {{{m\omega } \over \hbar }} x} \right)\exp \left( { - {{m\omega } \over {2\hbar }}{x^2}} \right) = {C_n}{H_n}(X)\exp \left( { - {1 \over 2}{X^2}} \right)} $$

where \(X \equiv \sqrt {{{m\omega } \over \hbar }} x\) is dimensionless position, C n ≡ normalization.

H n (X) ≡ Hermite polynomial of degree n.

Now, consider two recursion relations between the Hermite polynomials, (which imply relations between the harmonic oscillator stationary states), which were well known long before Dirac ([7], 7.19–20, p. 144):

$$\eqalign{ & {d \over {dX}}{H_n}(X) = 2n{H_{n - 1}}(X), & {\rm{and}}\quad 2X\,{H_n}(X) = {H_{n + 1}}(X) + 2n{H_{n - 1}}(X).} $$
(8.7)

In the second equation, we already see the seeds of the well-known relation \(\hat x\, \propto ({\hat a^\dag } + \hat a)\), because it says [after multiplying through by exp(− X 2/2)]:

$$\eqalign{ & \hat x|n\rangle \propto b|n + 1\rangle + c|n - 1\rangle , \cr & where\quad b,c\,{\rm{are}}\,{\rm{as - yet}}\,{\rm{unknown}}\,{\rm{numbers}}. \cr} $$
(8.8)

Also, the first recursion relation tells us something about momentum operators, since \(\hat p \propto d/dx\):

Now Dirac is famous for “Dirac notation,” which is a coordinate-free (i.e., representation-indepent) notation for quantum states, and other kets. We seek some purely conceptual relations between the state kets, which must be independent of representation. Therefore we ask: can we write the Hermite recursion relations in ket notation? We start with the first step: can we write a Hermite polynomial in ket notation? Sure:

$${u_n}(x) = {C_n}{H_n}( \cdot )\exp ( \cdot ) \Rightarrow {H_n}(x) = \frac{1}{{{C_n}}}\exp {\left( { + {X^2}/2} \right)_{{{{u_n}(x)}}{|n\rangle } }} = \frac{1}{{{C_n}}}\exp \left( { + {X^2}/2} \right)|n\rangle .$$

Since the first recursion relation in Eq. (8.7) includes a derivative, what is the derivative of a Hermite polynomial in ket notation? We simply differentiate the above equation:

$${d \over {dX}}{H_n}(X) = {d \over {dX}}{1 \over {{C_n}}}\exp \left( { + {X^2}/2} \right)|n\rangle .$$

In the x-representation,\(|n\rangle \) is a function of x, so we use the product rule on the RHS

$$\begin{array}{rl} {d \over {dX}}{H_n}(X) & = {1 \over {{C_n}}}\left\{ {\left[ {{d \over {dX}}\exp \left( { + {X^2}/2} \right)} \right]|n\rangle + \exp \left( { + {X^2}/2} \right){d \over {dX}}|n\rangle } \right\} \\ & = {1 \over {{C_n}}}\left\{ {\left[ {X\exp \left( { + {X^2}/2} \right)} \right]|n\rangle + \exp \left( { + {X^2}/2} \right){d \over {dX}}|n\rangle } \right\} \\ & = {1 \over {C{_n}}}\exp \left( { + {X^2}/2} \right)\left\{ {X + {d \over {dX}}} \right\}|n\rangle.\\ \end{array}$$

These results show that the first recursion relation relates |n〉 to |n − 1〉; when written in ket notation, this gives us a lowering operator. After canceling the exp(+ X 2/2):

$${1 \over {{C_n}}}\left\{ {X + {d \over {dX}}} \right\}|n\rangle = 2n{1 \over {{C_{n - 1}}}}|n - 1\rangle .$$
(8.9)

We eliminate the normalization factors from:

$${C_n} \propto 1/\sqrt {{2^n}n!} \quad \Rightarrow \quad {C_n}/{C_{n - 1}} = 1/\sqrt {2n} .$$

We now return to the true position variable. Use:

$$X = \sqrt {{{m\omega } \over \hbar }} x,\quad {d \over {dX}} = \sqrt {{\hbar \over {m\omega }}} {d \over {dx}} = \sqrt {{\hbar \over {m\omega }}} {i \over \hbar }\hat p.$$

To be representation-indpendent, we also take x\(\hat x\). Then Eq. (8.9) becomes:

$$\left\{ {\sqrt {{{m\omega } \over \hbar }} \hat x + i\sqrt {{1 \over {\hbar m\omega }}} \hat p} \right\}|n\rangle \,{\rm{or}}\,\left\{ {\sqrt {{{m\omega } \over {2\hbar }}} \hat x + i\sqrt {{1 \over {2\hbar m\omega }}} \hat p} \right\}|n\rangle = \sqrt n |n - 1\rangle .$$

This gives the lowering operator in final form:

$$\hat a|n\rangle = \sqrt n |n - 1\rangle \quad where\quad \hat a = \sqrt {{{m\omega } \over {2\hbar }}} \hat x + i\sqrt {{1 \over {2\hbar m\omega }}} \hat p.$$
(8.10)

As shown later, this is sufficient to also define a raising operator, which is the adjoint of â. However, to continue with our wave-function derivation, we first find the raising operator directly from the recursion relations Eq. (8.7). For a raising operator, we write them with H n+1 on the left, and only H n on the right:

$${H_{n + 1}}(X) = 2X\,{H_n}(X) - {d \over {dX}}{H_n}(X).$$

We already know how to convert this to ket notation. After again canceling all the exp(+ X2/2):

$$\displaylines{ {1 \over {{C_{n + 1}}}}\left| {n + 1} \right\rangle = {2 \over {{C_n}}}\sqrt {{{m\omega } \over \hbar }} \hat x|n\rangle - {1 \over {{C_n}}}\left\{ {\sqrt {{{m\omega } \over \hbar }} \hat x + i\sqrt {{1 \over {\hbar m\omega }}} \hat p} \right\}|n\rangle \cr= {1 \over {{C_n}}}\left\{ {\sqrt {{{m\omega } \over \hbar }} \hat x - i\sqrt {{1 \over {\hbar m\omega }}} \hat p} \right\}|n\rangle \quad {\rm{Use}}:\quad {C_n}/{C_{n + 1}} = \sqrt {2(n + 1)} \cr \sqrt {n + 1} \left| {n + 1} \right\rangle = \left\{ {\sqrt {{{m\omega } \over {2\hbar }}} \hat x - i\sqrt {{1 \over {2\hbar m\omega }}} \hat p} \right\}|n\rangle \quad \Rightarrow \quad {{\hat a}^\dag } = \sqrt {{{m\omega } \over {2\hbar }}} \hat x - i\sqrt {{1 \over {2\hbar m\omega }}} \hat p. \cr} $$
(8.11)

We see explicitly that the raising operator is the adjoint of the lowering operator.

Algebraic derivation of â : As noted ealier, we can derive the raising operator as the adjoint of the lowering operator more simply by using Dirac algebra. Start with Eq. (8.10), and use the definition of adjoint:

$$\hat a\left| n \right\rangle = \sqrt n \left| {n - 1} \right\rangle \quad \Rightarrow \quad \quad \left\langle n \right|{\hat a^\dag } = \sqrt n \left\langle {n - 1} \right|.$$

Since n is arbitrary, we can replace nn + 1:

$$\displaylines{ \left\langle {n + 1} \right|{{\hat a}^\dag } = \sqrt {n + 1} \left\langle n \right| \Rightarrow \quad \left\langle {n + 1} \right|{{\hat a}^\dag }\left| n \right\rangle = \sqrt {n + 1} ,\,{\rm{and}}\,\,\left\langle m \right|{{\hat a}^\dag }\left| n \right\rangle = 0,\,m \ne n + 1 \cr\Rightarrow \quad {{\hat a}^\dag }\left| n \right\rangle = \sqrt {n + 1} \left| {n + 1} \right\rangle . \cr} $$

A key property of â and â is that \(\hat x\) and \(\hat p\) are not just linear combinations of them, but proportional to their sum and difference:

$$\hat x \propto \left( {\hat a + {{\hat a}^\dag }} \right),\quad \hat p \propto \left( {\hat a - {{\hat a}^\dag }} \right).$$

This greatly simplifies computing inner products of \(\hat x\) and \(\hat p\). Also, note that \(\hat a\) and â are real; the appearance of i with \(\hat p\) in Eqs. (8.10) and (8.11) cancels the i in the definition \(\hat p \equiv (\hbar /i)\nabla \).

Of course now that we know these results, we could rederive all this much more quickly. That requires a “lo and behold” approach, where we show that a seemingly random definition of the operator â leads to a useful result. Many references supply such a derivation. Our goal here, though, was to show how one might develop this result from observations about well-known prior results.