Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Lecture 2.5: Generators, Pauli Matrices, and SU(2)

Purdue University

What Is a Generator?

Infinitesimal Rotations

We’ve been working with rotation matrices — R(θ)R(\theta) rotates a vector by angle θ\theta. But rotations can be understood more deeply by asking: what happens when the angle is very, very small?

Start with the 2D rotation matrix:

R(θ)=(cosθsinθsinθcosθ)R(\theta) = \begin{pmatrix} \cos\theta & -\sin\theta \\ \sin\theta & \cos\theta \end{pmatrix}

For a tiny angle δθ\delta\theta, Taylor expand: cosδθ1\cos\delta\theta \approx 1 and sinδθδθ\sin\delta\theta \approx \delta\theta:

R(δθ)(1δθδθ1)=(1001)+δθ(0110)R(\delta\theta) \approx \begin{pmatrix} 1 & -\delta\theta \\ \delta\theta & 1 \end{pmatrix} = \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix} + \delta\theta\begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix}
R(δθ)I+δθGR(\delta\theta) \approx I + \delta\theta\, G

where:

G=(0110)G = \begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix}

GG is called the generator of rotations. It tells you which direction you move when you begin to rotate. The identity II says “stay where you are,” and δθG\delta\theta\, G is the tiny nudge that starts the rotation.

From Generator to Finite Rotation

How do you build a large rotation from the generator? The same way you walk a mile — one step at a time.

To rotate by a finite angle θ\theta, break it into NN tiny steps, each of size θ/N\theta/N:

R(θ)=R(θ/N)R(θ/N)R(θ/N)=[R(θ/N)]NR(\theta) = R(\theta/N) \cdot R(\theta/N) \cdots R(\theta/N) = \left[R(\theta/N)\right]^N

For large NN, each step is infinitesimal, so R(θ/N)I+θNGR(\theta/N) \approx I + \frac{\theta}{N}G:

R(θ)=limN(I+θNG)NR(\theta) = \lim_{N \to \infty}\left(I + \frac{\theta}{N}G\right)^N

This limit is the definition of the matrix exponential:

R(θ)=eθG\boxed{R(\theta) = e^{\theta G}}

A finite rotation is the exponential of the generator. This is one of the most important ideas in physics: the generator encodes all rotations. You don’t need to memorize the full rotation matrix — just the generator, and exponentiation does the rest.

Evaluating the Exponential

Let’s verify this actually works. We need to compute eθGe^{\theta G} using the Taylor series:

eθG=I+θG+θ22!G2+θ33!G3+θ44!G4+e^{\theta G} = I + \theta G + \frac{\theta^2}{2!}G^2 + \frac{\theta^3}{3!}G^3 + \frac{\theta^4}{4!}G^4 + \cdots

First, what is G2G^2?

G2=(0110)(0110)=(1001)=IG^2 = \begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix}\begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix} = \begin{pmatrix} -1 & 0 \\ 0 & -1 \end{pmatrix} = -I

So G2=IG^2 = -I. This is the key property of the generator. The higher powers follow immediately:

G3=G2G=(I)G=G,G4=G2G2=(I)(I)=+I,G5=G4G=GG^3 = G^2 \cdot G = (-I)G = -G, \quad G^4 = G^2 \cdot G^2 = (-I)(-I) = +I, \quad G^5 = G^4 \cdot G = G

The pattern repeats with period 4: I,G,I,G,I,G,I, G, -I, -G, I, G, \ldots

Substituting into the Taylor series:

eθG=I+θGθ22!Iθ33!G+θ44!I+θ55!Ge^{\theta G} = I + \theta G - \frac{\theta^2}{2!}I - \frac{\theta^3}{3!}G + \frac{\theta^4}{4!}I + \frac{\theta^5}{5!}G - \cdots

Group the terms with II and the terms with GG:

=(1θ22!+θ44!)I+(θθ33!+θ55!)G= \left(1 - \frac{\theta^2}{2!} + \frac{\theta^4}{4!} - \cdots\right)I + \left(\theta - \frac{\theta^3}{3!} + \frac{\theta^5}{5!} - \cdots\right)G

These are the Taylor series for cosine and sine:

eθG=cosθI+sinθG\boxed{e^{\theta G} = \cos\theta\, I + \sin\theta\, G}

Let’s check:

eθG=cosθ(1001)+sinθ(0110)=(cosθsinθsinθcosθ)=R(θ)e^{\theta G} = \cos\theta\begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix} + \sin\theta\begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix} = \begin{pmatrix} \cos\theta & -\sin\theta \\ \sin\theta & \cos\theta \end{pmatrix} = R(\theta) \quad \checkmark

It works. The generator GG contains everything we need.


The Generator and the Imaginary Unit

G2=IG^2 = -I and i2=1i^2 = -1

We just found that the rotation generator satisfies G2=IG^2 = -I. Look at this property carefully — it’s the matrix version of i2=1i^2 = -1.

This is not a coincidence. Recall Euler’s formula for complex number rotations:

eiθ=cosθ+isinθe^{i\theta} = \cos\theta + i\sin\theta

Now compare our matrix rotation formula:

eθG=cosθ I+sinθ Ge^{\theta G} = \cos\theta\ I + \sin\theta\ G

The structures are identical. Everywhere that ii appears in Euler’s formula, GG appears in the matrix formula. Everywhere that 1 appears, II appears.

The generator GG plays the role of ii. Both satisfy the same algebraic property (G2=IG^2 = -I and i2=1i^2 = -1), and both generate rotations through exponentiation.

Two Representations of the Same Rotation

We saw last lecture that complex multiplication by eiθe^{i\theta} and matrix multiplication by R(θ)R(\theta) do the same thing to a vector. Now we see why: they are two representations of the same object, built from generators that obey the same algebra.

The complex number ii and the matrix GG are different representations of the same mathematical structure — the generator of 2D rotations.

Complex numbersMatrices
Elementeiθe^{i\theta}R(θ)=eθGR(\theta) = e^{\theta G}
GeneratoriiG=(0110)G = \begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix}
Key propertyi2=1i^2 = -1G2=IG^2 = -I
Rotation formulaeiθ=cosθ+isinθe^{i\theta} = \cos\theta + i\sin\thetaeθG=cosθI+sinθGe^{\theta G} = \cos\theta\, I + \sin\theta\, G
Acts onzCz \in \mathbb{C}(x,y)TR2(x, y)^T \in \mathbb{R}^2

Generators of SO(3)

From 2D to 3D: Things Get Complicated

In 2D there is one rotation axis and one generator. In 3D there are three rotation axes — xx, yy, and zz — so we expect three generators.

The full 3×33\times 3 rotation matrices are:

Rx(θ)=(1000cosθsinθ0sinθcosθ),Ry(θ)=(cosθ0sinθ010sinθ0cosθ),Rz(θ)=(cosθsinθ0sinθcosθ0001)R_x(\theta) = \begin{pmatrix} 1 & 0 & 0 \\ 0 & \cos\theta & -\sin\theta \\ 0 & \sin\theta & \cos\theta \end{pmatrix}, \quad R_y(\theta) = \begin{pmatrix} \cos\theta & 0 & \sin\theta \\ 0 & 1 & 0 \\ -\sin\theta & 0 & \cos\theta \end{pmatrix}, \quad R_z(\theta) = \begin{pmatrix} \cos\theta & -\sin\theta & 0 \\ \sin\theta & \cos\theta & 0 \\ 0 & 0 & 1 \end{pmatrix}

These are already getting unwieldy. Multiplying two of them together — say to check if they commute — would be a mess of trig identities. And we already established last lecture that 3D rotations don’t commute.

The generator approach cuts through this complexity. Instead of working with the full rotation matrices, we extract the generators — much simpler objects that encode all the essential information.

Extracting the Generators

Taylor expand each rotation matrix for small δθ\delta\theta:

Rx(δθ)I+δθ(000001010),Ry(δθ)I+δθ(001000100),Rz(δθ)I+δθ(010100000)R_x(\delta\theta) \approx I + \delta\theta \begin{pmatrix} 0 & 0 & 0 \\ 0 & 0 & -1 \\ 0 & 1 & 0 \end{pmatrix}, \quad R_y(\delta\theta) \approx I + \delta\theta \begin{pmatrix} 0 & 0 & 1 \\ 0 & 0 & 0 \\ -1 & 0 & 0 \end{pmatrix}, \quad R_z(\delta\theta) \approx I + \delta\theta \begin{pmatrix} 0 & -1 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix}

The three generators are:

Jx=(000001010),Jy=(001000100),Jz=(010100000)J_x = \begin{pmatrix} 0 & 0 & 0 \\ 0 & 0 & -1 \\ 0 & 1 & 0 \end{pmatrix}, \qquad J_y = \begin{pmatrix} 0 & 0 & 1 \\ 0 & 0 & 0 \\ -1 & 0 & 0 \end{pmatrix}, \qquad J_z = \begin{pmatrix} 0 & -1 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix}

Notice the pattern: each generator has the familiar SO(2) block (0110)\begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix} embedded in the 2×22\times 2 subspace corresponding to its plane of rotation, with zeros along the rotation axis. For example, JzJ_z rotates in the xx-yy plane, so the block sits in the upper-left corner. JxJ_x rotates in the yy-zz plane, so the block sits in the lower-right corner.

Finite rotations are exponentials:

Rx(θ)=eθJx,Ry(θ)=eθJy,Rz(θ)=eθJzR_x(\theta) = e^{\theta J_x}, \qquad R_y(\theta) = e^{\theta J_y}, \qquad R_z(\theta) = e^{\theta J_z}

SO(3) Commutation Relations

The non-commutativity of 3D rotations — which was messy to show with the full rotation matrices — becomes clean and elegant at the generator level. Let’s compute the commutator [Jx,Jy]=JxJyJyJx[J_x, J_y] = J_x J_y - J_y J_x:

JxJy=(000001010)(001000100)=(000100000)J_x J_y = \begin{pmatrix} 0 & 0 & 0 \\ 0 & 0 & -1 \\ 0 & 1 & 0 \end{pmatrix}\begin{pmatrix} 0 & 0 & 1 \\ 0 & 0 & 0 \\ -1 & 0 & 0 \end{pmatrix} = \begin{pmatrix} 0 & 0 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix}
JyJx=(001000100)(000001010)=(010000000)J_y J_x = \begin{pmatrix} 0 & 0 & 1 \\ 0 & 0 & 0 \\ -1 & 0 & 0 \end{pmatrix}\begin{pmatrix} 0 & 0 & 0 \\ 0 & 0 & -1 \\ 0 & 1 & 0 \end{pmatrix} = \begin{pmatrix} 0 & 1 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix}
[Jx,Jy]=(010100000)=Jz[J_x, J_y] = \begin{pmatrix} 0 & -1 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix} = J_z

The commutator of two generators gives the third! The full set of commutation relations:

[Jx,Jy]=Jz,[Jy,Jz]=Jx,[Jz,Jx]=Jy\boxed{[J_x, J_y] = J_z, \qquad [J_y, J_z] = J_x, \qquad [J_z, J_x] = J_y}

Compactly: [Ji,Jj]=ϵijkJk[J_i, J_j] = \epsilon_{ijk}J_k, where ϵijk=+1\epsilon_{ijk} = +1 for cyclic permutations (xyz, yzx, zxy), -1 for anti-cyclic, and 0 if any indices repeat.

These commutation relations are profound. They capture the entire structure of 3D rotations in a simple algebraic statement. Any set of three objects satisfying these relations generates a rotation group — even if those objects are 2×22\times 2 matrices instead of 3×33\times 3 matrices. This is the key insight that will connect SO(3) to SU(2).

The physical consequence of non-commuting generators is the uncertainty principle: if two physical quantities are described by non-commuting operators, they cannot both be known simultaneously. We’ll make this precise shortly.


From SO(3) to SU(2)

A Different Kind of Space

SO(3) describes rotations of real 3D vectors — positions, velocities, electric field directions. These are vectors in R3\mathbb{R}^3.

But throughout this course, we’ve been working with a different kind of object: a qubit.

ψ=c00+c11|\psi\rangle = c_0|0\rangle + c_1|1\rangle

This is a two-configuration system with complex amplitudes. The state is a vector in C2\mathbb{C}^2, and we’ve been visualizing it on the Bloch sphere. When we apply gates — Hadamard, phase gates, Pauli matrices — we rotate the state on the Bloch sphere.

We need a group that describes these rotations: transformations of C2\mathbb{C}^2 that preserve the normalization c02+c12=1|c_0|^2 + |c_1|^2 = 1.

Defining SU(2)

This group is SU(2): the group of 2×22\times 2 unitary matrices with determinant 1.

Unitarity is the complex generalization of orthogonality:

SO(n)SU(n)
Acts onReal vectors (Rn\mathbb{R}^n)Complex vectors (Cn\mathbb{C}^n)
PreservesRTR=IR^T R = I (transpose)UU=IU^\dagger U = I (conjugate transpose)
EntriesRealComplex
Determinant11

Both preserve lengths. Orthogonal matrices preserve vTv\vec{v}^T\vec{v}. Unitary matrices preserve vv=c02+c12\vec{v}^\dagger\vec{v} = |c_0|^2 + |c_1|^2 — exactly the normalization condition we need for quantum states.

What We Expect

SU(2) should behave like SO(3) — after all, both describe rotations on a sphere. The Bloch sphere has three axes, so SU(2) should have three generators, and those generators should satisfy the same commutation relations as the SO(3) generators Jx,Jy,JzJ_x, J_y, J_z.

But SU(2) acts on complex space, so the generators will be 2×22\times 2 complex matrices instead of 3×33\times 3 real matrices.


The Pauli Matrices: Generators of SU(2)

The three generators of SU(2) are the Pauli matrices:

σx=(0110),σy=(0ii0),σz=(1001)\boxed{\sigma_x = \begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix}, \qquad \sigma_y = \begin{pmatrix} 0 & -i \\ i & 0 \end{pmatrix}, \qquad \sigma_z = \begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix}}

The spin angular momentum operators are related by Si=2σiS_i = \frac{\hbar}{2}\sigma_i.

Key Properties

Hermitian: σi=σi\sigma_i^\dagger = \sigma_i. Hermitian matrices represent observables — quantities we can measure. Their eigenvalues are guaranteed to be real.

Traceless: Tr(σx)=Tr(σy)=Tr(σz)=0\text{Tr}(\sigma_x) = \text{Tr}(\sigma_y) = \text{Tr}(\sigma_z) = 0.

Square to identity: σx2=σy2=σz2=I\sigma_x^2 = \sigma_y^2 = \sigma_z^2 = I. Let’s verify one:

σx2=(0110)(0110)=(1001)=I\sigma_x^2 = \begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix}\begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix} = \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix} = I

Since σi2=I\sigma_i^2 = I, the eigenvalues satisfy λ2=1\lambda^2 = 1, giving λ=±1\lambda = \pm 1 for every Pauli matrix.

Eigenstates: The Six Cardinal States

Each Pauli matrix has eigenvalues +1 and -1. The eigenstates are:

σz\sigma_z:

σz0=+10,σz1=11\sigma_z|0\rangle = +1\cdot|0\rangle, \qquad \sigma_z|1\rangle = -1\cdot|1\rangle

σx\sigma_x:

σx+=+1+,σx=1\sigma_x|+\rangle = +1\cdot|+\rangle, \qquad \sigma_x|-\rangle = -1\cdot|-\rangle

where +=12(0+1)|+\rangle = \frac{1}{\sqrt{2}}(|0\rangle + |1\rangle) and =12(01)|-\rangle = \frac{1}{\sqrt{2}}(|0\rangle - |1\rangle).

σy\sigma_y:

σy+i=+1+i,σyi=1i\sigma_y|{+i}\rangle = +1\cdot|{+i}\rangle, \qquad \sigma_y|{-i}\rangle = -1\cdot|{-i}\rangle

where +i=12(0+i1)|{+i}\rangle = \frac{1}{\sqrt{2}}(|0\rangle + i|1\rangle) and i=12(0i1)|{-i}\rangle = \frac{1}{\sqrt{2}}(|0\rangle - i|1\rangle).

These are exactly the six cardinal points on the Bloch sphere:

Pauli+1 eigenstate-1 eigenstateBloch axis
σz\sigma_z0|0\rangle (north pole)1|1\rangle (south pole)z
σx\sigma_x+|+\rangle|-\ranglex
σy\sigma_y+i|{+i}\ranglei|{-i}\rangley

Pauli Commutation Relations

Now the crucial check: do the Pauli matrices satisfy the same commutation relations as the SO(3) generators?

Compute σxσy\sigma_x\sigma_y:

σxσy=(0110)(0ii0)=(i00i)=iσz\sigma_x\sigma_y = \begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix}\begin{pmatrix} 0 & -i \\ i & 0 \end{pmatrix} = \begin{pmatrix} i & 0 \\ 0 & -i \end{pmatrix} = i\sigma_z

Compute σyσx\sigma_y\sigma_x:

σyσx=(0ii0)(0110)=(i00i)=iσz\sigma_y\sigma_x = \begin{pmatrix} 0 & -i \\ i & 0 \end{pmatrix}\begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix} = \begin{pmatrix} -i & 0 \\ 0 & i \end{pmatrix} = -i\sigma_z

iClicker: Pauli Products

What is σxσy\sigma_x\sigma_y?

(A) $\sigma_z$
(B) $-\sigma_z$
(C) $i\sigma_z$ ✓
(D) $-i\sigma_z$

Commutators

[σx,σy]=σxσyσyσx=iσz(iσz)=2iσz[\sigma_x, \sigma_y] = \sigma_x\sigma_y - \sigma_y\sigma_x = i\sigma_z - (-i\sigma_z) = 2i\sigma_z

The full set:

[σx,σy]=2iσz,[σy,σz]=2iσx,[σz,σx]=2iσy\boxed{[\sigma_x, \sigma_y] = 2i\sigma_z, \qquad [\sigma_y, \sigma_z] = 2i\sigma_x, \qquad [\sigma_z, \sigma_x] = 2i\sigma_y}

Compactly: [σi,σj]=2iϵijkσk[\sigma_i, \sigma_j] = 2i\epsilon_{ijk}\sigma_k.

Compare to SO(3): [Ji,Jj]=ϵijkJk[J_i, J_j] = \epsilon_{ijk}J_k. The same structure — just with a factor of 2i2i. This factor is a convention: the Pauli matrices are Hermitian (so they can be observables), while the SO(3) generators are antisymmetric. The underlying pattern — which commutator gives which generator — is identical.

SU(2) and SO(3) share the same algebraic DNA.

Anticommutators

The anticommutator {A,B}=AB+BA\{A, B\} = AB + BA captures the symmetric part:

{σx,σy}=iσz+(iσz)=0\{\sigma_x, \sigma_y\} = i\sigma_z + (-i\sigma_z) = 0

Different Pauli matrices anticommute to zero. Same Pauli matrices give {σi,σi}=2I\{\sigma_i, \sigma_i\} = 2I. In general:

{σi,σj}=2δijI\boxed{\{\sigma_i, \sigma_j\} = 2\delta_{ij}\,I}

The Master Formula

Adding the commutator and anticommutator:

σiσj=δijI+iϵijkσk\boxed{\sigma_i\sigma_j = \delta_{ij}\,I + i\epsilon_{ijk}\sigma_k}

If i=ji = j: you get II (squaring gives the identity). If iji \neq j: you get ±i\pm i times the third Pauli matrix, with the sign given by cyclic ordering.

Physical Consequence: The Uncertainty Principle

Non-commuting generators mean that the corresponding physical quantities cannot be simultaneously known.

Since [σx,σz]=2iσy0[\sigma_x, \sigma_z] = -2i\sigma_y \neq 0:

You cannot simultaneously know the spin along two perpendicular axes. This is fundamentally different from classical physics, where all components of angular momentum can be known at once. The non-commutativity of the Pauli matrices — the same non-commutativity we saw for 3D rotations — is the mathematical origin of quantum uncertainty.


The Double Cover: What Does R(2π)=IR(2\pi) = -I Mean?

SU(2) \neq SO(3)

SU(2) and SO(3) have the same commutation relations, but they are not the same group. They differ in a fundamental way.

In SO(3), a 2π2\pi rotation around any axis brings you back to the identity:

RSO(3)(2π)=+IR_{SO(3)}(2\pi) = +I

This makes intuitive sense — spin a ball 360°360° and it’s back where it started.

In SU(2), a 2π2\pi rotation gives minus the identity:

RSU(2)(2π)=IR_{SU(2)}(2\pi) = -I

You need to rotate by 4π4\pi — two full turns — to get back to +I+I. SU(2) is the double cover of SO(3): it wraps around twice for every single winding of SO(3).

What Does the -1 Mean Physically?

In quantum mechanics, the state of a system is described by a complex amplitude:

ψ=c00+c11|\psi\rangle = c_0|0\rangle + c_1|1\rangle

If ψψ|\psi\rangle \to -|\psi\rangle, every amplitude picks up a minus sign: c0c0c_0 \to -c_0, c1c1c_1 \to -c_1. That -1 is a phase — specifically, it’s the phase eiπ=1e^{i\pi} = -1.

For a single isolated state, this phase is unobservable. The probabilities c02|c_0|^2 and c12|c_1|^2 don’t change when you multiply by -1.

But in a superposition, the phase matters. Consider a state that is a superposition of a rotated part and an unrotated part. The rotated part picks up -1, the unrotated part doesn’t. Where the two parts used to add constructively (interference maximum), they now add destructively (interference minimum). Where they used to cancel (minimum), they now reinforce (maximum).

The interference pattern shifts. The -1 is physically real.

This has been confirmed experimentally. In neutron interferometry experiments, a neutron beam is split, one path is rotated by 2π2\pi using a magnetic field, and the beams are recombined. The interference pattern shifts — exactly as predicted by the -1 phase from SU(2). The neutron must be rotated by 4π4\pi to restore the original interference pattern.

Fermions vs. Bosons

Nature has two kinds of particles:

Fermions (electrons, quarks, neutrinos — the matter particles) have half-integer spin and transform under SU(2): R(2π)=1R(2\pi) = -1.

Bosons (photons, gluons, the Higgs — the force carriers) have integer spin and transform under SO(3): R(2π)=+1R(2\pi) = +1.

This distinction is connected to the spin-statistics theorem: fermions obey the Pauli exclusion principle (no two can occupy the same state), while bosons don’t. The -1 under 2π2\pi rotation and the exclusion principle are two manifestations of the same deep mathematical structure. More on this later in the course.


Rotations on the Bloch Sphere

The General Rotation

A rotation by angle θ\theta around a unit axis n^=(nx,ny,nz)\hat{n} = (n_x, n_y, n_z) on the Bloch sphere is:

Rn^(θ)=eiθ(n^σ)/2R_{\hat{n}}(\theta) = e^{-i\theta(\hat{n}\cdot\vec{\sigma})/2}

where n^σ=nxσx+nyσy+nzσz\hat{n}\cdot\vec{\sigma} = n_x\sigma_x + n_y\sigma_y + n_z\sigma_z.

The factor of 1/21/2 is because of the double cover — SU(2) matrices make half-angle rotations on the Bloch sphere.

Deriving the Closed Form

We can evaluate this exponential using the same Taylor series technique that worked for SO(2). Define A=n^σA = \hat{n}\cdot\vec{\sigma}.

Step 1: Compute A2A^2 using the master formula σiσj=δijI+iϵijkσk\sigma_i\sigma_j = \delta_{ij}I + i\epsilon_{ijk}\sigma_k:

A2=i,jninjσiσj=i,jninj(δijI+iϵijkσk)A^2 = \sum_{i,j}n_in_j\,\sigma_i\sigma_j = \sum_{i,j}n_in_j(\delta_{ij}I + i\epsilon_{ijk}\sigma_k)

The symmetric piece gives ini2I=n^2I=I\sum_i n_i^2\,I = |\hat{n}|^2\,I = I. The antisymmetric piece ijninjϵijk\sum_{ij}n_in_j\epsilon_{ijk} vanishes because ninjn_in_j is symmetric while ϵijk\epsilon_{ijk} is antisymmetric.

(n^σ)2=I(\hat{n}\cdot\vec{\sigma})^2 = I

Just like σi2=I\sigma_i^2 = I for each individual Pauli, the combination (n^σ)2=I(\hat{n}\cdot\vec\sigma)^2 = I for any unit vector n^\hat{n}.

Step 2: Higher powers cycle: A2=IA^2 = I, A3=AA^3 = A, A4=IA^4 = I, ... — exactly the same pattern as GG in SO(2).

Step 3: Taylor expand with α=θ/2\alpha = \theta/2:

eiαA=I+(iα)A+(iα)22!I+(iα)33!A+(iα)44!I+e^{-i\alpha A} = I + (-i\alpha)A + \frac{(-i\alpha)^2}{2!}I + \frac{(-i\alpha)^3}{3!}A + \frac{(-i\alpha)^4}{4!}I + \cdots

Step 4: Group even powers (II) and odd powers (AA):

=(1α22!+α44!)I+(i)(αα33!+α55!)A= \left(1 - \frac{\alpha^2}{2!} + \frac{\alpha^4}{4!} - \cdots\right)I + (-i)\left(\alpha - \frac{\alpha^3}{3!} + \frac{\alpha^5}{5!} - \cdots\right)A
=cosαIisinαA= \cos\alpha\,I - i\sin\alpha\,A

Step 5: Substitute back α=θ/2\alpha = \theta/2:

Rn^(θ)=cosθ2Iisinθ2(n^σ)\boxed{R_{\hat{n}}(\theta) = \cos\frac{\theta}{2}\,I - i\sin\frac{\theta}{2}\,(\hat{n}\cdot\vec{\sigma})}

Compare to our SO(2) result: eθG=cosθI+sinθGe^{\theta G} = \cos\theta\,I + \sin\theta\,G. The same structure — the generator multiplied by sine, the identity multiplied by cosine. The i-i and θ/2\theta/2 reflect the Hermitian convention and the double cover.

Verification: At θ=2π\theta = 2\pi: Rn^(2π)=cosπIisinπ(n^σ)=IR_{\hat{n}}(2\pi) = \cos\pi\,I - i\sin\pi\,(\hat{n}\cdot\vec\sigma) = -I. ✓


Explicit Rotations and Gate Connections

Rotation Around the zz-Axis

Rz(θ)=cosθ2Iisinθ2σz=(eiθ/200eiθ/2)R_z(\theta) = \cos\frac{\theta}{2}\,I - i\sin\frac{\theta}{2}\,\sigma_z = \begin{pmatrix} e^{-i\theta/2} & 0 \\ 0 & e^{i\theta/2} \end{pmatrix}

Connection to the phase gate:

P(ϕ)=(100eiϕ)=eiϕ/2(eiϕ/200eiϕ/2)=eiϕ/2Rz(ϕ)P(\phi) = \begin{pmatrix} 1 & 0 \\ 0 & e^{i\phi} \end{pmatrix} = e^{i\phi/2}\begin{pmatrix} e^{-i\phi/2} & 0 \\ 0 & e^{i\phi/2} \end{pmatrix} = e^{i\phi/2}\,R_z(\phi)

Up to a global phase (which is unobservable), the phase gate IS a z-rotation. This is why applying P(ϕ)P(\phi) to +|+\rangle traces the equator of the Bloch sphere — it rotates around the z-axis.

Connection to the Z gate:

Z=σz=(1001)=iRz(π)Z = \sigma_z = \begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix} = iR_z(\pi)

The Z gate is a π\pi rotation around the z-axis. It maps +|+\rangle \to |-\rangle and +|-\rangle \to |+\rangle — it flips the equatorial states.

Rotation Around the xx-Axis

Rx(θ)=cosθ2Iisinθ2σx=(cosθ2isinθ2isinθ2cosθ2)R_x(\theta) = \cos\frac{\theta}{2}\,I - i\sin\frac{\theta}{2}\,\sigma_x = \begin{pmatrix} \cos\frac{\theta}{2} & -i\sin\frac{\theta}{2} \\ -i\sin\frac{\theta}{2} & \cos\frac{\theta}{2} \end{pmatrix}

Connection to the X gate:

X=σx=(0110)=iRx(π)X = \sigma_x = \begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix} = iR_x(\pi)

The X gate is a π\pi rotation around the x-axis. It swaps 01|0\rangle \leftrightarrow |1\rangle — the quantum NOT gate, flipping the north and south poles.

Rotation Around the yy-Axis

Ry(θ)=cosθ2Iisinθ2σy=(cosθ2sinθ2sinθ2cosθ2)R_y(\theta) = \cos\frac{\theta}{2}\,I - i\sin\frac{\theta}{2}\,\sigma_y = \begin{pmatrix} \cos\frac{\theta}{2} & -\sin\frac{\theta}{2} \\ \sin\frac{\theta}{2} & \cos\frac{\theta}{2} \end{pmatrix}

Note: Ry(θ)R_y(\theta) is a real matrix. This is because iσy=(0110)-i\sigma_y = \begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix}, which is the SO(2) generator GG. So y-rotations on the Bloch sphere look exactly like classical 2D rotations.

Connection to the Y gate:

Y=σy=(0ii0)=iRy(π)Y = \sigma_y = \begin{pmatrix} 0 & -i \\ i & 0 \end{pmatrix} = iR_y(\pi)

The Y gate is a π\pi rotation around the y-axis.

The Hadamard Gate

The Hadamard doesn’t correspond to a rotation around a coordinate axis. Instead:

H=12(1111)=12(σx+σz)H = \frac{1}{\sqrt{2}}\begin{pmatrix} 1 & 1 \\ 1 & -1 \end{pmatrix} = \frac{1}{\sqrt{2}}(\sigma_x + \sigma_z)

This is a π\pi rotation around the axis n^=(x^+z^)/2\hat{n} = (\hat{x} + \hat{z})/\sqrt{2}, halfway between xx and zz on the Bloch sphere. Geometrically, it swaps the z-axis and the x-axis:

H0=+,H1=,H+=0,H=1H|0\rangle = |+\rangle, \quad H|1\rangle = |-\rangle, \quad H|+\rangle = |0\rangle, \quad H|-\rangle = |1\rangle

The Fundamental Result

Every single-qubit unitary is a rotation on the Bloch sphere.\boxed{\text{Every single-qubit unitary is a rotation on the Bloch sphere.}}

Any UU \in SU(2) can be written as Rn^(θ)=eiθ(n^σ)/2R_{\hat{n}}(\theta) = e^{-i\theta(\hat{n}\cdot\vec\sigma)/2} for some axis n^\hat{n} and angle θ\theta. There are no other single-qubit gates. The three Pauli matrices generate all possible qubit operations through rotation.


Summary

ConceptSO(2)SO(3)SU(2)
Dimension1 axis3 axes3 axes
Acts onR2\mathbb{R}^2R3\mathbb{R}^3C2\mathbb{C}^2
GeneratorsGGJx,Jy,JzJ_x, J_y, J_zσx,σy,σz\sigma_x, \sigma_y, \sigma_z
Key propertyG2=IG^2 = -I[Ji,Jj]=ϵijkJk[J_i,J_j] = \epsilon_{ijk}J_k[σi,σj]=2iϵijkσk[\sigma_i,\sigma_j] = 2i\epsilon_{ijk}\sigma_k
Commutative?YesNoNo
R(2π)R(2\pi)+I+I+I+II-I

Key Results

  1. Generators encode infinitesimal rotations. Exponentiating gives finite rotations: R(θ)=eθGR(\theta) = e^{\theta G}.

  2. GG and ii are the same thing in different representations. G2=IG^2 = -I is the matrix version of i2=1i^2 = -1. Both generate rotations.

  3. SO(3) has three generators Jx,Jy,JzJ_x, J_y, J_z satisfying [Ji,Jj]=ϵijkJk[J_i, J_j] = \epsilon_{ijk}J_k. Non-commutativity of generators → uncertainty principle.

  4. SU(2) is the quantum rotation group. Its generators — the Pauli matrices — satisfy the same commutation relations as SO(3). They are Hermitian (observables), traceless, and square to the identity (eigenvalues ±1\pm 1). Their eigenstates are the six Bloch sphere cardinal states.

  5. The double cover: RSU(2)(2π)=IR_{SU(2)}(2\pi) = -I. The minus sign is a physical phase that shifts interference patterns. Fermions transform under SU(2); bosons under SO(3).

  6. Every qubit gate is a Bloch sphere rotation:

Rn^(θ)=cosθ2Iisinθ2(n^σ)R_{\hat{n}}(\theta) = \cos\frac{\theta}{2}\,I - i\sin\frac{\theta}{2}\,(\hat{n}\cdot\vec\sigma)

Phase gate Rz\sim R_z. X gate Rx(π)\sim R_x(\pi). Hadamard R(x^+z^)/2(π)\sim R_{(\hat{x}+\hat{z})/\sqrt{2}}(\pi).

Next Lecture

The Stern-Gerlach experiment: nature hands us a real physical qubit. Electron spin, measurement in different bases, and the birth of the qubit.


Homework 2.5

Problem 1: A Different Kind of Generator — Lorentz Boosts

Not all generators produce rotations. In special relativity, a Lorentz boost along the x-direction by rapidity β\beta is described by:

B(β)=(coshβsinhβsinhβcoshβ)B(\beta) = \begin{pmatrix} \cosh\beta & \sinh\beta \\ \sinh\beta & \cosh\beta \end{pmatrix}

This matrix mixes space and time coordinates, just as a rotation matrix mixes xx and yy.

(a) Verify that the set of boost matrices {B(β)}\{B(\beta)\} forms a group: show B(β1)B(β2)=B(β1+β2)B(\beta_1)B(\beta_2) = B(\beta_1 + \beta_2), identify the identity element and the inverse of B(β)B(\beta).

Hint: You will need the hyperbolic addition formulas: cosh(α+β)=coshαcoshβ+sinhαsinhβ\cosh(\alpha+\beta) = \cosh\alpha\cosh\beta + \sinh\alpha\sinh\beta and sinh(α+β)=sinhαcoshβ+coshαsinhβ\sinh(\alpha+\beta) = \sinh\alpha\cosh\beta + \cosh\alpha\sinh\beta.

(b) Find the generator KK by expanding B(δβ)B(\delta\beta) for small δβ\delta\beta (use coshδβ1\cosh\delta\beta \approx 1, sinhδβδβ\sinh\delta\beta \approx \delta\beta):

B(δβ)I+δβKB(\delta\beta) \approx I + \delta\beta\, K

Write out KK.

(c) Compute K2K^2. Compare to the rotation generator where G2=IG^2 = -I. What is different?

(d) Using K2=+IK^2 = +I, expand the exponential eβKe^{\beta K} as a Taylor series. Group even and odd powers (as we did in lecture for eθGe^{\theta G}). What functions appear instead of cosine and sine?

(e) Verify that your result matches B(β)B(\beta).

(f) Compute detB(β)\det B(\beta). Compare to detR(θ)=1\det R(\theta) = 1. Is a Lorentz boost a rotation?

(g) Reflect on the difference: G2=IG^2 = -I gives oscillatory behavior (cos,sin\cos, \sin) — rotations go around in circles. K2=+IK^2 = +I gives hyperbolic behavior (cosh,sinh\cosh, \sinh) — boosts grow without bound. What physical difference does this correspond to? (Can you rotate forever? Can you boost forever?)


Problem 2: The Pauli Vector Identity

(a) Using the master formula σiσj=δijI+iϵijkσk\sigma_i\sigma_j = \delta_{ij}I + i\epsilon_{ijk}\sigma_k, prove the identity:

(aσ)(bσ)=(ab)I+i(a×b)σ(\vec{a}\cdot\vec\sigma)(\vec{b}\cdot\vec\sigma) = (\vec{a}\cdot\vec{b})\,I + i(\vec{a}\times\vec{b})\cdot\vec\sigma

where a\vec{a} and b\vec{b} are arbitrary real 3-vectors.

Hint: Write aσ=iaiσi\vec{a}\cdot\vec\sigma = \sum_i a_i\sigma_i and bσ=jbjσj\vec{b}\cdot\vec\sigma = \sum_j b_j\sigma_j, multiply, and use the master formula on σiσj\sigma_i\sigma_j. You will need to recognize iaibi=ab\sum_i a_i b_i = \vec{a}\cdot\vec{b} and ijkϵijkaibj=(a×b)k\sum_{ijk}\epsilon_{ijk}a_i b_j = (\vec{a}\times\vec{b})_k.

(b) As a special case, set a=b=n^\vec{a} = \vec{b} = \hat{n} (a unit vector). Show that (n^σ)2=I(\hat{n}\cdot\vec\sigma)^2 = I.

(c) Set a=x^\vec{a} = \hat{x} and b=y^\vec{b} = \hat{y}. Evaluate both sides of the identity and verify they match.

(d) Set a=b=x^+z^\vec{a} = \vec{b} = \hat{x} + \hat{z} (not a unit vector). What is [(x^+z^)σ]2[(\hat{x}+\hat{z})\cdot\vec\sigma]^2? How does this relate to the Hadamard gate?


Problem 3: Mystery Matrix

Consider the unitary matrix:

U=12(1ii1)U = \frac{1}{\sqrt{2}}\begin{pmatrix} 1 & -i \\ -i & 1 \end{pmatrix}

(a) Verify that UU is unitary: show UU=IU^\dagger U = I.

(b) Verify that detU=1\det U = 1, confirming UU \in SU(2).

(c) Every element of SU(2) can be written as:

U=cosθ2Iisinθ2(n^σ)U = \cos\frac{\theta}{2}\,I - i\sin\frac{\theta}{2}\,(\hat{n}\cdot\vec\sigma)

By comparing UU to this formula, determine the rotation angle θ\theta and axis n^\hat{n}.

Hint: The coefficient of II gives cos(θ/2)\cos(\theta/2). The coefficients of σx\sigma_x, σy\sigma_y, σz\sigma_z in the remainder give sin(θ/2)nx\sin(\theta/2)\, n_x, sin(θ/2)ny\sin(\theta/2)\, n_y, sin(θ/2)nz\sin(\theta/2)\, n_z.

(d) Describe geometrically what this rotation does on the Bloch sphere. Where does it send 0|0\rangle? Where does it send +|+\rangle?

(e) Compute U2U^2. What rotation is this? What about U4U^4?


Problem 4: Visualizing Rotations on the Bloch Sphere (Qiskit)

In this problem you’ll use Qiskit to see rotations in action.

Setup: You will need the following imports:

from qiskit import QuantumCircuit
from qiskit.quantum_info import Statevector
from qiskit.visualization import plot_bloch_multivector, plot_bloch_vector
import numpy as np
import matplotlib.pyplot as plt

(a) Start with 0|0\rangle and apply Ry(θ)R_y(\theta) for θ=0,π/6,π/3,π/2,2π/3,5π/6,π\theta = 0, \pi/6, \pi/3, \pi/2, 2\pi/3, 5\pi/6, \pi. For each value of θ\theta: - Create a circuit with qc.ry(theta, 0) - Extract the statevector - Visualize on the Bloch sphere (or record the Bloch coordinates)

Describe the path traced out. What curve on the Bloch sphere is this?

(b) Now start with 0|0\rangle, apply Ry(π/2)R_y(\pi/2) to move to the equator, and then apply Rz(ϕ)R_z(\phi) for ϕ=0,π/4,π/2,3π/4,π,5π/4,3π/2,7π/4\phi = 0, \pi/4, \pi/2, 3\pi/4, \pi, 5\pi/4, 3\pi/2, 7\pi/4. Describe the path.

(c) Starting from 0|0\rangle, apply the following sequences and plot each final state on the Bloch sphere: 1. Ry(π/2)R_y(\pi/2) 2. Ry(π/2)R_y(\pi/2) then Rz(π/2)R_z(\pi/2) 3. Rz(π/2)R_z(\pi/2) then Ry(π/2)R_y(\pi/2)

Are sequences 2 and 3 the same? Relate to what we learned about non-commutativity.

(d) The path of a general rotation. Choose an axis n^=(1,1,1)/3\hat{n} = (1, 1, 1)/\sqrt{3} and apply Rn^(θ)R_{\hat{n}}(\theta) for θ\theta from 0 to 2π2\pi in 20 steps. Start from 0|0\rangle. Plot all 20 states on a single Bloch sphere.

Hint: In Qiskit, use qc.r(theta, phi, 0) where phi is the azimuthal angle of n^\hat{n}, or construct the rotation matrix manually and use qc.unitary(U, 0).


Problem 5: Phase Gate = Z-Rotation (Qiskit)

This problem tests the lecture’s claim that the phase gate P(ϕ)P(\phi) and the rotation Rz(ϕ)R_z(\phi) are the same up to a global phase.

Setup:

from qiskit import QuantumCircuit
from qiskit.quantum_info import Statevector
from qiskit_aer import AerSimulator
import numpy as np

(a) Create two circuits that act on +=H0|+\rangle = H|0\rangle:

Circuit A: Apply HH, then P(π/3)P(\pi/3).

Circuit B: Apply HH, then Rz(π/3)R_z(\pi/3).

Use Statevector.from_instruction(qc) to extract the output states. Print both statevectors. Are they the same?

(b) Compute the ratio of corresponding amplitudes: (state A)0/(state B)0(\text{state A})_0 / (\text{state B})_0 and (state A)1/(state B)1(\text{state A})_1 / (\text{state B})_1. Show that the ratio is the same for both components — this is the global phase eiϕ/2e^{i\phi/2}.

(c) Add measurements to both circuits. Run each on AerSimulator() with 10,000 shots. Compare the measurement statistics. Are they distinguishable?

(d) Repeat parts (a)–(c) for ϕ=π\phi = \pi (where P(π)=ZP(\pi) = Z and Rz(π)=iZR_z(\pi) = -iZ). Verify that the measurement statistics are still identical despite different global phases.

(e) Explain in your own words: if two unitaries differ only by a global phase, why can’t any measurement distinguish them?


Problem 6: The Hadamard Gate as a Rotation

The Hadamard gate is one of the most important single-qubit gates:

H=12(1111)H = \frac{1}{\sqrt{2}}\begin{pmatrix} 1 & 1 \\ 1 & -1 \end{pmatrix}

(a) Verify that H=12(σx+σz)H = \frac{1}{\sqrt{2}}(\sigma_x + \sigma_z) by writing out the right-hand side explicitly.

(b) The general SU(2) rotation is Rn^(θ)=cos(θ/2)Iisin(θ/2)(n^σ)R_{\hat{n}}(\theta) = \cos(\theta/2)\,I - i\sin(\theta/2)\,(\hat{n}\cdot\vec\sigma). Find the rotation axis n^\hat{n} and angle θ\theta such that H=eiϕRn^(θ)H = e^{i\phi}\,R_{\hat{n}}(\theta), where eiϕe^{i\phi} is a global phase. Determine ϕ\phi as well.

Hint: Since HH is Hermitian and H2=IH^2 = I, what does that tell you about the rotation angle? What direction is σx+σz\sigma_x + \sigma_z?

(c) Verify your answer by computing Rn^(θ)R_{\hat{n}}(\theta) explicitly and comparing to eiϕHe^{-i\phi}H.

(d) Compute H2H^2 directly. Show it equals II. Explain geometrically: why does applying the same rotation twice return to the identity? For what class of rotation angles θ\theta is Rn^(θ)2=IR_{\hat{n}}(\theta)^2 = I (up to global phase)?

(e) Where does HH send each of the six cardinal states? Describe the geometric action of HH on the Bloch sphere in one sentence.