Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Lecture 2.3: Polarization and Measurement

Purdue University

Polarization as a Qubit

Light is an electromagnetic wave. For light traveling along the zz-axis, the electric field oscillates in the transverse xx-yy plane:

E(x,t)=Re[(Exx^+Eyeiϕy^)ei(kxωt)]\vec{E}(x,t) = \text{Re}\left[(E_x \hat{x} + E_y e^{i\phi} \hat{y})\, e^{i(kx - \omega t)}\right]

The amplitudes ExE_x and EyE_y and the relative phase ϕ\phi together determine the polarization — the pattern traced by the electric field vector as the wave passes. Linear polarization, circular polarization, and everything in between are all encoded in these parameters.

The Jones Vector

We can factor out the propagation ei(kxωt)e^{i(kx - \omega t)} and focus on what makes different polarizations different. What remains is the Jones vector:

J=(ExEyeiϕ)\vec{J} = \begin{pmatrix} E_x \\ E_y e^{i\phi} \end{pmatrix}

This is a 2D complex vector — exactly like our qubit state vectors! For normalized states (total intensity = 1):

Ex2+Ey2=1|E_x|^2 + |E_y|^2 = 1

The Six Cardinal Polarization States

We identify horizontal and vertical polarization with our computational basis:

H=(10),V=(01)|H\rangle = \begin{pmatrix} 1 \\ 0 \end{pmatrix}, \qquad |V\rangle = \begin{pmatrix} 0 \\ 1 \end{pmatrix}

From these we can build the diagonal and anti-diagonal linear polarizations:

D=12(11)=H+V2,A=12(11)=HV2|D\rangle = \frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ 1 \end{pmatrix} = \frac{|H\rangle + |V\rangle}{\sqrt{2}}, \qquad |A\rangle = \frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ -1 \end{pmatrix} = \frac{|H\rangle - |V\rangle}{\sqrt{2}}

And the circular polarizations, where the relative phase between xx and yy components is ±π/2\pm \pi/2:

R=12(1i)=H+iV2,L=12(1i)=HiV2|R\rangle = \frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ i \end{pmatrix} = \frac{|H\rangle + i|V\rangle}{\sqrt{2}}, \qquad |L\rangle = \frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ -i \end{pmatrix} = \frac{|H\rangle - i|V\rangle}{\sqrt{2}}

For circular polarization, the ExE_x and EyE_y components have equal amplitude but are 90° out of phase. The electric field vector traces out a circle as the wave propagates — rotating clockwise (R) or counterclockwise (L) when viewed head-on.

The Polarization–Qubit Dictionary

These six states map directly onto the Bloch sphere:

PolarizationJones VectorQubitBloch Position
Horizontal H|H\rangle(1,0)T(1, 0)^T0|0\rangleNorth pole (+z)
Vertical V|V\rangle(0,1)T(0, 1)^T1|1\rangleSouth pole (−z)
Diagonal D|D\rangle(1,1)T/2(1, 1)^T/\sqrt{2}+|+\rangle+x axis
Anti-diagonal A|A\rangle(1,1)T/2(1, -1)^T/\sqrt{2}|-\rangle−x axis
Right circular R|R\rangle(1,i)T/2(1, i)^T/\sqrt{2}+i|+i\rangle+y axis
Left circular L|L\rangle(1,i)T/2(1, -i)^T/\sqrt{2}i|-i\rangle−y axis

Notice what distinguishes these states. D|D\rangle and A|A\rangle have the same amplitudes — both have Ex=Ey=1/2|E_x| = |E_y| = 1/\sqrt{2} — but differ by a relative phase of π\pi. Similarly, R|R\rangle and L|L\rangle have the same amplitudes but differ by a relative phase of π\pi. Relative phase is physical: it produces completely different polarizations.

The six states form three pairs of orthogonal states, corresponding to three measurement bases:

AxisBasisPhysical Measurement
zH|H\rangle, V|V\rangleHorizontal/Vertical polarizer
xD|D\rangle, A|A\ranglePolarizer at ±45°
yR|R\rangle, L|L\rangleCircular polarizer (quarter-wave plate + linear polarizer)

Polarizers and Projective Measurement

A polarizer transmits light polarized along one direction and blocks the orthogonal component. Let’s figure out two things: how much power comes through, and what is the state of the light afterward.

How Much Power Passes?

Consider a general input state ψ=c0H+c1V|\psi\rangle = c_0|H\rangle + c_1|V\rangle hitting a vertical polarizer. The polarizer only passes the V|V\rangle component.

The amplitude to pass is the overlap with V|V\rangle:

amplitude=Vψ=V(c0H+c1V)=c1\text{amplitude} = \langle V|\psi\rangle = \langle V|(c_0|H\rangle + c_1|V\rangle) = c_1

The power (intensity) detected is the magnitude squared of the amplitude:

power detected=c12=Vψ2\text{power detected} = |c_1|^2 = |\langle V|\psi\rangle|^2

This is Malus’s Law in Dirac notation: the fraction of power transmitted through a polarizer χ|\chi\rangle is χψ2|\langle \chi|\psi\rangle|^2.

What Is the State Afterward?

After the vertical polarizer, the output state is:

ψout=VVψ|\psi_{\text{out}}\rangle = |V\rangle\langle V|\psi\rangle

The polarizer doesn’t just filter — it projects the state onto V|V\rangle. Whatever the input was, what comes out is always proportional to V|V\rangle.

The Projection Operator

We can write this as the action of a projection operator:

PV=VV=(01)(01)=(0001)P_V = |V\rangle\langle V| = \begin{pmatrix} 0 \\ 1 \end{pmatrix}\begin{pmatrix} 0 & 1 \end{pmatrix} = \begin{pmatrix} 0 & 0 \\ 0 & 1 \end{pmatrix}

Acting on an arbitrary state:

PVψ=(0001)(c0c1)=(0c1)P_V |\psi\rangle = \begin{pmatrix} 0 & 0 \\ 0 & 1 \end{pmatrix}\begin{pmatrix} c_0 \\ c_1 \end{pmatrix} = \begin{pmatrix} 0 \\ c_1 \end{pmatrix}

The HH component is gone; only the VV component survives.

Similarly, the horizontal projector is:

PH=HH=(1000)P_H = |H\rangle\langle H| = \begin{pmatrix} 1 & 0 \\ 0 & 0 \end{pmatrix}

Projectors Are Idempotent

A key property of projectors: applying the same projection twice gives the same result as applying it once.

PVPV=VVVV=V(1)V=VV=PVP_V P_V = |V\rangle\langle V|V\rangle\langle V| = |V\rangle(1)\langle V| = |V\rangle\langle V| = P_V
PV2=PVP_V^2 = P_V

And more generally:

PVN=PVP_V^N = P_V

This makes physical sense: once you’ve projected onto V|V\rangle, the state is already V|V\rangle. Running it through the same polarizer again changes nothing.


From Waves to Single Photons

Everything so far has been classical wave optics. Malus’s Law tells us what fraction of the intensity passes through a polarizer. Now let’s ask: what happens when we turn the power way down?

Photons as Discrete Energy Packets

From the blackbody radiation problem (one of the founding puzzles of quantum mechanics), we know that the electromagnetic field carries energy in discrete packets — photons — each with energy:

Ephoton=hνE_{\text{photon}} = h\nu

where h=6.626×1034h = 6.626 \times 10^{-34} J·s is Planck’s constant and ν\nu is the frequency of the light.

Let’s calculate what this energy actually is. For a helium-neon (HeNe) laser at λ=633\lambda = 633 nm:

ν=cλ=3×108 m/s633×109 m4.7×1014 Hz\nu = \frac{c}{\lambda} = \frac{3 \times 10^8 \text{ m/s}}{633 \times 10^{-9} \text{ m}} \approx 4.7 \times 10^{14} \text{ Hz}
Ephoton=hν(6.6×1034)(4.7×1014)3.1×1019 JE_{\text{photon}} = h\nu \approx (6.6 \times 10^{-34})(4.7 \times 10^{14}) \approx 3.1 \times 10^{-19} \text{ J}

That is an incredibly small amount of energy.

How Many Photons in a Laser Pointer?

A typical laser pointer has a power of about P=1P = 1 mW =103= 10^{-3} W. Power is energy per unit time, so the number of photons emitted per second is:

N˙=PEphoton=103 W3.1×1019 J3.2×1015 photons/s\dot{N} = \frac{P}{E_{\text{photon}}} = \frac{10^{-3} \text{ W}}{3.1 \times 10^{-19} \text{ J}} \approx 3.2 \times 10^{15} \text{ photons/s}

That’s about 3 quadrillion photons per second! At this power level, the photons arrive so densely packed that the light appears perfectly continuous. Your detector sees a smooth signal proportional to E2|E|^2.

Turning Down the Power

Now imagine we gradually reduce the power. What do we expect to see on a detector?

At 1 mW, we have 1015\sim 10^{15} photons/s — a smooth, continuous signal.

At 1 nW (10-9 W), we have 109\sim 10^{9} photons/s — still essentially continuous.

But at extremely low power — say, a few photons per second — something qualitatively different happens.

How Do You Actually Detect Single Photons?

A standard photodetector works like this: an incoming photon hits a semiconductor and promotes an electron from the valence band to the conduction band. That electron flows as a current. The current passes through a resistor, and by Ohm’s law (V=IRV = IR), you measure a voltage drop.

The problem is that one photon produces one electron, which is a fantastically tiny current — far too small to measure directly.

The solution is an Avalanche Photodetector (APD). In an APD, a strong reverse bias voltage accelerates the initial photoelectron so that it knocks out additional electrons through impact ionization. Those electrons are also accelerated, knock out more, and so on — creating a cascade, an avalanche. A single absorbed photon triggers a macroscopic current pulse that you can easily measure.

A more modern alternative is the Superconducting Nanowire Single-Photon Detector (SNSPD). An SNSPD consists of a thin superconducting nanowire, cooled well below its critical temperature and biased with a current just below the critical current. When a single photon is absorbed, it breaks Cooper pairs in a small region of the wire, creating a resistive hotspot. The bias current is forced to divert around this hotspot, which pushes the current density in the remaining superconducting cross-section above the critical value. The entire wire cross-section goes normal (resistive), and the sudden appearance of resistance in the circuit produces a measurable voltage pulse. The wire then cools back to superconducting on a timescale of nanoseconds, ready for the next photon.

Regardless of the detection method, the end result is the same: a voltage spike on our oscilloscope, roughly 1 ps to 1 ns wide, that tells us exactly one photon has arrived.


Single Photons in the Mach-Zehnder Interferometer

Now put a single-photon detector at each output port of a Mach-Zehnder interferometer, and send in photons one at a time — on average, about one per second.

Classically, we found that the output intensities depend on the phase difference Δϕ\Delta\phi between the two arms:

ID1=12(1+cosΔϕ),ID2=12(1cosΔϕ)I_{D1} = \frac{1}{2}(1 + \cos\Delta\phi), \qquad I_{D2} = \frac{1}{2}(1 - \cos\Delta\phi)

With single photons, you don’t see continuous intensities. Instead, each photon produces a click at exactly one detector — never both, never neither. The photon is indivisible.

But if you accumulate many clicks, the statistics reproduce the classical interference pattern. If Δϕ=0\Delta\phi = 0 (constructive interference at D1), every photon clicks at D1. If Δϕ=π/2\Delta\phi = \pi/2, each photon has a 50/50 chance of clicking at either detector.

Which Path Did the Photon Take?

This is the central question — the MZI version of the double-slit argument.

After the first beam splitter, the quantum state of the photon is:

ψ=12(arm 1+arm 2)|\psi\rangle = \frac{1}{\sqrt{2}}\big(|\text{arm 1}\rangle + |\text{arm 2}\rangle\big)

The photon is in a superposition of being in both arms. It accumulates phase in each arm, arrives at the second beam splitter, and interference determines which detector clicks.

How can we be confident the photon really went both ways? Because we see interference. If we block one arm, the interference disappears — both detectors click with equal probability regardless of Δϕ\Delta\phi. The fact that Δϕ\Delta\phi controls the outcome means both arms must be contributing amplitudes. The only consistent picture is that the single photon traveled both paths simultaneously, as a wave with complex amplitudes along each arm. This is exactly the quantum description we’ve been building: multiple configurations, each carrying a complex amplitude.

The Wave Picture Works — Until Detection

So let’s trace the photon through the interferometer:

ψ=c1D1+c2D2|\psi\rangle = c_1|\text{D1}\rangle + c_2|\text{D2}\rangle

At every stage, the wave picture is perfectly logical and consistent. But then the photon hits a detector — and we get a click at exactly one location. The wave, which had amplitude at both detectors, seems to “choose” one.

What happened?


The Measurement Problem

This has been a source of a great deal of headache in physics. It’s called the measurement problem: how does a quantum superposition become a definite classical outcome?

Our goal here is not to resolve the full philosophical debate, but to understand the physical process that occurs during detection. The key framework is called open quantum system theory — a formalism that describes what happens to a quantum system when information or energy leaks out into an environment and can never be recovered. We won’t develop the full formalism here, but we can understand the essential physics by following the quantum state step by step through the detection process.

Step 1: Superposition at the Detectors

Just before detection, the photon is in a superposition of arriving at the two detectors:

Ψ=12γD1+12γD2|\Psi\rangle = \frac{1}{\sqrt{2}}|\gamma_{\text{D1}}\rangle + \frac{1}{\sqrt{2}}|\gamma_{\text{D2}}\rangle

where γD1|\gamma_{\text{D1}}\rangle means “photon heading toward detector D1” and likewise for D2.

Step 2: The Photon Interacts with One Electron

The photon hits the semiconductor in one of the detectors and is absorbed, promoting an electron from the valence band to the conduction band. But the photon was in a superposition of which detector it was at, so now the superposition transfers to the electron:

Ψ=12photon absorbed, e excited at D1+12photon absorbed, e excited at D2|\Psi\rangle = \frac{1}{\sqrt{2}}|\text{photon absorbed, } e^-\text{ excited at D1}\rangle + \frac{1}{\sqrt{2}}|\text{photon absorbed, } e^-\text{ excited at D2}\rangle

This is still a perfectly valid quantum superposition. The photon is gone, but its “which-detector” superposition has been handed off to the electron. In principle, if we could carefully isolate just these two electrons — one at D1, one at D2 — we could interfere them and still find evidence that the photon went both ways.

Step 3: The Avalanche

But we don’t isolate the electron. In an APD, that electron is accelerated and collides with other atoms, liberating more electrons. Those electrons hit more atoms. Within nanoseconds, the initial single-electron excitation has spread into an avalanche involving billions of electrons, atoms, and phonons.

Now the state looks something like:

Ψ=12avalancheD1,  quietD2+12quietD1,  avalancheD2|\Psi\rangle = \frac{1}{\sqrt{2}}|\text{avalanche}_{\text{D1}},\; \text{quiet}_{\text{D2}}\rangle + \frac{1}{\sqrt{2}}|\text{quiet}_{\text{D1}},\; \text{avalanche}_{\text{D2}}\rangle

The superposition hasn’t disappeared — it has spread to an enormous number of particles.

Step 4: The Two Branches Become Orthogonal

Here is the crucial point. As the entanglement spreads to more and more particles, the two terms in the superposition diverge. By “diverge” we mean they become orthogonal:

avalancheD1,quietD2    quietD1,avalancheD2    0\langle \text{avalanche}_{\text{D1}},\, \text{quiet}_{\text{D2}} \;|\; \text{quiet}_{\text{D1}},\, \text{avalanche}_{\text{D2}}\rangle \;\approx\; 0

When the overlap between two branches of a superposition reaches zero, they can no longer interfere with each other. For all practical purposes, they have become two completely independent macroscopic events — unaware of each other, unable to ever communicate or recombine. You might picture it as two independent simulations running in parallel: one universe where the photon clicked at D1, and another where it clicked at D2.

The Schrödinger Equation Is Enough

This process is called decoherence, and it is described entirely by the Schrödinger equation. We did not need to add any special “measurement postulate” or “wavefunction collapse” rule to quantum mechanics. The Schrödinger equation, applied to the photon and the detector and the environment together, naturally produces the branching into orthogonal, non-interfering outcomes.

What the Schrödinger equation does not do is select one branch over the other. From the perspective of the equation, both branches continue to exist. But from our perspective — trapped inside one branch — it appears that nature “chose” one outcome. We experience a click at D1 or D2, never both.

This is admittedly unsatisfying. But it is the current state of the art: the Schrödinger equation fully describes the measurement process, including the apparent randomness, without requiring any additional postulates. The “randomness” we experience emerges from our inability to access the other branch once decoherence has made the two branches orthogonal.


Homework

Problem 1: Changing Basis

A photon is prepared in the state H|H\rangle.

(a) Write H|H\rangle in the circular polarization basis {R,L}\{|R\rangle, |L\rangle\}. That is, find coefficients α\alpha and β\beta such that H=αR+βL|H\rangle = \alpha|R\rangle + \beta|L\rangle.

Hint: Use the definitions R=12(H+iV)|R\rangle = \frac{1}{\sqrt{2}}(|H\rangle + i|V\rangle) and L=12(HiV)|L\rangle = \frac{1}{\sqrt{2}}(|H\rangle - i|V\rangle), and solve for H|H\rangle.

(b) If this H|H\rangle-polarized photon passes through a right-circular polarizer, what is the probability it is transmitted? What about a left-circular polarizer?


Problem 2: Sequential Polarizers

(a) A photon is prepared in the state H|H\rangle and sent through a V|V\rangle polarizer. What fraction of the light is transmitted?

(b) Now insert a D|D\rangle polarizer between the H|H\rangle source and the V|V\rangle polarizer. Walk through the sequence step by step: what is the state after each polarizer, and what is the total fraction of light transmitted?

(c) Instead of a D|D\rangle polarizer, insert a polarizer oriented at angle θ\theta from horizontal. The corresponding state is θ=cosθH+sinθV|\theta\rangle = \cos\theta\,|H\rangle + \sin\theta\,|V\rangle. Show that the total transmission through the sequence Hθ polarizerV polarizer|H\rangle \to |\theta\rangle\text{ polarizer} \to |V\rangle\text{ polarizer} is:

T(θ)=cos2θsin2θ=14sin2(2θ)T(\theta) = \cos^2\theta\,\sin^2\theta = \frac{1}{4}\sin^2(2\theta)

What angle θ\theta maximizes the transmission, and what is the maximum?

(d) Now insert nn polarizers evenly spaced in angle between 0° and 90°, at angles θk=k90°n+1\theta_k = k \cdot \frac{90°}{n+1} for k=1,2,,nk = 1, 2, \ldots, n. Each consecutive pair of polarizers differs by Δθ=90°n+1\Delta\theta = \frac{90°}{n+1}. Show that the total transmission is:

T(n)=cos2(n+1) ⁣(π2(n+1))T(n) = \cos^{2(n+1)}\!\left(\frac{\pi}{2(n+1)}\right)

(e) Compute T(n)T(n) numerically for n=1,2,3,5,10,50,100n = 1, 2, 3, 5, 10, 50, 100. What happens as nn \to \infty?

(f) Explain physically why adding more polarizers — each of which can only absorb light — increases the total transmission. What does this have to do with measurement changing the state?


Problem 3: Quantum Measurement Problem

In the Mach-Zehnder interferometer, a single photon enters the first beamsplitter and — based on the interference pattern — appears to travel through both arms simultaneously. Yet when it reaches a detector, it always produces exactly one click at one detector. It is never split between two detectors, and it never fails to arrive.

(a) In your own words, describe the quantum measurement problem. At what point does the photon stop behaving like a wave (traveling both paths, producing interference) and start behaving like a particle (clicking at one detector)? Be specific about what happens at each stage of the experiment. (b) What experimental evidence forces us to conclude the photon travels both paths? What happens to the interference pattern if you block one arm?

(c) Propose a resolution to the measurement problem. There is no single accepted answer — physicists have debated this for a century. Some approaches include: the Copenhagen interpretation, the many-worlds interpretation, decoherence/entanglement with the environment, or objective collapse theories. Pick one (or propose your own) and argue for it. Your answer should address: at what point (if any) does the wave description break down, and why do we only ever experience one outcome?

(d) Is the “collapse of the wavefunction” a physical process, or a mathematical update to our description? Defend your position.

You are welcome (and encouraged) to discuss with classmates, consult textbooks, or use an LLM. If you use an LLM, cite it and briefly note what you used it for. Your final answer must be in your own words and reflect your own understanding — we are looking for evidence that you’ve genuinely thought about this, not for a “correct” answer.