Part I — Dirac Notation & the Completeness Relation
A quantum state lives in a complex vector space — a Hilbert space. Any orthonormal basis $\{|i\rangle\}$ of this space satisfies:
Any state $|a\rangle$ can be expanded in this basis. The expansion coefficient onto basis vector $|i\rangle$ is the inner product:
Substituting the second into the first:
Since this holds for every $|a\rangle$, the operator in parentheses must be the identity:
This is one of the most powerful tools in quantum mechanics. Inserting a complete set of states between any two expressions allows you to change bases, evaluate matrix elements, and derive results without ever writing out components explicitly.
The bra vector $\langle a|$ is the linear functional corresponding to the ket $|a\rangle$. If $|a\rangle = \sum_i \alpha_i|i\rangle$, then $\langle a| = \sum_i \alpha_i^*\langle i|$. The bra is the "dual" of the ket — a row vector of complex conjugates.
Part II — Linear Operators
A linear operator $M$ is a map from the Hilbert space to itself satisfying:
In any orthonormal basis, a linear operator is represented by a matrix. The matrix element $M_{ij}$ is:
This formula is central: it tells us how to construct the matrix representation of any operator in any basis. The row index $i$ and column index $j$ correspond to the "output" and "input" basis states respectively.
The Hermitian Conjugate
The Hermitian conjugate (or adjoint) $M^\dagger$ of an operator $M$ is defined by:
In matrix form: take the transpose and then complex-conjugate every element (or equivalently, complex-conjugate then transpose — the order doesn't matter):
Part III — Eigenvectors and Eigenvalues
A vector $|\lambda\rangle$ is an eigenvector of operator $M$ with eigenvalue $\lambda$ if:
Geometrically: the operator does not rotate this vector — it only scales it by $\lambda$. In quantum mechanics, the eigenvectors of an observable are the states in which that observable has a definite value, and the eigenvalues are the possible measurement outcomes.
Part IV — Hermitian Operators
An operator is Hermitian (or self-adjoint) if it equals its own conjugate:
Hermitian operators are the mathematical objects that represent physical observables in quantum mechanics. There are two fundamental theorems that explain why.
Theorem 1: Eigenvalues of Hermitian Operators Are Real
Proof. Let $M|\lambda\rangle = \lambda|\lambda\rangle$ with $|\lambda\rangle$ normalized ($\langle\lambda|\lambda\rangle = 1$). Compute $\langle\lambda|M|\lambda\rangle$ two ways:
Therefore $\lambda = \lambda^*$, which means $\lambda \in \mathbb{R}$. $\square$
This is why observables must be Hermitian: measurement outcomes are real numbers, and Hermitian operators guarantee real eigenvalues.
Theorem 2: Eigenvectors with Different Eigenvalues Are Orthogonal
Proof. Let $M|a\rangle = \lambda_a|a\rangle$ and $M|b\rangle = \lambda_b|b\rangle$ with $\lambda_a \neq \lambda_b$. Compute $\langle b|M|a\rangle$ two ways:
Subtracting: $(\lambda_a - \lambda_b)\langle b|a\rangle = 0$. Since $\lambda_a \neq \lambda_b$, we have $\langle b|a\rangle = 0$. $\square$
Hermitian operators always have enough eigenvectors to form a complete basis (a fact requiring more work to prove in general). In their own eigenbasis, a Hermitian operator is diagonal:
Part V — The Four Postulates
All of quantum mechanics for isolated systems rests on four postulates. Together they form a complete logical framework — a precise mathematical language in which to express all quantum phenomena.
Consistency: Normalization
The Born rule is consistent with normalization. Using the completeness relation:
Global Phase
Multiplying a state by an overall phase $e^{i\theta}$ changes nothing observable. For any $|\phi\rangle$:
Therefore $e^{i\theta}|\psi\rangle \equiv |\psi\rangle$ as physical states. Quantum states are not individual vectors — they are rays (equivalence classes of vectors differing only by a global phase) in Hilbert space.
Part VI — Deriving the Pauli Matrices
Rather than postulating the Pauli matrices, we can derive them systematically from their eigenstates using the formula $M_{ij} = \langle i|M|j\rangle$.
Deriving $\sigma_z$
The eigenstates are $|u\rangle = (1,0)^T$ with eigenvalue $+1$ and $|d\rangle = (0,1)^T$ with eigenvalue $-1$. Using $\{|u\rangle, |d\rangle\}$ as the basis, compute each matrix element:
Deriving $\sigma_x$
The eigenstates are $|r\rangle = \frac{1}{\sqrt{2}}(1,1)^T$ ($+1$) and $|l\rangle = \frac{1}{\sqrt{2}}(1,-1)^T$ ($-1$). Using the completeness relation $|r\rangle\langle r| + |l\rangle\langle l| = \mathbf{1}$, each matrix element is:
More directly, $\sigma_x = (+1)|r\rangle\langle r| + (-1)|l\rangle\langle l|$. Computing in the $\{|u\rangle, |d\rangle\}$ basis:
Deriving $\sigma_y$
The eigenstates are $|i_s\rangle = \frac{1}{\sqrt{2}}(1,i)^T$ ($+1$) and $|o\rangle = \frac{1}{\sqrt{2}}(1,-i)^T$ ($-1$). Using $\sigma_y = |i_s\rangle\langle i_s| - |o\rangle\langle o|$:
Verification: $\sigma_y|i_s\rangle = \begin{pmatrix}0&-i\\i&0\end{pmatrix}\frac{1}{\sqrt{2}}\begin{pmatrix}1\\i\end{pmatrix} = \frac{1}{\sqrt{2}}\begin{pmatrix}-i\cdot i\\i\cdot 1\end{pmatrix} = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\i\end{pmatrix} = +1\cdot|i_s\rangle$ ✓
Properties of All Three Pauli Matrices
| Property | Meaning |
|---|---|
| Hermitian: $\sigma_i^\dagger = \sigma_i$ | Represent valid observables (real eigenvalues) |
| Unitary: $\sigma_i\sigma_i^\dagger = \mathbf{1}$ | Eigenvalues have unit magnitude |
| Traceless: $\text{tr}(\sigma_i) = 0$ | Eigenvalues sum to zero: $+1 + (-1) = 0$ |
| $\sigma_i^2 = \mathbf{1}$ | Idempotent in a sense; measurement twice = once |
The full Pauli algebra is encoded in one compact formula:
Part VII — Physical Meaning of Matrix Elements
The matrix element $M_{ij} = \langle i|M|j\rangle$ has a direct physical interpretation: it is the amplitude for a system in state $|j\rangle$ to contribute to the observable value when the output is projected onto $|i\rangle$. The squared modulus $|M_{ij}|^2$ gives the corresponding probability (in a suitable sense).
The expectation value of $M$ in state $|a\rangle$ is:
The second form (expanding in the eigenbasis $\{|n\rangle\}$ of $M$) shows that $\langle M\rangle$ is literally the weighted average of eigenvalues, with weights given by the Born rule probabilities. This is the quantum generalization of a classical expectation value.
Part VIII — Degrees of Freedom and the Bloch Sphere
How many real parameters specify a quantum state? Naively, a vector in $\mathbb{C}^N$ has $2N$ real numbers (real and imaginary parts of each component). But physics reduces this count:
| Constraint | Removes | Remaining parameters |
|---|---|---|
| Start with $|a\rangle\in\mathbb{C}^N$ | — | $2N$ |
| Normalization: $\langle a|a\rangle = 1$ | 1 real constraint | $2N - 1$ |
| Global phase: $e^{i\theta}|a\rangle \equiv |a\rangle$ | 1 more | $2N - 2$ |
For a spin-½ system ($N = 2$): $2(2) - 2 = 2$ real parameters. Two real parameters label a point on a sphere — the Bloch sphere. The general qubit state (up to global phase) is:
The expectation values of the three spin components are:
These are exactly the Cartesian components of a unit vector in spherical coordinates $(\theta, \varphi)$. The "Bloch vector" $(\langle\sigma_x\rangle, \langle\sigma_y\rangle, \langle\sigma_z\rangle)$ lies exactly on the unit sphere. Every pure qubit state corresponds to a unique point on the Bloch sphere.