Eigenvalues and Eigenvectors
For a square matrix , an eigenvector is a non-zero vector that the matrix only stretches or shrinks — its direction is preserved (up to sign) under multiplication by . The factor by which it gets scaled is the corresponding eigenvalue.
These pairs reveal how the linear map behaves along its “natural axes”: every other vector twists toward or away from these eigendirections, but the eigenvectors themselves only get rescaled.
Definition
For a square matrix , a non-zero vector is an eigenvector of with eigenvalue if:
The pair is called an eigenpair of . The requirement is essential — the equation is trivially satisfied by the zero vector for any , so without it the definition would be vacuous.
The scalar may be complex even when has real entries, but for the symmetric matrices most often encountered in this material — Hessian matrices, covariance matrices, and so on — every eigenvalue is guaranteed to be real.
Finding Eigenvalues
Start from the eigenvalue equation and move everything to one side:
(The identity matrix sneaks in because — it’s the only way to subtract a scalar from a matrix and keep the equation matrix-shaped.)
The key observation: we’re looking for a non-zero that the matrix sends to . A healthy, invertible matrix never does that — multiplying any non-zero vector by an invertible matrix gives back another non-zero vector. So has to be non-invertible — it must collapse some direction down to . From the determinant properties, this is exactly the condition
which gives a single equation in the unknown .
The characteristic polynomial of a square matrix is
Its roots are exactly the eigenvalues of . For an matrix, is a polynomial of degree in , so has at most eigenvalues (counted with multiplicity).
Picture as a tuning knob you can dial. For most settings, the matrix is perfectly healthy — its determinant is some non-zero number, and the matrix has an inverse. But at a few special, isolated values of — exactly the eigenvalues — the matrix collapses, its determinant snaps to zero, and a direction in space gets crushed to . The characteristic polynomial is just the determinant of written as a function of , and “finding eigenvalues” is the same as hunting for the values where this polynomial crosses zero.
In practice the recipe is three steps:
- Form by subtracting from every diagonal entry of (off-diagonal entries stay put).
- Compute its determinant. Expanding gives a polynomial in — the characteristic polynomial .
- Solve for . Each root is an eigenvalue.
Finding Eigenvectors
Once you have an eigenvalue , the matching eigenvectors are the directions that flattens to zero — and you already know at least one such direction must exist, because that collapsing-a-direction property is exactly what made an eigenvalue in the first place. To pin it down, plug the value of back into and solve
for . This is just a linear system: for a matrix it boils down to a single linear relation between and , leaving one free parameter to slide along.
One thing to expect: if is an eigenvector, so is , , or any other non-zero scalar multiple — multiplying both sides of by a constant doesn’t change anything. That’s not a flaw in the recipe; it just means the eigenvector’s direction is what’s pinned down, not its length. By convention you pick a clean representative with simple integer entries (or unit length).
Worked Example
Find the eigenvalues and eigenvectors of .
Step 1. Form by subtracting from each diagonal entry:
Step 2. Compute its determinant — this is the characteristic polynomial:
Step 3. Set and solve. Factoring is easiest here:
So has two eigenvalues, and .
Step 4. For each eigenvalue, plug it back in and solve to find the matching eigenvector.
For :
For :
Sanity check — both pairs satisfy :
The same recipe extends to larger matrices — for matrices the characteristic polynomial is cubic in and yields up to three eigenvalues, each with its own eigenvector(s) found by the same plug-and-solve step.
Sum and Product of Eigenvalues
For any square matrix, the eigenvalues come paired with two simple bookkeeping identities — they sum to the trace, and they multiply to the determinant.
For a square matrix with eigenvalues (counted with multiplicity), the trace equals the sum of the eigenvalues:
For a square matrix with eigenvalues (counted with multiplicity), the determinant equals the product of the eigenvalues:
Both follow from a single observation: the characteristic polynomial can be expanded directly or read off its roots, and matching coefficients recovers the identities at once. The case spells it out.
Expanding for gives
The same polynomial, with as its roots, also factors as
Matching coefficients (Vieta’s formulas) reads off both identities at once:
For an matrix the picture is the same — match coefficients of the degree- characteristic polynomial with its fully-factored form.
Diagonal Matrices: A Free Lunch
For a diagonal matrix the entire computation collapses to inspection — no characteristic polynomial needed.
If is diagonal with entries along its diagonal, then:
- The eigenvalues of are exactly the diagonal entries: for .
- The corresponding eigenvectors are the standard basis vectors: .
The reason is mechanical: multiplying by picks out the -th column of , and since is diagonal, that column is . So maps to itself, scaled by the matching diagonal entry — the very definition of an eigenpair.
The characteristic polynomial confirms it directly: is also diagonal, with entries along its diagonal, and the determinant of a diagonal matrix is the product of its diagonal entries, so
which is already in factored form — its roots are visibly .
has eigenvalues with corresponding eigenvectors . The characteristic polynomial is
with roots reading straight off the diagonal.