Matrices
A handful of small but recurring pieces of linear algebra used throughout the notes: the transpose of a matrix, the trace of a square matrix, and two named symmetry classes — symmetric and skew-symmetric matrices.
Transpose
The transpose of an matrix , denoted , is the matrix obtained by swapping its rows and columns. If denotes the entry in row , column , then:
Concretely, the -th row of becomes the -th column of :
Geometrically, transposing a square matrix is the same as mirror-reflecting it across its main diagonal — the entries with . Each off-diagonal entry gets sent to position ; the diagonal entries themselves stay put.
Symmetric Matrix
Some square matrices are unchanged by that diagonal reflection — the transpose does nothing to them. They earn a special name.
A square matrix is symmetric if it equals its own transpose:
Equivalently, every entry above the main diagonal mirrors the one below it. Only square matrices can be symmetric — if is with , then has different dimensions () and the equality cannot even be stated.
The matrix on the left is symmetric — mirror-reflecting it across the diagonal leaves it unchanged. The one on the right is not: the entry is while the entry is .
One property worth noting up front: every eigenvalue of a real symmetric matrix is itself a real number. A general real matrix can have complex eigenvalues — the characteristic polynomial need not factor over the reals — but symmetry forces every eigenvalue onto the real line. This is what makes sign-based reasoning about symmetric matrices (positive/negative definiteness, ordering eigenvalues by size, classifying Hessians, and so on) meaningful in the first place.
Skew-Symmetric Matrix
The mirror image of the symmetric case: instead of being unchanged by the diagonal reflection, the matrix flips sign.
A square matrix is skew-symmetric (also called antisymmetric) if its transpose equals its negative:
Setting in the entry-wise condition forces , so every diagonal entry of a skew-symmetric matrix must be zero. Off-diagonal entries come in opposite-sign pairs across the diagonal — whatever sits at has its negative sitting at .
The diagonal is all zeros, and each off-diagonal entry is the negative of its mirror across the diagonal — for instance, the entry is while the entry is .
Diagonal Matrix
Yet another structural class of square matrix — this time defined by where the non-zero entries are allowed to sit.
A square matrix is diagonal if every off-diagonal entry is zero:
Equivalently, has the form
with all entries off the main diagonal forced to zero, and the diagonal entries free to be any scalars.
A few quick observations follow from the definition:
- Every diagonal matrix is automatically symmetric — both and are zero off the diagonal, so holds for free.
- The trace is just — the sum of the only non-zero entries.
- Multiplying two diagonal matrices is entry-wise on the diagonal: if are diagonal with entries and , then is also diagonal, with entries .
is diagonal with . Its trace is .
The most important diagonal matrix is the one whose every diagonal entry is — it shows up so often that it gets its own name and symbol.
The identity matrix is the diagonal matrix with every diagonal entry equal to :
where is the Kronecker delta. It acts as the multiplicative identity for square matrices: for every .
When the size is clear from context, the subscript is dropped — alone refers to the identity matrix of whatever size makes the surrounding expression dimensionally consistent.
Trace
A scalar quantity attached to a square matrix — collapsing the whole block down to a single number by summing along the diagonal.
The trace of a square matrix , denoted , is the sum of its diagonal entries:
Only diagonal entries contribute; off-diagonal entries are ignored entirely. Like the transpose, the trace cares about both rows and columns being indexed by the same range — so it is undefined for non-square matrices.
A few useful properties for and :
- Linearity: and
- Transpose invariance: — transposing preserves the diagonal, so it preserves the sum
- Cyclic property: whenever both products are defined, even though in general