Integration
Integration is the reverse of differentiation. If , then could be — we ran the derivative backward. The same reverse operation also computes areas, total accumulated quantities, and the cumulative effect of any rate of change. This page builds up the toolkit from scratch.
The basic question
Differentiation asks: given , what is its rate of change ?
Integration asks the inverse: given a rate , what function has that rate as its derivative? Such a function is called an antiderivative (or primitive) of , written , with .
The notation for this hunt is
The integral sign is a stretched “S” (for summa), is the integrand, and has two roles: it marks the variable being integrated over, and also stands for an infinitesimal slice along that variable — the role made precise by the Riemann-sum picture further down (and the reason substitution lets us replace with ). The whole expression reads “the antiderivative of with respect to .”
Why the ?
Differentiation throws away constants: , but also , and . So when we run the derivative backward from , we cannot recover which constant was originally there.
Every integration leaves behind a constant of integration : the answer is not a single function but a whole family , all sharing the same derivative.
Building the rules
The power rule, by reversal
Differentiation: — the power drops by one, and the old power comes out front as a coefficient.
To reverse it, we need to raise the power by one (so it lands at ) and divide by the new power (to cancel the coefficient that the derivative would pull out):
Verify by differentiating
The case fails because we would divide by zero. That case has its own answer — see the table below — and it’s the reason the natural logarithm exists.
Linearity
Two easy rules carry over from differentiation directly:
- Constants pass through: .
- Sums split apart: .
Together, these mean integration is linear. You can break any integral of a sum into a sum of integrals, and pull constants out front. Both follow immediately from the corresponding differentiation rules.
Putting it together
The fundamental integration rules are:
| Rule | Formula |
|---|---|
| Constant | |
| Power | |
| Constant multiple | |
| Sum | |
| Substitution |
The substitution rule is more subtle than the others — we’ll build it up in its own section.
Common Integrals
Each row pairs a function with its antiderivative. To verify any row, differentiate the right column — you should get back the left.
| , | |
Two oddities worth noting:
- uses absolute value because is undefined for negative numbers, but has antiderivatives on both sides of zero.
- — the appears because differentiating pulls an out (chain rule), which we have to un-multiply by dividing.
Trigonometric Integrals
Memorize the derivatives of sine and cosine, then everything else follows by reversal:
Be careful with the sign in the second one. The full set:
The integrals of the trigonometric functions are:
The integrands producing inverse trigonometric antiderivatives are worth recognizing too:
Substitution: reversing the chain rule
The chain rule says
Read right to left, this is an integral statement:
In words: if your integrand is a composite function multiplied by the derivative of its inner part , the antiderivative is — no extra work required.
The mechanical technique for using this is called u-substitution (or just substitution): let , compute , and rewrite the integral entirely in terms of :
The diagnostic: what to look for
Substitution is not a general technique — it works only when a specific pattern is present. Before trying it, run this two-part check:
- Find a composite. Does your integrand contain something of the form , where “inner” isn’t just by itself? Call that inner expression .
- Find the inner derivative. Is (or a constant multiple of it) also present as a factor in the integrand?
If both answers are yes, set and write . The factor in the integrand gets replaced by , and the integral collapses into one in alone. If the second answer is no — the derivative is completely absent and not just off by a constant — substitution will not simplify the integral.
Why does the derivative have to be there? When you set , you need to replace every piece of in the integrand, including the . The relation shows that — so if doesn’t cancel with something already in the integrand, it ends up stuck in the denominator and the substitution makes things worse, not better.
For example, looks like a candidate with inner function , but is nowhere in the integrand. Setting gives , and the integral becomes — which still contains , leaving the substitution stuck. This particular integral has no elementary closed form.
Fixing an off-by-constant factor
If the derivative is present but multiplied by the wrong constant, fix it by multiplying and dividing. For example, : the inner function is with derivative , but only appears in the integrand — off by a factor of 3.
Constants only. This multiply-and-divide trick works only when the missing factor is a constant. Linearity lets you pull constants in and out of integrals freely — but not anything involving . Writing is wrong, because is not a constant and cannot leave the integrand. If the missing factor depends on the variable of integration, substitution doesn’t apply — try a different technique.
Examples
. The inner function of is , and its derivative sits right next to it as a factor.
Let , so :
. The inner function is , whose derivative is the constant . A constant is always implicitly present.
Let , so , meaning :
. The denominator is the inner function, and its derivative is the numerator.
Let , so :
Integration by parts: reversing the product rule
The product rule says . Integrating both sides:
Rearranging to solve for one of the two integrals:
For differentiable functions and , integration by parts states:
In short form with and (so and ):
The idea: trade an integral you can’t do for one you can. You pick and from the integrand, hoping that (the derivative of ) and (the antiderivative of ) make the resulting integral easier than what you started with.
Rule of thumb: pick to be the part that gets simpler when differentiated — polynomials, , . Pick to be the part you can easily integrate — exponentials, , .
LIATE mnemonic. When in doubt about which factor should be , try them in this order — pick the first type that appears in the integrand:
| Letter | Function type | Example |
|---|---|---|
| L | Logarithm | |
| I | Inverse trig | , |
| A | Algebraic (polynomials, roots) | , |
| T | Trigonometric | , |
| E | Exponential | , |
The order is roughly “what gets simpler fastest under differentiation” → “what stays the same or grows worse.” For , L beats A, so . For , A beats E, so . The mnemonic is a guide, not a law — when it fails, try the other split.
. Pick (so , simpler) and (so , easy to integrate):
The polynomial factor disappeared after one differentiation — that’s the whole reason this works.
— looks impossible, but the trick is to take and let the entire be :
(so ), (so ):
If the resulting integral isn’t easier, you picked and the wrong way around — try swapping them.
The Definite Integral
So far we’ve computed antiderivatives — families of functions defined up to a constant. The definite integral asks for a single number: the amount accumulated by between two endpoints and .
Geometrically, this number is the signed area between the graph of and the -axis from to — area above the axis counted positive, area below counted negative:
Riemann sums: where the area definition comes from
The “signed area” idea is made precise by chopping into small slices of width , building a rectangle of height over each slice (for some sample point in the slice — see remark below for what the asterisk means), and adding up their areas:
This is a Riemann sum. As , the rectangles shrink and the sum converges to a single number — the definite integral of over .
The definite integral of a function from to is the limit of Riemann sums as the partition is refined:
with and any sample point in the -th subinterval. The number measures the signed area between the graph of and the -axis from to .
What the asterisk on means
The -th slice is an interval, not a single point — so to evaluate for the rectangle’s height, we have to pick one point from inside that interval. The asterisk is a free choice marker: it says “some point we picked from the -th subinterval,” without committing to which one. Common choices are the left endpoint, the right endpoint, or the midpoint — but any point in the slice is fair game. The whole reason this notation works is the theorem that as , which point you pick stops mattering — every valid choice converges to the same limit. The acknowledges the ambiguity up front, then the limit erases it.
This sum-based definition is what “area” actually means — but computing the limit directly is brutal even for simple integrands. The miracle is that we never have to: the limit equals the difference of any antiderivative at the endpoints.
Fundamental theorem of calculus (often abbreviated FTC). If is continuous on and is any antiderivative of , then:
The constant cancels in the difference and does not appear.
We don’t need to add up infinitely many tiny rectangles — we just find any antiderivative and subtract. The link between the two pictures (limit of sums on one side, antiderivative on the other) is what makes calculus the toolkit it is: a question about accumulation answered by a question about rates.
. An antiderivative of is :
The same rules — linearity, substitution, integration by parts — all apply to definite integrals. The only extra adjustment for substitution: when you change variable from to , the limits transform too:
After substituting you can either update the limits to and (faster) or substitute back in at the end before evaluating (safer when first learning).
Arc length: integrating speed
A definite integral doesn’t have to be the area under a graph. The same Riemann-sum machinery — chop a parameter interval, sum a quantity proportional to the slice width, take the limit — measures any quantity that accumulates linearly along a path. The first non-area example is arc length: the total length traced out by a parametrized curve.
Consider a curve in the plane (or in ) given by a parametrization
To approximate the length of , partition into slices, sample points , and add up the straight-line chord lengths between consecutive samples:
For a smooth curve, over a small slice (this is the linearization from the derivative), so each chord length is approximately . The sum becomes a Riemann sum for , and the limit is a definite integral.
Before writing it down, it helps to name what actually is. Think of as time and as the position of a moving particle that traces out the curve as runs from to . Then:
- is the velocity vector at time — direction-and-rate of motion.
- is the speed at time — a single non-negative number, “how fast the particle is moving” with the direction stripped away. (This is the standard physics distinction between velocity and speed.)
- “Speed of the parametrization” just means: how fast the parameter pushes the point along the curve.
What does “trace the same curve faster” actually mean?
“Faster” means changing the parameter, not scaling the components. Take the unit circle on . The reparametrization on traces the same circle — but only needs to reach for one full lap. By the chain rule , so the speed is doubled at every point. The image (the unit circle) is unchanged.
What this is not: scaling the components, like . That’s a circle of radius 2 — a different curve (circumference ), not the same circle traced faster. Scaling components scales the curve itself; reparametrizing the variable scales the speed at which it’s traced.
With that vocabulary, the arc length integral is just the total-distance formula from physics in disguise. The infinitesimal piece reads as “speed time slice = distance covered in that slice” — and integrating over adds up all those tiny distances:
The arc length of a continuously differentiable (i.e., ) curve is
For a 2D curve this expands to
For the graph of a function on — a special case parametrized by — it becomes
Two points worth noticing:
- The integrand is a scalar. Even though is vector-valued, collapses to a single non-negative number at each — the instantaneous speed. Arc length is a 1D integral of that scalar.
- The answer doesn’t depend on the parametrization. The same physical curve traced twice as fast still has the same length: doubling the speed of halves the parameter interval needed to cover it, and doubles, so the integral is unchanged. Arc length is a property of the image (the trace) of , not of the parametrization.
Circle of radius . Parametrize for :
so the speed is constant, and
— the familiar circumference, recovered from the integral.
Arc length is the simplest of a wider family of path integrals — quantities that accumulate along a curve. In every case the recipe is the same: identify what’s being accumulated per unit parameter, multiply by , and integrate. This same pattern is what generalizes to surface area as a double integral over a parametrized surface, where the analog of “speed” becomes the magnitude of the cross product of the two tangent vectors — but that’s for the curves chapter to develop.