Separation of Variables

Integration runs differentiation backward for a plain function. Separation of variables does the same job for a differential equation: given a relation between an unknown function and its own derivative, it recovers the function itself in closed form. It is the most basic technique for solving a first-order ordinary differential equation, as opposed to guessing a solution and checking it by substitution.

It does not work on every ODE — only on those with a particular shape. This page describes that shape, the procedure, and two worked examples drawn from population modeling.

What makes an ODE separable

A first-order ODE is separable if its right-hand side can be written as a product of two factors — one in which only the independent variable appears, and one in which only the unknown function appears:

\frac{dy}{dx} = g(x)\, h(y).

Each factor may be written using only its own variable: $x$ inside $g$ , $y$ inside $h$ .

This factoring is the whole condition the method needs: it is exactly what lets the $x$ -symbols and the $y$ -symbols be sorted onto opposite sides of the equation. A few quick tests:

$\dfrac{dy}{dx} = x y$ is separable: $g(x) = x$ , $h(y) = y$ . Both variables appear, but they multiply, so they pull apart cleanly.
$\dfrac{dy}{dx} = y$ is separable: $g(x) = 1$ (the independent variable isn’t typed in at all) and $h(y) = y$ .
$\dfrac{dy}{dx} = x + y$ is not separable: the right-hand side is a sum, and there is no way to rewrite $x + y$ as one $x$ -only factor times one $y$ -only factor.

The method

Once an equation is in separable form, the solution comes in three moves: separate, integrate, solve.

To solve a separable ODE $\dfrac{dy}{dx} = g(x)\, h(y)$ :

Separate. Divide by $h(y)$ and multiply by $dx$ , collecting each variable on its own side:

\frac{1}{h(y)}\, dy = g(x)\, dx.

Integrate both sides, each against its own variable:

\int \frac{1}{h(y)}\, dy = \int g(x)\, dx.

A single constant of integration $C$ on the right absorbs the constants from both integrals.

Solve the resulting algebraic relation for $y$ , then fix $C$ using the initial condition.

Why splitting

dy

and

dx

is legitimate

Treating $dy$ and $dx$ as quantities to be shuffled across an equation is informal shorthand; the move is justified by the substitution rule for integrals (reversing the chain rule), reading $y$ as a function of $x$ throughout. Start from the separated equation with both sides still functions of $x$ , and integrate against $x$ :

\int \frac{1}{h(y)}\, \frac{dy}{dx}\, dx = \int g(x)\, dx.

The left-hand side is exactly the pattern the substitution rule collapses: with $y = y(x)$ , the factor $\frac{dy}{dx}\, dx$ becomes $dy$ , and the integral turns into $\int \frac{1}{h(y)}\, dy$ . So the ” $dy$ on one side, $dx$ on the other” picture is not an abuse of notation but a faithful record of a substitution.

The one division to watch. Step 1 divides by $h(y)$ , which is only valid where $h(y) \neq 0$ . Any value $y^\ast$ with $h(y^\ast) = 0$ gives a constant solution $y(x) \equiv y^\ast$ — the derivative is zero and the right-hand side is zero, so the equation holds — and dividing it away can drop it from the family the method produces. Check these equilibrium values separately. In the examples below each one is quietly recovered by the final formula, but that is luck worth verifying, not a rule.

Worked example: the Malthus equation

The Malthus model of population growth gives the cleanest possible case. Its population $p(t)$ satisfies

\dot p(t) = \lambda\, p(t),

with growth rate $\lambda$ and initial condition $p(0) = p_0$ .

Fitting it to the general shape. Line the equation up against $\frac{dy}{dx} = g(x)\, h(y)$ and rename. The independent variable $x$ is the time $t$ . The unknown function $y$ is the population, which we call $p$ , not $p(t)$ : the letter $p$ names the function we are solving for, while $p(t)$ is the single number that function returns at one instant. The slot $y$ fills in the general form is the whole function, so $y$ becomes $p$ . With both renamings the general shape reads

\frac{dp}{dt} = g(t)\, h(p),

and our equation $\dot p = \lambda p$ matches it with $g(t) = \lambda$ and $h(p) = p$ . The split is forced: the right-hand side $\lambda p$ must break into a factor written with $t$ alone and a factor written with $p$ alone. The constant $\lambda$ carries no $t$ and no $p$ , so it becomes the $t$ -factor $g(t) = \lambda$ ; the leftover $p$ is the $p$ -factor, already as simple as possible, so $h$ is just the rule “return what you are handed,” i.e. $h(p) = p$ . That is why $h(p) = p$ is enough: nothing further has to be done to $p$ .

Why the $t$ hidden inside $p(t)$ is not a problem. Writing $h(p) = p$ can look as if it forgets that $p$ secretly depends on $t$ . It does not. $h$ is a rule applied to a number: hand it any value and it gives that value straight back. While the equation runs, the value handed in is the current population $p(t)$ , so applying the rule gives $h\big(p(t)\big) = p(t)$ . Spelled out in full, the equation is

\dot p(t) = g(t)\, h\big(p(t)\big) = \lambda\, p(t),

which is just $\dot p = \lambda p$ written out. So $\dfrac{dp}{dt} = g(t)\, h(p)$ and $\dfrac{dp(t)}{dt} = g(t)\, h\big(p(t)\big)$ are the same equation: the short form defines the rule $h$ , the long form applies it at time $t$ . (This is the same one-letter-for-both convention introduced with the ordinary differential equation, where $p$ does double duty for the function and for its value.) The $t$ buried inside $p(t)$ is no obstacle, because $h$ never inspects where its input came from — it only uses the input’s value. Separability asks only that the rule $g$ mention no $p$ and the rule $h$ mention no $t$ , and both hold here.

With the pieces named, run the three steps.

Separate, assuming $p \neq 0$ :

\frac{1}{p}\, dp = \lambda\, dt.

Integrate both sides. The left is the logarithm; the right is linear in $t$ :

\ln|p| = \lambda t + C.

Solve for $p$ . Exponentiating both sides,

|p| = e^{\lambda t + C} = e^{C}\, e^{\lambda t},

so $p(t) = A\, e^{\lambda t}$ , where $A = \pm e^{C}$ is an arbitrary nonzero constant rolling the sign and the $e^{C}$ together. The initial condition pins it down: at $t = 0$ ,

p_0 = A\, e^{0} = A,

so $A = p_0$ , and the solution is the exponential

p(t) = p_0\, e^{\lambda t}.

The division by $p$ assumed $p \neq 0$ , which set aside the constant solution $p(t) \equiv 0$ — the extinct population, the lone zero of $h(p) = p$ . That solution is not lost after all: it is the $p_0 = 0$ case of the formula, which gives $A = 0$ and $p(t) \equiv 0$ . So $p(t) = p_0\, e^{\lambda t}$ holds for every starting population.

Worked example: the saturation equation

The Verhulst saturation model is barely harder. Its population satisfies

\dot p(t) = -m\,\big(p(t) - K\big),

with response rate $m > 0$ , carrying capacity $K$ , and initial condition $p(0) = p_0$ . It is separable with $g(t) = -m$ and $h(p) = p - K$ .

Separate, assuming $p \neq K$ :

\frac{1}{p - K}\, dp = -m\, dt.

Integrate. The left-hand side is again a logarithm, since $\frac{d}{dp}\ln|p - K| = \frac{1}{p - K}$ :

\ln|p - K| = -m t + C.

Solve for $p$ . Exponentiating gives $p - K = A\, e^{-m t}$ with $A = \pm e^{C}$ , so

p(t) = K + A\, e^{-m t}.

The initial condition at $t = 0$ reads $p_0 = K + A$ , hence $A = p_0 - K$ , and the solution is

p(t) = K + (p_0 - K)\, e^{-m t}.

The dropped value this time is $p \equiv K$ , the population sitting exactly at carrying capacity. It returns as the $p_0 = K$ case, where $A = 0$ and $p(t) \equiv K$ holds for all time.

The saturation model can also be solved without separation, by the change of variable $u(t) = p(t) - K$ that turns it into the Malthus equation for the gap $u$ . Separation reaches the same closed form directly, without the substitution — two routes to one answer.

When separation isn’t enough

Separation handles a first-order ODE only when the right-hand side factors as $g(x)\, h(y)$ , and even then only when both resulting integrals can be carried out. Two limits show up immediately in population modeling.

The logistic growth model, with $\dot p = a\, p - b\, p^2 = (a - b\, p)\, p$ , is separable: the right-hand side is a product $g(t) = 1$ times $h(p) = (a - b\, p)\, p$ . But the integral $\int \frac{1}{(a - b\, p)\, p}\, dp$ is not one of the standard antiderivatives. Evaluating it needs partial-fraction decomposition — splitting the single fraction into $\frac{1}{p}$ and $\frac{1}{a - b\, p}$ pieces, each a logarithm — which is a separate technique layered on top of separation.

Systems of coupled ODEs, such as the two-species predator–prey models, are a harder break: there the rate of each population depends on both populations at once, so no single equation can be separated into one-variable pieces. Such systems are studied through their equilibria and stability rather than solved in closed form.