Variance of a sum

November 13, 2025 at 12:07 PM by Dr. Drang

Earlier this week, John D. Cook wrote a post about minimizing the variance of a sum of random variables. The sum he looked at was this:

Z = t X + (1 - t) Y

where $X$ and $Y$ are independent random variables, and $t$ is a deterministic value. The proportion of $Z$ that comes from $X$ is $t$ and the proportion that comes from $Y$ is $1 - t$ . The goal is to choose $t$ to minimize the variance of $Z$ . As Cook says, this is weighting the sum to minimize its variance.

The result he gets is

t = \frac{V a r (Y)}{V a r (X) + V a r (Y)}

and one of the consequences of this is that if $X$ and $Y$ have equal variances, the $t$ that minimizes the variance of $Z$ is $t = 1 / 2$ .

You might think that if the variances are equal, it shouldn’t matter what proportions you use for the two random variables, but it does. That’s due in no small part to the independence of $X$ and $Y$ , which is part of the problem’s setup.

A natural question to ask, then, is what happens if $X$ and $Y$ aren’t independent. That’s what we’ll look into here.

First, a little review. The variance of a random variable, $X$ , is defined as

V a r (X) = \int_{- \infty}^{\infty} (x - μ_{X})^{2} f_{X} (x) d x

where $μ_{X}$ is the mean value of $X$ and $f_{X} (x)$ is its probability density function (PDF). The most familiar PDF is the bell-shaped curve of the normal distribution.

The mean value is defined like this:

μ_{X} = \int_{- \infty}^{\infty} x f_{X} (x) d x

People often like to work with the standard deviation $σ_{X}$ instead of the variance. The relationship is

V a r (X) = σ_{X}^{2}

Now let’s consider two random variables, $X$ and $Y$ . They have a joint PDF, $f_{X Y} (x, y)$ . The covariance of the two is defined like this:

C o v (X, Y) = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} (x - μ_{X}) (y - μ_{Y}) f_{X Y} (x, y) d x d y

It’s common to express the covariance in terms of the standard deviations and the correlation coefficient, $ρ$ :

C o v (X, Y) = ρ σ_{X} σ_{Y}

If we were going to deal with more random variables, I’d explicitly include the variables as subscripts to $ρ$ , but there’s no need to in the two-variable situation.

The correlation coefficient is a pure number and is always in this range:

- 1 \leq ρ \leq 1

A positive value of $ρ$ means that the two variables tend to be above or below their respective mean values at the same time. A negative value of $ρ$ means that when one variable is above its mean, the other tends to be below its mean, and vice versa.

If $X$ and $Y$ are independent, their joint PDF can be expressed as the product of two individual PDFs:

f_{X Y} (x, y) = f_{X} (x) f_{Y} (y)

which means

\begin{matrix} C o v (X, Y) & = & \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} (x - μ_{X}) (y - μ_{Y}) f_{X} (x) f_{Y} (y) d x d y \\ = & \int_{- \infty}^{\infty} (x - μ_{X}) f_{X} (x) d x \int_{- \infty}^{\infty} (y - μ_{Y}) f_{Y} (y) d y \\ = & 0 \end{matrix}

because of the definition of the mean given above. Cook took advantage of this in his analysis to simplify his equations. We won’t be doing that.

Going back to our definition of $Z$ ,

Z = t X + (1 - t) Y

the variance of $Z$ is

σ_{Z}^{2} = t^{2} σ_{X}^{2} + 2 t (1 - t) ρ σ_{X} σ_{Y} + (1 - t)^{2} σ_{Y}^{2}

To get the value of $t$ that minimizes the variance, we take the derivative with respect to $t$ and set that equal to zero. This leads to

t = \frac{σ_{Y}^{2} - ρ σ_{X} σ_{Y}}{σ_{X}^{2} - 2 ρ σ_{X} σ_{Y} + σ_{Y}^{2}}

This reduces to Cook’s equation when $ρ = 0$ , which is what we’d expect.

At this value of $t$ , the variance of the sum is

σ_{Z}^{2} = \frac{(1 - ρ^{2}) σ_{X}^{2} σ_{Y}^{2}}{σ_{X}^{2} - 2 ρ σ_{X} σ_{Y} + σ_{Y}^{2}}

Considering now the situation where $σ_{Y} = σ_{X}$ , the value of $t$ that minimizes the variance is

t = \frac{σ_{X}^{2} - ρ σ_{X}^{2}}{2 σ_{X}^{2} - 2 ρ σ_{X}^{2}} = \frac{1}{2}

which is the same result as before. In other words, when the variances of $X$ and $Y$ are equal, the variance of their sum is minimized by having equal amounts of both, regardless of their correlation. I don’t know about you, but I wasn’t expecting that.

Just because the minimizing value of $t$ doesn’t depend on the correlation coefficient, that doesn’t mean the variance itself doesn’t. The minimum variance of $Z$ when $σ_{Y} = σ_{X}$ is

σ_{Z}^{2} = \frac{1}{2} (1 + ρ) σ_{X}^{2}

A pretty simple result and one that I did expect. When $X$ and $Y$ are positively correlated, their extremes tend to reinforce each other and the variance of $Z$ goes up. When $X$ and $Y$ are negatively correlated, their extremes tend to balance out, and $Z$ stays closer to its mean value.

And now it’s all this

I just said what I said and it was wrong
Or was taken wrong

Variance of a sum

Site search

Meta

Recent posts

Credits

And now it’s all this

I just said what I said and it was wrong Or was taken wrong

Variance of a sum

Site search

Meta

Recent posts

Credits

I just said what I said and it was wrong
Or was taken wrong