EnglandFurther MathsSyllabus dot point

How do you describe a discrete random variable, and how do you find its expectation and variance?

Probability distributions of discrete random variables, the expectation and variance, the effect of linear coding, and expectation and variance of functions of a random variable.

A focused answer to the AQA A-Level Further Mathematics discrete random variables content, covering probability distributions, the expectation and variance, the effect of linear coding, and the expectation and variance of functions of a random variable.

Generated by Claude Opus 4.810 min answerUpdated 2026-06-02

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this dot point is asking
Probability distributions
Expectation and variance
Linear coding and functions of a variable
Putting the steps together

What this dot point is asking

AQA wants you to work with the probability distribution of a discrete random variable, compute its expectation (mean) and variance, apply linear coding to find the mean and variance of $aX + b$ , and find the expectation and variance of a function of the variable such as $X^2$ .

Probability distributions

A discrete random variable takes a finite or countable set of values, each with an assigned probability. Two conditions define a valid distribution: every probability is non-negative, and the probabilities sum to exactly $1$ . The summing condition is the lever for almost every opening part of an exam question, because it lets you solve for an unknown constant in a probability formula or fill in a missing entry in a table. Once the distribution is fully known, every other quantity (expectation, variance, the probability of an event) follows by summation over the values.

Expectation and variance

The expectation is the long-run average value, a weighted mean of the outcomes with the probabilities as weights. The variance measures how spread out the values are about that mean. The formula $\operatorname{Var}(X) = E(X^2) - [E(X)]^2$ is almost always preferred over the defining form $E[(X - \mu)^2]$ because it needs only the two sums $E(X)$ and $E(X^2)$ , which are quick to tabulate. A common rearrangement, $E(X^2) = \operatorname{Var}(X) + [E(X)]^2$ , lets you recover $E(X^2)$ when the mean and variance are given.

Find the mean and variance

A variable $X$ takes values $1, 2, 3$ with probabilities $0.2, 0.5, 0.3$ .

Step 1: Find the expectation

The expectation is the probability-weighted average of all the values. Multiply each outcome by its probability and sum the results.

E(X) = 1(0.2) + 2(0.5) + 3(0.3) = 0.2 + 1.0 + 0.9 = 2.1.

Step 2: Find the expectation of the squares

To apply the shortcut variance formula we need $E(X^2)$ , found by replacing each $x$ with $x^2$ in the same weighted sum. We do not square the probabilities, only the values.

E(X^2) = 1^2(0.2) + 2^2(0.5) + 3^2(0.3) = 0.2 + 2.0 + 2.7 = 4.9.

Step 3: Apply the variance formula

The variance measures how spread out the distribution is about its mean. The shortcut formula $\operatorname{Var}(X) = E(X^2) - [E(X)]^2$ subtracts the square of the mean from the mean of the squares; this is algebraically equivalent to the defining form $E[(X - \mu)^2]$ but requires no extra subtraction inside a squared term.

\operatorname{Var}(X) = E(X^2) - [E(X)]^2 = 4.9 - 2.1^2 = 4.9 - 4.41 = 0.49.

Final answer: $E(X) = 2.1$ and $\operatorname{Var}(X) = 0.49$ .

Linear coding and functions of a variable

Linear coding turns an awkward set of values into convenient ones, then transforms the answers back. The key insight in the variance rule is that adding a constant slides the whole distribution along without changing how spread out it is, so $b$ drops out, while multiplying by $a$ stretches the spread by a factor of $a$ , which squares to $a^2$ in the variance. For a more general function $g(X)$ the expectation is found directly as $E(g(X)) = \sum g(x) P(X = x)$ , summing the transformed values against the original probabilities. A frequent special case is $g(X) = X^2$ , whose expectation is exactly the $E(X^2)$ used in the variance formula.

A subtle point that examiners probe is that the expectation of a non-linear function is not the function of the expectation. In general $E(g(X)) \neq g(E(X))$ , and the variance formula is the clearest example: $E(X^2)$ is almost never equal to $[E(X)]^2$ , and their difference is precisely the variance, which is positive for any genuinely random variable. The linear rules are special because a straight line is the one shape of function for which working through the expectation gives the same answer either way, which is why $E(aX + b) = aE(X) + b$ holds exactly. For any curved function you must average the transformed values, not transform the average.

Putting the steps together

A typical exam question chains these ideas in a fixed order, and recognising the chain saves time. First use the total-probability condition to find any unknown constant or missing probability. Next tabulate the products $xP(X = x)$ and sum them for $E(X)$ . Then tabulate $x^2 P(X = x)$ and sum them for $E(X^2)$ , from which the variance follows as $E(X^2) - [E(X)]^2$ . Finally apply any coding or function in the last part, using the rules $E(aX + b) = aE(X) + b$ and $\operatorname{Var}(aX + b) = a^2\operatorname{Var}(X)$ , or the direct sum for a non-linear function. Laying the work out as a table with a row for each value and columns for $x$ , $P(X = x)$ , $xP(X = x)$ and $x^2 P(X = x)$ keeps the arithmetic organised and makes slips easy to spot.

Exam-style practice questions

Practice questions written in the style of AQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AQA 20197 marksA discrete random variable

X

has the probability distribution

P(X = x) = kx

for

x = 1, 2, 3, 4

, where

k

is a constant. Find the value of

k

, then calculate

E(X)

and

\operatorname{Var}(X)

Show worked answer →

The probabilities must sum to $1$ . So $k(1) + k(2) + k(3) + k(4) = 1$ , giving $10k = 1$ and $k = 0.1$ .

The distribution is therefore $P(X = 1) = 0.1$ , $P(X = 2) = 0.2$ , $P(X = 3) = 0.3$ , $P(X = 4) = 0.4$ .

Expectation: $E(X) = \sum x P(X = x) = 1(0.1) + 2(0.2) + 3(0.3) + 4(0.4) = 0.1 + 0.4 + 0.9 + 1.6 = 3.0$ .

For the variance, first find $E(X^2) = 1(0.1) + 4(0.2) + 9(0.3) + 16(0.4) = 0.1 + 0.8 + 2.7 + 6.4 = 10.0$ .

Then $\operatorname{Var}(X) = E(X^2) - [E(X)]^2 = 10.0 - 3.0^2 = 10.0 - 9.0 = 1.0$ .

Markers reward using the total probability to find $k$ , the expectation, $E(X^2)$ , and the variance formula.

AQA 20215 marksA discrete random variable

X

has

E(X) = 5

and

\operatorname{Var}(X) = 4

. The variable

Y

is defined by

Y = 3X - 2

. Find

E(Y)

and

\operatorname{Var}(Y)

, and calculate

E(X^2)

Show worked answer →

Use the linear coding rules. For $Y = aX + b$ , $E(Y) = aE(X) + b$ and $\operatorname{Var}(Y) = a^2\operatorname{Var}(X)$ .

Here $a = 3$ , $b = -2$ . So $E(Y) = 3(5) - 2 = 13$ .

$\operatorname{Var}(Y) = 3^2 \times 4 = 9 \times 4 = 36$ . Note the additive constant $-2$ does not affect the variance, because variance measures spread, not location.

For $E(X^2)$ , rearrange the variance formula $\operatorname{Var}(X) = E(X^2) - [E(X)]^2$ : $E(X^2) = \operatorname{Var}(X) + [E(X)]^2 = 4 + 25 = 29$ .

Markers reward both coding results, noting that $b$ leaves variance unchanged, and rearranging the variance formula to get $E(X^2)$ .

Related dot points

Sources & how we know this

AQA A-level Further Mathematics (7367) specification — AQA (2017)