Skip to main content
EnglandFurther MathsSyllabus dot point

How do you describe a discrete random variable, and how do you find its expectation and variance?

Probability distributions of discrete random variables, the expectation and variance, the effect of linear coding, and expectation and variance of functions of a random variable.

A focused answer to the AQA A-Level Further Mathematics discrete random variables content, covering probability distributions, the expectation and variance, the effect of linear coding, and the expectation and variance of functions of a random variable.

Generated by Claude Opus 4.810 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Probability distributions
  3. Expectation and variance
  4. Linear coding and functions of a variable
  5. Putting the steps together

What this dot point is asking

AQA wants you to work with the probability distribution of a discrete random variable, compute its expectation (mean) and variance, apply linear coding to find the mean and variance of aX+baX + b, and find the expectation and variance of a function of the variable such as X2X^2.

Probability distributions

A discrete random variable takes a finite or countable set of values, each with an assigned probability. Two conditions define a valid distribution: every probability is non-negative, and the probabilities sum to exactly 11. The summing condition is the lever for almost every opening part of an exam question, because it lets you solve for an unknown constant in a probability formula or fill in a missing entry in a table. Once the distribution is fully known, every other quantity (expectation, variance, the probability of an event) follows by summation over the values.

Expectation and variance

The expectation is the long-run average value, a weighted mean of the outcomes with the probabilities as weights. The variance measures how spread out the values are about that mean. The formula Var(X)=E(X2)[E(X)]2\operatorname{Var}(X) = E(X^2) - [E(X)]^2 is almost always preferred over the defining form E[(Xμ)2]E[(X - \mu)^2] because it needs only the two sums E(X)E(X) and E(X2)E(X^2), which are quick to tabulate. A common rearrangement, E(X2)=Var(X)+[E(X)]2E(X^2) = \operatorname{Var}(X) + [E(X)]^2, lets you recover E(X2)E(X^2) when the mean and variance are given.

Linear coding and functions of a variable

Linear coding turns an awkward set of values into convenient ones, then transforms the answers back. The key insight in the variance rule is that adding a constant slides the whole distribution along without changing how spread out it is, so bb drops out, while multiplying by aa stretches the spread by a factor of aa, which squares to a2a^2 in the variance. For a more general function g(X)g(X) the expectation is found directly as E(g(X))=g(x)P(X=x)E(g(X)) = \sum g(x) P(X = x), summing the transformed values against the original probabilities. A frequent special case is g(X)=X2g(X) = X^2, whose expectation is exactly the E(X2)E(X^2) used in the variance formula.

A subtle point that examiners probe is that the expectation of a non-linear function is not the function of the expectation. In general E(g(X))g(E(X))E(g(X)) \neq g(E(X)), and the variance formula is the clearest example: E(X2)E(X^2) is almost never equal to [E(X)]2[E(X)]^2, and their difference is precisely the variance, which is positive for any genuinely random variable. The linear rules are special because a straight line is the one shape of function for which working through the expectation gives the same answer either way, which is why E(aX+b)=aE(X)+bE(aX + b) = aE(X) + b holds exactly. For any curved function you must average the transformed values, not transform the average.

Putting the steps together

A typical exam question chains these ideas in a fixed order, and recognising the chain saves time. First use the total-probability condition to find any unknown constant or missing probability. Next tabulate the products xP(X=x)xP(X = x) and sum them for E(X)E(X). Then tabulate x2P(X=x)x^2 P(X = x) and sum them for E(X2)E(X^2), from which the variance follows as E(X2)[E(X)]2E(X^2) - [E(X)]^2. Finally apply any coding or function in the last part, using the rules E(aX+b)=aE(X)+bE(aX + b) = aE(X) + b and Var(aX+b)=a2Var(X)\operatorname{Var}(aX + b) = a^2\operatorname{Var}(X), or the direct sum for a non-linear function. Laying the work out as a table with a row for each value and columns for xx, P(X=x)P(X = x), xP(X=x)xP(X = x) and x2P(X=x)x^2 P(X = x) keeps the arithmetic organised and makes slips easy to spot.

Exam-style practice questions

Practice questions written in the style of AQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AQA 20197 marksA discrete random variable XX has the probability distribution P(X=x)=kxP(X = x) = kx for x=1,2,3,4x = 1, 2, 3, 4, where kk is a constant. Find the value of kk, then calculate E(X)E(X) and Var(X)\operatorname{Var}(X).
Show worked answer →

The probabilities must sum to 11. So k(1)+k(2)+k(3)+k(4)=1k(1) + k(2) + k(3) + k(4) = 1, giving 10k=110k = 1 and k=0.1k = 0.1.

The distribution is therefore P(X=1)=0.1P(X = 1) = 0.1, P(X=2)=0.2P(X = 2) = 0.2, P(X=3)=0.3P(X = 3) = 0.3, P(X=4)=0.4P(X = 4) = 0.4.

Expectation: E(X)=xP(X=x)=1(0.1)+2(0.2)+3(0.3)+4(0.4)=0.1+0.4+0.9+1.6=3.0E(X) = \sum x P(X = x) = 1(0.1) + 2(0.2) + 3(0.3) + 4(0.4) = 0.1 + 0.4 + 0.9 + 1.6 = 3.0.

For the variance, first find E(X2)=1(0.1)+4(0.2)+9(0.3)+16(0.4)=0.1+0.8+2.7+6.4=10.0E(X^2) = 1(0.1) + 4(0.2) + 9(0.3) + 16(0.4) = 0.1 + 0.8 + 2.7 + 6.4 = 10.0.

Then Var(X)=E(X2)[E(X)]2=10.03.02=10.09.0=1.0\operatorname{Var}(X) = E(X^2) - [E(X)]^2 = 10.0 - 3.0^2 = 10.0 - 9.0 = 1.0.

Markers reward using the total probability to find kk, the expectation, E(X2)E(X^2), and the variance formula.

AQA 20215 marksA discrete random variable XX has E(X)=5E(X) = 5 and Var(X)=4\operatorname{Var}(X) = 4. The variable YY is defined by Y=3X2Y = 3X - 2. Find E(Y)E(Y) and Var(Y)\operatorname{Var}(Y), and calculate E(X2)E(X^2).
Show worked answer →

Use the linear coding rules. For Y=aX+bY = aX + b, E(Y)=aE(X)+bE(Y) = aE(X) + b and Var(Y)=a2Var(X)\operatorname{Var}(Y) = a^2\operatorname{Var}(X).

Here a=3a = 3, b=2b = -2. So E(Y)=3(5)2=13E(Y) = 3(5) - 2 = 13.

Var(Y)=32×4=9×4=36\operatorname{Var}(Y) = 3^2 \times 4 = 9 \times 4 = 36. Note the additive constant 2-2 does not affect the variance, because variance measures spread, not location.

For E(X2)E(X^2), rearrange the variance formula Var(X)=E(X2)[E(X)]2\operatorname{Var}(X) = E(X^2) - [E(X)]^2: E(X2)=Var(X)+[E(X)]2=4+25=29E(X^2) = \operatorname{Var}(X) + [E(X)]^2 = 4 + 25 = 29.

Markers reward both coding results, noting that bb leaves variance unchanged, and rearranging the variance formula to get E(X2)E(X^2).

Related dot points

Sources & how we know this