Skip to main content
ScotlandStatisticsSyllabus dot point

How does the sample mean vary from sample to sample, and why is it approximately normal?

Describe the sampling distribution of the sample mean, calculate its mean and standard error, and state and apply the central limit theorem to find probabilities for a sample mean.

A focused answer to the SQA Advanced Higher Statistics sampling distributions content: the sampling distribution of the sample mean, its expected value and standard error, the central limit theorem, and how to find probabilities for a sample mean by standardising.

Generated by Claude Opus 4.812 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. The sampling distribution of the sample mean
  3. The central limit theorem
  4. Try this

What this dot point is asking

A statistic such as the sample mean is itself random: a fresh sample gives a fresh value. The SQA wants you to describe how the sample mean varies, to compute its centre and spread (the standard error), and to use the central limit theorem to treat it as approximately normal so you can find probabilities. This result is the engine behind every confidence interval and hypothesis test in the course.

The sampling distribution of the sample mean

Imagine taking every possible sample of size nn and recording each sample mean; the distribution of those means is the sampling distribution of Xˉ\bar{X}.

The n\sqrt{n} in the denominator is why precision improves only slowly: to halve the standard error you must quadruple the sample size.

The central limit theorem

The central limit theorem is the most important result in the course because it makes the normal distribution apply to almost any sample mean.

Two consequences matter for the exam. First, the result holds regardless of the parent distribution's shape, so even a skewed or oddly shaped population gives an approximately normal sample mean once nn is large (a sample size of about 3030 is the usual rule of thumb). Second, if the parent population is already normal, then Xˉ\bar{X} is exactly normal for every nn, however small.

How quickly the approximation kicks in depends on the parent shape: a nearly symmetric population needs only a small nn, whereas a strongly skewed one needs a larger nn before the sample mean looks normal. This is why the rule of thumb is a guide, not a guarantee, and why you should note the parent shape when you justify using the CLT. The total of nn observations behaves the same way, since X=nXˉ\sum X = n\bar{X} is also approximately normal, with mean nμn\mu and variance nσ2n\sigma^2.

Try this

Q1. A population has σ=20\sigma = 20. What sample size gives a standard error of 44? [2 marks]

  • Cue. 20n=4n=5n=25\dfrac{20}{\sqrt{n}} = 4 \Rightarrow \sqrt{n} = 5 \Rightarrow n = 25.

Q2. State the distribution of the sample mean of size 99 from a normal population N(100,36)N(100, 36). [2 marks]

  • Cue. A normal parent gives an exactly normal sample mean: XˉN ⁣(100,369)=N(100,4)\bar{X} \sim N\!\left(100, \dfrac{36}{9}\right) = N(100, 4), standard error 22.

Exam-style practice questions

Practice questions written in the style of SQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AH style: standard error3 marksA population has mean μ=70\mu = 70 and standard deviation σ=12\sigma = 12. Samples of size 3636 are taken. State the mean and standard error of the sample mean Xˉ\bar{X}, and find P(Xˉ>73)P(\bar{X} > 73).
Show worked answer →

Mean of the sample mean: E(Xˉ)=μ=70E(\bar{X}) = \mu = 70; standard error: σn=1236=126=2\dfrac{\sigma}{\sqrt{n}} = \dfrac{12}{\sqrt{36}} = \dfrac{12}{6} = 2 (1 mark).

By the central limit theorem XˉN(70,22)\bar{X} \approx N(70, 2^2). Standardise: Z=73702=1.5Z = \dfrac{73 - 70}{2} = 1.5 (1 mark).

P(Xˉ>73)=P(Z>1.5)=10.9332=0.0668P(\bar{X} > 73) = P(Z > 1.5) = 1 - 0.9332 = 0.0668 (1 mark). Markers reward the standard error, the standardisation using it, and the tail probability.

AH style: CLT statement3 marksState the central limit theorem and explain why it lets you treat the mean of a large sample from a skewed population as approximately normal.
Show worked answer →

The central limit theorem states that for a sample of size nn from any population with mean μ\mu and finite standard deviation σ\sigma, the distribution of the sample mean Xˉ\bar{X} approaches N ⁣(μ,σ2n)N\!\left(\mu, \dfrac{\sigma^2}{n}\right) as nn becomes large (2 marks).

Because the result holds whatever the shape of the parent population, a large enough sample mean is approximately normal even when the population itself is skewed; the skew of the parent is "averaged out" as nn grows (1 mark). Markers reward a correct statement including the mean and variance and the point that the parent distribution need not be normal.

Related dot points

Sources & how we know this