How does the sample mean vary from sample to sample, and why is it approximately normal?
Describe the sampling distribution of the sample mean, calculate its mean and standard error, and state and apply the central limit theorem to find probabilities for a sample mean.
A focused answer to the SQA Advanced Higher Statistics sampling distributions content: the sampling distribution of the sample mean, its expected value and standard error, the central limit theorem, and how to find probabilities for a sample mean by standardising.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
A statistic such as the sample mean is itself random: a fresh sample gives a fresh value. The SQA wants you to describe how the sample mean varies, to compute its centre and spread (the standard error), and to use the central limit theorem to treat it as approximately normal so you can find probabilities. This result is the engine behind every confidence interval and hypothesis test in the course.
The sampling distribution of the sample mean
Imagine taking every possible sample of size and recording each sample mean; the distribution of those means is the sampling distribution of .
The in the denominator is why precision improves only slowly: to halve the standard error you must quadruple the sample size.
The central limit theorem
The central limit theorem is the most important result in the course because it makes the normal distribution apply to almost any sample mean.
Two consequences matter for the exam. First, the result holds regardless of the parent distribution's shape, so even a skewed or oddly shaped population gives an approximately normal sample mean once is large (a sample size of about is the usual rule of thumb). Second, if the parent population is already normal, then is exactly normal for every , however small.
How quickly the approximation kicks in depends on the parent shape: a nearly symmetric population needs only a small , whereas a strongly skewed one needs a larger before the sample mean looks normal. This is why the rule of thumb is a guide, not a guarantee, and why you should note the parent shape when you justify using the CLT. The total of observations behaves the same way, since is also approximately normal, with mean and variance .
Try this
Q1. A population has . What sample size gives a standard error of ? [2 marks]
- Cue. .
Q2. State the distribution of the sample mean of size from a normal population . [2 marks]
- Cue. A normal parent gives an exactly normal sample mean: , standard error .
Exam-style practice questions
Practice questions written in the style of SQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AH style: standard error3 marksA population has mean and standard deviation . Samples of size are taken. State the mean and standard error of the sample mean , and find .Show worked answer →
Mean of the sample mean: ; standard error: (1 mark).
By the central limit theorem . Standardise: (1 mark).
(1 mark). Markers reward the standard error, the standardisation using it, and the tail probability.
AH style: CLT statement3 marksState the central limit theorem and explain why it lets you treat the mean of a large sample from a skewed population as approximately normal.Show worked answer →
The central limit theorem states that for a sample of size from any population with mean and finite standard deviation , the distribution of the sample mean approaches as becomes large (2 marks).
Because the result holds whatever the shape of the parent population, a large enough sample mean is approximately normal even when the population itself is skewed; the skew of the parent is "averaged out" as grows (1 mark). Markers reward a correct statement including the mean and variance and the point that the parent distribution need not be normal.
Related dot points
- Describe and apply the main sampling methods, including simple random, systematic and stratified sampling, distinguish a sample from a population and a statistic from a parameter, and explain how a poor sampling method introduces bias.
A focused answer to the SQA Advanced Higher Statistics sampling content: the difference between a population and a sample and a parameter and a statistic, simple random, systematic and stratified sampling, how to carry each out, and how a poor sampling frame or method introduces bias.
- Calculate point estimates of a population mean and variance, construct and interpret confidence intervals for a population mean using the normal and Student's t-distributions, and construct a confidence interval for a population proportion.
A focused answer to the SQA Advanced Higher Statistics estimation content: point estimates of the population mean and variance, confidence intervals for a mean using the normal distribution and Student's t-distribution, the role of degrees of freedom, and confidence intervals for a population proportion.
- Work with continuous random variables and the normal distribution, standardise to find probabilities, combine independent normal variables, and use the normal approximation to the binomial and Poisson distributions with a continuity correction.
A focused answer to the SQA Advanced Higher Statistics continuous random variables content: the normal distribution, standardising to the Z-distribution, finding probabilities and quantiles, combining independent normal variables, and the normal approximation to the binomial and Poisson with a continuity correction.
- Carry out the one-sample, two-sample (independent) and paired t-tests for population means, stating the hypotheses, computing the test statistic, using degrees of freedom, and interpreting the result, while checking the normality assumption.
A focused answer to the SQA Advanced Higher Statistics t-test content: the one-sample t-test, the two-sample (independent) t-test and the paired t-test, with the test statistics, the degrees of freedom, the normality assumption and how to interpret the outcome.
Sources & how we know this
- SQA Advanced Higher Statistics Course Specification (C803 77) — SQA (2023)
- SQA Advanced Higher Statistics Data Booklet — SQA (2019)