AQA GCSE Statistics Scatter diagrams and correlation: a complete overview of correlation, causation and lines of best fit
A deep-dive AQA GCSE Statistics guide to Scatter diagrams and correlation. Covers plotting scatter diagrams, types and strength of correlation, correlation versus causation, lines of best fit and Spearman's rank correlation coefficient, with the exam patterns AQA repeats.
Reviewed by: AI editorial process; not yet individually human-reviewed
Jump to a section
What this module demands
Scatter diagrams and correlation is about relationships between two variables. AQA tests whether you can describe correlation precisely, resist the trap of claiming causation, fit a line and use it to predict safely, and rank data with Spearman's coefficient. Interpretation in context carries most of the marks.
This guide walks through the three topics in specification order, then sets out the exam patterns AQA repeats. Each topic has a matching dot-point page with practice questions; this overview ties them together.
Scatter diagrams
The module opens with scatter diagrams: plotting bivariate data, describing the type of correlation (positive, negative or none) and its strength (strong or weak), and spotting outliers. Always describe both type and strength together.
Correlation and causation
Correlation and causation covers why correlation does not prove cause, spurious correlation, and confounding variables. The classic example is ice cream sales and drowning, both driven by hot weather rather than each other.
Lines of best fit and regression
Lines of best fit and regression covers drawing the line through the mean point, the equation , interpolation and extrapolation, and Spearman's rank correlation coefficient, .
How this module is examined
A typical AQA profile for this module:
- Scatter diagrams. Describing the type and strength of correlation and spotting outliers.
- Causation. Explaining why correlation does not prove cause and naming a confounding variable.
- Lines of best fit. Predicting by interpolation and judging when extrapolation is unsafe.
- Spearman's rank. Calculating and interpreting the coefficient.
Check your knowledge
A mix of recall and calculation questions covering this module. Attempt them under timed conditions, then check against the solutions.
- Points slope upwards and lie close to a line. Describe the correlation. (2 marks)
- Explain why correlation does not prove causation. (2 marks)
- Name a confounding variable for the link between ice cream sales and drowning. (1 mark)
- A line of best fit is . Predict when . (1 mark)
- What is extrapolation, and why is it unreliable? (2 marks)
- For pairs, . Find Spearman's rank correlation coefficient. (2 marks)
- What point should a line of best fit pass through? (1 mark)
- What does a Spearman value near mean? (1 mark)
Sources & how we know this
- AQA GCSE Statistics (8382) specification β AQA (2017)