OCR GCSE Mathematics Statistics: a complete overview of sampling, averages, charts and correlation
A deep-dive OCR GCSE Mathematics guide to the Statistics content. Covers sampling and types of data, averages and spread, statistical charts and graphs, and scatter graphs and correlation, with the methods and exam patterns OCR repeats across Foundation and Higher tier.
Reviewed by: AI editorial process; not yet individually human-reviewed
Jump to a section
What the Statistics content demands
Statistics applies number and reasoning to data, and it rewards both accurate calculation and clear interpretation. The content runs from sampling and types of data through averages and spread to charts and scatter graphs. Because so many questions ask you to "describe", "compare" or "explain", this area carries a large share of the AO2 communication marks, and the data-handling contexts often test AO3 problem solving.
This guide walks through the four areas of the Statistics content and ties together the matching dot-point pages, each of which has its own practice questions.
Sampling and types of data
Data is qualitative (categories) or quantitative (numbers), and quantitative data is discrete (counted) or continuous (measured). A population is everyone of interest; a sample is a manageable subset. A random sample gives every member an equal chance; a stratified sample takes the same fraction from each group, so groups are represented in proportion. A sample is biased if it is not representative, and explaining precisely why earns the marks.
Averages and spread
The mean is the total divided by the count, the median is the middle of the ordered data, the mode is the most common value, and the range (largest minus smallest) measures spread. From a frequency table, the mean is ; for grouped data, use class midpoints to estimate it. Compare two data sets using both an average (location) and the range or, at Higher tier, the interquartile range (spread).
Statistical charts and graphs
Bar charts compare categories; pie charts show proportions as angles out of . Frequency polygons join class midpoints, stem-and-leaf diagrams keep the raw values, and box plots show the five-number summary. At Higher tier, histograms use frequency density (), so the area of each bar, not its height, represents the frequency, which matters for unequal class widths.
Scatter graphs and correlation
A scatter graph plots two variables together. Correlation is positive (both rise), negative (one rises as the other falls) or none. A line of best fit follows the trend and is used to estimate one variable from the other. Estimating within the data range (interpolation) is reliable; beyond it (extrapolation) is not. Correlation does not prove causation, because a third factor may drive both variables.
Check your knowledge
A mix of sampling, average, chart and correlation questions. Attempt them under timed conditions, then check against the solutions.
- State whether shoe size is discrete or continuous data. (1 mark)
- Find the median of . (2 marks)
- Find the mean of . (2 marks)
- A school of takes a stratified sample of . How many from a year group of ? (2 marks)
- In a pie chart of people, a sector is . How many people does it represent? (2 marks)
- A histogram class has frequency density . Find the frequency. (2 marks)
- Describe the correlation expected between a car's age and its value. (1 mark)
- State whether estimating beyond the data range is interpolation or extrapolation. (1 mark)
Sources & how we know this
- OCR GCSE (9-1) Mathematics (J560) specification — OCR (2015)