How do you measure how spread out a data set is?
Range, interquartile range, percentiles, the effect of outliers, and choosing a measure of spread.
A focused answer to AQA GCSE Statistics on measures of spread, covering the range, interquartile range, percentiles, how outliers affect spread, and how to choose a suitable measure of spread.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
AQA wants you to measure the spread of data using the range, interquartile range and percentiles, understand how outliers affect each measure, and choose a suitable measure of spread for a situation. Spread questions almost always pair with an average: examiners reward candidates who quote one average and one spread, both in context.
The range
The range is the crudest measure of spread because a single unusually large or small value sets it entirely. It is fine for small, clean data sets, or for stating the full extent of variation (for example the range of daily temperatures), but it tells you nothing about how the bulk of the data is distributed.
The interquartile range
Because it cuts off the bottom and top quarters, the interquartile range is resistant to outliers, which makes it the standard companion to the median for skewed data. A small interquartile range means the central half of the data is tightly grouped; a large one means even the middle values are widely spread. On AQA box plot and cumulative frequency questions, the interquartile range is read directly from and .
Percentiles
To find the position of the th percentile in a list of ordered values, AQA uses . For the th percentile is at position , so you read off the nd value (or interpolate from a cumulative frequency graph for grouped data). Interpercentile ranges are popular in exams because they let you describe spread while deliberately discarding the volatile tails.
The effect of outliers and choosing a measure
The choice rule examiners reward: use the range for a quick measure of total spread on clean data, the interquartile range when there are outliers or the data is skewed (pair it with the median), and the standard deviation when the data is roughly symmetrical and you want a measure that uses every value (pair it with the mean).
The reason this matters is that a measure of spread is only useful if it reflects the typical variation rather than one freak value. The range uses only the two most extreme values, so it is the least robust. The interquartile range and interpercentile ranges deliberately discard the tails, so they are robust but ignore some information. The standard deviation uses every value, so it is the most informative, but it pays for that by being sensitive to outliers. Choosing the right one, and pairing it with the matching average, is exactly the judgement examiners test in "compare the data" questions, where you must quote one average and one spread and justify the choice in context.
Exam-style practice questions
Practice questions written in the style of AQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AQA 20194 marksThe waiting times (minutes) at a clinic were . (a) Calculate the range. (b) The lower quartile is and the upper quartile is . Calculate the interquartile range and explain why it is a better measure of spread for this data.Show worked answer β
(a) Range minutes.
(b) Interquartile range minutes.
The value is an outlier that inflates the range to , far larger than the typical spread. The interquartile range uses only the middle half of the data, so it ignores the and better describes the spread of a usual visit.
Markers reward the two calculations and a clear, contextual reason that the interquartile range is resistant to the outlier.
AQA 20223 marksA data set of values has its th percentile at and its th percentile at . (a) State how many values lie below the th percentile. (b) Calculate the th to th interpercentile range.Show worked answer β
(a) The th percentile has of the data below it: values.
(b) Interpercentile range .
Markers reward for part (a) (using of ) and the subtraction for part (b). This range ignores the most extreme tenth at each end, so it is robust like the interquartile range but uses a wider central band.
Related dot points
- Mean, median and mode, averages from frequency tables, estimated mean from grouped data, and weighted means.
A focused answer to AQA GCSE Statistics on measures of central tendency, covering the mean, median and mode, averages from frequency tables, the estimated mean from grouped data using midpoints, and weighted means.
- Finding quartiles from a list and from cumulative frequency, the interquartile range, percentiles and identifying outliers.
A focused answer to AQA GCSE Statistics on quartiles and the interquartile range, covering how to find quartiles from a list and from a cumulative frequency curve, calculate the interquartile range, and use the 1.5 times IQR rule to identify outliers.
- Variance and standard deviation, calculating standard deviation from a list and a frequency table, and interpreting it.
A focused answer to AQA GCSE Statistics on standard deviation, covering variance, calculating standard deviation from a list and from a frequency table, and interpreting standard deviation as spread around the mean.
- Comparing distributions using an average and a measure of spread, skewness, and writing comparisons in context.
A focused answer to AQA GCSE Statistics on comparing distributions, covering how to compare two data sets using an average and a measure of spread, describe skewness from the mean, median and mode, and write comparisons in context.
Sources & how we know this
- AQA GCSE Statistics (8382) specification β AQA (2017)