Skip to main content
EnglandStatisticsSyllabus dot point

How do you calculate and choose the right average for a data set?

Mode, median and mean for discrete and grouped data; estimating the mean of grouped data with midpoints; linear interpolation for the median; weighted and geometric mean; effect of changes and transformations on averages.

A focused answer to Edexcel GCSE Statistics on averages, covering mode, median and mean for discrete and grouped data, estimating the mean with class midpoints, linear interpolation for the median, weighted and geometric mean at Higher tier, and the effect of changes and transformations.

Generated by Claude Opus 4.810 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Mode, median and mean
  3. Grouped data: estimating the mean
  4. Grouped data: median by interpolation
  5. Weighted and geometric mean
  6. Effect of changes and transformations

What this dot point is asking

Edexcel codes 2b.01 to 2b.03 require you to calculate the mode, median and arithmetic mean for discrete and grouped data, estimate the mean of grouped data using class midpoints, find the median of grouped data by linear interpolation, and (Higher tier) calculate the weighted mean and geometric mean. You must also justify which average suits a context and understand the effect on each average of adding or removing a value, or of simple transformations (scaling and translating the data).

Mode, median and mean

For a small ordered list of nn values, the median is at position n+12\frac{n+1}{2}. For frequency data the mean is

The most common error is dividing by the number of categories instead of f\sum f, the total frequency.

Grouped data: estimating the mean

For grouped data you do not know the individual values, so you assume each lies at its class midpoint xx. The mean is then estimated as fxf\frac{\sum fx}{\sum f} using those midpoints. Because of this assumption the answer is an estimate, and Edexcel regularly asks you to state why. Use equal class widths at Foundation; Higher tier may use unequal widths.

Grouped data: median by interpolation

The median of grouped data is found by linear interpolation within the class that contains the middle value. Locate the n2\frac{n}{2}th value, find which class it falls in, then assume the values are evenly spread across that class:

medianL+n2Ff×w,\text{median} \approx L + \frac{\frac{n}{2} - F}{f} \times w,

where LL is the lower boundary of the median class, FF the cumulative frequency before it, ff the median class frequency and ww its width.

Weighted and geometric mean

The weighted mean gives different importance to different values:

It is used when components count unequally (for example a grade made of 70%70\% exam and 30%30\% coursework). The geometric mean of nn values is the nnth root of their product, x1×x2××xnn\sqrt[n]{x_1 \times x_2 \times \cdots \times x_n}, and suits averaging growth rates or ratios, where the arithmetic mean would mislead.

Effect of changes and transformations

Edexcel expects you to predict how averages respond to changes. Adding a value above the mean raises the mean; removing the largest value lowers the mean and may change the median. Under a transformation (Higher tier), if every value is multiplied by kk and then increased by cc, the mean, mode and median are all transformed the same way: new average =k×(old average)+c= k \times (\text{old average}) + c.

Exam-style practice questions

Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Edexcel 1ST0 20194 marksThe table shows the number of goals scored in 4040 matches. 00 goals: 88 matches; 11: 1414; 22: 1010; 33: 66; 44: 22. Calculate the mean number of goals per match.
Show worked answer →

Mean =fxf= \frac{\sum fx}{\sum f} where xx is the number of goals and ff the frequency.

fx=(0×8)+(1×14)+(2×10)+(3×6)+(4×2)=0+14+20+18+8=60\sum fx = (0 \times 8) + (1 \times 14) + (2 \times 10) + (3 \times 6) + (4 \times 2) = 0 + 14 + 20 + 18 + 8 = 60.

f=40\sum f = 40, so mean =6040=1.5= \frac{60}{40} = 1.5 goals per match.

Markers reward the fx\sum fx total, dividing by the total frequency 4040, and the answer 1.51.5. A common slip is dividing by 55 (the number of categories) instead of 4040.

Edexcel 1ST0 20224 marksThe table shows the heights, in cm, of 5050 plants. 0<h100 < h \le 10: 66; 10<h2010 < h \le 20: 1414; 20<h3020 < h \le 30: 2020; 30<h4030 < h \le 40: 1010. Estimate the mean height, and explain why your answer is only an estimate.
Show worked answer →

Use class midpoints x=5,15,25,35x = 5, 15, 25, 35.

fx=(5×6)+(15×14)+(25×20)+(35×10)=30+210+500+350=1090\sum fx = (5 \times 6) + (15 \times 14) + (25 \times 20) + (35 \times 10) = 30 + 210 + 500 + 350 = 1090.

f=50\sum f = 50, so estimated mean =109050=21.8= \frac{1090}{50} = 21.8 cm.

It is only an estimate because the data is grouped: the exact heights are unknown, so each value is assumed to lie at its class midpoint.

Markers reward using midpoints, the fx\sum fx total, the mean 21.821.8 cm, and the reason (grouped data, midpoints assumed).

Related dot points

Sources & how we know this