Skip to main content
EnglandStatisticsSyllabus dot point

How do you find expected frequencies, and what are absolute and relative risk?

Expected frequency from probability; absolute and relative risk expressed as expected frequencies; comparing experimental data with theoretical predictions to detect bias in the design.

A focused answer to Edexcel GCSE Statistics on risk and expected frequency, covering calculating expected frequency from a probability, absolute and relative risk expressed as expected frequencies, and comparing experimental data with theoretical predictions to identify bias in an experiment.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Expected frequency
  3. Absolute and relative risk
  4. Comparing experimental with theoretical results
  5. Why risk is reported as expected frequencies

What this dot point is asking

Edexcel codes 3p.03 to 3p.05 require you to use probability to calculate expected frequency, to determine and interpret absolute and relative risk (expressing them as expected frequencies in groups), and to compare experimental data with theoretical predictions to identify possible bias in an experiment. These ideas connect probability back to real data and to the statistical enquiry cycle.

Expected frequency

For example, if a fair dice is rolled 300300 times, the expected number of sixes is 16×300=50\frac{1}{6} \times 300 = 50. Expected frequency is a prediction; the actual count will vary around it because of chance, but it tells you what is typical. Given a total frequency, you can also work backwards: the proportion (probability) times the total gives the expected count in a category.

Absolute and relative risk

A relative risk of 11 means the outcome is equally likely in both groups; greater than 11 means more likely in group A; less than 11 means less likely. Edexcel expects you to express risks as expected frequencies too (for example "out of 200200 learners with each instructor, 120120 would pass with A but only 8080 with B"), which makes the comparison vivid and concrete.

Comparing experimental with theoretical results

Code 3p.05 asks you to compare what actually happened with what the theory predicts, to judge whether an experiment or device is biased. If a dice is fair, each face should come up about 16\frac{1}{6} of the time; if one face appears far more often than its expected frequency over many rolls, that suggests the dice is biased. The judgement is informal at GCSE (no formal significance test), but you should compare observed and expected frequencies and comment sensibly, remembering that some variation is normal by chance and that more trials make the comparison more reliable.

A neat way to set this out is a small table with three rows: the outcome, the expected frequency (probability times number of trials), and the observed frequency from the experiment. Comparing the two rows shows at a glance which outcomes are over or under represented. A close match supports the theoretical model (for example that the dice is fair); a large, consistent mismatch is evidence that the model is wrong or the device is biased.

Why risk is reported as expected frequencies

Risks expressed as bare probabilities (such as 0.60.6 versus 0.40.4) can be hard for people to grasp, so Edexcel encourages expressing them as expected frequencies in groups, which is also how risks are reported in real life (in medicine and the media). Saying "out of every 100100 learners, 6060 pass with instructor A but only 4040 with instructor B" conveys the same information as a relative risk of 1.51.5 but is far more intuitive. Being able to move between probability, relative risk and expected frequency, and to choose the clearest form for the audience, is exactly the kind of communication skill the qualification rewards.

Exam-style practice questions

Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Edexcel 1ST0 20203 marksThe probability that a particular machine produces a faulty part is 0.040.04. In a batch of 15001500 parts, (a) calculate the expected number of faulty parts. (b) The batch actually contained 9090 faulty parts. Comment on whether this suggests a problem.
Show worked answer →

(a) Expected frequency == probability ×\times number of trials =0.04×1500=60= 0.04 \times 1500 = 60 faulty parts.

(b) The actual number 9090 is much higher than the expected 6060, which suggests the machine may be producing more faults than usual (a possible problem with the process), though some variation is expected by chance.

Markers reward expected frequency =0.04×1500=60= 0.04 \times 1500 = 60, and a comment that 9090 is higher than expected, suggesting a possible problem.

Edexcel 1ST0 20224 marksDriving instructor A has a pass rate of 0.60.6; instructor B has a pass rate of 0.40.4. (a) State the absolute risk (probability) of passing with each instructor. (b) Calculate the relative risk of passing with A compared with B, and interpret it.
Show worked answer →

(a) Absolute risk of passing: with A it is 0.60.6; with B it is 0.40.4. (These are just the probabilities of passing.)

(b) Relative risk =risk with Arisk with B=0.60.4=1.5= \frac{\text{risk with A}}{\text{risk with B}} = \frac{0.6}{0.4} = 1.5.

A relative risk of 1.51.5 means a learner is 1.51.5 times as likely to pass with instructor A as with instructor B.

Markers reward stating each absolute risk, the relative risk 1.51.5, and interpreting it as "1.5 times as likely to pass with A".

Related dot points

Sources & how we know this