Skip to main content
EnglandStatisticsSyllabus dot point

How do you measure probability, and how does relative frequency estimate it?

The probability scale and language of likelihood; calculating theoretical probability; estimating probability from data using relative frequency; experimental probability tending to theoretical as trials increase.

A focused answer to Edexcel GCSE Statistics on probability basics, covering the probability scale and language of likelihood, theoretical probability, estimating probability from data using relative frequency, and why experimental probability tends towards theoretical probability as the number of trials increases.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. The probability scale and likelihood
  3. Theoretical probability
  4. Relative frequency
  5. Experimental tends to theoretical
  6. Mutually exclusive outcomes and the total

What this dot point is asking

Edexcel codes 3p.01, 3p.02 and 3p.06 require you to use the probability scale and the language of likelihood, to calculate theoretical probability, to estimate probability from collected data using relative frequency, and to understand that experimental probability tends towards theoretical probability as the number of trials increases. This is the foundation for every later probability topic, including the distributions.

The probability scale and likelihood

You should be able to place events on the scale and use the language of likelihood: impossible, very unlikely, unlikely, evens, likely, very likely, certain. The probabilities of all the possible outcomes of an event add up to 11, which gives the complement rule: P(not A)=1P(A)P(\text{not } A) = 1 - P(A).

Theoretical probability

When all outcomes are equally likely, the theoretical probability of an event is

For example, a fair six-sided dice has P(even)=36=12P(\text{even}) = \frac{3}{6} = \frac{1}{2}. This only works when the outcomes really are equally likely; for a biased dice or a real-world event you cannot count outcomes this way, so you estimate from data instead.

Relative frequency

This is how Edexcel expects you to estimate probabilities for biased or real situations: roll the dice many times, record the results, and divide. Relative frequency is the bridge between collected data (the rest of the course) and probability.

Experimental tends to theoretical

A key idea (code 3p.06) is that as the number of trials increases, the relative frequency gets closer to the true probability, provided the trials are random. A handful of trials can give a misleading estimate because of random variation, but thousands of trials average this out. This is why a larger experiment gives a more reliable estimate, and why you can use a long run of data to estimate the probability of a biased event with confidence.

If you plot the relative frequency against the number of trials, the graph swings about wildly at first and then settles down towards a horizontal level: that level is the estimate of the true probability. Edexcel may show you such a graph and ask what the probability is converging to, or ask you to read off the most reliable estimate (the value at the largest number of trials). The same principle explains why estimates of population parameters improve with larger samples, linking probability back to the rest of the course.

Mutually exclusive outcomes and the total

For a single event, the probabilities of all the separate outcomes must add up to 11, because one of them is certain to happen. This lets you find a missing probability: if a spinner can land on red, blue or green with P(red)=0.5P(\text{red}) = 0.5 and P(blue)=0.2P(\text{blue}) = 0.2, then P(green)=10.50.2=0.3P(\text{green}) = 1 - 0.5 - 0.2 = 0.3. It also underpins the complement rule, and it is the quickest route to "at least one" style questions, where P(at least one)=1P(none)P(\text{at least one}) = 1 - P(\text{none}). Checking that your probabilities sum to 11 is a fast way to catch arithmetic slips.

Exam-style practice questions

Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Edexcel 1ST0 20194 marksA biased dice is rolled 200200 times. It lands on six 5050 times. (a) Estimate the probability of rolling a six with this dice. (b) Explain why this estimate is more reliable than one based on 2020 rolls.
Show worked answer →

(a) Relative frequency =number of sixesnumber of rolls=50200=0.25= \frac{\text{number of sixes}}{\text{number of rolls}} = \frac{50}{200} = 0.25.

(b) A larger number of trials gives a more reliable estimate, because experimental probability tends towards the true (theoretical) probability as the number of trials increases. With only 2020 rolls, random variation has a much bigger effect on the estimate.

Markers reward the relative frequency 0.250.25 and the explanation that more trials give a more reliable estimate (experimental tends to theoretical).

Edexcel 1ST0 20213 marksA bag contains 55 red, 33 blue and 22 green counters. One counter is taken at random. (a) Write down the probability it is blue. (b) On a probability scale from 00 to 11, describe the likelihood of taking a counter that is not green.
Show worked answer →

(a) P(blue)=310=0.3P(\text{blue}) = \frac{3}{10} = 0.3 (there are 1010 counters in total).

(b) P(not green)=1P(green)=1210=810=0.8P(\text{not green}) = 1 - P(\text{green}) = 1 - \frac{2}{10} = \frac{8}{10} = 0.8. On the scale this is close to 11, so taking a counter that is not green is "likely".

Markers reward 310\frac{3}{10}, using the complement for "not green" to get 0.80.8, and describing it as likely (near 11).

Related dot points

Sources & how we know this