How does the capture-recapture method estimate the size of a population you cannot count directly?
The Petersen capture-recapture formula to estimate a population size; the assumptions the method relies on and their appropriateness; the role of sample size in the reliability of the estimate.
A focused answer to Edexcel GCSE Statistics (Higher tier) on the capture-recapture method, covering the Petersen formula to estimate a population size, the assumptions it relies on and their appropriateness, and how sample size affects the reliability of the estimate.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
Edexcel Higher tier code 2h.02 requires you to apply the Petersen capture-recapture formula to estimate the size of a population that cannot be counted directly, and to know the assumptions the method relies on and judge their appropriateness in practice. It is the classic technique for estimating wildlife populations, and it connects sampling, proportion and inference.
The capture-recapture idea
The logic is a proportion: if a known number of individuals are marked and released, then the fraction of the second sample that is marked should equal the fraction of the whole population that is marked. Turning that equality round gives the population estimate.
The Petersen formula
So for fish marked, a recapture of fish containing marked, . Set up the proportion carefully (the marked fraction of the recapture equals the marked fraction of the population), then solve for .
The assumptions
The estimate is only valid if the method's assumptions hold:
- The population is closed: no births, deaths, immigration or emigration between the two samples.
- Marked individuals mix back evenly with the rest of the population before the second sample.
- The marks do not come off and do not change an individual's chance of being caught (or its survival).
- Every individual is equally likely to be caught in each sample.
Edexcel expects you to state these assumptions and to comment on whether they are reasonable in a given context. For example, if marking makes fish easier for predators to catch, the closed-population and equal-catch assumptions break down, and the estimate becomes unreliable.
Sample size and reliability
As with all estimation, larger samples give a more reliable result. A bigger second sample contains more individuals, so the proportion of marked ones is estimated more accurately and random variation has less effect on . A very small recapture (few marked individuals) makes the estimate volatile, because a change of one or two marked recaptures swings a long way.
You can see this sensitivity directly in the formula : because is in the denominator, a small makes change sharply if moves by even one. For instance, with and , a recapture of gives , but gives and gives . Catching a larger second sample (so is larger) reduces this instability and narrows the likely range of the estimate.
Why capture-recapture is used
Capture-recapture matters because many populations cannot be counted directly: you cannot line up and count every fish in a lake, every bird in a wood or every insect in a field. By marking and recapturing, you turn an impossible census into a manageable pair of samples, then use proportion to infer the whole. The same idea is used in real ecology and conservation to monitor wildlife numbers over time. Understanding both how to apply the formula and when its assumptions make it trustworthy is what the qualification rewards, because a method is only as good as the assumptions behind it.
Exam-style practice questions
Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Edexcel 1ST0 20214 marksTo estimate the number of fish in a lake, fish are caught, marked and released. Later fish are caught, of which are marked. Use the capture-recapture method to estimate the number of fish in the lake.Show worked answer →
The Petersen estimate uses .
So , giving .
The estimated number of fish in the lake is .
Markers reward setting up the proportion (marked fraction in the recapture equals the marked fraction of the population), and the estimate .
Edexcel 1ST0 20224 marksA student uses capture-recapture to estimate a population of birds. (a) State two assumptions the method relies on. (b) Explain why catching a larger second sample would improve the estimate.Show worked answer →
(a) Any two assumptions: the population is closed (no births, deaths, immigration or emigration between the two samples); marked individuals mix back evenly with the population; the marks do not come off and do not affect the chance of being caught; every individual is equally likely to be caught.
(b) A larger second sample includes more individuals, so the proportion of marked ones is estimated more accurately, reducing the effect of random variation and making the population estimate more reliable.
Markers reward two valid assumptions and the explanation that a larger sample gives a more reliable estimate.
Related dot points
- Using summary statistics to estimate population characteristics; estimating the population mean from a sample; predicting population proportions; the effect of sample size on reliability and replication.
A focused answer to Edexcel GCSE Statistics on statistical inference, covering using summary statistics to estimate population characteristics, estimating the population mean from a sample, predicting population proportions, and how sample size affects reliability and replication.
- Population, sampling frame and sample; simple random, systematic, stratified, quota, cluster, judgement and opportunity sampling; selecting random members; calculating strata sizes.
A focused answer to Edexcel GCSE Statistics on sampling, covering population, sampling frame and sample, simple random, systematic, stratified, quota, cluster, judgement and opportunity sampling, selecting random members electronically, and calculating stratified sample sizes.
- The probability scale and language of likelihood; calculating theoretical probability; estimating probability from data using relative frequency; experimental probability tending to theoretical as trials increase.
A focused answer to Edexcel GCSE Statistics on probability basics, covering the probability scale and language of likelihood, theoretical probability, estimating probability from data using relative frequency, and why experimental probability tends towards theoretical probability as the number of trials increases.
- Mode, median and mean for discrete and grouped data; estimating the mean of grouped data with midpoints; linear interpolation for the median; weighted and geometric mean; effect of changes and transformations on averages.
A focused answer to Edexcel GCSE Statistics on averages, covering mode, median and mean for discrete and grouped data, estimating the mean with class midpoints, linear interpolation for the median, weighted and geometric mean at Higher tier, and the effect of changes and transformations.
Sources & how we know this
- Pearson Edexcel GCSE (9-1) Statistics (1ST0) specification — Pearson Edexcel (2017)