Skip to main content
EnglandStatisticsSyllabus dot point

How does the capture-recapture method estimate the size of a population you cannot count directly?

The Petersen capture-recapture formula to estimate a population size; the assumptions the method relies on and their appropriateness; the role of sample size in the reliability of the estimate.

A focused answer to Edexcel GCSE Statistics (Higher tier) on the capture-recapture method, covering the Petersen formula to estimate a population size, the assumptions it relies on and their appropriateness, and how sample size affects the reliability of the estimate.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. The capture-recapture idea
  3. The Petersen formula
  4. The assumptions
  5. Sample size and reliability
  6. Why capture-recapture is used

What this dot point is asking

Edexcel Higher tier code 2h.02 requires you to apply the Petersen capture-recapture formula to estimate the size of a population that cannot be counted directly, and to know the assumptions the method relies on and judge their appropriateness in practice. It is the classic technique for estimating wildlife populations, and it connects sampling, proportion and inference.

The capture-recapture idea

The logic is a proportion: if a known number of individuals are marked and released, then the fraction of the second sample that is marked should equal the fraction of the whole population that is marked. Turning that equality round gives the population estimate.

The Petersen formula

So for 6060 fish marked, a recapture of 8080 fish containing 1212 marked, N=60×8012=400N = \frac{60 \times 80}{12} = 400. Set up the proportion carefully (the marked fraction of the recapture equals the marked fraction of the population), then solve for NN.

The assumptions

The estimate is only valid if the method's assumptions hold:

  1. The population is closed: no births, deaths, immigration or emigration between the two samples.
  2. Marked individuals mix back evenly with the rest of the population before the second sample.
  3. The marks do not come off and do not change an individual's chance of being caught (or its survival).
  4. Every individual is equally likely to be caught in each sample.

Edexcel expects you to state these assumptions and to comment on whether they are reasonable in a given context. For example, if marking makes fish easier for predators to catch, the closed-population and equal-catch assumptions break down, and the estimate becomes unreliable.

Sample size and reliability

As with all estimation, larger samples give a more reliable result. A bigger second sample contains more individuals, so the proportion of marked ones is estimated more accurately and random variation has less effect on NN. A very small recapture (few marked individuals) makes the estimate volatile, because a change of one or two marked recaptures swings NN a long way.

You can see this sensitivity directly in the formula N=M×nmN = \frac{M \times n}{m}: because mm is in the denominator, a small mm makes NN change sharply if mm moves by even one. For instance, with M=60M = 60 and n=80n = 80, a recapture of m=12m = 12 gives N=400N = 400, but m=10m = 10 gives N=480N = 480 and m=15m = 15 gives N=320N = 320. Catching a larger second sample (so mm is larger) reduces this instability and narrows the likely range of the estimate.

Why capture-recapture is used

Capture-recapture matters because many populations cannot be counted directly: you cannot line up and count every fish in a lake, every bird in a wood or every insect in a field. By marking and recapturing, you turn an impossible census into a manageable pair of samples, then use proportion to infer the whole. The same idea is used in real ecology and conservation to monitor wildlife numbers over time. Understanding both how to apply the formula and when its assumptions make it trustworthy is what the qualification rewards, because a method is only as good as the assumptions behind it.

Exam-style practice questions

Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Edexcel 1ST0 20214 marksTo estimate the number of fish in a lake, 6060 fish are caught, marked and released. Later 8080 fish are caught, of which 1212 are marked. Use the capture-recapture method to estimate the number of fish in the lake.
Show worked answer →

The Petersen estimate uses marked in second sampletotal second sample=total markedpopulation\frac{\text{marked in second sample}}{\text{total second sample}} = \frac{\text{total marked}}{\text{population}}.

So 1280=60N\frac{12}{80} = \frac{60}{N}, giving N=60×8012=480012=400N = \frac{60 \times 80}{12} = \frac{4800}{12} = 400.

The estimated number of fish in the lake is 400400.

Markers reward setting up the proportion (marked fraction in the recapture equals the marked fraction of the population), and the estimate N=400N = 400.

Edexcel 1ST0 20224 marksA student uses capture-recapture to estimate a population of birds. (a) State two assumptions the method relies on. (b) Explain why catching a larger second sample would improve the estimate.
Show worked answer →

(a) Any two assumptions: the population is closed (no births, deaths, immigration or emigration between the two samples); marked individuals mix back evenly with the population; the marks do not come off and do not affect the chance of being caught; every individual is equally likely to be caught.

(b) A larger second sample includes more individuals, so the proportion of marked ones is estimated more accurately, reducing the effect of random variation and making the population estimate more reliable.

Markers reward two valid assumptions and the explanation that a larger sample gives a more reliable estimate.

Related dot points

Sources & how we know this