EnglandFurther MathsSyllabus dot point

How do you carry out hypothesis tests on a Poisson mean and on a population mean, and what do Type I and Type II errors mean?

Hypothesis tests for the mean of a Poisson distribution, tests for a population mean using the normal distribution, one-tailed and two-tailed tests, and the meaning of Type I and Type II errors.

A focused answer to the AQA A-Level Further Mathematics hypothesis testing content, covering tests for the mean of a Poisson distribution, tests for a population mean using the normal distribution, one-tailed and two-tailed tests, and the meaning of Type I and Type II errors.

Generated by Claude Opus 4.811 min answerUpdated 2026-06-02

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this dot point is asking
Setting up a test
Test for a Poisson mean
Test for a population mean
Type I and Type II errors

What this dot point is asking

AQA wants you to set up and carry out hypothesis tests for the mean of a Poisson distribution and for a population mean using the normal distribution, to distinguish one-tailed and two-tailed tests, to compare a test statistic with a critical value or a probability with the significance level, and to explain Type I and Type II errors.

Setting up a test

Every test follows the same skeleton, and marks are lost by skipping a step rather than by hard calculation. State the null hypothesis $H_0$ (the value being assumed) and the alternative $H_1$ (what you are testing for). Choose the significance level, usually $5\%$ or $1\%$ . Decide on one tail or two: a claim that a parameter has increased or decreased is one-tailed, while a claim that it has merely changed is two-tailed, splitting the significance level between the two tails. Then either find the critical region in advance and check whether the observation falls in it, or compute the probability of a result as extreme as observed and compare it with the significance level. Finally, state the conclusion in the context of the original problem, not just as accept or reject.

Test for a Poisson mean

For a Poisson test the test statistic is the observed count itself, and you work with exact Poisson probabilities or cumulative tables rather than a continuous approximation. For an upper-tail test the relevant probability is $P(X \geq \text{observed})$ , computed as $1 - P(X \leq \text{observed} - 1)$ from cumulative tables; for a lower-tail test it is $P(X \leq \text{observed})$ directly.

Poisson hypothesis test

A process has historic rate $\text{Po}(4)$ . After a change, $9$ events are observed in one interval. Test at the $5\%$ level whether the rate has increased. Use $P(X \leq 8) = 0.9786$ for $X \sim \text{Po}(4)$ .

State the hypotheses

$H_0: \lambda = 4$ and $H_1: \lambda > 4$ . The wording "increased" makes this one-tailed.

Find the probability of the observed result

Under $H_0$ , $P(X \geq 9) = 1 - P(X \leq 8) = 1 - 0.9786 = 0.0214$ .

Compare with the significance level

$0.0214 < 0.05$ , so the observed result falls in the critical region.

Conclude in context

Reject $H_0$ : there is evidence at the $5\%$ level that the rate of events has increased.

Test for a population mean

This $Z$ test relies on the sampling distribution of the mean. If the population is normal with known variance, the sample mean $\bar{X}$ is normally distributed with mean $\mu$ and standard error $\frac{\sigma}{\sqrt{n}}$ , so standardising gives the $Z$ statistic. By the central limit theorem the same test is approximately valid for a large sample from any population, even one that is not normal, which is why it appears so widely. A large absolute value of $Z$ means the observed sample mean is many standard errors away from the hypothesised mean, which is unlikely if $H_0$ is true, so $H_0$ is rejected.

Type I and Type II errors

The two error types pull against each other, which is the heart of choosing a significance level. A very small significance level makes you reluctant to reject $H_0$ , so you rarely raise a false alarm (low Type I rate) but more often miss a genuine effect (high Type II rate). Increasing the sample size is the only way to reduce both at once, because a larger sample sharpens the sampling distribution and separates the competing hypotheses more cleanly. In context, a Type I error is a false positive (acting on a change that is not real) and a Type II error is a false negative (missing a change that is real); which is worse depends on the practical consequences.

Exam-style practice questions

Practice questions written in the style of AQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AQA 20207 marksThe number of flaws in rolls of fabric follows a Poisson distribution with historic mean

2

flaws per roll. After a change to the process, a single roll is inspected and found to contain

6

flaws. Test at the

5\%

significance level whether the mean number of flaws has increased. Use

P(X \leq 5) = 0.9834

for

X \sim \text{Po}(2)

Show worked answer →

State the hypotheses. $H_0: \lambda = 2$ , $H_1: \lambda > 2$ (one-tailed, because the question asks about an increase).

The test is one-tailed at $5\%$ . Find the probability of a result as extreme as observed, assuming $H_0$ is true: $P(X \geq 6) = 1 - P(X \leq 5) = 1 - 0.9834 = 0.0166$ .

Compare with the significance level: $0.0166 < 0.05$ .

Since the probability of the observed result (or more extreme) is less than $5\%$ , the result lies in the critical region, so reject $H_0$ .

Conclusion in context: there is evidence at the $5\%$ level that the mean number of flaws per roll has increased.

Markers reward the one-tailed hypotheses, computing $P(X \geq 6)$ as $1 - P(X \leq 5)$ , the comparison with $0.05$ , and the contextual conclusion.

AQA 20226 marksThe weights of bags filled by a machine are normally distributed with known standard deviation

\sigma = 4

g and supposed mean

500

g. A random sample of

16

bags has mean weight

497.5

g. Test at the

5\%

level whether the mean weight differs from

500

g, and explain what a Type I error would mean in this context.

Show worked answer →

State the hypotheses for a two-tailed test. $H_0: \mu = 500$ , $H_1: \mu \neq 500$ .

Compute the test statistic $Z = \frac{\bar{x} - \mu}{\sigma / \sqrt{n}} = \frac{497.5 - 500}{4 / \sqrt{16}} = \frac{-2.5}{1} = -2.5$ .

For a two-tailed $5\%$ test the critical values are $\pm 1.96$ . Since $-2.5 < -1.96$ , the test statistic lies in the critical region.

Reject $H_0$ : there is evidence at the $5\%$ level that the mean weight differs from $500$ g.

A Type I error here would be concluding the mean weight has changed (rejecting $H_0$ ) when in fact the machine is still correctly set at $500$ g. Its probability equals the significance level, $5\%$ .

Markers reward the two-tailed hypotheses, the $Z$ statistic, comparison with $\pm 1.96$ , the conclusion, and a correct contextual Type I error description.

Related dot points

Sources & how we know this

AQA A-level Further Mathematics (7367) specification — AQA (2017)