Skip to main content
ScotlandStatisticsSyllabus dot point

How do you set up a hypothesis test and decide whether to reject the null hypothesis?

Set up null and alternative hypotheses, choose a significance level, compute and use a test statistic and p-value, decide between one- and two-tailed tests, identify the critical region, and distinguish Type I and Type II errors.

A focused answer to the SQA Advanced Higher Statistics hypothesis testing framework: forming null and alternative hypotheses, the significance level, the test statistic, the p-value and critical region, one- and two-tailed tests, and Type I and Type II errors.

Generated by Claude Opus 4.812 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Setting up the hypotheses
  3. The significance level, test statistic and p-value
  4. One- and two-tailed tests and the critical region
  5. Type I and Type II errors
  6. Try this

What this dot point is asking

Hypothesis testing is the formal procedure for deciding whether sample data give convincing evidence against a claim. The SQA wants you to state hypotheses correctly, fix a significance level, compute a test statistic and p-value, choose between a one- and two-tailed test, identify the critical region, and reach a conclusion while understanding the two kinds of error you might make. This framework is shared by every named test in the area.

Setting up the hypotheses

Every test begins with a pair of hypotheses about a population parameter.

You never "prove" H0H_0; you either reject it (the evidence is strong enough) or fail to reject it (the evidence is insufficient). This asymmetry is deliberate: the test protects H0H_0 unless the data are convincing.

The significance level, test statistic and p-value

The decision rests on comparing a p-value with a pre-chosen significance level.

One- and two-tailed tests and the critical region

Whether you look in one tail or both depends entirely on the alternative hypothesis.

Type I and Type II errors

Because the decision is based on a sample, two mistakes are possible.

Try this

Q1. A researcher tests H0:μ=20H_0: \mu = 20 against H1:μ>20H_1: \mu > 20 at α=0.05\alpha = 0.05 and obtains a p-value of 0.080.08. State the conclusion. [1 mark]

  • Cue. Since 0.08>0.050.08 > 0.05, do not reject H0H_0: there is insufficient evidence at the 5%5\% level that the mean exceeds 2020.

Q2. State which error becomes more likely if the significance level is lowered from 5%5\% to 1%1\%, keeping the sample size fixed. [1 mark]

  • Cue. A Type II error becomes more likely, because a stricter rejection threshold makes a true effect harder to detect.

Exam-style practice questions

Practice questions written in the style of SQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AH style: set up test3 marksA manufacturer claims the mean lifetime of a bulb is 10001000 hours. A consumer group suspects it is less. State the null and alternative hypotheses, say whether the test is one- or two-tailed, and explain how the significance level is used.
Show worked answer →

Hypotheses: H0:μ=1000H_0: \mu = 1000 against H1:μ<1000H_1: \mu < 1000, where μ\mu is the true mean lifetime (1 mark).

The test is one-tailed (lower tail) because the suspicion is specifically that the mean is less than 10001000, not merely different (1 mark).

The significance level α\alpha (for example 0.050.05) is the probability of rejecting H0H_0 when it is in fact true; if the p-value is less than α\alpha, or the test statistic falls in the critical region, H0H_0 is rejected (1 mark). Markers reward the correctly directed hypotheses, the one-tailed identification and the role of α\alpha.

AH style: errors3 marksDefine a Type I and a Type II error in hypothesis testing, and state what reducing the significance level does to each.
Show worked answer →

A Type I error is rejecting H0H_0 when it is actually true (a false positive); its probability is the significance level α\alpha (1 mark).

A Type II error is failing to reject H0H_0 when it is actually false (a false negative); its probability is denoted β\beta (1 mark).

Reducing α\alpha lowers the chance of a Type I error but, for a fixed sample size, raises the chance of a Type II error, because a stricter rejection rule makes it harder to detect a real effect (1 mark). Markers reward both definitions and the trade-off between the two error types.

Related dot points

Sources & how we know this