How do you test goodness of fit and independence using the chi-squared distribution?
Goodness of fit tests, contingency tables and tests for independence using the chi-squared statistic, expected frequencies, degrees of freedom, and Yates' correction.
A focused answer to the Edexcel A-Level Further Mathematics Further Statistics content on chi-squared tests, covering goodness of fit tests, contingency tables and tests for independence, calculating expected frequencies, choosing degrees of freedom, and applying Yates' correction.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
Edexcel Further Statistics wants you to carry out chi-squared goodness of fit tests against a proposed distribution, test for independence using a contingency table, compute expected frequencies, choose the correct degrees of freedom, apply Yates' correction where required, and reach a conclusion in context. A full hypothesis-test structure (hypotheses, statistic, comparison, contextual conclusion) is expected for the marks.
The chi-squared statistic
Both the goodness of fit test and the test of independence compare observed counts with the counts expected under a null hypothesis, using the same statistic. A large value means observed and expected diverge sharply, giving evidence against the null hypothesis; a small value is consistent with it. The statistic is then compared with a tabulated critical value at the chosen significance level and degrees of freedom.
Goodness of fit and degrees of freedom
A goodness of fit test checks whether data are consistent with a proposed distribution (uniform, binomial, Poisson and so on). The degrees of freedom start as the number of classes minus one (because the totals must agree), then lose one further degree for each parameter you estimated from the data, such as a Poisson mean estimated from the sample.
Contingency tables and independence
A contingency table cross-classifies a sample by two factors, and the chi-squared test of independence checks whether the two factors are associated. The expected frequency in each cell, assuming independence, is the product of its row and column totals divided by the grand total.
Examples in context
Chi-squared tests are the inferential capstone of Further Statistics, drawing on the distributions studied earlier. Goodness of fit tests are most often applied to the Poisson and binomial models of the poisson-and-binomial dot point (with the mean estimated from the data, costing a degree of freedom) and to the discrete distributions whose expectations you compute elsewhere. The expected-frequency calculation for contingency tables uses the multiplication of probabilities for independent events. The combining of small classes connects to the practical requirement that the chi-squared approximation to the discrete sampling distribution be reliable.
Try this
Q1. State the degrees of freedom for a contingency table. [1 mark]
- Cue. .
Q2. Write down the chi-squared test statistic formula. [1 mark]
- Cue. .
Q3. A Poisson goodness of fit test has classes and the mean was estimated from the data. State the degrees of freedom. [2 marks]
- Cue. (one lost for the total, one for the estimated mean).
Exam-style practice questions
Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Edexcel 20198 marksA die is rolled times. The observed frequencies of the scores to are . Test, at the significance level, whether the die is fair. The critical value of with degrees of freedom at is .Show worked answer →
State hypotheses, compute expected frequencies, the test statistic, and compare with the critical value.
: the die is fair (each score equally likely); : the die is not fair (B1). Under , each expected frequency is (M1 A1).
(M1 A1 A1).
Degrees of freedom . Since , we do not reject (M1). There is insufficient evidence at the level to conclude the die is unfair (A1).
Edexcel 20227 marksA survey of people records gender against preferred drink in a contingency table. Explain how to find the expected frequencies and state the degrees of freedom. Given the test statistic is and the critical value at is , state the conclusion.Show worked answer →
Describe the expected-frequency rule, give the degrees of freedom, then compare and conclude.
: drink preference is independent of gender; : they are not independent (B1).
Each expected frequency is (M1 A1).
Degrees of freedom (M1 A1).
Since , reject (M1). There is evidence at the level that drink preference is associated with gender (A1).
Related dot points
- The Poisson distribution as a model for random events, its mean and variance, the binomial distribution, the additive property of Poisson variables, and the Poisson approximation to the binomial.
A focused answer to the Edexcel A-Level Further Mathematics Further Statistics content on the Poisson and binomial distributions, covering the Poisson model and its mean and variance, the binomial distribution, the additive property of independent Poisson variables, and the Poisson approximation to the binomial.
- Discrete random variables and probability distributions, expectation and variance, the effect of linear coding, and expectation and variance of functions of a discrete variable.
A focused answer to the Edexcel A-Level Further Mathematics Further Statistics content on discrete probability distributions, covering discrete random variables and their distributions, expectation and variance, the effect of linear coding, and the expectation and variance of functions of a discrete random variable.
- The geometric distribution as a model for the trial of the first success, the negative binomial distribution for the rth success, and their means and variances.
A focused answer to the Edexcel A-Level Further Mathematics Further Statistics content on the geometric and negative binomial distributions, covering the geometric model for the trial of the first success, the negative binomial model for the rth success, and the means and variances of both distributions.
- Summing series of powers of integers, relationships between roots and coefficients of polynomials, transforming equations with new roots, and the method of differences.
A focused answer to the Edexcel A-Level Further Mathematics further algebra content, covering standard summation formulae for powers of integers, the relationships between roots and coefficients of polynomials, forming equations with transformed roots, and the method of differences.
Sources & how we know this
- Pearson Edexcel A-Level Further Mathematics (9FM0) specification — Pearson Edexcel (2017)