Skip to main content
ScotlandApplications of Mathematics

SQA Higher Applications of Mathematics Statistics and Probability: diagrams, correlation, hypothesis testing and expectation

A deep-dive SQA Higher Applications of Mathematics guide to Statistics and Probability. Covers statistical diagrams and sampling, Pearson's correlation and linear regression, hypothesis tests with p-values and confidence intervals, and probability with expected value.

Generated by Claude Opus 4.815 min readHigher

Reviewed by: AI editorial process; not yet individually human-reviewed

Jump to a section
  1. What Statistics and Probability actually demands
  2. Statistical diagrams and sampling
  3. Correlation and regression
  4. Hypothesis testing and confidence intervals
  5. Probability and expectation
  6. How Statistics and Probability is examined
  7. Check your knowledge

What Statistics and Probability actually demands

Statistics is the largest single strand of the course and the basis of the project, which is a statistics report. The examiners test whether you can describe and compare data, measure relationships, make and judge inferences, and reason about uncertainty. Software produces the numbers; the marks are for choosing the right tool, interpreting the output, and concluding in context. This guide ties together the four dot-point pages of the module.

Statistical diagrams and sampling

The descriptive foundation: choose a diagram to suit the data (a box plot for comparing distributions, a histogram for grouped continuous data), and compare two data sets by one measure of centre (median or mean) and one of spread (interquartile range or standard deviation). An outlier lies beyond Q11.5×IQRQ_1 - 1.5 \times \text{IQR} or Q3+1.5×IQRQ_3 + 1.5 \times \text{IQR}. Watch for misleading graphs such as a non-zero axis, and sample without bias using a method such as random, systematic or stratified sampling.

Correlation and regression

Pearson's correlation coefficient rr measures linear association from 1-1 to +1+1, stating strength and direction. The least-squares regression line y=a+bxy = a + bx has slope bb (change in yy per unit of xx) and intercept aa (the value of yy at x=0x = 0), and predicts yy from xx. Trust interpolation within the data and treat extrapolation beyond it with caution. A strong correlation never proves causation: allow for a confounding variable.

Hypothesis testing and confidence intervals

A hypothesis test weighs evidence against a null hypothesis H0H_0. Software returns a p-value, the probability of a result at least as extreme as observed if H0H_0 is true; reject H0H_0 when the p-value is below the significance level. A t-test is used when the population standard deviation is estimated from the sample (the usual case) and a z-test when it is known. A confidence interval gives a plausible range for a population value; a higher confidence level widens it and a larger sample narrows it.

Probability and expectation

A probability lies between 00 and 11. Combine events with the multiplication rule for "and" (independent events) and the addition rule for "or", and organise multi-stage events with tree diagrams, multiplying along paths and adding across them. Without replacement, later probabilities are conditional. The expected value E=(value×probability)E = \sum (\text{value} \times \text{probability}) is the long-run average outcome and the link to decision making under risk.

How Statistics and Probability is examined

A typical SQA profile for this area:

  • Describing data. Choosing and reading diagrams, comparing centre and spread, flagging outliers and misleading graphs.
  • Relationships. Interpreting rr and a regression line, predicting, and cautioning about causation.
  • Inference. Stating hypotheses, interpreting a p-value and a confidence interval, choosing a t-test or z-test.
  • Uncertainty. Combining probabilities, tree diagrams, and expected value.

Check your knowledge

A mix of recall and method questions covering the module. Attempt them, then check against the solutions.

  1. A data set has Q1=20Q_1 = 20 and Q3=32Q_3 = 32. Find the IQR and the upper outlier boundary. (2 marks)
  2. A correlation coefficient is r=0.92r = 0.92. Describe the relationship. (1 mark)
  3. A test gives a p-value of 0.030.03 at the 5%5\% level. State the conclusion. (1 mark)
  4. Interpret a 95%95\% confidence interval of (7.2,8.8)(7.2, 8.8) for a mean. (1 mark)
  5. A prize pays £20\pounds 20 with probability 0.050.05 and nothing otherwise. Find the expected value. (2 marks)

Sources & how we know this

  • applications-of-mathematics
  • sqa-higher
  • sqa-apps-maths
  • statistics-and-probability
  • higher
  • correlation
  • hypothesis-testing
  • confidence-intervals
  • expected-value