How do psychologists decide whether a result is significant, and how do they analyse qualitative data?
Inferential statistics and qualitative analysis: probability and significance, the five Edexcel inferential tests, Type 1 and Type 2 errors, the normal and skewed distributions, and the analysis of qualitative data through thematic analysis and grounded theory.
An Edexcel A-Level Psychology answer to Paper 3 data analysis, covering probability and significance at p < 0.05, the five inferential tests (sign, Wilcoxon, Mann-Whitney, Spearman, chi-squared), Type 1 and Type 2 errors, the normal and skewed distributions, and qualitative analysis through thematic analysis and grounded theory.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
Paper 3 (Psychological Skills) requires you to handle data: to explain probability and significance, choose and interpret the correct inferential test, recognise Type 1 and Type 2 errors, read the normal and skewed distributions, and analyse qualitative data through thematic analysis and grounded theory. These are synoptic skills tested across novel scenarios drawn from any topic.
The answer
Probability and significance
A null hypothesis predicts no difference or no relationship; the alternative (experimental) hypothesis predicts one. An inferential test produces a calculated value, which is compared with a critical value from a table (using , whether the test is one- or two-tailed, and the significance level). If the result passes the test, we reject the null hypothesis. A one-tailed hypothesis predicts the direction of the effect; a two-tailed hypothesis does not, and uses a more conservative critical value.
Choosing the inferential test
| Test | Looking for | Design | Data |
|---|---|---|---|
| Sign test | Difference | Related | Nominal |
| Wilcoxon signed-rank | Difference | Related | Ordinal |
| Mann-Whitney U | Difference | Unrelated | Ordinal |
| Spearman's rho | Correlation | (pairs of scores) | Ordinal |
| Chi-squared () | Difference / association | Unrelated | Nominal |
For the sign test, Wilcoxon and Mann-Whitney, the result is significant when the calculated value is equal to or less than the critical value. For chi-squared and Spearman, the result is significant when the calculated value is equal to or greater than the critical value. Mixing up these two rules is the most common error.
Type 1 and Type 2 errors
The significance level controls the trade-off. A lenient level (for example ) makes a Type 1 error more likely; a strict level (for example ) reduces Type 1 risk but raises Type 2 risk. The conventional balances the two.
The normal and skewed distributions
The normal distribution is a symmetrical, bell-shaped curve where the mean, median and mode coincide at the centre, with most scores clustered around the mean and the tails thinning symmetrically. Many psychological variables (IQ, reaction time) approximate it.
A distribution is skewed when scores bunch at one end:
- Positive (right) skew. A long tail to the right (high scores). The mode is lowest, the mean is pulled highest by the tail, so .
- Negative (left) skew. A long tail to the left (low scores), so .
Skew matters because the mean is distorted by extreme scores, so the median is a better measure of central tendency for skewed data.
Analysis of qualitative data
Qualitative data (interviews, open questions) are analysed for meaning rather than counted:
- Thematic analysis identifies patterns of meaning. The researcher reads the data, assigns codes to segments, then groups codes into broader themes that capture something important about the data, illustrated with quotations. It is flexible and rich but can be subjective, so reliability is checked through a second coder.
- Grounded theory builds theory inductively from the data rather than testing a pre-set hypothesis. Through constant comparison (comparing each new piece of data with earlier codes) and successive coding, categories emerge until theoretical saturation (no new categories appear), at which point a theory grounded in the data is proposed.
Evaluation (GRAVE)
- Generalisability. Inferential statistics let researchers generalise from a sample to a population, but only if sampling was sound.
- Reliability. Standardised tests and decision rules make quantitative analysis highly reliable and replicable; qualitative analysis needs inter-rater checks to be reliable.
- Application. Choosing the right test underpins every Paper 1 and Paper 2 data question and real published research.
- Validity. Thematic analysis and grounded theory preserve the richness and meaning of data, raising validity where reducing experience to numbers would lose it.
- Ethics. Researchers must not "p-hack" (run many tests to find a significant one), which inflates Type 1 errors and is a form of misreporting.
Examples in context
Example 1. Reading reaction-time data that are skewed. Reaction times usually show positive skew, because a few slow trials create a long right tail. Reporting the mean would overstate the typical response time, so a psychologist reports the median instead, and may treat the data as ordinal, pushing the analysis towards a non-parametric test such as Mann-Whitney. This shows how the shape of the distribution drives both the descriptive measure and the inferential test.
Example 2. Thematic analysis of interviews about exam stress. A researcher interviews 15 students, codes phrases such as "I couldn't sleep" and "my mind went blank", then groups them into themes like "physical symptoms" and "cognitive disruption", illustrating each with a quotation. A second researcher codes a sample independently to check agreement. The output is a structured account of how students experience stress that a questionnaire's numbers could not capture, demonstrating the value (and the subjectivity) of qualitative analysis.
Try this
Q1. State the significance level used in psychology and explain what it means. [2 marks]
- Cue. : there is less than a 5 per cent probability the result is due to chance if the null hypothesis is true.
Q2. A study correlates two sets of ranked scores. Name and justify the inferential test. [3 marks]
- Cue. Spearman's rho, because the data are ordinal and the aim is to measure a relationship (correlation) rather than a difference.
Q3. Outline the difference between thematic analysis and grounded theory. [4 marks]
- Cue. Both code qualitative data, but thematic analysis identifies patterns (themes) within data, while grounded theory builds a new theory inductively from the data through constant comparison until saturation.
Exam-style practice questions
Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Edexcel style6 marksA researcher uses a repeated measures design and ordinal data to test for a difference between two conditions. Identify the appropriate inferential test, justify your choice, and explain how to decide whether the result is significant. [6 marks]Show worked answer →
Work from level of measurement and design to the test, then state the decision rule.
Test: Wilcoxon signed-rank test. Justification: the design is related (repeated measures, the same participants in both conditions), the data are ordinal, and the aim is to test for a difference. Wilcoxon is the Edexcel test of difference for a related design with ordinal data (Mann-Whitney would be wrong because it is for unrelated designs).
Deciding significance: compare the calculated value (T) with the critical value from the table for the relevant N and significance level. For Wilcoxon, like Mann-Whitney, the result is significant if the calculated value is equal to or less than the critical value at p < 0.05. If it is, reject the null hypothesis and accept a significant difference; if not, retain the null hypothesis.
Markers reward the correct named test, a justification using level of measurement and related-or-unrelated design, and the correct decision rule (calculated equal to or less than critical for Wilcoxon).
Edexcel style4 marksExplain what is meant by a Type 1 error and a Type 2 error in inferential testing. [4 marks]Show worked answer →
Define each error precisely and link it to the significance level.
A Type 1 error is a false positive: rejecting the null hypothesis when it is actually true, so the researcher claims a significant effect that does not really exist. Using a lenient significance level such as p < 0.10 raises the risk of a Type 1 error.
A Type 2 error is a false negative: retaining the null hypothesis when it is actually false, so the researcher misses a real effect. Using a very strict level such as p < 0.01 lowers the Type 1 risk but raises the Type 2 risk.
The conventional p < 0.05 is a compromise that balances the two. Markers reward a correct definition of each error and the point that the choice of significance level trades one risk against the other.
Related dot points
- Research methods: experiments and other methods, sampling, experimental design, variables and hypotheses, descriptive and inferential statistics, and the chosen inferential tests.
An Edexcel A-Level Psychology answer to research methods, covering experimental and non-experimental methods, sampling, experimental design, variables and hypotheses, descriptive statistics, levels of measurement, GRAVE evaluation and the Edexcel inferential tests including Mann-Whitney, Wilcoxon, Spearman and chi-square.
- Key studies and classic research: the named classic studies across topics, how to evaluate studies methodologically and ethically, and reviewing and synthesising research evidence.
An Edexcel A-Level Psychology answer to the key studies and classic research, covering the named classic studies (Milgram, Sherif, Baddeley, Watson and Rayner, Raine, Rosenhan), how to evaluate methodology and ethics with GRAVE, and how to review and synthesise evidence for Paper 3.
- Issues and debates: nature-nurture, free will and determinism, reductionism and holism, ethics and social control, gender and cultural bias, and the use of psychology in the real world.
An Edexcel A-Level Psychology answer to issues and debates, covering nature-nurture, free will and determinism, reductionism and holism, ethics and social control, gender and cultural bias, GRAVE evaluation and the practical and social implications of psychological research.
- Cognitive psychology: the multi-store model, the working memory model, the reconstructive nature of memory, theories of forgetting, and key cognitive studies.
An Edexcel A-Level Psychology answer to cognitive psychology, covering the multi-store model, the working memory model, reconstructive memory and Bartlett, theories of forgetting including interference and retrieval failure, and the named cognitive studies with GRAVE evaluation.
- Criminological or health psychology: explanations of the chosen application (offending or health behaviour), biological and social factors, treatments or interventions, and the named application studies.
An Edexcel A-Level Psychology answer to the application option, covering criminological psychology (biological and social explanations of offending, eyewitness testimony, the cognitive interview and treatments) and health psychology (theories of addiction and interventions), with GRAVE evaluation and named application studies.
Sources & how we know this
- Pearson Edexcel A-Level Psychology (9PS0) specification — Pearson Edexcel (2015)