Skip to main content
EnglandPsychologySyllabus dot point

How do psychologists design studies, analyse data and decide whether results are significant?

Research methods: experiments and other methods, sampling, experimental design, variables and hypotheses, descriptive and inferential statistics, and the chosen inferential tests.

An Edexcel A-Level Psychology answer to research methods, covering experimental and non-experimental methods, sampling, experimental design, variables and hypotheses, descriptive statistics, levels of measurement, GRAVE evaluation and the Edexcel inferential tests including Mann-Whitney, Wilcoxon, Spearman and chi-square.

Generated by Claude Opus 4.815 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. The answer
  3. Examples in context
  4. Try this

What this dot point is asking

Edexcel weaves research methods through every topic, so you must choose appropriate methods, design valid studies, handle data, and select and justify the correct inferential test from the Edexcel list. The skills here are tested across all three papers, especially in data-handling questions.

The answer

Methods, variables and hypotheses

Beyond the experiment (laboratory, field, natural and quasi), psychologists use observations (naturalistic or controlled, structured or unstructured), self-report (questionnaires with open and closed questions, structured and unstructured interviews), correlations (measuring the relationship between two co-variables) and case studies (in-depth study of one person or group). Each balances control against ecological validity.

Sampling and experimental design

Descriptive and inferential statistics

Descriptive statistics summarise data: measures of central tendency (mean, median, mode) and dispersion (range and standard deviation), plus graphs (bar charts, histograms, scattergrams) and tables. The standard deviation measures the average spread of scores around the mean.

Reliability, validity and ethics (GRAVE)

  • Generalisability. A representative sample (favoured by random or stratified sampling) allows findings to generalise; biased samples (volunteer, opportunity) limit it.
  • Reliability. Standardised procedures, test-retest and inter-rater checks make a study consistent and replicable.
  • Application. Sound methods let psychology be applied with confidence, for example to design effective health campaigns or therapies.
  • Validity. Internal validity (the IV, not a confound, caused the change) and ecological validity (findings apply outside the artificial setting) must both be considered.
  • Ethics. The BPS code requires informed consent, no unnecessary deception, the right to withdraw, confidentiality and protection from harm, with debriefing where deception is used.

Examples in context

Example 1. Reading a chi-square (χ2\chi^2) result. Suppose a study categorises participants by whether they were primed with a word (yes or no) and whether they then recalled a target (recalled or not). These are nominal frequency counts in an unrelated design, so chi-square is the correct test. With a 2×22 \times 2 contingency table, the degrees of freedom are df=(2−1)(2−1)=1df = (2 - 1)(2 - 1) = 1, and the critical value at p<0.05p < 0.05 is 3.843.84. If the calculated χ2=5.20\chi^2 = 5.20, then because 5.20>3.845.20 > 3.84 the result is significant: there is a significant association between priming and recall, with less than a 5 per cent probability the result is due to chance. Note that chi-square is one of the few tests where the calculated value must be larger than the critical value.

Example 2. Spearman's rho for a correlation. A researcher measures hours of revision and exam mark for 20 students and finds Spearman's rho=+0.78rho = +0.78 (p<0.05p < 0.05). Spearman is the correct test because the data are ordinal and the aim is to measure a relationship rather than a difference. The coefficient +0.78+0.78 is a strong positive correlation: more revision is associated with higher marks. The significance (p<0.05p < 0.05) means the relationship is unlikely to be chance, but it does not prove revision causes higher marks, because correlations cannot establish causation (a third variable such as motivation could drive both). This shows how the choice of test follows from the research aim and data type.

Try this

Q1. A study compares recall scores (ordinal data) between two separate groups. Name and justify the inferential test. [3 marks]

  • Cue. Mann-Whitney U, because the data are ordinal and the design is unrelated (independent groups), testing for a difference.

Q2. Explain what is meant by a significant result at p<0.05p < 0.05. [2 marks]

  • Cue. There is a 5 per cent or less probability that the result is due to chance, so the null hypothesis can be rejected.

Q3. Explain the difference between a directional and a non-directional hypothesis, using an example. [4 marks]

  • Cue. A directional (one-tailed) hypothesis states the direction of difference (group A recalls more than group B); a non-directional (two-tailed) hypothesis states only that there is a difference (group A and B differ in recall).

Exam-style practice questions

Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Edexcel 20196 marksA researcher compares recall scores (treated as ordinal) between two separate groups. Identify the appropriate inferential test, justify your choice, and explain how to decide whether the result is significant. [6 marks]
Show worked answer →

A methods item testing test choice and significance reasoning (AO2 and AO3).

Test: Mann-Whitney U. Justification: the design is unrelated (two separate, independent groups), the data are ordinal, and the researcher is testing for a difference. Mann-Whitney is the Edexcel test of difference for an unrelated design with ordinal data (Wilcoxon would be wrong because it is for related designs).

Deciding significance: compare the calculated U value with the critical value from the statistical table for the relevant NN and significance level. For Mann-Whitney, the result is significant if the calculated value is equal to or less than the critical value at p<0.05p < 0.05. If it is, reject the null hypothesis and accept that there is a significant difference; if not, retain the null hypothesis.

Markers reward the correct named test, a justification referring to level of measurement and related-or-unrelated design, and the correct rule (calculated equal to or less than critical for Mann-Whitney).

Edexcel 20226 marksA class recorded reaction times (ms) for 55 participants: 220,240,260,280,300220, 240, 260, 280, 300. Calculate the mean and the standard deviation, and explain what the standard deviation tells you about the data. [6 marks]
Show worked answer →

A full worked calculation (AO2) then interpretation (AO3).

Mean: xˉ=220+240+260+280+3005=13005=260\bar{x} = \frac{220 + 240 + 260 + 280 + 300}{5} = \frac{1300}{5} = 260 ms.

Standard deviation (using the sample formula, dividing by n−1=4n - 1 = 4): deviations from the mean are −40,−20,0,+20,+40-40, -20, 0, +20, +40; squared they are 1600,400,0,400,16001600, 400, 0, 400, 1600, summing to 40004000. Variance =40004=1000= \frac{4000}{4} = 1000; standard deviation =1000≈31.6= \sqrt{1000} \approx 31.6 ms.

Interpretation: the standard deviation (≈31.6\approx 31.6 ms) measures the average spread of scores around the mean. A relatively small standard deviation would mean scores cluster tightly near the mean (a consistent group); a larger one would mean scores are widely spread (more individual variation). Here the scores are evenly spread either side of 260260 ms, so the standard deviation summarises that typical departure from the mean.

Markers reward the correct mean (260260), a correct method for standard deviation (deviations, square, sum, divide by n−1n-1, square root) giving about 31.631.6, and a correct statement that SD measures dispersion around the mean.

Related dot points

Sources & how we know this