How do you design a methodology and gather data for the language investigation, and what makes the method sound?
Investigation methodology and data: choosing a method (quantitative, qualitative or mixed), selecting and gathering data, sampling, transcription, ethics, and writing a transparent, repeatable methodology (AO1 and AO3 in H470/03 Task 1).
How to design a methodology and gather data for the OCR A-Level English Language language investigation (H470/03 Task 1): choosing a quantitative, qualitative or mixed method, sampling, transcription, ethics, and writing a transparent, repeatable methodology that underpins a sound study.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
A language investigation is only as good as its method: the data must suit the question, be gathered soundly and ethically, and be analysed in a transparent, repeatable way. OCR Component 03 assesses the methodology within the AO1 and AO3 of Task 1. This dot point covers choosing a method (quantitative, qualitative or mixed), selecting and gathering data, sampling, transcription, ethics, and writing a methodology that underpins a sound study.
The answer
The methodology succeeds when it is justified, ethical, transparent and fit for the research question. The unifying idea is soundness: a study's conclusions are only as trustworthy as the method that produced them, so the methodology must show that the data genuinely bears on the question and was gathered and analysed properly. A clear, justified method is itself part of the AO1 mark and grounds the AO3 reading of context.
Choosing a method
The method must suit the question, and naming the approach is part of the rationale.
- Quantitative. Counting and measuring features (frequencies of a form, lengths, ratios), suited to questions about how much or how often, and to comparison.
- Qualitative. Close analysis of a smaller data set, suited to questions about how meaning is made and how features function in context.
- Mixed methods. Combining the two, for example counting a feature and then analysing examples closely, often the most powerful for a small investigation.
Selecting and gathering data
The data is the foundation, and its selection must be justified.
- Suitability. The data must actually bear on the research question; choose a data type (transcripts, written texts, online posts, historical texts) that can answer it.
- Sampling. How and why the particular data was chosen, and the limits of the sample (a small or convenient sample cannot support broad claims).
- Transcription. Where spoken data is used, the conventions for transcribing it (and the detail required, including prosody where relevant).
Write a transparent, repeatable methodology
The methodology should be written so that a reader understands exactly what was done and could, in principle, repeat it. State the question, the data and its rationale, the sampling, the analytical approach, and the ethical and practical decisions, and be honest about the limits. This transparency is part of the AO1 mark and grounds the AO3 reading of how context shapes the data.
Examples in context
The investigation is the student's own, so the moves below are illustrative.
A model methodology rationale. "An investigation into the language of a sports commentary might adopt a mixed method: a quantitative count of dynamic verbs and present-tense forms across a set period of commentary, to establish a pattern, followed by qualitative close analysis of selected moments to read how the language constructs excitement. The data, a transcribed broadcast segment, suits a question about commentary style, and the methodology would state the sampling (which match, which minutes, and why), the transcription conventions, and the limits of a single-segment sample." This justifies a method matched to the question.
A model ethics paragraph. "Where an investigation records private conversation, the methodology must address ethics directly: it would explain that participants gave informed consent to be recorded and that names and identifying details were anonymised in the transcript, and acknowledge that recording may itself affect how naturally people speak (the observer's paradox), a limit on the data. Treating ethics as part of the design, not a formality, both meets OCR's requirements and strengthens the study." This addresses ethics as design.
Try this
Q1. What is the difference between quantitative and qualitative methods? [2 marks]
- Cue. Quantitative methods count and measure features (suited to how much or how often); qualitative methods closely analyse a smaller data set (suited to how meaning is made).
Q2. Why is ethics part of the investigation's design? [2 marks]
- Cue. Where data involves people, consent and anonymisation are required, and the ethics shape what data can be gathered and its limits; it is not an afterthought.
Q3. Design and justify a methodology for an investigation into a chosen area of language, explaining your data selection and analytical approach. [18 marks]
- What the marker wants. A justified method matched to the research question, a clear and transparent account of data selection, sampling, analysis and ethics, written so the study could be repeated.
A note on the NEA
This guide is AI-written and not individually human-reviewed. The NEA requirements and the ethical guidance are set by OCR and administered by your centre; confirm them against the current H470 specification and the NEA guidance, and agree your method with your teacher before collecting data.
Exam-style practice questions
Practice questions written in the style of OCR exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
OCR H470/03 NEA18 marksDesign and justify a methodology for an investigation into a chosen area of language, explaining your data selection and analytical approach. [NEA Task 1 methodology, assessed within AO1 and AO3]Show worked answer →
This models the methodology dimension of the investigation. The methodology is assessed within the AO1 and AO3 of Task 1, because a sound method is part of a coherent, well-grounded study.
A strong methodology states the research question, the type of data and why it suits the question, how the data was sampled and gathered, the analytical approach (which language levels and concepts), and the ethical and practical decisions. It justifies its choices: why this data, this size, this method, for this question. It is written so the study could be understood and, in principle, repeated.
Reward a transparent, justified, repeatable method that suits the question, and the reading of how context shapes the data (AO3). Weaker methodologies gather data with no rationale, choose a method that cannot answer the question, ignore ethics, or describe the method so vaguely it could not be repeated.
OCR H470/03 NEA18 marksExplain the ethical and practical considerations in collecting your data and how you addressed them. [NEA Task 1 methodology, assessed within AO1 and AO3]Show worked answer →
A methodology task foregrounding ethics and practicality. Assessed within AO1 and AO3.
A high-band answer addresses the ethics of data collection: informed consent where people are recorded, anonymisation of participants, the sensitivity of the data, and the practical constraints (access, size, time) and how they were managed. It explains how these considerations shaped the data and the limits they impose.
Reward a careful, honest account of ethics and practicality and its effect on the data and the study's claims. Weaker answers ignore consent and anonymisation, overlook the limits a small or convenient sample imposes, or treat ethics as a box to tick rather than something that shapes the research.
Related dot points
- The language investigation (H470/03 Task 1, 30 marks): the independent 2000 to 2500 word study, its structure (introduction, methodology, analysis, conclusion), and how to integrate analysis (AO1), concepts (AO2) and context (AO3).
What the OCR A-Level English Language language investigation is (H470/03 Task 1, 30 marks): the independent 2000 to 2500 word study, its structure (introduction, methodology, analysis, conclusion), and how to integrate analysis (AO1), linguistic concepts (AO2) and context (AO3) into an evidenced research report.
- The academic poster (H470/03 Task 2, 10 marks): the 750 to 1000 word poster presenting the investigation to a non-specialist audience, assessed for AO5 only, and how to craft it for clarity, accessibility and effective communication.
What the OCR A-Level English Language academic poster is (H470/03 Task 2, 10 marks): the 750 to 1000 word poster presenting the investigation to a non-specialist audience, assessed for AO5 only, and how to craft it for clarity, accessibility and effective communication.
- Choosing an investigation topic: identifying a workable area of language, narrowing it to a focused and answerable research question, ensuring data is gatherable, and avoiding common pitfalls (the NEA planning stage for H470/03 Task 1).
How to choose a good investigation topic for the OCR A-Level English Language NEA (H470/03 Task 1): identifying a workable area of language, narrowing it to a focused and answerable research question, ensuring data is gatherable, and avoiding the common pitfalls that sink investigations.
- Phonetics, phonology and prosody: speech sounds and the IPA, phonological patterning (alliteration, rhythm, sound symbolism), and prosodic features in transcripts (intonation, stress, pitch, pace, pause), and reading their effect (AO1 and AO3 across H470).
How to analyse a text at the level of phonetics, phonology and prosody for OCR A-Level English Language (H470): speech sounds and the IPA, phonological patterning, and prosodic features in spoken transcripts (intonation, stress, pitch, pace, pause), with the move from a sound feature to its effect on meaning.
- Context, audience, purpose and mode: how contextual factors shape language, the spoken-written mode continuum, and using context to analyse the construction of meaning (AO3, the dominant analytical objective across H470).
How contextual factors shape language for OCR A-Level English Language (H470): audience, purpose, genre and the spoken-written mode continuum, and how to use context to drive AO3 analysis of the construction of meaning, the analytical objective that underpins every task.
Sources & how we know this
- OCR A-Level English Language (H470) specification — OCR (2015)
- OCR H470/03 Independent language research moderators' report — OCR (2023)