Skip to main content
EnglandEnglish LanguageSyllabus dot point

How do you collect and prepare data for a language investigation, and how do you write a sound, ethical methodology?

Methodology and data collection (Component 4 NEA): selecting and gathering a workable data set, qualitative and quantitative approaches, ethical considerations (consent, anonymity), preparing and presenting data, and writing a transparent methodology that justifies the research design.

How to collect and prepare data and write a methodology for the Eduqas A-Level English Language (A700) Component 4 NEA: selecting a workable data set, qualitative and quantitative approaches, ethics (consent, anonymity), preparing data, and writing a transparent methodology that justifies the research design (AO1, AO3).

Generated by Claude Opus 4.813 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. The answer
  3. Examples in context
  4. Try this
  5. A note on the NEA

What this dot point is asking

Methodology and data collection is the design stage of the Component 4 NEA: gathering a workable data set, choosing an analytical approach, handling ethics, and writing a transparent methodology that justifies the research design. It asks you to collect the right data in the right way and to explain and justify how you did it. In Eduqas English Language this underpins the AO1 rigour and AO3 context of the investigation. This dot point covers data collection, ethics, and writing the methodology.

The answer

This stage succeeds when you gather an appropriate, ethically collected data set and write a methodology that justifies the design. The unifying idea is rigour and transparency: a sound investigation rests on data suited to its question, collected ethically, and a methodology clear enough that a reader understands and could in principle repeat the study. Your task is both practical (collect good data) and written (justify the design), and the quality of the analysis depends on getting the data right.

Selecting and gathering the data

The data set must fit the research question: the right kind of language data (spoken, written, digital), from the right source, in an amount you can analyse closely at this length. A focused question needs a focused data set, often quite small, analysed in depth, rather than a large set skimmed. Decide your source, how you will sample from it, and how much you need, and ensure the data is rich enough to show the features your question targets.

Choosing the analytical approach

Decide how you will analyse the data. Most language investigations are primarily qualitative: close analysis of features using the linguistic frameworks. Some support this with a quantitative element (counting occurrences of a feature, comparing frequencies), which can strengthen a claim if the data set allows it. Choose the approach that suits the question and the data, and be honest about the limits of a small data set, which cannot support broad statistical claims.

Writing the methodology

The methodology section explains and justifies the design, not just describes it. It states how the data was selected and gathered (source, sampling, amount), the analytical approach and why it suits the question, and the ethical decisions taken. It is written transparently, so a reader could understand and in principle repeat the study. The justification, why this design answers this question, is what lifts the methodology above a bare description.

Examples in context

The investigation is the student's own, so the moves below are illustrative.

A model data-and-approach fit. "For a question on how gendered identity is constructed in online reviews, an appropriate design might gather a focused set of reviews from a single platform, sampled to allow comparison, and analyse them qualitatively using the frameworks (lexis, pragmatics, modality), perhaps with a small quantitative count of evaluative terms. The data set is the right kind and size to answer the question, and the qualitative-led approach suits the close analysis the question needs." This shows a design fitting the question.

A model ethics statement. "Where an investigation uses recorded spoken data, the methodology would record that informed consent was obtained from the participants, that they were told how the data would be used, and that all names and identifying details were anonymised in the transcription and write-up. Stating these decisions, rather than omitting them, shows the ethical rigour the NEA requires and protects the participants." This shows ethics built into the methodology.

Try this

Q1. Why must the data set fit the research question? [2 marks]

  • Cue. The data must be the right kind and amount to show the features the question targets; data that cannot answer the question undermines the investigation.

Q2. What ethical steps are required when collecting data from identifiable people? [2 marks]

  • Cue. Obtaining informed consent and anonymising participants (no real names or identifying details), and respecting privacy.

Q3. Describe and justify the methodology for a language investigation, including data collection and ethics. [16 marks]

  • What the marker wants. A transparent, justified account of how the data was selected and gathered, the analytical approach and why it fits the question, and the ethical decisions taken, written so the study could in principle be repeated.

A note on the NEA

This guide is AI-written and not individually human-reviewed. The methodology requirements, the word count and the ethical guidance are set by Eduqas and administered by your centre; confirm them against the current A700 specification and NEA guidance, and agree your data collection and ethics with your teacher before you begin. Follow your centre's ethical guidance for all data collection.

Exam-style practice questions

Practice questions written in the style of WJEC Eduqas exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Eduqas A700 Component 4 NEA16 marksDescribe and justify the methodology for your language investigation, including how you gathered your data and your ethical considerations. [NEA methodology; AO1, AO3]
Show worked answer →

The methodology is a key part of the investigation, assessed within the AO1 (structure and rigour) and AO3 (context) marks. This models writing it.

A strong methodology explains and justifies the research design: how the data was selected and gathered (the source, the sampling, the amount), the analytical approach (qualitative close analysis, any quantitative counting), and the ethical decisions (consent for spoken data, anonymisation of participants). It is written so the study is transparent and, in principle, repeatable.

The discipline is justification and transparency, not just description. Reward a methodology that explains why the design suits the question and addresses ethics; penalise a vague account, an unjustified design, or a study that ignores consent and anonymity. The methodology underpins the validity of the analysis.

Eduqas A700 Component 4 NEA14 marksExplain how you ensured your data collection was ethical and your data set was appropriate to your research question. [NEA methodology; ethics and design focus]
Show worked answer →

This isolates ethics and the fit between data and question. It is assessed within AO1 and AO3.

A strong answer shows the data set is appropriate (the right kind and amount of data to answer the question) and that collection was ethical: informed consent for any data from identifiable people, anonymisation of participants, and respect for privacy, especially with online or spoken data. It explains the practical choices made to secure suitable, ethically gathered data.

Reward a clear account of an appropriate, ethically collected data set; penalise a mismatch between data and question, or a study that gathers data from identifiable people without consent or anonymisation. Ethics is a requirement, not an optional extra.

Related dot points

Sources & how we know this