How do you collect and prepare data for a language investigation, and how do you write a sound, ethical methodology?
Methodology and data collection (Component 4 NEA): selecting and gathering a workable data set, qualitative and quantitative approaches, ethical considerations (consent, anonymity), preparing and presenting data, and writing a transparent methodology that justifies the research design.
How to collect and prepare data and write a methodology for the Eduqas A-Level English Language (A700) Component 4 NEA: selecting a workable data set, qualitative and quantitative approaches, ethics (consent, anonymity), preparing data, and writing a transparent methodology that justifies the research design (AO1, AO3).
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
Methodology and data collection is the design stage of the Component 4 NEA: gathering a workable data set, choosing an analytical approach, handling ethics, and writing a transparent methodology that justifies the research design. It asks you to collect the right data in the right way and to explain and justify how you did it. In Eduqas English Language this underpins the AO1 rigour and AO3 context of the investigation. This dot point covers data collection, ethics, and writing the methodology.
The answer
This stage succeeds when you gather an appropriate, ethically collected data set and write a methodology that justifies the design. The unifying idea is rigour and transparency: a sound investigation rests on data suited to its question, collected ethically, and a methodology clear enough that a reader understands and could in principle repeat the study. Your task is both practical (collect good data) and written (justify the design), and the quality of the analysis depends on getting the data right.
Selecting and gathering the data
The data set must fit the research question: the right kind of language data (spoken, written, digital), from the right source, in an amount you can analyse closely at this length. A focused question needs a focused data set, often quite small, analysed in depth, rather than a large set skimmed. Decide your source, how you will sample from it, and how much you need, and ensure the data is rich enough to show the features your question targets.
Choosing the analytical approach
Decide how you will analyse the data. Most language investigations are primarily qualitative: close analysis of features using the linguistic frameworks. Some support this with a quantitative element (counting occurrences of a feature, comparing frequencies), which can strengthen a claim if the data set allows it. Choose the approach that suits the question and the data, and be honest about the limits of a small data set, which cannot support broad statistical claims.
Writing the methodology
The methodology section explains and justifies the design, not just describes it. It states how the data was selected and gathered (source, sampling, amount), the analytical approach and why it suits the question, and the ethical decisions taken. It is written transparently, so a reader could understand and in principle repeat the study. The justification, why this design answers this question, is what lifts the methodology above a bare description.
Examples in context
The investigation is the student's own, so the moves below are illustrative.
A model data-and-approach fit. "For a question on how gendered identity is constructed in online reviews, an appropriate design might gather a focused set of reviews from a single platform, sampled to allow comparison, and analyse them qualitatively using the frameworks (lexis, pragmatics, modality), perhaps with a small quantitative count of evaluative terms. The data set is the right kind and size to answer the question, and the qualitative-led approach suits the close analysis the question needs." This shows a design fitting the question.
A model ethics statement. "Where an investigation uses recorded spoken data, the methodology would record that informed consent was obtained from the participants, that they were told how the data would be used, and that all names and identifying details were anonymised in the transcription and write-up. Stating these decisions, rather than omitting them, shows the ethical rigour the NEA requires and protects the participants." This shows ethics built into the methodology.
Try this
Q1. Why must the data set fit the research question? [2 marks]
- Cue. The data must be the right kind and amount to show the features the question targets; data that cannot answer the question undermines the investigation.
Q2. What ethical steps are required when collecting data from identifiable people? [2 marks]
- Cue. Obtaining informed consent and anonymising participants (no real names or identifying details), and respecting privacy.
Q3. Describe and justify the methodology for a language investigation, including data collection and ethics. [16 marks]
- What the marker wants. A transparent, justified account of how the data was selected and gathered, the analytical approach and why it fits the question, and the ethical decisions taken, written so the study could in principle be repeated.
A note on the NEA
This guide is AI-written and not individually human-reviewed. The methodology requirements, the word count and the ethical guidance are set by Eduqas and administered by your centre; confirm them against the current A700 specification and NEA guidance, and agree your data collection and ethics with your teacher before you begin. Follow your centre's ethical guidance for all data collection.
Exam-style practice questions
Practice questions written in the style of WJEC Eduqas exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Eduqas A700 Component 4 NEA16 marksDescribe and justify the methodology for your language investigation, including how you gathered your data and your ethical considerations. [NEA methodology; AO1, AO3]Show worked answer →
The methodology is a key part of the investigation, assessed within the AO1 (structure and rigour) and AO3 (context) marks. This models writing it.
A strong methodology explains and justifies the research design: how the data was selected and gathered (the source, the sampling, the amount), the analytical approach (qualitative close analysis, any quantitative counting), and the ethical decisions (consent for spoken data, anonymisation of participants). It is written so the study is transparent and, in principle, repeatable.
The discipline is justification and transparency, not just description. Reward a methodology that explains why the design suits the question and addresses ethics; penalise a vague account, an unjustified design, or a study that ignores consent and anonymity. The methodology underpins the validity of the analysis.
Eduqas A700 Component 4 NEA14 marksExplain how you ensured your data collection was ethical and your data set was appropriate to your research question. [NEA methodology; ethics and design focus]Show worked answer →
This isolates ethics and the fit between data and question. It is assessed within AO1 and AO3.
A strong answer shows the data set is appropriate (the right kind and amount of data to answer the question) and that collection was ethical: informed consent for any data from identifiable people, anonymisation of participants, and respect for privacy, especially with online or spoken data. It explains the practical choices made to secure suitable, ethically gathered data.
Reward a clear account of an appropriate, ethically collected data set; penalise a mismatch between data and question, or a study that gathers data from identifiable people without consent or anonymisation. Ethics is a requirement, not an optional extra.
Related dot points
- The Language and Identity investigation (Component 4 NEA): the independent 2,500 to 3,500 word language investigation on a language and identity topic, its structure (introduction, methodology, analysis, conclusion), the prescribed areas, and how it is assessed (AO1, AO2 and AO3) and moderated.
What the Eduqas A-Level English Language (A700) Component 4 Language and Identity non-exam assessment is: the independent 2,500 to 3,500 word language investigation on a language and identity topic, its structure, the prescribed areas, and how it is assessed for AO1, AO2 and AO3 and moderated by Eduqas.
- Choosing an investigation area (Component 4 NEA): selecting a language and identity topic (self-representation, gender, culture, diversity), narrowing it to a focused, answerable research question, ensuring a workable data set, and the concepts and theories that frame each area.
How to choose a language and identity topic and frame a research question for the Eduqas A-Level English Language (A700) Component 4 NEA: the prescribed areas (self-representation, gender, culture, diversity), narrowing a topic to an answerable question, ensuring a workable data set, and the concepts that frame each area.
- Analysis and frameworks in the NEA (Component 4): applying the linguistic frameworks to your data (AO1), integrating identity concepts, theories and research (AO2), reading context (AO3), and building a sustained, evaluative analysis that answers the research question rather than describing the data.
How to analyse your data in the Eduqas A-Level English Language (A700) Component 4 NEA: applying the linguistic frameworks (AO1), integrating identity concepts and research (AO2), reading context (AO3), and building a sustained, evaluative analysis that answers the research question rather than describing the data.
- Writing up the investigation (Component 4 NEA): structuring the research report, writing in academic register, drawing evidenced conclusions that answer the research question and acknowledge limitations, and referencing sources and data correctly within the 2,500 to 3,500 word limit.
How to write up the Eduqas A-Level English Language (A700) Component 4 NEA: structuring the research report, writing in academic register, drawing evidenced conclusions that answer the research question and acknowledge limitations, and referencing sources and data correctly within the word limit.
- The spoken language question (Component 1 Section A): analysing at least two transcriptions of real spoken language across the linguistic frameworks, reading transcript notation, and moving from feature to effect to construct an argument about the talk (AO1 and AO3).
How to answer the Eduqas A-Level English Language (A700) Component 1 Section A spoken language question: analysing at least two transcripts of real talk across the frameworks, reading the transcript's notation, and moving from feature to effect to build an argument about the interaction, the core AO1 and AO3 task of the paper.
Sources & how we know this
- Eduqas A-Level English Language (A700) specification — Eduqas (2015)
- Eduqas A-Level English Language sample assessment materials — Eduqas (2017)