How is data organised in files and databases, and why does normalising a database remove redundancy and anomalies?
Describe files, fields and records, relational databases, normalisation, basic SQL, and validation and verification.
A focused answer to WJEC A-Level Computer Science Unit 1 organisation of data, covering files, fields and records, relational databases and keys, normalisation to remove redundancy, basic SQL, and validation and verification.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
WJEC wants you to describe how data is held in files and in relational databases, to explain primary and foreign keys, to normalise a simple database to remove redundancy and anomalies, to write basic SQL, and to distinguish validation from verification. Databases are a substantial slice of Unit 1 and link forward to the deeper database content in Unit 4. Expect a normalisation question and a keys question, both of which reward precise vocabulary.
The answer
Files, fields and records
The move from a single flat file to a structured database is the central theme of this dot point, because separating data into tables is what makes it consistent and maintainable.
Relational databases and keys
Designing the tables and their relationships, with one fact stored in one place, is what gives a database integrity and avoids the contradictions a flat file invites.
Normalisation
Normalisation is the process of organising tables to remove redundancy (repeated data) and the anomalies redundancy causes: the insert anomaly (cannot add data without unrelated data), the update anomaly (a change must be made in many places) and the delete anomaly (removing one fact loses another). It splits repeating or dependent data into separate, linked tables.
SQL, validation and verification
Validation checks that entered data is reasonable, using checks such as range, type, presence, format and length. Verification checks that data has been entered or transferred correctly, for example by double entry or a visual check. Validation cannot guarantee correctness, only plausibility.
Examples in context
- Example 1. A library catalogue
- A library separates Books, Members and Loans into three tables. The Loans table holds a foreign key to Books and a foreign key to Members, so each loan links a member to a book without copying the book title or member name. This is normalisation in action: each fact is stored once, in one place.
- Example 2. The cost of a flat-file address
- A shop keeps customer addresses in its Orders flat file, repeating the address on every order. When a customer moves, staff must edit dozens of rows, and one missed row leaves the database self-contradicting. Splitting customers into their own table fixes this update anomaly, showing concretely why normalisation matters.
- Example 3. Validation versus verification at sign-up
- A web form rejects an email without an at sign (validation: a format check), then asks the user to type the email twice and compares the two (verification: a double-entry check). Both run, because a correctly formatted email can still be the wrong one, which is exactly the distinction examiners test.
Try this
Q1. State the SQL keyword used to filter the rows returned by a query. [1 mark]
- Cue. WHERE.
Q2. Give one example of a validation check and state what it tests. [2 marks]
- Cue. For example, a range check tests that a value falls between a stated minimum and maximum.
Exam-style practice questions
Practice questions written in the style of WJEC exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
WJEC 20184 marksExplain what is meant by a primary key and a foreign key in a relational database, and state why a foreign key is used.Show worked answer →
Define each key, then explain the purpose of the foreign key.
A primary key is a field (or combination of fields) that uniquely identifies each record in a table; no two records share the same primary key value.
A foreign key is a field in one table that refers to the primary key of another table.
A foreign key is used to link the two tables, establishing a relationship between them so that related data can be joined without being duplicated. For example, a CustomerID foreign key in an Orders table links each order to a record in the Customers table.
Markers reward the unique-identifier definition of a primary key, the foreign key as a reference to another table's primary key, and the linking purpose.
WJEC 20224 marksA table stores, for each order, the customer's name and address repeated on every order. Explain one problem this causes and how normalisation resolves it.Show worked answer →
Identify a redundancy or anomaly, then show how splitting the data fixes it.
Problem: the customer's name and address are stored redundantly on every order. If the customer moves, the address must be updated on every one of their orders; missing one leaves inconsistent data (an update anomaly), and the repetition also wastes storage.
Resolution: normalisation splits the data into a Customers table (one record per customer, holding the name and address once) and an Orders table that references the customer by a foreign key. The address is then stored only once and updated in only one place.
Markers reward identifying the redundancy or update anomaly and explaining that normalisation removes it by separating the data into linked tables.
Related dot points
- Describe and use arrays, records, lists, stacks, queues, trees and hash tables, and explain their operations and uses.
A focused answer to WJEC A-Level Computer Science Unit 1 data structures, covering arrays and records, the abstract data types stack and queue, lists, binary trees and hash tables, and when each is the right choice.
- Describe systems, application and utility software, the functions of the operating system, translators, and modes of operation.
A focused answer to WJEC A-Level Computer Science Unit 1 software and systems, covering system, application and utility software, operating system functions, compilers, interpreters and assemblers, and modes of operation.
- Represent numbers in binary, hexadecimal and two's complement, perform binary arithmetic, and represent characters, sound and images as binary data.
A focused answer to WJEC A-Level Computer Science Unit 1 data representation, covering binary and hexadecimal, two's complement, binary arithmetic and shifts, and how characters, sound and images are stored as binary.
- Describe data security threats and protection, encryption, data integrity, and the legislation and ethics governing computer use.
A focused answer to WJEC A-Level Computer Science Unit 1 security and the law, covering threats such as malware and hacking, protection measures, encryption, data integrity, and the relevant legislation and ethics.
- Describe networks and topologies, transmission media and methods, protocols and the TCP/IP stack, and how the internet works.
A focused answer to WJEC A-Level Computer Science Unit 1 communication, covering LANs and WANs, network topologies, transmission methods, protocols and the TCP/IP stack, packet switching, and how the internet works.