How is data storage measured in binary multiples, and why and how is data compressed?
Understand that data storage is measured in binary multiples (bit, nibble, byte, kibibyte, mebibyte, gibibyte, tebibyte), construct expressions to calculate file sizes and data capacity, and understand the need for and methods of compression (lossless, lossy).
A focused answer to Edexcel GCSE Computer Science 2.3.1 and 2.3.2, covering the binary storage multiples (bit, nibble, byte, kibibyte up to tebibyte), file-size and capacity calculations, and lossy and lossless compression.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
Edexcel wants you to know the binary units of storage (bit, nibble, byte and the binary multiples kibibyte, mebibyte, gibibyte, tebibyte), to build expressions for file sizes and storage capacity, and to explain why data is compressed and the difference between lossless and lossy compression.
The binary storage multiples
This is an Edexcel-specific point worth stressing: the board uses the binary multiples (kibibyte, mebibyte, and so on), where each unit is of the one below, because fits binary storage exactly. So 1 kibibyte bytes bits. Getting the factor right (1024) is essential for capacity calculations.
File-size and capacity expressions
The reliable method for capacity questions is: (1) work out the size of one file; (2) convert the storage capacity into the same unit; (3) divide. Conversions use the factor 1024 for each binary step (and 8 for bits to bytes). For instance, a 1 gibibyte drive holds mebibytes, so it stores files that are 4 mebibytes each.
Why and how data is compressed
The need for compression is practical: smaller files cost less to store and stream, and load faster on web pages, which matters for images, audio and video that would otherwise be very large.
The trade-off is size against fidelity. Lossy achieves far smaller files but loses quality and is irreversible, so it is wrong for a program file (a single changed bit could break it). Lossless is reversible and safe for any data, but the size saving is more modest. Choosing the right type for the data, and justifying it, is what compression questions reward.
Try this
Q1. State how many bytes are in a kibibyte. [1 mark]
- Cue. 1024 bytes.
Q2. State one reason data is compressed before being sent over a network. [1 mark]
- Cue. A smaller file transfers faster and uses less bandwidth (and less storage).
Exam-style practice questions
Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Edexcel 20233 marksA memory card has a capacity of 2 gibibytes. Each photo is 4 mebibytes. Calculate how many photos can be stored on the card. Show your working.Show worked answer →
Convert the capacity to the same unit as the photos. Binary multiples step up by 1024 ().
2 gibibytes mebibytes. Number of photos photos.
Markers reward converting gibibytes to mebibytes using the factor 1024, dividing by the photo size, and the answer 512. Using 1000 instead of 1024 for binary multiples is the usual error.
Edexcel 20224 marksExplain the difference between lossless and lossy compression, and give one suitable use of each.Show worked answer →
Lossless compression reduces file size with no loss of data, so the original can be reconstructed exactly; it is suitable where every bit matters, such as text documents, spreadsheets or program files.
Lossy compression permanently removes some data (often detail the human eye or ear is least likely to notice) to achieve much smaller files, so the original cannot be perfectly restored; it is suitable for photographs (JPEG) and music (MP3), where a small quality loss is acceptable for a big size saving.
Markers reward the key distinction (lossless keeps all data and is reversible; lossy permanently discards data and is not) plus one valid use of each that fits the trade-off.
Related dot points
- Understand how bitmap images are represented in binary (pixels, resolution, colour depth), how analogue sound is represented in binary (amplitude, sample rate, bit depth, sample interval), and the limitations of binary representation when constrained by the number of available bits.
A focused answer to Edexcel GCSE Computer Science 2.2.2, 2.2.3 and 2.2.4, covering bitmap images (pixels, resolution, colour depth), sampled sound (amplitude, sample rate, bit depth, sample interval), and the limitations of binary representation.
- Understand how computers encode characters using 7-bit ASCII.
A focused answer to Edexcel GCSE Computer Science 2.2.1, covering how computers encode characters using a character set, the 7-bit ASCII system, its 128 codes, and the ordering of letters and digits.
- Understand that computers use binary to represent data (numbers, text, sound, graphics) and program instructions, and determine the maximum number of states that can be represented by a binary pattern of a given length.
A focused answer to Edexcel GCSE Computer Science 2.1.1, covering why computers use binary to represent all data and instructions, and how to calculate the maximum number of states a binary pattern of a given length can represent.
- Understand why hexadecimal notation is used and convert between hexadecimal and binary.
A focused answer to Edexcel GCSE Computer Science 2.1.6, covering why hexadecimal notation is used as a shorthand for binary and how to convert between hexadecimal, binary and denary.
- Understand how the characteristics of wired and wireless connectivity impact on performance (speed, range, latency, bandwidth), that network speeds are measured in bits per second, and construct expressions involving file size, transmission rate and time.
A focused answer to Edexcel GCSE Computer Science 4.1.4 and 4.1.5, covering how wired and wireless connectivity affect speed, range, latency and bandwidth, network speed units, and calculating file size, transmission rate and time.
Sources & how we know this
- Pearson Edexcel GCSE (9-1) Computer Science (1CP2) specification — Pearson (2020)