Why is data compressed, and what is the difference between lossy and lossless?
The need for compression and the difference between lossy and lossless compression, with their typical uses.
An OCR J277 1.2.5 answer on the need for data compression and the difference between lossy and lossless compression, including how each works and their typical uses.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
OCR wants you to explain why files are compressed and to distinguish lossy from lossless compression, including how each works and where each is used. The lossy-versus-lossless comparison is a reliable exam question, so learn the contrast and a typical use for each precisely.
Why data is compressed
Lossy compression
Lossless compression
Comparing the two
The trade-off is quality versus size. Lossy achieves much smaller files but loses some data permanently, which is acceptable for media where the human eye or ear will not notice. Lossless preserves everything, so it cannot shrink files as much, but it is essential where any change would matter. A common exam point is that you can compress a photo with lossy methods because a viewer will not spot the removed detail, but you must use lossless for a document or program because even one altered character could break it.
Try this
Q1. State two reasons a file might be compressed. [2 marks]
- Cue. Any two of: smaller size / less storage, faster transfer / less bandwidth, fits a size limit.
Q2. State one difference between lossy and lossless compression. [1 mark]
- Cue. Lossy permanently removes data and cannot be reversed exactly; lossless keeps all data and can be reconstructed perfectly.
Q3. Give a suitable use for lossless compression. [1 mark]
- Cue. Compressing text documents, program code or any file where every bit must be preserved (for example a ZIP archive or PNG image).
Exam-style practice questions
Practice questions written in the style of OCR exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
OCR 20204 marksExplain the difference between lossy and lossless compression, and state a suitable use for each.Show worked answer →
Lossy compression permanently removes some of the data, usually detail that humans are less likely to notice, to make the file much smaller; the original cannot be exactly restored. Lossless compression reduces the file size without removing any data, so the original can be perfectly reconstructed; it does this by recording the data more efficiently (for example storing repeated patterns once).
Uses (one mark each): lossy suits photos, music and video streaming, where a smaller file matters more than perfect detail (for example JPEG images or MP3 audio). Lossless suits text documents, program code, spreadsheets and any file where every bit must be preserved (for example a ZIP archive or a PNG image).
Markers reward "lossy removes data / cannot be reversed" versus "lossless keeps all data / can be reversed", plus a sensible use for each.
OCR 20223 marksGive three reasons why a user might compress a file before sending it as an email attachment.Show worked answer →
Award one mark per valid reason, up to three: a compressed file is smaller, so it takes up less storage space; it transfers faster and uses less bandwidth, so the email sends and downloads more quickly; it may fit within an email provider's attachment size limit that the original would exceed; and several files can be combined into one compressed archive for convenience.
Markers reward distinct reasons (smaller size, faster transfer / less bandwidth, fits size limits). Repeating the same idea in different words counts once.
Related dot points
- Why data must be represented in binary, the units of information (bit, nibble, byte, kB, MB, GB, TB, PB) and how to convert between them.
An OCR J277 1.2.3 answer on why computers use binary, the units of information from bit and nibble up to petabyte, and how to convert between units of data capacity.
- Representing characters with ASCII and Unicode; representing images with pixels, colour depth, resolution and metadata; representing sound with sample rate, sample resolution and bit rate; and the effect on file size and quality.
An OCR J277 1.2.4 answer on representing characters (ASCII, Unicode), images (pixels, colour depth, resolution, metadata) and sound (sample rate, sample resolution, bit rate), and the effect of each on file size and quality.
- The need for secondary storage, the common types (optical, magnetic, solid state) and how to choose a suitable device using capacity, speed, portability, durability, reliability and cost.
An OCR J277 1.2.2 answer on the need for secondary storage, the three common types (optical, magnetic, solid state), and how to choose a suitable device by capacity, speed, portability, durability, reliability and cost.
- The purpose of utility software, and the purpose of encryption software, defragmentation software, data compression and backup utilities (full and incremental).
An OCR J277 1.5.2 answer on the purpose of utility software and the specific roles of encryption software, defragmentation software, data compression and backup utilities, including full and incremental backups.
- Why hexadecimal is used to represent numbers, and how to convert between binary, denary and hexadecimal.
An OCR J277 1.2.4 answer on why hexadecimal is used in computing and how to convert between binary, denary and hexadecimal, with worked conversions in both directions.
Sources & how we know this
- OCR GCSE (9-1) Computer Science (J277) specification — OCR (2020)