ZIP often shrinks text and office-style files a lot, but it usually barely reduces photos, videos, and other already-compressed formats.
You click “Compress,” the archive finishes, and the size number drops. Or it barely moves. That swing is normal. ZIP is lossless, so it can only squeeze out patterns that already exist in the bytes.
This piece breaks down what ZIP compression looks like in real use, why results change by file type, and how to predict the outcome before you spend time zipping a big folder.
What ZIP Compression Is Doing
A ZIP archive is a container that stores many files as one package. It can also compress each file inside the archive using a method like Deflate. Deflate mixes repeated-string matching with Huffman coding.
Plain-English version: ZIP hunts for repetition. If your data repeats, ZIP can store a shorter description of it. If the data already looks “random,” ZIP has little to grab.
Why ZIP Compression Ratios Vary So Much
People expect one answer like “ZIP cuts files in half.” Real folders don’t behave that neatly. The same set can zip down to a small slice of its size, or it can even grow a bit. ZIP adds headers, names, timestamps, and a directory listing, so a tiny file set can end up slightly larger.
The biggest driver is the file format. Text, source code, CSV, and logs have lots of repeating patterns. JPEG photos, MP4 videos, MP3 audio, and many modern app exports already use their own compression, so ZIP can’t squeeze much more out of them.
Compression Ratio Vs. Space Saved
These get mixed up all the time. Space saved is the size difference. Compression ratio is the zipped size divided by the original size.
- 50% saved means the ZIP is half the size of the original.
- Ratio 0.50 means the ZIP is half the size of the original.
- Ratio 0.95 means the ZIP is 95% of the original size, so you saved 5%.
The Data Has To Be Redundant
ZIP can’t invent new information. It can only store the same information more efficiently. That’s why a folder of raw text can shrink hard, while a folder of photos barely moves.
This also explains why PDFs can go either way. A PDF made from scanned pages behaves like images and zips poorly. A PDF that’s mostly selectable text often zips better.
How Much Does ZIP Compress Files?
In day-to-day use, ZIP savings often land in these bands:
- Text and code: often 60–90% smaller.
- Office-style content: often 10–60% smaller, with wide swings by what’s inside.
- Mixed folders: the zip result tracks the “worst” files, since already-compressed items can dominate total bytes.
- Photos, videos, audio: often 0–10% smaller, sometimes larger.
These are expectations, not promises. ZIP responds to what the bytes look like, not what the file icon looks like.
ZIP File Compression Amounts For Common File Types
If you want a fast mental estimate, ask one question: “Did this format already compress the data?” If yes, ZIP will act more like a bundling wrapper than a size reducer.
File Types That Commonly Zip Well
- TXT, CSV, JSON, XML, logs
- SQL dumps and plain-text exports
- Source code and project folders
- Large spreadsheets with lots of repeated values
- Raw bitmap images (BMP) and uncompressed audio (WAV)
File Types That Commonly Zip Poorly
- JPG/JPEG, PNG, WebP, HEIC
- MP4/MKV/MOV, MP3/AAC
- Existing archives: ZIP, RAR, 7Z, TAR.GZ
- Many installers, game packs, and app bundles
What’s Inside ZIP: Methods, Levels, And Overhead
ZIP is a file format plus a menu of compression methods. Many tools follow the published specification, and the most common method you’ll run into is Deflate.
If you want the formal write-ups, see the IETF RFC 1951 (DEFLATE format) and the PKWARE APPNOTE ZIP specification.
Compression Level Changes Time More Than Size
Most ZIP tools offer levels like fast, normal, and maximum. Higher levels spend more CPU time searching for longer matches and better symbol codes. On many file types, the size difference between normal and maximum is modest, but the time cost can jump.
On folders full of text or logs, maximum can still pay off. On media files, changing levels rarely moves the needle.
Why Small Files Sometimes Get Bigger
ZIP needs metadata for each entry. When the payload is tiny, metadata can be a big share of the total. Zipping a handful of small icons, config files, or thumbnails can add more bytes than it removes.
What To Expect In Common Real-World Scenarios
When you zip a single file, the result reflects that file’s compressibility. When you zip a folder, the result reflects the whole mix. One large video can drown out hundreds of small text files.
Source Code Repo For Sharing
Code and text assets usually zip well. Many repos drop to 20–40% of their original size. Large binary blobs like images, fonts, and compiled builds can erase a lot of that win.
Office Work Folder
Modern Office formats (DOCX, XLSX, PPTX) are already ZIP containers under the hood. Zipping them again often yields little size change. A ZIP still helps when you want one download that preserves folder structure.
Photo Album Or Video Dump
These are already compressed. Zipping is still useful for packaging and keeping filenames tidy, but size savings tend to be slim. If your ZIP grows, that can still be normal, since ZIP overhead lands on every file.
Table: Typical ZIP Savings By Data Type
The ranges below work as a planning yardstick. Your results can land outside these bands, yet this is a solid starting point when you need to guess “Will this zip down enough to send?”
| Data Type | Common Size After ZIP | Why It Lands There |
|---|---|---|
| Plain text (TXT, logs) | 10–40% of original | Lots of repeating words and patterns |
| Structured text (CSV, JSON, XML) | 15–50% of original | Repeated field names, separators, and values |
| Source code folders | 20–50% of original | Repetition across files and templates |
| Spreadsheets with repeated values | 30–80% of original | Swings with text density and embedded media |
| PDF (text-heavy) | 60–90% of original | Some PDFs store text and fonts efficiently already |
| PDF (scan-heavy) | 90–105% of original | Acts like images, so ZIP adds overhead |
| Photos (JPEG/PNG/WebP) | 90–102% of original | Already compressed; little redundancy left |
| Video (MP4/MKV/MOV) | 98–102% of original | Compressed streams look random to ZIP |
| Existing archives (ZIP, 7Z, RAR) | 99–103% of original | Double-compressing adds headers, not savings |
How To Check ZIP Compression Before You Commit
If you are zipping for email or a strict upload limit, a fast test saves time. Zip a small sample first. Keep the same mix of file types and folder depth as the full set, then compare sizes.
Many archive tools also show per-file compression inside the archive view. That per-file view is great for spotting which items are dragging the overall ratio down.
Use The Right Tool For The Job
The built-in ZIP tool in Windows and macOS is fine for packaging. If you need tighter compression, tools like 7-Zip can squeeze a bit more on text-heavy sets. The trade is more CPU time and slower unzip on older devices.
Beware Of “ZIP Shrinks Everything” Myths
If a file came from a camera, a streaming export, or a modern app save, odds are it is already compressed. ZIP is still useful for bundling, but it won’t act like a magic shrink ray.
Practical Ways To Get Smaller ZIP Files
If the goal is fewer bytes on disk or fewer bytes on the wire, focus on the inputs, not just the ZIP setting slider.
Strip Stuff That Doesn’t Need To Ship
- Delete caches, build folders, and temp exports.
- Remove duplicate copies and old versions.
- Exclude big dependency folders when the receiver can reinstall.
Convert Before You Zip When It Fits
- Turn many small logs into one combined log, then zip it.
- Export a database to plain text, then zip the export.
- If images are huge, resize them first, then zip the resized set.
ZIP is lossless. If you want a major size cut on photos, the cut comes from resizing or re-encoding, not from the ZIP step.
Pick Solid Archives Only When You Accept The Trade
Some tools can compress across files as one stream (“solid” mode). It can help when many files share patterns, like logs or code. It also makes random access slower, so pulling one file out can take longer.
Table: What Changes ZIP Results And What To Do
Think of ZIP outcomes as a mix of three forces: file format, repetition, and overhead. This table shows the knobs you can turn and what each one buys you.
| Factor | What You’ll See | Action That Helps |
|---|---|---|
| Lots of text repetition | Big size drop | Use maximum level if time is fine |
| Already-compressed media | Tiny drop or slight growth | Zip for bundling, not shrinking |
| Many tiny files | Overhead becomes visible | Group related items into fewer larger files when possible |
| Mixed folder with one huge video | Overall ratio looks weak | Send media separately or accept little savings |
| Office formats that are ZIP already | Little change | Zip only to combine files into one download |
| High compression level | Slower zip, small extra savings | Use it on text-heavy sets, skip it on media |
| Encryption enabled | Sometimes worse ratio | Compress first, then encrypt in the archive tool |
When ZIP Is The Wrong Tool
If you need the smallest archive, ZIP is not always the winner. Formats like 7Z or Zstandard-based tar archives can beat ZIP on text-heavy folders. For long-term storage, you may also care about error recovery and metadata handling.
If you need “open everywhere” compatibility, ZIP stays hard to beat. Phones, tablets, and operating systems can open ZIP without extra installs. That can matter more than shaving off a few extra megabytes.
A Simple Checklist Before You Zip
- Scan the folder and spot media files that won’t shrink.
- Remove duplicates and junk outputs.
- Zip a small sample and check the size result.
- Raise compression level only for text-heavy sets.
- If you need a major size cut, resize or re-encode inputs first.
Putting It All Together
ZIP compression is pattern hunting. When data repeats, ZIP can slash the size. When data is already compressed, ZIP is mostly a neat wrapper that keeps files together.
If you keep one idea, keep this: file type sets the ceiling. Your best wins come from zipping text-heavy data and from cleaning a folder before you archive it.
References & Sources
- IETF.“RFC 1951: DEFLATE Compressed Data Format Specification (v1.3).”Defines the Deflate format used by many ZIP tools for lossless compression.
- PKWARE.“APPNOTE.TXT – .ZIP File Format Specification.”Documents the ZIP container format, headers, and compression method IDs used in ZIP archives.
