Document types, formats: file size, part 0

Written by

in

,

Looking at relative file sizes for the same content.

The following is by my understanding.

One issue that’s long interested me is file size. I opened a plain-text document, then typed Hello world! and saved it. Next, I did the same for a Word (docx) document. Then, I also saved the docx as a pdf in Word. Finally, I compressed both the docx and the pdf to respective zip files.

Here are the relevant file sizes:

  • .txt file: 12 bytes
  • .docx file: 13 kilobytes
  • .pdf file: 71.9 kilobytes
  • compressed .docx file: 10.4 kilobytes
  • compressed .pdf file: 60.6 kilobytes

Today I looked into the .docx format specifically, why it might be so much bigger than its plain-text version.

It turns out that the .docx format consists of numerous .xml files that describe said document. Said files, one might suspect, amount to overhead of about 13 kilobytes.

Interesting, eh?

Source:

agiledocumentation.co.uk

-JS

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *