Word (.doc)

application/msword

Extensions
.doc .dot
Compressible
no
Status
IANA-registered

The binary Word format of 1997-2003, and the type people still reach for when they mean any Word file — a .docx served as application/msword is the single most common Office mislabel. Inside is an OLE2 compound file, a little filesystem-in-a-file specified by [MS-CFB], and its d0 cf 11 e0 signature is shared byte-for-byte by binary Excel, binary PowerPoint, MSI installers and everything else built on the container. The magic listed here therefore proves only compound file, not Word; identifying Word specifically means finding the WordDocument stream inside.

.dot templates share the type; the modern .docx is a different type entirely.

Magic bytes

d0 cf 11 e0 a1 b1 1a e1 · offset 0

Per [MS-CFB] — Compound File Binary format · test a file against it.

Defined by [MS-DOC] — Word (.doc) binary file format · IANA registration. Registry facts from the IANA media-types registry via mime-db.