format-corpus

May 2, 2026 ยท View on GitHub

An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.

All items are CC0 licenced unless otherwise stated.

A recent summary of the contents of the repository can be found here.

How to Contribute

See the wiki for more information.

See metadata-template.ext.md for a simple per-file metadata template.

Pooled Signatures

As well as pooling example files, we also pool format signatures:

More details here: http://wiki.curatecamp.org/index.php/Improving_format_ID_coverage