# BPE Public Benchmarks
## Compression ratio (chars / tokens)
| GPT-2 (reported) | — | — | — |
| tiktoken (GPT-2 vocab) | — | — | — |
| Your BPE v1 | — | — | — |
| Your BPE v2 | — | — | — |
## Fertility score (avg tokens per word)
| GPT-2 (reported) | — | — | — |
| tiktoken (GPT-2 vocab) | — | — | — |
| Your BPE v1 | — | — | — |
| Your BPE v2 | — | — | — |
## OOV rate (% unknown tokens)
| GPT-2 (reported) | — | — | — |
| tiktoken (GPT-2 vocab) | — | — | — |
| Your BPE v1 | — | — | — |
| Your BPE v2 | — | — | — |
## Encode throughput (tokens / sec)
| GPT-2 (reported) | — | — | — |
| tiktoken (GPT-2 vocab) | — | — | — |
| Your BPE v1 | — | — | — |
| Your BPE v2 | — | — | — |