anamnesis

ἀνάμνησις — Parse any format, recover any precision.

⚠️ This crate is under active development. See ROADMAP.md for the plan and CHANGELOG.md for current progress.

Tested Models

Cross-validated against PyTorch on 7 real FP8 models from 5 quantization tools. Bit-exact output (0 ULP difference). Auto-vectorized: SSE2 on any x86-64, AVX2 with target-cpu=native.

Model	Quantizer	Scheme	Scales	vs PyTorch (AVX2)
EXAONE-4.0-1.2B-FP8	LG AI	Fine-grained	BF16	6.0x faster
Qwen3-1.7B-FP8	Qwen	Fine-grained	BF16	3.9x faster
Qwen3-4B-Instruct-2507-FP8	Qwen	Fine-grained	F16	3.0x faster
Ministral-3-3B-Instruct-2512	Mistral	Per-tensor	BF16	9.7x faster
Llama-3.2-1B-Instruct-FP8	RedHat	Per-tensor	BF16	3.9x faster
Llama-3.2-1B-Instruct-FP8-dynamic	RedHat	Per-channel	BF16	2.7x faster
Llama-3.1-8B-Instruct-FP8	NVIDIA	Per-tensor	F32	6.3x faster

Development

Exclusively developed with Claude Code (dev) and Augment Code (review)
Git workflow managed with Fork
All code follows CONVENTIONS.md, derived from Amphigraphic-Strict's Grit — a strict Rust subset designed to improve AI coding accuracy.

anamnesis 0.1.0

anamnesis

Tested Models

Development