mnem-bench 0.1.6

v1.0 benchmark harness for mnem - download, run, score LongMemEval / LoCoMo against the cpu-local adapter.
Documentation
# mnem-bench


Benchmark harness for mnem. Ships the runner, dataset cache,
LongMemEval / LoCoMo / ConvoMem / MemBench / hybrid-v4 scorers, the
cpu-local mnem adapter, the bundled ONNX MiniLM-L6-v2 embedder, and
the `mnem bench` interactive TUI.

## Status


- 6 benches: LongMemEval, LoCoMo, ConvoMem, MemBench (simple-roles +
  highlevel-movie), LongMemEval-hybrid-v4. All shipped, end-to-end.
- Adapter: in-process `mnem` (`Repo::open_in_memory`).
- Run mode: cpu-local (single-threaded, in-process).
- Embedders: ONNX MiniLM (default, bundled, matches headline numbers)
  and bag-of-tokens (offline / CI fallback).

See ``
for the design rationale.

## Quickstart


```bash
mnem bench               # interactive TUI
mnem bench list          # JSON of available benches
mnem bench fetch longmemeval    # cache LongMemEval (~264 MB, HF)
mnem bench fetch locomo         # cache LoCoMo (~3 MB, GitHub raw)
mnem bench fetch                # fetch every bench
mnem bench run --benches longmemeval --with mnem --mode cpu-local --out ./out
mnem bench results ./out        # re-render RESULTS.md
```

## Smoke test


```bash
cargo run --example smoke -p mnem-bench
```

A 5-question LongMemEval canary against a synthetic in-process
mnem repo. Exits non-zero if `recall@5 == 0`.