tandem-eval 0.6.3

Evaluation harness and regression tooling for Tandem