tandem-eval 0.6.6

Evaluation harness and regression tooling for Tandem