minillm 0.1.1

A mini inference engine for running transformer language models
Documentation