Raw bindings to llama.cpp with cuda support.
Originally created and built by [utilityai/llama-cpp-rs](https://github.com/utilityai/llama-cpp-rs) released with MIT/Apache licenses.
But was derived and broken down into internal crates to maintain ownership and usage matching desired configuration.