mistralrs-quant 0.7.0

Fast, flexible LLM inference.
Documentation

mistralrs-quant

There is very little structured metadata to build this page from currently. You should check the main library docs, readme, or Cargo.toml in case the author documented the features in them.

This version has 20 feature flags, 0 of them enabled by default.

accelerate

cuda

cuda-11040

This feature flag does not enable additional features.

cuda-11050

This feature flag does not enable additional features.

cuda-11060

This feature flag does not enable additional features.

cuda-11070

This feature flag does not enable additional features.

cuda-11080

This feature flag does not enable additional features.

cuda-12000

This feature flag does not enable additional features.

cuda-12010

This feature flag does not enable additional features.

cuda-12020

This feature flag does not enable additional features.

cuda-12030

This feature flag does not enable additional features.

cuda-12040

This feature flag does not enable additional features.

cuda-12050

This feature flag does not enable additional features.

cuda-12060

This feature flag does not enable additional features.

cuda-12080

This feature flag does not enable additional features.

cuda-12090

This feature flag does not enable additional features.

cuda-13000

This feature flag does not enable additional features.

metal

nccl

ring

This feature flag does not enable additional features.