ferrum-models 0.6.0

Model architectures (LLaMA, Qwen, BERT) for Ferrum inference
Documentation

ferrum-models

There is very little structured metadata to build this page from currently. You should check the main library docs, readme, or Cargo.toml in case the author documented the features in them.

This version has 8 feature flags, 0 of them enabled by default.

default

This feature flag does not enable additional features.

accelerate

cuda

integration-tests

This feature flag does not enable additional features.

marlin

metal

registry

This feature flag does not enable additional features.

runtime

tensor-parallel