ferrum-models 0.4.0

Model architectures (LLaMA, Qwen, BERT) for Ferrum inference
Documentation

ferrum-models

There is very little structured metadata to build this page from currently. You should check the main library docs, readme, or Cargo.toml in case the author documented the features in them.

This version has 6 feature flags, 0 of them enabled by default.

default

This feature flag does not enable additional features.

cuda

integration-tests

This feature flag does not enable additional features.

marlin

registry

This feature flag does not enable additional features.

runtime

tensor-parallel