rlx-models-core 0.2.1

Shared config, weight loading, and compile helpers for RLX model crates

Coverage
68.73%
255 out of 371 items documented1 out of 254 items with examples
Size
Source code size: 211.67 kB This is the summed size of all the files inside the crates.io package for this release.
Documentation size: 5.97 MB This is the summed size of all files generated by rustdoc for all configured targets
Ø build duration
this release: 18s Average build duration of successful builds.
all releases: 17s Average build duration of successful builds in releases after 2024-10-23.
Links
Homepage
Repository
crates.io
Dependencies
Versions
- 0.2.1 (2026-05-29)
- 0.2.0 (2026-05-29)
Owners

rlx-models-core

Shared config, weight loading, compile profiles, and packed GGUF prefill helpers for RLX model crates (published on crates.io as rlx-models-core; import as rlx_core).

Version 0.2.1 adds packed GGUF support:

API	Role
`packed_gguf_compile_guard`	Metal `RLX_DISABLE_MPSGRAPH`, MLX `RLX_MLX_MODE=eager` during compile
`compile_options_for_packed_gguf_prefill_with_profile`	Fusion off on wgpu/CUDA/ROCm for `FusedResidualRmsNorm` gaps
`packed_gguf_execution_device`	Route MLX/wgpu/CUDA packed prefill to CPU when needed

Used by rlx-llama32, rlx-qwen3, rlx-gemma, and rlx-minicpm5.

rlx-models-core 0.2.1

rlx-models-core

See also