rlx-vlm-base 0.2.6

Shared vision-language base types for RLX VLM/Omni runners (PLAN.md M7)

Coverage
29.73%
11 out of 37 items documented0 out of 21 items with examples
Size
Source code size: 7.69 kB This is the summed size of all the files inside the crates.io package for this release.
Documentation size: 513.99 kB This is the summed size of all files generated by rustdoc for all configured targets
Ø build duration
this release: 2s Average build duration of successful builds.
all releases: 3s Average build duration of successful builds in releases after 2024-10-23.
Links
MIT-RLX/rlx-models
3 0 0
crates.io
Dependencies
- anyhow ^1 normal
Versions
Owners

Shared base types for vision-language and omni runners (PLAN.md M7).

rlx-qwen3-vl, rlx-lfm-vl, and rlx-nemotron-omni all need the same shape of plumbing: a per-image preprocessor (resize + patchify), a vision-tower trait, an MLP projector trait, and a multimodal turn interleaver that mixes image / text / (audio) into a single LM token stream. This crate hosts those traits so the family crates stay thin.

Status: TYPE SKELETON. The traits and supporting structs are in place; implementations land alongside the per-family crates as M7 progresses.