rlx-vlm-base 0.2.6

Shared vision-language base types for RLX VLM/Omni runners (PLAN.md M7)
Documentation
  • Coverage
  • 29.73%
    11 out of 37 items documented0 out of 21 items with examples
  • Size
  • Source code size: 7.69 kB This is the summed size of all the files inside the crates.io package for this release.
  • Documentation size: 513.99 kB This is the summed size of all files generated by rustdoc for all configured targets
  • Ø build duration
  • this release: 2s Average build duration of successful builds.
  • all releases: 3s Average build duration of successful builds in releases after 2024-10-23.
  • Links
  • MIT-RLX/rlx-models
    3 0 0
  • crates.io
  • Dependencies
  • Versions
  • Owners
  • eugenehp

Shared base types for vision-language and omni runners (PLAN.md M7).

rlx-qwen3-vl, rlx-lfm-vl, and rlx-nemotron-omni all need the same shape of plumbing: a per-image preprocessor (resize + patchify), a vision-tower trait, an MLP projector trait, and a multimodal turn interleaver that mixes image / text / (audio) into a single LM token stream. This crate hosts those traits so the family crates stay thin.

Status: TYPE SKELETON. The traits and supporting structs are in place; implementations land alongside the per-family crates as M7 progresses.