rlx-phi 0.2.0

Phi 3 / Phi 4 runner — STUB (PLAN.md M4)

Coverage
6.67%
1 out of 15 items documented0 out of 12 items with examples
Size
Source code size: 63.81 kB This is the summed size of all the files inside the crates.io package for this release.
Documentation size: 958.86 kB This is the summed size of all files generated by rustdoc for all configured targets
Ø build duration
this release: 22s Average build duration of successful builds.
all releases: 28s Average build duration of successful builds in releases after 2024-10-23.
Links
Repository
crates.io
Dependencies
Versions
Owners

Phi 3 / Phi 4 runner.

Phi-3 and Phi-4 ship as general.architecture = phi3 in their GGUF converters (Phi-4 reuses the Phi-3 arch tag upstream — there's no separate phi4 enum in llama.cpp). This crate is a thin wrapper over [rlx_llama32::Llama32Runner] with arch validation.

Caveat: Phi-3's per-layer LayerNorm placement and partial-RoPE split aren't yet implemented in rlx-llama32 — runs will produce some tokens but won't match the upstream reference until those land. PLAN.md M4 follow-up.