whisperforge-core 0.4.0

GPU-accelerated Whisper model inference with streaming audio, quantization, and KV-cached decoding

Coverage
53.07%
164 out of 309 items documented0 out of 100 items with examples
Size
Source code size: 362.67 kB This is the summed size of all the files inside the crates.io package for this release.
Documentation size: 6.82 MB This is the summed size of all files generated by rustdoc for all configured targets
Ø build duration
this release: 49s Average build duration of successful builds.
all releases: 1m 11s Average build duration of successful builds in releases after 2024-10-23.
Links
Homepage
bevsxyz/WhisperForge
2 0 3
crates.io
Dependencies
Versions
Owners

whisperforge-core

GPU-accelerated Whisper model inference with streaming audio, quantization, and KV-cached decoding.

Quick Links

Full Documentation: WhisperForge Repository
Installation: See Installation Guide
Examples: Library Usage

Features

All Whisper model sizes (tiny.en through large-v2/v3)
GPU acceleration via WGPU (Vulkan/DX12/Metal)
burn-flex backend: CPU + automatic GPU dispatch
INT8 quantization (~4× compression)
Streaming audio pipeline with resampling
KV-cache O(n) decoder
Per-token timestamps via cross-attention

Usage

use whisperforge_core::{Model, WhisperConfig};
use std::path::Path;

let config = WhisperConfig::tiny_en();
let model = Model::load(Path::new("models/tiny_en_converted"))?;
let transcript = model.transcribe(audio_samples, sample_rate)?;
println!("{}", transcript);

See Also

whisperforge — Command-line binary (wf) including wf convert for HuggingFace model conversion
whisperforge-align — VAD & SRT output
whisperforge-diarize — Speaker diarization

For full documentation, visit the WhisperForge repository.