Skip to main content

Module audio_processor

Module audio_processor 

Source
Expand description

Audio preprocessing for Whisper ASR.

Load audio files → decode → resample to 16kHz mono → f32 PCM samples. Pure-Rust pipeline via symphonia — no ffmpeg runtime dependency. Supports WAV / MP3 / FLAC / M4A (AAC) / OGG (Vorbis).

Constants§

CHUNK_SAMPLES
Whisper processes 30-second chunks. At 16kHz → 480,000 samples.

Functions§

chunk_pcm
Split PCM samples into 30-second chunks for Whisper processing.
load_audio
Load audio file and return 16kHz mono f32 PCM samples.
load_audio_at_rate
Load audio file and return mono f32 PCM samples at a configurable sample rate.
load_audio_bytes
Load audio from raw bytes (used by the HTTP multipart endpoint).