Skip to main content

Module vad

Module vad 

Source
Expand description

Enhanced VAD functionality with segment aggregation

This module provides advanced VAD features beyond the basic whisper.cpp implementation, inspired by faster-whisper’s optimizations. VAD is a preprocessing step that happens BEFORE transcription, not part of the transcription API itself.

Structs§

AudioChunk
Audio chunk with metadata for transcription
ChunkMetadata
EnhancedVadParams
Enhanced VAD parameters with aggregation settings
EnhancedVadParamsBuilder
Builder for enhanced VAD parameters
EnhancedWhisperVadProcessor
Enhanced VAD processor with segment aggregation