Crate silero_vad_rs

Expand description

Silero Voice Activity Detection (VAD) - Rust Implementation

This crate provides a Rust implementation of the Silero Voice Activity Detection (VAD) model. It uses the ort crate for efficient ONNX model inference and provides both streaming and batch processing capabilities.

§Features

Voice Activity Detection using the Silero model
Support for both 8kHz and 16kHz audio
Streaming VAD with iterator interface and state management
Batch processing for efficient handling of multiple audio chunks
GPU acceleration support via ONNX Runtime with CUDA
Audio file I/O utilities
Automatic model downloading from Silero repository
Multiple language support (English, Russian, German, Spanish)
Comprehensive error handling

§Example

use silero_vad::{SileroVAD, VADIterator};
use silero_vad::utils::{read_audio, save_audio};
 
fn main() -> Result<(), Box<dyn std::error::Error>> {
    // Load the model
    let model = SileroVAD::new("path/to/silero_vad.onnx")?;
     
    // Create a VAD iterator
    let mut vad = VADIterator::new(
        model,
        0.5,  // threshold
        16000, // sampling rate
        100,   // min silence duration (ms)
        30,    // speech pad (ms)
    );
 
    // Read audio file
    let audio = read_audio("input.wav", 16000)?;
     
    // Get speech timestamps
    let timestamps = vad.get_speech_timestamps(
        &audio.view(),
        250,    // min speech duration (ms)
        f32::INFINITY, // max speech duration (s)
        100,    // min silence duration (ms)
        30,     // speech pad (ms)
    )?;
 
    // Process timestamps
    for ts in timestamps {
        println!("Speech detected from {:.2}s to {:.2}s", ts.start, ts.end);
    }
 
    Ok(())
}

Re-exports§

pub use model::SileroVAD;
pub use vad::VADIterator;
pub use vad::SpeechTimestamps;

Modules§

model: Silero VAD model implementation
utils: Audio utilities for the Silero VAD library
vad: Voice Activity Detection iterator implementation

Enums§

Error: Error types for the Silero VAD library
Language: Supported languages for VAD

Type Aliases§

Result: Result type for the Silero VAD library