Expand description
§speech-prep
Speech-focused audio preprocessing for Rust.
- Voice activity detection (dual-metric: energy + spectral flux)
- Audio format detection plus WAV decoding to 16kHz mono PCM
- Preprocessing (DC removal, high-pass filter, noise reduction, normalization)
- Speech-aligned chunking with overlap handling
- Quality assessment metrics
§Usage
use speech_prep::VadConfig;
let config = VadConfig::default();
assert!(config.base_threshold > 0.0);Re-exports§
pub use buffer::AudioBuffer;pub use buffer::AudioBufferMetadata;pub use chunker::ChunkBoundary;pub use chunker::Chunker;pub use chunker::ChunkerConfig;pub use chunker::ProcessedChunk;pub use error::Error;pub use error::Result;pub use pipeline::AudioPipelineCoordinator;pub use pipeline::ProcessingResult;pub use pipeline::StageLatencies;pub use preprocessing::DcHighPassFilter;pub use preprocessing::HighpassOrder;pub use preprocessing::NoiseReducer;pub use preprocessing::NoiseReductionConfig;pub use preprocessing::PreprocessingConfig;pub use preprocessing::VadContext;pub use time::AudioDuration;pub use time::AudioInstant;pub use time::AudioTimestamp;pub use types::AudioChunk;pub use vad::NoopVadMetricsCollector;pub use vad::SpeechChunk;pub use vad::VadConfig;pub use vad::VadDetector;pub use vad::VadMetricsCollector;
Modules§
- buffer
- Audio buffer types for batch audio processing.
- chunker
- Audio chunking aligned to VAD segments.
- converter
- High-level audio format conversion pipeline.
- error
- Error types for speech-prep.
- format
- Audio format detection and metadata extraction for the audio pipeline.
- pipeline
- End-to-end audio processing coordinator.
- preprocessing
- Audio preprocessing module for clean, analysis-ready audio.
- time
- Stream-relative time types for audio processing.
- types
- Core types for audio processing.
- vad
- Voice Activity Detection (VAD) with dual-metric analysis.
Structs§
- VADStats
- VAD statistics snapshot.