AutoAgents Speech
Speech (TTS/STT) provider abstractions for the AutoAgents framework.
This crate provides trait-based abstraction layers for speech providers, allowing different backends to be used interchangeably within the AutoAgents ecosystem.
Features
TTS (Text-to-Speech)
- Speech Generation: Generate audio from text
- Voice Management: Use predefined voices
- Streaming Support: Optional streaming for real-time audio generation
- Model Management: Support for multiple models and languages
STT (Speech-to-Text) - Coming Soon
Architecture
The crate follows a trait-based design with provider implementations in the providers module:
TTS Traits
TTSProvider: Marker trait combining all TTS capabilitiesTTSSpeechProvider: Speech generation capabilitiesTTSModelsProvider: Model and language support
Providers
Enable providers using feature flags:
pocket-tts: Pocket-TTS model support
Example
use ;
async