Expand description
Audio models for text-to-speech, transcription, and translation
§Audio Models
Data structures for the OpenAI Audio API including text-to-speech, speech-to-text transcription, and translation endpoints.
This module has been restructured for better organization:
types- Core types and enumsrequests- Request structuresresponses- Response structuresbuilders- Builder patternsmodels- Model constants
Modules§
- builders
- Audio API builders for fluent request creation
- models
- Audio model constants
- requests
- Audio API request structures
- responses
- Audio API response structures
- types
- Core audio types and enums
Structs§
- Audio
Models - Common audio models
- Audio
Speech Request - Request for text-to-speech audio generation
- Audio
Speech Response - Response from speech generation endpoint
- Audio
Transcription Request - Request for speech-to-text transcription
- Audio
Transcription Response - Response from transcription endpoint
- Audio
Translation Request - Request for speech-to-text translation
- Audio
Translation Response - Response from translation endpoint
- Speech
Builder - Builder for creating speech requests
- Transcription
Builder - Builder for creating transcription requests
- Transcription
Segment - Segment-level transcription data
- Transcription
Word - Word-level transcription data
- Translation
Builder - Builder for creating translation requests
Enums§
- Audio
Format - Audio output formats
- Timestamp
Granularity - Timestamp granularity for transcriptions
- Transcription
Format - Transcription output formats
- Voice
- Available voices for text-to-speech