Expand description
Module for audio processing, including transcription, translation, and speech synthesis. This module defines the structures and methods for handling audio-based requests and responses. It includes functionality for audio transcription, translation, and speech synthesis. The module facilitates the creation, serialization, and deserialization of requests and responses related to these audio operations.
Features include:
- AudioTranscriptionRequest: Handles requests for audio file transcription using specified models.
- AudioTranscriptionResponse: Returns the transcription text along with optional headers.
- AudioTranslationRequest: Manages requests for translating audio content using specific models.
- AudioTranslationResponse: Delivers translated text and optional response headers.
- AudioSpeechRequest: Manages requests for generating speech from text using designated voice models.
- AudioSpeechResponse: Provides the success status of the speech synthesis operation and optional headers.
Constants for model and voice identifiers are also defined to standardize the references used across requests.
Structs§
- Audio
Speech Request - Represents a request for text-to-speech synthesis.
- Audio
Speech Response - Represents the response from a text-to-speech synthesis request.
- Audio
Transcription Request - Represents a request for audio transcription.
- Audio
Transcription Response - Represents the response from an audio transcription request.
- Audio
Translation Request - Represents a request for audio translation.
- Audio
Translation Response - Represents the response from an audio translation request.
Constants§
- TTS_1
- Constant for the TTS-1 model identifier.
- TTS_
1_ HD - Constant for the TTS-1 HD model identifier.
- VOICE_
ALLOY - Constant for the Alloy voice model.
- VOICE_
ECHO - Constant for the Echo voice model.
- VOICE_
FABLE - Constant for the Fable voice model.
- VOICE_
NOVA - Constant for the Nova voice model.
- VOICE_
ONYX - Constant for the Onyx voice model.
- VOICE_
SHIMMER - Constant for the Shimmer voice model.
- WHISPER_
1