pub struct RealtimeTranscriptionSessionCreateRequest {
pub modalities: Option<Vec<RealtimeTranscriptionSessionCreateRequestModality>>,
pub input_audio_format: Option<RealtimeTranscriptionSessionCreateRequestInputAudioFormat>,
pub input_audio_transcription: Option<RealtimeTranscriptionSessionCreateRequestInputAudioTranscription>,
pub turn_detection: Option<RealtimeTranscriptionSessionCreateRequestTurnDetection>,
pub input_audio_noise_reduction: Option<RealtimeTranscriptionSessionCreateRequestInputAudioNoiseReduction>,
pub include: Option<Vec<String>>,
pub client_secret: Option<RealtimeTranscriptionSessionCreateRequestClientSecret>,
}
Expand description
Realtime transcription session object configuration.
Fields§
§modalities: Option<Vec<RealtimeTranscriptionSessionCreateRequestModality>>
The set of modalities the model can respond with. To disable audio, set this to [“text”].
input_audio_format: Option<RealtimeTranscriptionSessionCreateRequestInputAudioFormat>
The format of input audio. Options are pcm16
, g711_ulaw
, or g711_alaw
.
For pcm16
, input audio must be 16-bit PCM at a 24kHz sample rate,
single channel (mono), and little-endian byte order.
input_audio_transcription: Option<RealtimeTranscriptionSessionCreateRequestInputAudioTranscription>
Configuration for input audio transcription. The client can optionally set the language and prompt for transcription, these offer additional guidance to the transcription service.
turn_detection: Option<RealtimeTranscriptionSessionCreateRequestTurnDetection>
Configuration for turn detection, ether Server VAD or Semantic VAD. This can be set to null
to turn off, in which case the client must manually trigger model response.
Server VAD means that the model will detect the start and end of speech based on audio volume and respond at the end of user speech.
Semantic VAD is more advanced and uses a turn detection model (in conjuction with VAD) to semantically estimate whether the user has finished speaking, then dynamically sets a timeout based on this probability. For example, if user audio trails off with “uhhm”, the model will score a low probability of turn end and wait longer for the user to continue speaking. This can be useful for more natural conversations, but may have a higher latency.
input_audio_noise_reduction: Option<RealtimeTranscriptionSessionCreateRequestInputAudioNoiseReduction>
Configuration for input audio noise reduction. This can be set to null
to turn off.
Noise reduction filters audio added to the input audio buffer before it is sent to VAD and the model.
Filtering the audio can improve VAD and turn detection accuracy (reducing false positives) and model performance by improving perception of the input audio.
include: Option<Vec<String>>
The set of items to include in the transcription. Current available items are:
item.input_audio_transcription.logprobs
client_secret: Option<RealtimeTranscriptionSessionCreateRequestClientSecret>
Configuration options for the generated client secret.
Implementations§
Source§impl RealtimeTranscriptionSessionCreateRequest
impl RealtimeTranscriptionSessionCreateRequest
Sourcepub fn builder() -> RealtimeTranscriptionSessionCreateRequestBuilder<((), (), (), (), (), (), ())>
pub fn builder() -> RealtimeTranscriptionSessionCreateRequestBuilder<((), (), (), (), (), (), ())>
Create a builder for building RealtimeTranscriptionSessionCreateRequest
.
On the builder, call .modalities(...)
(optional), .input_audio_format(...)
(optional), .input_audio_transcription(...)
(optional), .turn_detection(...)
(optional), .input_audio_noise_reduction(...)
(optional), .include(...)
(optional), .client_secret(...)
(optional) to set the values of the fields.
Finally, call .build()
to create the instance of RealtimeTranscriptionSessionCreateRequest
.
Trait Implementations§
Source§impl Clone for RealtimeTranscriptionSessionCreateRequest
impl Clone for RealtimeTranscriptionSessionCreateRequest
Source§fn clone(&self) -> RealtimeTranscriptionSessionCreateRequest
fn clone(&self) -> RealtimeTranscriptionSessionCreateRequest
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
source
. Read moreSource§impl Default for RealtimeTranscriptionSessionCreateRequest
impl Default for RealtimeTranscriptionSessionCreateRequest
Source§fn default() -> RealtimeTranscriptionSessionCreateRequest
fn default() -> RealtimeTranscriptionSessionCreateRequest
Source§impl<'de> Deserialize<'de> for RealtimeTranscriptionSessionCreateRequest
impl<'de> Deserialize<'de> for RealtimeTranscriptionSessionCreateRequest
Source§fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>where
D: Deserializer<'de>,
fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>where
D: Deserializer<'de>,
Source§impl PartialEq for RealtimeTranscriptionSessionCreateRequest
impl PartialEq for RealtimeTranscriptionSessionCreateRequest
Source§fn eq(&self, other: &RealtimeTranscriptionSessionCreateRequest) -> bool
fn eq(&self, other: &RealtimeTranscriptionSessionCreateRequest) -> bool
self
and other
values to be equal, and is used by ==
.