Struct RealtimeTranscriptionSessionCreateRequest

Source

pub struct RealtimeTranscriptionSessionCreateRequest {
    pub modalities: Option<Vec<Item>>,
    pub input_audio_format: Option<InputAudioFormat>,
    pub input_audio_transcription: Option<InputAudioTranscription>,
    pub turn_detection: Option<TurnDetection>,
    pub input_audio_noise_reduction: Option<InputAudioNoiseReduction>,
    pub include: Option<Vec<String>>,
    pub client_secret: Option<ClientSecret>,
}

Expand description

Realtime transcription session object configuration.

Fields§

§modalities: Option<Vec<Item>>

The set of modalities the model can respond with. To disable audio, set this to [“text”].

§input_audio_format: Option<InputAudioFormat>

The format of input audio. Options are pcm16, g711_ulaw, or g711_alaw. For pcm16, input audio must be 16-bit PCM at a 24kHz sample rate, single channel (mono), and little-endian byte order.

§input_audio_transcription: Option<InputAudioTranscription>

Configuration for input audio transcription. The client can optionally set the language and prompt for transcription, these offer additional guidance to the transcription service.

§turn_detection: Option<TurnDetection>

Configuration for turn detection, ether Server VAD or Semantic VAD. This can be set to null to turn off, in which case the client must manually trigger model response. Server VAD means that the model will detect the start and end of speech based on audio volume and respond at the end of user speech. Semantic VAD is more advanced and uses a turn detection model (in conjunction with VAD) to semantically estimate whether the user has finished speaking, then dynamically sets a timeout based on this probability. For example, if user audio trails off with “uhhm”, the model will score a low probability of turn end and wait longer for the user to continue speaking. This can be useful for more natural conversations, but may have a higher latency.

§input_audio_noise_reduction: Option<InputAudioNoiseReduction>

Configuration for input audio noise reduction. This can be set to null to turn off. Noise reduction filters audio added to the input audio buffer before it is sent to VAD and the model. Filtering the audio can improve VAD and turn detection accuracy (reducing false positives) and model performance by improving perception of the input audio.

§include: Option<Vec<String>>

The set of items to include in the transcription. Current available items are:

item.input_audio_transcription.logprobs

§client_secret: Option<ClientSecret>

Configuration options for the generated client secret.

Struct RealtimeTranscriptionSessionCreateRequest Copy item path

Fields§

Implementations§

impl RealtimeTranscriptionSessionCreateRequest

pub fn builder() -> RealtimeTranscriptionSessionCreateRequestBuilder<((), (), (), (), (), (), ())>

Trait Implementations§

impl Clone for RealtimeTranscriptionSessionCreateRequest

fn clone(&self) -> RealtimeTranscriptionSessionCreateRequest

fn clone_from(&mut self, source: &Self)

impl Debug for RealtimeTranscriptionSessionCreateRequest

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for RealtimeTranscriptionSessionCreateRequest

fn default() -> RealtimeTranscriptionSessionCreateRequest

impl<'de> Deserialize<'de> for RealtimeTranscriptionSessionCreateRequest

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where __D: Deserializer<'de>,

impl PartialEq for RealtimeTranscriptionSessionCreateRequest

fn eq(&self, other: &RealtimeTranscriptionSessionCreateRequest) -> bool

fn ne(&self, other: &Rhs) -> bool

impl Serialize for RealtimeTranscriptionSessionCreateRequest

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where __S: Serializer,

impl StructuralPartialEq for RealtimeTranscriptionSessionCreateRequest

Auto Trait Implementations§

impl Freeze for RealtimeTranscriptionSessionCreateRequest

impl RefUnwindSafe for RealtimeTranscriptionSessionCreateRequest

impl Send for RealtimeTranscriptionSessionCreateRequest

impl Sync for RealtimeTranscriptionSessionCreateRequest

impl Unpin for RealtimeTranscriptionSessionCreateRequest

impl UnwindSafe for RealtimeTranscriptionSessionCreateRequest

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<T> DeserializeOwned for Twhere T: for<'de> Deserialize<'de>,

Struct RealtimeTranscriptionSessionCreateRequest

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,