Structs§
Functions§
- vosk_
batch_ ⚠model_ free - Releases batch model object
- vosk_
batch_ ⚠model_ new - Creates the batch recognizer object
- vosk_
batch_ ⚠model_ wait - Wait for the processing
- vosk_
batch_ ⚠recognizer_ accept_ waveform - Accept batch voice data
- vosk_
batch_ ⚠recognizer_ finish_ stream - Closes the stream
- vosk_
batch_ ⚠recognizer_ free - Releases batch recognizer object
- vosk_
batch_ ⚠recognizer_ front_ result - Return results
- vosk_
batch_ ⚠recognizer_ get_ pending_ chunks - Get amount of pending chunks for more intelligent waiting
- vosk_
batch_ ⚠recognizer_ new - Creates batch recognizer object @returns recognizer object or NULL if problem occured
- vosk_
batch_ ⚠recognizer_ pop - Release and free first retrieved result
- vosk_
batch_ ⚠recognizer_ set_ nlsml - Set NLSML output @param nlsml - boolean value
- vosk_
gpu_ ⚠init - Init, automatically select a CUDA device and allow multithreading. Must be called once from the main thread. Has no effect if HAVE_CUDA flag is not set.
- vosk_
gpu_ ⚠thread_ init - Init CUDA device in a multi-threaded environment. Must be called for each thread. Has no effect if HAVE_CUDA flag is not set.
- vosk_
model_ ⚠find_ word - Check if a word can be recognized by the model @param word: the word @returns the word symbol if @param word exists inside the model or -1 otherwise. Reminding that word symbol 0 is for <epsilon>
- vosk_
model_ ⚠free - Releases the model memory
- vosk_
model_ ⚠new - Loads model data from the file and returns the model object
- vosk_
recognizer_ ⚠accept_ waveform - Accept voice data
- vosk_
recognizer_ ⚠accept_ waveform_ f - Same as above but the version with the float data for language bindings where you have audio as array of floats
- vosk_
recognizer_ ⚠accept_ waveform_ s - Same as above but the version with the short data for language bindings where you have audio as array of shorts
- vosk_
recognizer_ ⚠final_ result - Returns speech recognition result. Same as result, but doesn’t wait for silence You usually call it in the end of the stream to get final bits of audio. It flushes the feature pipeline, so all remaining audio chunks got processed.
- vosk_
recognizer_ ⚠free - Releases recognizer object
- vosk_
recognizer_ ⚠new - Creates the recognizer object
- vosk_
recognizer_ ⚠new_ grm - Creates the recognizer object with the phrase list
- vosk_
recognizer_ ⚠new_ spk - Creates the recognizer object with speaker recognition
- vosk_
recognizer_ ⚠partial_ result - Returns partial speech recognition
- vosk_
recognizer_ ⚠reset - Resets the recognizer
- vosk_
recognizer_ ⚠result - Returns speech recognition result
- vosk_
recognizer_ ⚠set_ max_ alternatives - Configures recognizer to output n-best results
- vosk_
recognizer_ ⚠set_ nlsml - Set NLSML output @param nlsml - boolean value
- vosk_
recognizer_ ⚠set_ partial_ words - Like above return words and confidences in partial results
- vosk_
recognizer_ ⚠set_ spk_ model - Adds speaker model to already initialized recognizer
- vosk_
recognizer_ ⚠set_ words - Enables words with times in the output
- vosk_
set_ ⚠log_ level - Set log level for Kaldi messages
- vosk_
spk_ ⚠model_ free - Releases the model memory
- vosk_
spk_ ⚠model_ new - Loads speaker model data from the file and returns the model object