1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
//! [Whisper](https://github.com/openai/whisper) is a Speech-to-Text (STT) model developed by [OpenAI](https://openai.com/).
//!
//! This implementation of Whisper is powered by [candle](https://github.com/huggingface/candle).
//! Long-form transcription is fully implemented, as described in [the whisper papper](https://arxiv.org/abs/2212.04356)
//!
//! Whisper comes with multiple wheights/checkpoints
//! that differ by inference speed, accuracy, voabulary and supported languages.
//! Refer to the below table for specifics.
//!
//! | Model | Params (M) | Relative speed | Short-Form WER | Long-Form WER | Vocab | Languages |
//! |-----------------|------------|----------------|----------------|---------------|-------|-----------|
//! | QuantizedTiny | | | | | V1 | All |
//! | Tiny | 39 | 5.75 | | | V1 | All |
//! | Base | 74 | 4.6 | | | V1 | All |
//! | Small | 244 | 2.4 | | | V1 | All |
//! | Medium | 769 | 1.35 | | | V1 | All |
//! | Large | 1550 | 1.0 | | | V1 | All |
//! | QuantizedTinyEn | | | | | EnV1 | English |
//! | TinyEn | 39 | 5.75 | 18.9 | 18.9 | EnV1 | English |
//! | BaseEn | 74 | 4.6 | 14.3 | 15.7 | EnV1 | English |
//! | SmallEn | 244 | 2.4 | 10.8 | 14.7 | EnV1 | English |
//! | MediumEn | 769 | 1.35 | 9.5 | 12.3 | EnV1 | English |
//! | LargeV2 | 1550 | 1.0 | 9.1 | 11.7 | V1 | All |
//! | LargeV3 | 1550 | 1.0 | 8.4 | 11.0 | V2 | All |
//! | DistilLargeEnV3 | 756 | 6.3 | 9.7 | 10.8 | V2 | English |
//! | DistilLargeEnV2 | 756 | 5.8 | 10.1 | 11.6 | V1 | English |
//! | DistilMediumEn | 394 | 6.8 | 11.1 | 12.4 | V1 | English |
//!
//! # Models
//!
//! ## Model Types
//!
//! ### [Distil-Whisper](https://github.com/huggingface/distil-whisper)
//!
//! ### Quantized
pub use Language;
use LanguageState;
pub use Model;
pub use TranscriberError;
use Type;
use Error;
use Tokenizer;