pub struct CoverageAnalysis {
pub char_coverage_rate: f64,
pub word_coverage_rate: f64,
pub compression_ratio: f64,
pub total_chars: usize,
pub total_words: usize,
pub total_tokens: usize,
pub covered_chars: usize,
pub covered_words: usize,
pub length_distribution: HashMap<usize, u32>,
pub oov_tokens: Vec<String>,
pub vocab_size: usize,
}Expand description
Detailed coverage analysis results for tokenizer evaluation.
Fields§
§char_coverage_rate: f64Character-level coverage rate (0.0 to 1.0)
word_coverage_rate: f64Word-level coverage rate (0.0 to 1.0)
compression_ratio: f64Compression ratio (tokens per character)
total_chars: usizeTotal characters in the test corpus
total_words: usizeTotal words in the test corpus
total_tokens: usizeTotal tokens produced by tokenization
covered_chars: usizeNumber of characters covered by the vocabulary
covered_words: usizeNumber of words covered by the vocabulary
length_distribution: HashMap<usize, u32>Distribution of token lengths
oov_tokens: Vec<String>List of out-of-vocabulary tokens encountered
vocab_size: usizeSize of the tokenizer vocabulary
Implementations§
Source§impl CoverageAnalysis
impl CoverageAnalysis
Sourcepub fn summary(&self) -> String
pub fn summary(&self) -> String
Generate a comprehensive summary report of the coverage analysis.
Sourcepub fn average_token_length(&self) -> f64
pub fn average_token_length(&self) -> f64
Calculate average token length based on the length distribution.
Sourcepub fn top_token_lengths(&self, n: usize) -> Vec<(usize, u32)>
pub fn top_token_lengths(&self, n: usize) -> Vec<(usize, u32)>
Get the most common token lengths with their frequencies.
Sourcepub fn efficiency_score(&self) -> f64
pub fn efficiency_score(&self) -> f64
Calculate efficiency score combining coverage and compression.
Trait Implementations§
Source§impl Clone for CoverageAnalysis
impl Clone for CoverageAnalysis
Source§fn clone(&self) -> CoverageAnalysis
fn clone(&self) -> CoverageAnalysis
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for CoverageAnalysis
impl Debug for CoverageAnalysis
Source§impl<'de> Deserialize<'de> for CoverageAnalysis
impl<'de> Deserialize<'de> for CoverageAnalysis
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Auto Trait Implementations§
impl Freeze for CoverageAnalysis
impl RefUnwindSafe for CoverageAnalysis
impl Send for CoverageAnalysis
impl Sync for CoverageAnalysis
impl Unpin for CoverageAnalysis
impl UnsafeUnpin for CoverageAnalysis
impl UnwindSafe for CoverageAnalysis
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> ConfigSerializable for Twhere
T: Serialize + for<'de> Deserialize<'de>,
impl<T> ConfigSerializable for Twhere
T: Serialize + for<'de> Deserialize<'de>,
Source§fn save_to_file(&self, path: &Path) -> Result<(), TrustformersError>
fn save_to_file(&self, path: &Path) -> Result<(), TrustformersError>
Save to file
Source§fn load_from_file(path: &Path) -> Result<Self, TrustformersError>where
Self: Sized,
fn load_from_file(path: &Path) -> Result<Self, TrustformersError>where
Self: Sized,
Load from file
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more