pub struct TreeTokenizer {
pub universe: Universe,
pub tree: HashMap<String, Lapper<u32, u32>>,
}Expand description
A tokenizer that uses an interval tree to find overlaps
§Attributes
universe- the universe of regionstree- the interval tree
§Methods
from- create a new TreeTokenizer from a bed filetokenize_region- tokenize a region into the vocabulary of the tokenizertokenize_region_set- tokenize a region set into the vocabulary of the tokenizertokenize_bed_set- tokenize a bed set into the vocabulary of the tokenizerunknown_token- get the unknown token
Fields§
§universe: Universe§tree: HashMap<String, Lapper<u32, u32>>Implementations§
Source§impl TreeTokenizer
impl TreeTokenizer
pub fn tokenize_region_set_batch( &self, region_sets: &Vec<RegionSet>, ) -> Option<Vec<TokenizedRegionSet<'_>>>
Trait Implementations§
Source§impl From<&Path> for TreeTokenizer
impl From<&Path> for TreeTokenizer
Source§impl Tokenizer for TreeTokenizer
impl Tokenizer for TreeTokenizer
Source§fn tokenize_region(&self, region: &Region) -> Option<TokenizedRegionSet<'_>>
fn tokenize_region(&self, region: &Region) -> Option<TokenizedRegionSet<'_>>
Source§fn tokenize_region_set(
&self,
region_set: &RegionSet,
) -> Option<TokenizedRegionSet<'_>>
fn tokenize_region_set( &self, region_set: &RegionSet, ) -> Option<TokenizedRegionSet<'_>>
Tokenize a region set into the vocabulary of the tokenizer Read more
fn padding_token(&self) -> Region
fn unknown_token(&self) -> Region
Source§fn tokenize_bed_set(
&self,
bed_set: &BedSet,
) -> Option<Vec<TokenizedRegionSet<'_>>>
fn tokenize_bed_set( &self, bed_set: &BedSet, ) -> Option<Vec<TokenizedRegionSet<'_>>>
Tokenize a bed set into the vocabulary of the tokenizer Read more
Auto Trait Implementations§
impl Freeze for TreeTokenizer
impl RefUnwindSafe for TreeTokenizer
impl Send for TreeTokenizer
impl Sync for TreeTokenizer
impl Unpin for TreeTokenizer
impl UnwindSafe for TreeTokenizer
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more