pub struct RankTransform {
    pub ranks: SymbolRanks,
}
Expand description

Tools based on transforming the alphabet symbols to their lexicographical ranks.

Lexicographical rank is computed using u8 representations, i.e. ASCII codes, of the input characters. For example, assuming that the alphabet consists of the symbols A, C, G, and T, this will yield ranks 0, 1, 2, 3 for them, respectively.

RankTransform can be used in to perform bit encoding for texts over a given alphabet via bio::data_structures::bitenc.

Fields§

§ranks: SymbolRanks

Implementations§

source§

impl RankTransform

source

pub fn new(alphabet: &Alphabet) -> Self

Construct a new RankTransform.

Complexity: O(n), where n is the number of symbols in the alphabet.

Example
use bio::alphabets;

let dna_alphabet = alphabets::Alphabet::new(b"acgtACGT");
let dna_ranks = alphabets::RankTransform::new(&dna_alphabet);
source

pub fn get(&self, a: u8) -> u8

Get the rank of symbol a.

This method panics for characters not contained in the alphabet.

Complexity: O(1)

Example
use bio::alphabets;

let dna_alphabet = alphabets::Alphabet::new(b"acgtACGT");
let dna_ranks = alphabets::RankTransform::new(&dna_alphabet);
assert_eq!(dna_ranks.get(65), 0); // "A"
assert_eq!(dna_ranks.get(116), 7); // "t"
source

pub fn transform<C, T>(&self, text: T) -> Vec<u8>where C: Borrow<u8>, T: IntoIterator<Item = C>,

Transform a given text into a vector of rank values.

Complexity: O(n), where n is the length of the text.

Example
use bio::alphabets;

let dna_alphabet = alphabets::Alphabet::new(b"ACGTacgt");
let dna_ranks = alphabets::RankTransform::new(&dna_alphabet);
let text = b"aAcCgGtT";
assert_eq!(dna_ranks.transform(text), vec![4, 0, 5, 1, 6, 2, 7, 3]);
source

pub fn qgrams<C, T>(&self, q: u32, text: T) -> QGrams<'_, C, T::IntoIter> where C: Borrow<u8>, T: IntoIterator<Item = C>,

Iterate over q-grams (substrings of length q) of given text. The q-grams are encoded as usize by storing the symbol ranks in log2(|A|) bits (with |A| being the alphabet size).

If q is larger than usize::BITS / log2(|A|), this method fails with an assertion.

Complexity: O(n), where n is the length of the text.

Example
use bio::alphabets;

let dna_alphabet = alphabets::Alphabet::new(b"ACGTacgt");
let dna_ranks = alphabets::RankTransform::new(&dna_alphabet);

let q_grams: Vec<usize> = dna_ranks.qgrams(2, b"ACGT").collect();
assert_eq!(q_grams, vec![1, 10, 19]);
source

pub fn alphabet(&self) -> Alphabet

Restore alphabet from transform.

Complexity: O(n), where n is the number of symbols in the alphabet.

Example
use bio::alphabets;

let dna_alphabet = alphabets::Alphabet::new(b"acgtACGT");
let dna_ranks = alphabets::RankTransform::new(&dna_alphabet);
assert_eq!(dna_ranks.alphabet().symbols, dna_alphabet.symbols);
source

pub fn get_width(&self) -> usize

Compute the number of bits required to encode the largest rank value.

For example, the alphabet b"ACGT" with 4 symbols has the maximal rank 3, which can be encoded in 2 bits.

This value can be used to create a data_structures::bitenc::BitEnc bit encoding tailored to the given alphabet.

Complexity: O(n), where n is the number of symbols in the alphabet.

Example
use bio::alphabets;

let dna_alphabet = alphabets::Alphabet::new(b"ACGT");
let dna_ranks = alphabets::RankTransform::new(&dna_alphabet);
assert_eq!(dna_ranks.get_width(), 2);
let dna_n_alphabet = alphabets::Alphabet::new(b"ACGTN");
let dna_n_ranks = alphabets::RankTransform::new(&dna_n_alphabet);
assert_eq!(dna_n_ranks.get_width(), 3);

Trait Implementations§

source§

impl Clone for RankTransform

source§

fn clone(&self) -> RankTransform

Returns a copy of the value. Read more
1.0.0 · source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
source§

impl Debug for RankTransform

source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
source§

impl Default for RankTransform

source§

fn default() -> RankTransform

Returns the “default value” for a type. Read more
source§

impl<'de> Deserialize<'de> for RankTransform

source§

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more
source§

impl Hash for RankTransform

source§

fn hash<__H: Hasher>(&self, state: &mut __H)

Feeds this value into the given Hasher. Read more
1.3.0 · source§

fn hash_slice<H>(data: &[Self], state: &mut H)where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more
source§

impl Ord for RankTransform

source§

fn cmp(&self, other: &RankTransform) -> Ordering

This method returns an Ordering between self and other. Read more
1.21.0 · source§

fn max(self, other: Self) -> Selfwhere Self: Sized,

Compares and returns the maximum of two values. Read more
1.21.0 · source§

fn min(self, other: Self) -> Selfwhere Self: Sized,

Compares and returns the minimum of two values. Read more
1.50.0 · source§

fn clamp(self, min: Self, max: Self) -> Selfwhere Self: Sized + PartialOrd<Self>,

Restrict a value to a certain interval. Read more
source§

impl PartialEq<RankTransform> for RankTransform

source§

fn eq(&self, other: &RankTransform) -> bool

This method tests for self and other values to be equal, and is used by ==.
1.0.0 · source§

fn ne(&self, other: &Rhs) -> bool

This method tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
source§

impl PartialOrd<RankTransform> for RankTransform

source§

fn partial_cmp(&self, other: &RankTransform) -> Option<Ordering>

This method returns an ordering between self and other values if one exists. Read more
1.0.0 · source§

fn lt(&self, other: &Rhs) -> bool

This method tests less than (for self and other) and is used by the < operator. Read more
1.0.0 · source§

fn le(&self, other: &Rhs) -> bool

This method tests less than or equal to (for self and other) and is used by the <= operator. Read more
1.0.0 · source§

fn gt(&self, other: &Rhs) -> bool

This method tests greater than (for self and other) and is used by the > operator. Read more
1.0.0 · source§

fn ge(&self, other: &Rhs) -> bool

This method tests greater than or equal to (for self and other) and is used by the >= operator. Read more
source§

impl Serialize for RankTransform

source§

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where __S: Serializer,

Serialize this value into the given Serde serializer. Read more
source§

impl Eq for RankTransform

source§

impl StructuralEq for RankTransform

source§

impl StructuralPartialEq for RankTransform

Auto Trait Implementations§

Blanket Implementations§

source§

impl<T> Any for Twhere T: 'static + ?Sized,

source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
source§

impl<T> Borrow<T> for Twhere T: ?Sized,

source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
source§

impl<T> BorrowMut<T> for Twhere T: ?Sized,

source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
source§

impl<Q, K> Equivalent<K> for Qwhere Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

source§

fn equivalent(&self, key: &K) -> bool

Compare self to key and return true if they are equal.
source§

impl<T> From<T> for T

source§

fn from(t: T) -> T

Returns the argument unchanged.

source§

impl<T, U> Into<U> for Twhere U: From<T>,

source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

source§

impl<T> Same<T> for T

§

type Output = T

Should always be Self
§

impl<SS, SP> SupersetOf<SS> for SPwhere SS: SubsetOf<SP>,

§

fn to_subset(&self) -> Option<SS>

The inverse inclusion map: attempts to construct self from the equivalent element of its superset. Read more
§

fn is_in_subset(&self) -> bool

Checks if self is actually part of its subset T (and can be converted to it).
§

fn to_subset_unchecked(&self) -> SS

Use with care! Same as self.to_subset but without any property checks. Always succeeds.
§

fn from_subset(element: &SS) -> SP

The inclusion map: converts self to the equivalent element of its superset.
source§

impl<T> ToOwned for Twhere T: Clone,

§

type Owned = T

The resulting type after obtaining ownership.
source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
source§

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

§

type Error = Infallible

The type returned in the event of a conversion error.
source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
source§

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
§

impl<V, T> VZip<V> for Twhere V: MultiLane<T>,

§

fn vzip(self) -> V

source§

impl<T> DeserializeOwned for Twhere T: for<'de> Deserialize<'de>,

source§

impl<T> Scalar for Twhere T: 'static + Clone + PartialEq<T> + Debug,