Struct ssdeep::internal_comparison::BlockHashPositionArray

source ·

pub struct BlockHashPositionArray { /* private fields */ }

Expand description

A position array-based block hash except its length.

Each element of the position array indicates which positions in the corresponding block hash have the given alphabet (note that array index is of the alphabet).

For instance, if representation()[5] == 0x81, it means the block hash contains the alphabet index 5 in the positions 0 and 7 (block hash glob: E??????E*).

This is because bit 0 0x01 at index 5 means that the position 0 is the alphabet index 5 (E). Likewise, bit 7 (0x80) at index 5 corresponds to the fact that position 7 is the alphabet index 5 (E).

This representation makes possible to make certain dynamic programming algorithms bit-parallel. In other words, some table updates of certain top-down dynamic programming algorithms can be represented as logical expressions (with some arithmetic ones to enable e.g. horizontal propagation). This is particularly effective on ssdeep because each block hash has a maximum size of BlockHash::FULL_SIZE (64; many 64-bit machines would handle that efficiently and even 32-bit machines can benefit from).

This is so fast so that the bit-parallel approach is still faster even if we don’t use any batching.

For an example of such algorithms, see Bitap algorithm.

Important Note: Length not included

Note that this struct does not include its length. The length must be given from outside.

Struct ssdeep::internal_comparison::BlockHashPositionArray

Implementations§

impl BlockHashPositionArray

pub fn new() -> Self

pub fn representation(&self) -> [u64; 64]

pub fn clear(&mut self)

pub fn init_from(&mut self, blockhash: &[u8])

pub unsafe fn is_equiv_unchecked(&self, len: u8, other: &[u8]) -> bool

pub fn is_equiv(&self, len: u8, other: &[u8]) -> bool

pub fn is_valid(&self, len: u8) -> bool

pub unsafe fn has_common_substring_unchecked( &self, len: u8, other: &[u8] ) -> bool

pub fn has_common_substring(&self, len: u8, other: &[u8]) -> bool

pub unsafe fn edit_distance_unchecked(&self, len: u8, other: &[u8]) -> u32

pub fn edit_distance(&self, len: u8, other: &[u8]) -> u32

pub unsafe fn score_strings_raw_unchecked(&self, len: u8, other: &[u8]) -> u32

pub fn score_strings_raw(&self, len: u8, other: &[u8]) -> u32

pub unsafe fn score_strings_unchecked( &self, len: u8, other: &[u8], log_block_size: u8 ) -> u32

pub fn score_strings(&self, len: u8, other: &[u8], log_block_size: u8) -> u32

pub const fn element_has_sequences(pa_elem: u64, len: u32) -> bool

pub const fn element_has_sequences_const<const LEN: u32>(pa_elem: u64) -> bool

Trait Implementations§

impl Clone for BlockHashPositionArray

fn clone(&self) -> BlockHashPositionArray

fn clone_from(&mut self, source: &Self)

impl Debug for BlockHashPositionArray

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for BlockHashPositionArray

fn default() -> Self

impl PartialEq<BlockHashPositionArray> for BlockHashPositionArray

fn eq(&self, other: &BlockHashPositionArray) -> bool

fn ne(&self, other: &Rhs) -> bool

impl Copy for BlockHashPositionArray

impl Eq for BlockHashPositionArray

impl StructuralEq for BlockHashPositionArray

impl StructuralPartialEq for BlockHashPositionArray

Auto Trait Implementations§

impl RefUnwindSafe for BlockHashPositionArray

impl Send for BlockHashPositionArray

impl Sync for BlockHashPositionArray

impl Unpin for BlockHashPositionArray

impl UnwindSafe for BlockHashPositionArray

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>