compute_content_hash

Function compute_content_hash 

Source
pub fn compute_content_hash(
    source: &[u8],
    start: usize,
    end: usize,
) -> Option<String>
Expand description

Compute a normalized content hash for duplicate detection. Returns None if the content is too small (< MIN_HASH_SIZE bytes).

Normalization:

  • Strips leading/trailing whitespace
  • Collapses all internal whitespace to single spaces
  • Uses blake3 for fast, high-quality hashing
  • Returns first 16 hex chars (64 bits) - sufficient for grouping