Expand description
Data processing pipeline for chunking, deduplication, and file reconstruction, used in the Hugging Face Xet storage tools.
Provides content-defined chunking via gear hashing, deduplication against metadata shards, and file reconstruction from deduplicated chunk references.