derive_hash_fast 0.2.3

# A faster `#[derive(Hash)]` for Rust

**TL;DR:** `#[derive(Hash)]` hashes your struct fields and slice elements one by one, which is slow. This crate hashes the entire struct at once, which is much faster.
 
**Limitations:** The struct must be safe to view as a slice of bytes. This is enforced by requiring derived traits from either [`bytemuck`](https://crates.io/crates/bytemuck) or [`zerocopy`](https://crates.io/crates/zerocopy), at your option.

## Tell me more

This crate is inspired by the [excellent blog post](https://purplesyringa.moe/blog/thoughts-on-rust-hashing/) by [@purplesyringa](https://github.com/purplesyringa) (who is not affiliated with this crate). Check it out for an in-depth exploration of the issues with `#[derive(Hash)]` and the `Hash` trait in general.

We achieve better performance than `#[derive(Hash)]` by:

1. Hashing the entire struct at once (as opposed to each field individually)
1. Dispatching to a sequence of primitive writes such as `hasher.write_u64` which is determined at compile time, padded where necessary (as opposed to using the slow variable-length codepath in the hashers)
1. Replicating the optimization `std` performs for `u8` and other primitive types in slices, so that e.g. `&[MyType(u8)]` can he hashed as fast as `&[u8]`. This applies to structs with multiple fields as well.

## Alternatives

`bytemuck` and `zerocopy` crates provide their own implementations of this idea as `#[derive(ByteHash)]`. They employ fewer micro-optimizations, but their performance when used in a `HashSet` is mostly identical to this crate, and the difference can go either way based on the chosen hash function.

Therefore, it is recommended to try `#[derive(ByteHash)]` first to avoid additional dependencies, and only switch to this crate if it improves your project's benchmarks.

## Usage

For using the crate with `zerocopy` (recommended), see [the docs on `derive_hash_fast_zerocopy!`](https://docs.rs/derive_hash_fast/latest/derive_hash_fast/macro.derive_hash_fast_zerocopy.html)

For using the crate with `bytemuck` (which puts more restrictions on your type), see [the docs on `derive_hash_fast_bytemuck!`](https://docs.rs/derive_hash_fast/latest/derive_hash_fast/macro.derive_hash_fast_bytemuck.html)

## Benchmarks

Clone the repository and run `cargo bench`.

I've published the raw results from a run [here](https://shnatsel.github.io/derive_hash_benchmark_report/report/), but nothing beats benchmarks on your hardware and on your verstion of Rust compiler.

## FAQ

### Is this a hash function?

**No.** It's a more efficient way to feed data to your chosen hash function. If you care about performance, you should use a fast hash function *in conjunction* with this crate, since `std::hash::DefaultHasher` is DoS-resistant but slow.

### Is this ALWAYS faster?

**Almost.** In my benchmarks this approach is faster than `#[derive(Hash)]` across the board, but there is one exception. If you are hashing a very short slice (64 bits or less) and you're using a function with a fast fixed-size path and slow variable-sized path (pretty much only `rustc_hash::FxHasher`), this approach may be slower. This crate is still dramatically faster for structs and longer slices even with `rustc_hash::FxHasher`. Whether this helps or hinders depends on the abundance of short slices in the data you're hashing.

### Does this work in `#![no_std]`?

Yes. Or it should, anyway. Please open an issue if it doesn't.

### Why not improve the Rust compiler?

Right now the pass that expands the `#[derive(Hash)]` macro happens before the properties of the type required for this optimization are known. So this would require significant architectural changes.

Hopefully that will happen sooner or later, but for now there's this crate.