keyspace
Keyspace partitioning and re-balancing for distributed systems.
Motivation
Implement a keyspace partitioning and re-balancing algorithm that is:
- Memory/space efficient: no virtual nodes, scalable to thousands of nodes.
- Fair: data is uniformly distributed across partitions.
- Compact: to compute the target node of a key, we only need to know the number of nodes
n. - Adaptive: supports node addition and removal, with close to theoretically minimal data movement.
- Robust: supports replication out of the box.
- Heterogeneous: supports weighted nodes with different storage capacities.
The idea is to allow system to grow to thousands of nodes, and to process millions of keys per second efficiently. Additionally, provide a simple API exposing the keyspace data movement details, so that the system can be re-balanced in a distributed fashion.