Crate throttlecrab_server

Expand description

§ThrottleCrab Server

A high-performance, standalone rate limiting service with multiple protocol support.

§Purpose

ThrottleCrab Server solves the problem of distributed rate limiting by providing a centralized service that multiple applications can use to enforce rate limits. Instead of implementing rate limiting logic in every service, you can:

Centralize rate limiting logic in one place
Share rate limit state across multiple services
Enforce consistent policies across your entire infrastructure
Scale independently from your application services

§Use Cases

API Gateway Rate Limiting: Protect backend services from overload
Multi-Service Rate Limiting: Share rate limits across microservices
User Action Throttling: Prevent abuse across multiple endpoints
Resource Protection: Guard expensive operations with global limits

§Installation

cargo install throttlecrab-server

§Quick Start

# Show all available options
throttlecrab-server --help

# Start with http transport on port 8080
throttlecrab-server --http --http-port 8080

# Enable multiple protocols
throttlecrab-server --http --grpc --native

# Custom store configuration
throttlecrab-server --http --http-port 8080 --store adaptive --store-capacity 100000 --store-cleanup-interval 60

§Configuration

Configure via CLI arguments or environment variables (CLI takes precedence):

# Via CLI
throttlecrab-server --http --http-port 9090 --store periodic

# Via environment variables
export THROTTLECRAB_HTTP=true
export THROTTLECRAB_HTTP_PORT=9090
export THROTTLECRAB_STORE=periodic
throttlecrab-server

# List all available environment variables
throttlecrab-server --list-env-vars

§Key Configuration Options

Transports: Enable with --http, --grpc, --native (at least one required)
Ports: --http-port 8080, --grpc-port 50051, --native-port 8072
Store Type: --store periodic|probabilistic|adaptive
Store Capacity: --store-capacity 100000
Log Level: --log-level error|warn|info|debug|trace

§How It Works

The server uses GCRA (Generic Cell Rate Algorithm) with a token bucket approach:

Each key gets a bucket with max_burst capacity
Tokens refill at count_per_period / period per second
Requests consume quantity tokens (default: 1)
Denied requests receive retry_after and reset_after times

§Available Protocols

HTTP/JSON: Easy integration with any programming language (173K req/s)
gRPC: Service mesh and microservices integration (163K req/s)
Native Binary Protocol: Maximum performance with minimal overhead (183K req/s)

All protocols share the same underlying rate limiter, ensuring consistent rate limiting across different client types. RPS are provided for comparison purposes only.

§Architecture

The server uses an actor-based architecture with Tokio for async I/O:

┌─────────────┐   ┌─────────────┐   ┌─────────────┐
│   Native    │   │    HTTP     │   │    gRPC     │
│  Transport  │   │  Transport  │   │  Transport  │
└──────┬──────┘   └──────┬──────┘   └──────┬──────┘
       │                 │                 │
       └─────────────────┴─────────────────┘
                         │
                   ┌─────▼─────┐
                   │   Actor   │
                   │  (Shared  │
                   │   State)  │
                   └─────┬─────┘
                         │
                   ┌─────▼─────┐
                   │RateLimiter│
                   │   Store   │
                   └───────────┘

§Performance

Benchmark results on modern hardware:

Protocol	Throughput	P99 Latency	P50 Latency
Native	183K req/s	263 μs	170 μs
HTTP	173K req/s	309 μs	177 μs
gRPC	163K req/s	370 μs	186 μs

§Usage

§Starting the Server

# HTTP protocol only
throttlecrab-server --http --http-port 8080

# All protocols
throttlecrab-server --native --http --grpc

§Client Examples

§HTTP Protocol (curl)

curl -X POST http://localhost:8080/throttle \
  -H "Content-Type: application/json" \
  -d '{"key": "user:123", "max_burst": 10, "count_per_period": 100, "period": 60}'

§gRPC Protocol

Use any gRPC client library with the provided protobuf definitions.

§Native Protocol (Rust)

use std::io::{Read, Write};
use std::net::TcpStream;
use std::time::{SystemTime, UNIX_EPOCH};

// Connect to server
let mut stream = TcpStream::connect("127.0.0.1:8072")?;

// Prepare request
let key = "user:123";
let timestamp = SystemTime::now()
    .duration_since(UNIX_EPOCH)?
    .as_nanos() as i64;

// Write request (42 bytes + key length)
let mut request = Vec::new();
request.push(1u8);                           // cmd: 1 for rate_limit
request.push(key.len() as u8);               // key_len
request.extend_from_slice(&10i64.to_le_bytes());   // max_burst
request.extend_from_slice(&100i64.to_le_bytes());  // count_per_period
request.extend_from_slice(&60i64.to_le_bytes());   // period (seconds)
request.extend_from_slice(&1i64.to_le_bytes());    // quantity
request.extend_from_slice(&timestamp.to_le_bytes()); // timestamp
request.extend_from_slice(key.as_bytes());   // key

stream.write_all(&request)?;
stream.flush()?;

// Read response (34 bytes)
let mut response = [0u8; 34];
stream.read_exact(&mut response)?;

let ok = response[0] == 1;
let allowed = response[1] == 1;
let limit = i64::from_le_bytes(response[2..10].try_into()?);
let remaining = i64::from_le_bytes(response[10..18].try_into()?);
let retry_after = i64::from_le_bytes(response[18..26].try_into()?);
let reset_after = i64::from_le_bytes(response[26..34].try_into()?);

println!("Allowed: {}, Remaining: {}/{}", allowed, remaining, limit);

Modules§

actor: Actor-based rate limiter for shared state management
config: Server configuration and CLI argument parsing
grpc
store: Store factory for creating rate limiter instances
transport: Transport layer implementations for the rate limiting server
types: Common types used across the server

Crate throttlecrab_serverCopy item path