LatencyTracker

A high-performance, lock-free latency percentile tracker for Rust.

LatencyTracker is designed for extremely high write throughput in multi-threaded services. It maintains a rolling time window of latency observations and provides fast percentile queries (P50, P90, P95, P99) without storing individual samples.

Features

Lock-free recording path using atomics
Thread-local batching to minimize contention
Sliding time window (10 seconds by default)
Percentile estimation from a fixed-size histogram
Multi-threaded friendly
Service isolation (multiple independent trackers)
Constant memory usage
High write throughput

How It Works

Histogram-based aggregation

Latency values are mapped into histogram bins:

Linear resolution: 10 ms
Range: 0–5000 ms
Values above 5000 ms are clamped into the final bucket

Thread-local batching

Each thread accumulates observations locally and periodically flushes them into shared striped histograms.

Benefits:

Fewer atomic operations
Reduced cache contention
Better scalability under heavy load

Striped architecture

The tracker uses 64 independent stripes. Each thread is assigned a stripe based on its thread ID hash.

This significantly reduces contention during updates.

Sliding Window

Data is stored in per-second buckets across a rolling 10-second window.

Only buckets belonging to the active window are used when computing percentiles.

Installation

Add to your Cargo.toml:

[dependencies]
latency_tracker = "0.1"

Usage

use latency_tracker::LatencyTracker;

let metrics = LatencyTracker::new();

metrics.record_latency_ms(120);
metrics.record_latency_ms(250);
metrics.record_latency_ms(80);

metrics.flush_thread_local();

println!("P50: {} ms", metrics.latency_ms(50));
println!("P90: {} ms", metrics.latency_ms(90));
println!("P99: {} ms", metrics.latency_ms(99));

API

Create tracker

let tracker = LatencyTracker::new();

Record latency

tracker.record_latency_ms(150);

Flush thread-local data

tracker.flush_thread_local();

Call this before reading percentiles if recent observations are still buffered in the current thread.

Read percentiles

tracker.latency_ms(50); // P50
tracker.latency_ms(90); // P90
tracker.latency_ms(95); // P95
tracker.latency_ms(99); // P99

Percentiles above 99 return 0.

Performance Characteristics

Write path

O(1)
Lock-free
Thread-local batching

Read path

Percentile calculation scans the histogram:

O(number_of_bins)
Constant memory
Suitable for frequent monitoring

Default Configuration

Parameter	Value
Window	10 seconds
Stripes	64
Flush threshold	512 samples
Histogram step	10 ms
Maximum tracked latency	5000 ms

Testing

The library includes tests covering:

Percentile ordering
Uniform distributions
Heavy-tail distributions
Multi-threaded recording
Accuracy validation
Stress testing with millions of samples
Service isolation

Run tests:

cargo test

License

MIT

latency_tracker 0.1.0