tower-http-cache
Tower middleware for HTTP response caching with pluggable storage backends (in-memory, Redis, and more). tower-http-cache brings a production-grade caching layer to Tower/Axum/Hyper stacks, with stampede protection, stale-while-revalidate, header allowlisting, compression, and policy controls out of the box.
Features at a Glance
- ✅ Drop-in
CacheLayer: wrap any Tower service; caches GET/HEAD by default. - 🔒 Stampede protection: deduplicates concurrent misses and serves stale data while recomputing.
- ⏱ Flexible TTLs: positive/negative TTL, refresh-before-expiry window, stale-while-revalidate.
- 🔄 Auto-refresh: proactively refreshes frequently-accessed cache entries before expiration.
- 🏷️ Cache Tags: group and invalidate related cache entries together.
- 🎯 Multi-Tier: hybrid L1/L2 caching for optimal performance and capacity.
- 📊 Admin API: REST endpoints for cache introspection and management.
- 🤖 ML-Ready Logging: structured logs with request correlation for ML training.
- 📦 Pluggable storage: in-memory backend (Moka) and optional Redis backend (async pooled).
- 📏 Policy guards: min/max body size, cache-control respect/override, custom method/status filters.
- 🧰 Custom keys: built-in extractors (path, path+query) plus custom closures.
- 📉 Observability hooks: optional metrics counters and tracing spans.
Installation
[]
= "0.3"
# Enable Redis support if required
= { = "0.3", = ["redis-backend"] }
# With admin API support
= { = "0.3", = ["admin-api"] }
Quick Start
use Duration;
use ServiceBuilder;
use *;
let cache_layer = builder
.ttl
.negative_ttl
.stale_while_revalidate
.refresh_before
.min_body_size
.max_body_size
.respect_cache_control
.build;
let svc = new
.layer
.service;
Using the Redis backend
use Duration;
use *;
async
Enabling Auto-Refresh
Auto-refresh proactively refreshes frequently-accessed cache entries before they expire, reducing cache misses and latency for hot endpoints:
use Duration;
use *;
use AutoRefreshConfig;
let cache_layer = builder
.ttl
.refresh_before
.auto_refresh
.build;
// Initialize auto-refresh with the service instance
cache_layer.init_auto_refresh.await?;
Using Cache Tags
Group related cache entries and invalidate them together:
use *;
use TagPolicy;
let cache_layer = builder
.policy
.build;
// Later: invalidate all entries with a tag
backend.invalidate_by_tag.await?;
backend.invalidate_by_tags.await?;
Multi-Tier Caching
Combine fast in-memory cache with larger distributed storage:
use MultiTierBackend;
let backend = builder
.l1 // Hot data (fast)
.l2 // Cold storage (large)
.promotion_threshold // Promote after 3 L2 hits
.promotion_strategy
.write_through
.build;
let cache_layer = builder
.ttl
.build;
Admin API
Enable cache introspection and management endpoints:
use ;
let admin_config = builder
.require_auth
.auth_token
.build;
// Mount admin routes (Axum example)
let admin_routes = admin_router;
let app = new
.nest
.layer;
// Available endpoints:
// GET /admin/cache/health
// GET /admin/cache/stats
// GET /admin/cache/hot-keys
// GET /admin/cache/tags
// POST /admin/cache/invalidate
ML-Ready Structured Logging
Enable structured logging for ML model training:
use MLLoggingConfig;
let cache_layer = builder
.policy
.build;
// Logs will be emitted in JSON format:
// {
// "timestamp": "2025-11-10T12:00:00Z",
// "request_id": "550e8400-...",
// "operation": "cache_hit",
// "latency_us": 150,
// "tags": ["user:123"],
// "tier": "l1"
// }
Configuration Highlights
| Policy | Description |
|---|---|
ttl / negative_ttl |
cache lifetime for successful and error responses |
stale_while_revalidate |
serve stale data while a refresh is in progress |
refresh_before |
proactively refresh the cache shortly before expiry |
auto_refresh |
automatically refresh frequently-accessed entries before expiration |
tag_policy |
configure cache tags and invalidation groups |
multi_tier |
enable multi-tier caching with L1/L2 backends |
ml_logging |
enable ML-ready structured logging |
allow_streaming_bodies |
opt into caching streaming responses |
min_body_size / max_body_size |
enforce size bounds for cached bodies |
header_allowlist |
restrict which headers are stored alongside cached bodies |
method_predicate / statuses |
customize cacheable methods and status codes |
For the full API surface, see the generated docs: cargo doc --open.
Benchmarks
Benchmarks are powered by Criterion and can be reproduced with:
Latest results (macOS / M3 Pro / Rust 1.85, redis-backend disabled unless noted):
| Group | Benchmark | Median | Notes |
|---|---|---|---|
layer_throughput |
baseline_inner |
1.41 ms | Underlying service without caching |
cache_hit |
0.67 µs | Cached GET; body already materialized | |
cache_miss |
0.68 µs | Miss with immediate store | |
key_extractor |
path |
23.8 ns | GET/HEAD path only |
path_and_query |
97.4 ns | Path + query concatenation | |
custom_hit |
84.7 ns | User extractor returning Some |
|
custom_miss |
1.35 ns | User extractor returning None |
|
backend/in_memory |
get_small_hit |
309 ns | 1 KiB entry |
get_large_hit |
327 ns | 128 KiB entry | |
set_small |
676 ns | 1 KiB write | |
set_large |
660 ns | 128 KiB write | |
stampede |
cache_layer |
5.92 ms | 64 concurrent requests with caching |
no_cache |
5.76 ms | Same workload without layer | |
stale_while_revalidate |
stale_hit_latency |
33.6 ms | Serve-stale branch |
strict_refresh_latency |
33.7 ms | Force refresh branch | |
codec/bincode |
encode_small |
362 ns | 1 KiB payload |
decode_small |
381 ns | 1 KiB payload | |
encode_large |
146 µs | 128 KiB payload | |
decode_large |
174 µs | 128 KiB payload | |
negative_cache |
initial_miss |
14.0 µs | First miss populates negative entry |
stored_negative_hit |
21.9 ms | TTL-expired negative pathways | |
after_ttl_churn |
5.66 µs | Subsequent positive hit |
Full raw output, including outlier analysis, is captured in initial_benchmark.md.
Testing & Tooling
# Library unit tests + integration tests
# Redis integration tests
REDIS_URL=redis://127.0.0.1:6379/
# Redis smoke test (launches example service, verifies cache hit/miss behaviour)
# Examples
Feature Flags
| Feature | Description | Default |
|---|---|---|
in-memory |
Enables the Moka-powered in-memory backend | ✓ |
redis-backend |
Enables the Redis backend, codec, and async utilities | ✗ |
admin-api |
Enables admin REST API endpoints (requires axum) | ✗ |
serde |
Derives serde traits for cached entries/codecs |
✓ |
compression |
Adds optional gzip compression for cached payloads | ✗ |
metrics |
Emits metrics counters (hit/miss/store/etc.) |
✗ |
tracing |
Adds tracing spans around cache operations | ✗ |
Minimum Supported Rust Version
MSRV: 1.75.0 (matching the crate's rust-version field).
The MSRV will only increase with a minor version bump and will be documented in release notes.
Status
tower-http-cache is under active development. Expect API adjustments while we stabilize the 0.x series. Contributions and feedback are welcome—feel free to open an issue or PR! ***
License
This project is dual-licensed under either:
- Apache License, Version 2.0 (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
- MIT License (LICENSE-MIT or http://opensource.org/licenses/MIT)
You may choose either license to suit your needs. Unless explicitly stated otherwise, any contribution intentionally submitted for inclusion in the crate shall be dual-licensed as above, without additional terms or conditions.
Contributing
- Fork and clone the repository.
- Install prerequisites (
cargo,rustup, and Docker if you plan to run Redis tests). - Run the checks:
- Open a pull request with a succinct summary, test evidence, and (when applicable) benchmark output via
cargo bench.
Bug reports and feature requests are welcome in the issue tracker. For larger design changes, please start a discussion thread to align on API shape before submitting code.