Vulture

Rust implementation of RAPTOR (Delling, Pajor, Werneck): given a public transit network, find all Pareto-optimal journeys between two stops, trading fewer transfers against earlier arrival.

Browser demo: vulture compiled to WASM (~275 KB gzipped) running in the browser: stop-to-stop routing, depart-in-window Pareto profile, walking-footpath augmentation. See vulture-wasm/ for the bindings.

JS / browser bindings on npm: vulture-wasm: npm install vulture-wasm.

Quick start

The gtfs module wraps a parsed GTFS feed and implements the Timetable trait. The repo bundles aux/dmrc_gtfs.zip (Delhi Metro) so you can run the example without downloading anything:

use gtfs_structures::Gtfs;
use jiff::civil::date;
use vulture::{SecondOfDay, Timetable};
use vulture::gtfs::GtfsTimetable;

let gtfs = Gtfs::new("aux/dmrc_gtfs.zip")?;
let tt = GtfsTimetable::new(&gtfs, date(2024, 1, 15))?;

let start = tt.stop_idx("1").expect("Dilshad Garden");
let target = tt.stop_idx("44").expect("Vishwavidyalaya");

let journeys = tt
    .query()
    .from(start)
    .to(target)
    .max_transfers(10)
    .depart_at(SecondOfDay::hms(9, 0, 0))
    .run();

for j in &journeys {
    println!("{} trip(s), arrives {}", j.plan.len(), j.arrival());
}

Runnable as cargo run --example gtfs-timetable -- aux/dmrc_gtfs.zip 2024-01-15 1 44.

A returned Journey has plan: Vec<(RouteIdx, StopIdx)> ("take this route, get off at this stop"), an origin and target (relevant when the query supplies multiple of either), and a label carrying the criterion under optimisation. journey.arrival() reads it as a SecondOfDay. See Journey for the full contract; for trip IDs and per-leg depart/arrive times see Journey::with_timing.

Gtfs::new(path) is the local-file form; the same gtfs-structures API also exposes Gtfs::from_url (reqwest-backed sync fetch), Gtfs::from_url_async, and Gtfs::from_reader (any Read + Seek), all returning the same Gtfs value GtfsTimetable::new consumes:

let gtfs = Gtfs::from_url("https://example.org/feed/gtfs.zip")?;
let tt = GtfsTimetable::new(&gtfs, date(2026, 5, 4))?;

from_url requires the read-url feature on gtfs-structures. Since v0.19, vulture depends on gtfs-structures with default-features = false (so wasm and minimal native builds don't drag in reqwest / tokio). To use the URL-loading paths, add gtfs-structures directly to your Cargo.toml: gtfs-structures = { version = "0.47", features = ["read-url"] }. Gtfs::new(path) and Gtfs::from_reader (any Read + Seek) need no extra features.

Worked examples

Berlin VBB: station-level query

A station like Berlin Hbf has hundreds of platforms; you don't usually care which one. GtfsTimetable::station_stops(parent_id) expands a parent station to its child platforms and the multi-source/multi-target search picks the best combination:

let tt = GtfsTimetable::new(&gtfs, date(2026, 5, 4))?;

let journeys = tt
    .query()
    .from(tt.station_stops("de:11000:900003201"))   // Hbf – ~301 platforms
    .to(tt.station_stops("de:11000:900100003"))     // Alex – ~50 platforms
    .max_transfers(10)
    .depart_at(SecondOfDay::hms(9, 0, 0))
    .run();

Returns a direct S-Bahn boarding at 09:01, arriving Alex at 09:07 – across a feed of ~42k stops, ~71k active trips, in ~385 µs.

Helsinki HSL: augmenting sparse footpaths

HSL ships an empty transfers.txt, so out of the box no walking transfers exist between physical stops. Build coordinate-derived walking edges before querying:

let tt = GtfsTimetable::new(&gtfs, date(2026, 5, 4))?
    .with_walking_footpaths(&gtfs, 500.0, 1.4);   // 500 m at 5 km/h

let journeys = tt
    .query()
    .from(tt.stop_idx("1040601").unwrap())   // Kamppi metro
    .to(tt.stop_idx("1453601").unwrap())     // Itäkeskus metro
    .max_transfers(10)
    .depart_at(SecondOfDay::hms(9, 0, 0))
    .run();

with_walking_footpaths uses an R-tree over an equirectangular projection (~0.5% accurate at city scale) and preserves any pre-existing transfers.txt entries.

Paris IDFM: expensive case

Châtelet → Versailles Rive Droite crosses the IDFM region: the expanded RER feed has ~54k stops, ~146k active trips per weekday, and the optimal journey involves the RER C from Saint-Michel-Notre-Dame:

let tt = GtfsTimetable::new(&gtfs, date(2026, 5, 4))?
    .assert_footpaths_closed();   // IDFM's transfers.txt is publisher-curated

let journeys = tt
    .query()
    .from(tt.stop_idx("IDFM:monomodalStopPlace:45102").unwrap())   // Châtelet
    .to(tt.stop_idx("IDFM:monomodalStopPlace:44602").unwrap())     // Versailles RD
    .max_transfers(10)
    .depart_at(SecondOfDay::hms(9, 0, 0))
    .run();

Returns in ~17 ms. Smaller cross-Paris queries (Châtelet → Gare du Nord, Châtelet → La Défense) come back in under 1 ms. See docs/cross-city-benchmarks.md for the full numbers and methodology.

Range queries: "leave between X and Y"

let profile = tt
    .query()
    .from(start)
    .to(target)
    .max_transfers(10)
    .depart_in_window(SecondOfDay::every(
        SecondOfDay::hms(17, 0, 0),
        SecondOfDay::hms(18, 0, 0),
        300,   // every 5 minutes
    ))
    .run();

Returns Vec<RangeJourney> – each entry is a (depart, journey) pair, Pareto-filtered on (later depart, fewer transfers, earlier arrival). Serial uses rRAPTOR (single reverse-chronological scan reusing labels across departures); .run_par() spreads the per-departure work across a Rayon thread pool.

Server workload: many queries, multi-threaded

Allocate a RaptorCachePool once and have each thread check out a cache:

use rayon::prelude::*;
use vulture::RaptorCachePool;

let pool = RaptorCachePool::for_timetable(&tt);

let results: Vec<_> = queries.par_iter().map(|q| {
    let mut cache = pool.checkout();
    tt.query()
        .from(q.from)
        .to(q.to)
        .max_transfers(10)
        .depart_at(q.depart)
        .run_with_cache(&mut cache)
}).collect();

pool.checkout() returns a PooledCache handle that owns a cache for its scope; the cache goes back to the pool when the handle is dropped. The same pool serves any number of threads with no per-thread bookkeeping.

Runnable examples

The vulture/examples/ directory has four self-contained, synthetic-network examples covering the patterns above:

cargo run --release --example custom_label – defines a custom Label (route-preference scoring) with its own Ctx, builds the lookup table, runs a query, and inspects the Pareto front.
cargo run --release --example fare_aware – fare-aware Pareto routing using the bundled ArrivalAndFare and a FareTable context.
cargo run --release --example range_query – depart_in_window(...) over a window of departure times; runs both serial rRAPTOR and the parallel naïve batch, asserts they agree.
cargo run --release --example cache_reuse – RaptorCache reuse across many queries; asserts cached results match one-shot results.

Features

Timetable trait – describes a transit network. Use the bundled GtfsTimetable for any GTFS feed, or implement directly for non-GTFS data.
Typestate query builder – tt.query().from(...).to(...).max_transfers(...).depart_at(...).run(). Compile-time enforced order; .depart_at(...) and .depart_in_window(...) switch the return type.
Multi-source / multi-target – .from(...) and .to(...) accept a StopIdx, a slice, a Vec, or (stop, walk_offset) pairs. Pair with station_stops(parent_id) for "any platform of this station" queries.
Range queries – .depart_in_window(times) returns a Pareto profile; serial path uses rRAPTOR, .run_par() and .run_with_pool(&pool) use a parallel naïve batch.
Label trait – single-criterion ArrivalTime is the default. vulture::labels ships ArrivalAndWalk (arrival vs. walking) and ArrivalAndFare (arrival vs. accumulated fare from a route → fare table) for trade-off queries. Implement Label directly for custom criteria, threading lookup tables through Query::with_context.
Walking footpaths from coordinates – with_walking_footpaths(&gtfs, max_dist_m, speed_m_per_s) builds bidirectional R-tree walking edges, preserving any existing transfers.txt.
Closed-footpath fast path – if your transfers.txt is the entire intended relation, assert_footpaths_closed() opts into a single-pass O(E) relaxation instead of Dijkstra.
RaptorCache – reusable scratch allocations across queries against one timetable. RaptorCachePool is the Sync variant for thread pools.
Per-leg timing – journey.with_timing(&tt, depart, origin_walk) reconstructs trip IDs and per-leg depart/arrive times. Journey.plan alone is just topology.
Wheelchair-accessibility filter – chain require_wheelchair_accessible() on the query builder to skip trips where trips.wheelchair_accessible = 2 and stops where stops.wheelchair_boarding = 2. Default behaviour is unchanged.
Multi-day search – GtfsTimetable::new_with_overnight_days(&gtfs, date, OvernightDays(n)) loads n additional service days after the base date, shifting day-d trips into a single time axis so a 23:00 query can catch tomorrow morning's first train.
Per-leg shape access – GtfsTimetable::shape_for_leg(&gtfs, trip_id, board, alight) returns the polyline points (lat, lon) for one journey leg, sliced exactly when shape_dist_traveled is present and projected onto the polyline as a fallback. Used by the live demo to draw journeys on a MapLibre map.
Feed introspection – GtfsTimetable::features() returns a FeedFeatures snapshot (stop / trip / route counts, transfers.txt breakdown by transfer_type, parent stations, shaped trips, wheelchair flags, current footpath state) so callers can describe a feed without reaching back to the source Gtfs. FeedFeatures::suggestions() returns a heuristic, advisory list of vulture features that might be useful given the snapshot (e.g. "transfers.txt is empty: call with_walking_footpaths"). The bundled demo shows both under each loaded feed.

When To Use What

Goal	API
Single query, default routing	`tt.query()...run()`
Many queries, one server	allocate a `RaptorCache`, finish each chain with `.run_with_cache(&mut cache)`
Same as above, multi-threaded	`RaptorCachePool::for_timetable(&tt)`, `pool.checkout()` per worker
"Leave between X and Y"	`.depart_in_window(times).run()` (serial rRAPTOR), or `.run_par()` (multi-core wins on wide windows)
"Slower route with less walking"	`tt.query_with_label::<ArrivalAndWalk>()...run()`
"Cheapest vs. fastest journey"	`tt.query_with_label::<ArrivalAndFare>().with_context(fares)...run()`
Wheelchair-only routing	`tt.query()....require_wheelchair_accessible()....run()`
"Last train home" / overnight queries	`GtfsTimetable::new_with_overnight_days(&gtfs, date, OvernightDays(1))?`
Any platform of a station	`tt.station_stops(parent_id)` into `.from(...)` / `.to(...)`
Sparse `transfers.txt`	`.with_walking_footpaths(&gtfs, max_dist_m, speed_m_per_s)` at construction
Trip IDs and per-leg times in output	`journey.with_timing(&tt, depart, origin_walk)`
Polyline geometry per leg (for map drawing)	`tt.shape_for_leg(&gtfs, trip_id, board, alight)`

Perf

Single-query latency, warm RaptorCache, M-series Apple Silicon, single thread. Full numbers (load times, multi-trip queries, methodology) in docs/cross-city-benchmarks.md:

Feed	Stops	Trips/day	Representative query	Latency
Delhi Metro	262	5,400	2-trip cross-network	22 µs
Helsinki HSL	8,400	24,000	Kamppi → Itäkeskus (with walking)	5.9 ms
Berlin VBB	42,000	71,000	Hbf → Alex (station-to-station)	385 µs
Paris IDFM	54,000	146,000	Châtelet → Versailles RD	17 ms

For range-query latencies (serial rRAPTOR vs parallel naïve batch), see the bench source and the linked benchmark page. For a head-to-head against an independent RAPTOR implementation (the TypeScript raptor-journey-planner) on the same four feeds – including the correctness diff and a diagnosed timezone bug in the upstream library – see docs/cross-impl-comparison.md.

Soundness

For an issue-by-issue walk through the algorithmic correctness of the implementation against the paper – including the historical record of bugs found and fixed – see docs/soundness.md. The vulture-proptest harness runs the algorithm against a brute-force reference solver on every test invocation as live validation.

How Vulture is tested

Three overlapping layers, all run by cargo nextest r on every PR:

Unit tests in vulture/src/test.rs cover the algorithm (single-source, multi-source, range queries, footpath relaxation, wheelchair filtering, multi-day overnight) and the GTFS adapter against hand-built SimpleTimetable fixtures and the bundled Delhi Metro feed.
Doctests on every public type with a non-trivial contract (Label, Query, Journey, RaptorCache, GtfsTimetable::station_stops, etc.)
Property-based tests in vulture-proptest generate random transit networks and check the algorithm against a brute-force reference solver. Powered by Hegel.

The proptest harness ships three generator layers – Layer 1 (1–2 routes, 2–4 stops, no footpaths), Layer 2 (adds 1–4 footpaths), Layer 3 (1–4 routes, up to 6 stops, optional footpaths, multi-source/multi-target queries with walk offsets, per-route fares for fare-label tests) – and six properties:

Property	What it checks
`layer{1,2,3}_matches_reference`	Algorithm output equals the brute-force reference solver's Pareto front of `(arrival, trip_count)` for randomly generated networks at each layer.
`parallel_naive_matches_serial_rrap`	The serial rRAPTOR range path and the parallel naïve batch return the same Pareto profile.
`range_query_matches_reference`	Range-query output equals an independent brute-force reference: per-`τ` solve plus the same 3-D Pareto filter (later depart, fewer transfers, earlier arrival) vulture's `filter_range_pareto_front` uses.
`fare_label_matches_per_leg_sum`	The `ArrivalAndFare` label's accumulated fare equals the per-leg sum of fares on every returned journey.

Hegel persists shrunk failing seeds to .hegel/ so failures are deterministically reproducible. See vulture-proptest/README.md for the layer/issue map and instructions for adding a new layer or property.

Cargo features

parallel (default-on) – pulls in rayon, enables Query::run_par / Query::run_with_pool. Opt out with default-features = false for wasm or minimal builds; RaptorCachePool itself stays available.
gtfs-bench – enables the gtfs criterion benchmark.
internal – enables the manual criterion benchmark over manual::SimpleTimetable.

License

Apache-2.0

vulture 0.22.1