foundationdb-simulation

This crate provides the tools to write Rust workloads that can be loaded and executed in the official FoundationDB simulator, allowing for rigorous and deterministic testing of Rust layers.

How It Works

FoundationDB's simulation framework includes an ExternalWorkload that can dynamically load a shared object at runtime. This shared object must expose specific symbols that the simulator calls to drive the workload.

Originally, ExternalWorkload used a C++ interface. This is cumbersome because the C++ ABI is not stable and most languages do not interoperate with it easily. As of FoundationDB 7.4, the ExternalWorkload supports a pure C interface, which this crate targets. For backwards compatibility with FoundationDB 7.1 and 7.3, a C++ shim can be compiled to translate the C interface back to the C++ one.

For FoundationDB 7.4 and Newer (Recommended)

Uses a pure C API (FFI-safe).
No need to build inside the official FoundationDB Docker image.
Requires setting useCAPI=true in the test configuration file.
Results in faster build times and a simpler setup.

For FoundationDB 7.1 and 7.3

Requires a C++ shim to bridge the C and C++ ABIs.
Must be built within the official foundationdb/build Docker image).
The linker must be set to clang.
Involves a more complex build process due to C++ ABI compatibility requirements.

Setup

First, create a new Rust project with a library structure:

.
├── Cargo.toml
└── src/
    └── lib.rs

Next, add foundationdb-simulation to your Cargo.toml and configure your crate as cdylib as the ExternalWorkload expects a shared object.

[lib]
name = "myworkload"
crate-type = ["cdylib"]

[dependencies]
# Make sure to select the feature flag for your target FDB version.
foundationdb-simulation = { version = "...", features = ["fdb-7_4"] } # or "fdb-7_1", "fdb-7_3"

Compilation

If you are targeting FoundationDB 7.1 or 7.3 (which require the C++ shim), you must compile your workload inside the official FoundationDB Docker image. Compiling outside of this environment will almost certainly lead to segmentation faults at runtime due to C++ ABI mismatches.

Compile your workload using the standard Cargo commands. This will create a shared object in ./target/release/ (./target/debug/). The filename will be lib<name>.so, where <name> is the name you set in the [lib] section of your Cargo.toml. For example: libmyworkload.so.

Launching the Simulation

The FoundationDB simulator is launched via the fdbserver binary with a TOML configuration file.

fdbserver -r simulation -f ./test_file.toml

Your test file must specify testName=External to use the ExternalWorkload framework.

testTitle = "MyRustTest"
testName = "External"

# The name passed to your workload factory
workloadName = "MyWorkload" 

# The path to the directory containing your .so file
libraryPath = "./target/release/" 
# The name of your library from Cargo.toml (without lib/.so)
libraryName = "myworkload" 

# Required for FDB 7.4+ when using the C API
useCAPI = true

# Custom options for your workload
myCustomOption = 42

Workload Definition

FoundationDB workloads are defined by implementing the RustWorkload trait:

pub trait RustWorkload {
    async fn setup(&mut self, db: SimDatabase);
    async fn start(&mut self, db: SimDatabase);
    async fn check(&mut self, db: SimDatabase);
    fn get_metrics(&self, out: Metrics);
    fn get_check_timeout(&self) -> f64;
}

The simulator expects the shared object to expose a factory that can create workload instances. This crate provides two traits to define these factories.

For simple use cases where a single workload implementation is defined, you can implement the SingleRustWorkload trait directly on your RustWorkload.

pub trait SingleRustWorkload: RustWorkload {
    const FDB_API_VERSION: u32;
    fn new(name: String, context: WorkloadContext) -> Self;
}

Then, register your workload with the corresponding register_workload macro.

For more complex scenarios, where multiple workload implementations exist in a single shared object, you can define a separate factory that implements the RustWorkloadFactory trait. This allows selecting the implementation based on the workload name provided in the test configuration.

pub trait RustWorkloadFactory {
    const FDB_API_VERSION: u32;
    fn create(name: String, context: WorkloadContext) -> WrappedWorkload;
}

The factory must be registered with the corresponding register_factory macro.

Warning: Do not use more than one "register macro" per project.

See the atomic and noop implementations in the examples/ directory for complete working examples.

The WorkloadContext

The WorkloadContext passed at instantiation is the primary way to interact with the simulator. It provides several useful methods:

trace(severity, name, details): Add a log entry to the FDB trace files.
now(): Get the current simulated time as a f64.
rnd(): Get a deterministic 32-bit random number.
shared_random_number(): Get a deterministic 64-bit random number (the same for all clients).
client_id(): Get the ID of the current client.
client_count(): Get the total number of clients in the simulation.
get_option<T>(name): Get a custom configuration option from the test file.

Get options

In the simulation configuration file you can add custom parameters to your workload. These parameters can be read with WorkloadContext::get_option. This method will first try to get the parameter value as a raw string and then convert it in a the type of your choice. If the parameter doesn't exist, its value is invalid or set to null, the function returns None.

Example:

impl SingleRustWorkload for MyWorkload {
    fn new(name: String, context: WorkloadContext) -> Self {
        let my_custom_option: usize = context
            .get_option("myCustomOption")
            .unwrap();
        Self { context, name, my_custom_option }
    }
}

Tracing

Use WorkloadContext::trace to log messages with a given severity and a map of string "details". A severity of Severity::Error will automatically stop the fdbserver process.

impl RustWorkload for MyWorkload {
    fn setup(&mut self, db: SimDatabase) {
        self.context.trace(
            Severity::Info,
            "SuccessfullySetupWorkload",
            details![
                "Layer" => "Rust",
                "Phase" => "setup",
                "Name" => self.name,
                "CustomOption" => self.my_custom_option,
            ],
        );
    }
    ...
}

Randomness

To maintain determinism, all random numbers must be sourced from the simulator. Use WorkloadContext::rnd() or WorkloadContext::shared_random_number() for a shared seed. Do not use external entropy sources like rand::thread_rng().

Workload Lifecycle

Instantiation

Once the fdbserver process is ready, it loads your shared object and calls your workload factory to create instances. The simulator creates a random number of "clients," and your factory will be called once for each client.

Note: Contrary to the C++ ExternalWorkload which has separate create and init methods, the RustWorkload is not created until the simulator's init phase.

The WorkloadContext passed to your factory should be stored in your RustWorkload, as it is safe to use across phases and will not be provided again.

The setup, start, and check Phases

These three phases run sequentially. The simulation will not begin the start phase until all clients have completed the setup phase, and so on.

Each function is async and runs cooperatively alonside the fdbserver. As long as your function is executing, the fdbserver is paused to ensure determinism. You must yield control back to the simulator for any database operations to occur. Do not "busy wait" for a foundationdb function to return as it will deadlock.

FoundationDB Simulation runs your async code on a custom future executor that integrates with the fdbserver event loop. It effectively turns your async fn into a sequence of cooperative steps, allowing the fdbserver to drive simulation events between awaits.

All foundationdb-rs async operations are compatible with this model.
Avoid using arbitrary async primitives from other crates (e.g., tokio::sleep).

Reporting Metrics

At the end of the simulation, the get_metrics method is called for each client. Implement this method to report results.

fn get_metrics(&self, out: Metrics) {
    // val metrics are summed across all clients
    out.push(Metric::val("total_ops", self.ops as f64));
    // avg metrics are averaged across all clients
    out.push(Metric::avg("avg_latency", self.latency / self.ops.max(1) as f64));
}

Common Mistakes

Compiling C++ shim outside of the Docker. C++ ABI is extremely environment-dependent, not compiling in the exact same environment as the other half of the interface will probably result in segmentation faults or mangled strings that will trigger unrecoverable errors.
Forgetting to read all options. Any custom option in the configuration that is not read will trigger a Workload had invalid options. error.
Forgetting useCAPI=true. If the fdbserver used supports the C API, you still need to add explicitely this option, otherwise the ExternalWorkload will try to load the C++ symbol.
Using pointers or references after a phase ends. After each phase, the simulator may move or deallocate memory. Any pointers or references to FoundationDB objects become invalid. You must use the fresh &mut self and db references passed to the next phase. Storing SimDatabase, Transaction, or Future objects across phases will lead to segmentation faults or other undefined behavior.

foundationdb-simulation 0.2.3