trine-kv 0.2.0 - Docs.rs

# Trine KV Usage Guide

This guide shows the shortest path from an empty Rust program to a working
Trine KV database. The examples use the public v1 API and avoid engine internals.

Run the checked quickstart first:

```text
cargo run --example quickstart
```

Run the sync-adapter quickstart when you need the explicit synchronous API:

```text
cargo run --example sync_quickstart
```

Then look at the integration examples when you want to embed the database
behind an application boundary:

```text
cargo run --example user_store
cargo run --example event_index
```

`user_store` wraps Trine KV behind a small repository-style API. `event_index`
uses two buckets and one write batch to keep event payloads and an account
index in sync.

## Add The Crate

Published releases use Semantic Versioning. For the `0.1` release line:

```toml
[dependencies]
trine-kv = "0.1"
```

For local development from this repository:

```toml
[dependencies]
trine-kv = { path = "../trine-kv" }
```

If you consume the crate through git, replace the path dependency with your
repository URL.

Enable native platform I/O explicitly when you want Trine's feature-gated
native file driver:

```toml
[dependencies]
trine-kv = { version = "0.1", features = ["platform-io"] }
```

## Open A Database

The primary database API is async-first. Passing a path opens a persistent
database:

```rust
use trine_kv::Db;

let db = Db::open("./trine-data").await?;
```

Use an in-memory database only when the data is intentionally short-lived:

```rust
use trine_kv::{Db, DbOptions};

let db = Db::open(DbOptions::memory()).await?;
```

Persistent mode creates the directory when `create_if_missing` is true and the
database is not opened read-only. On native targets, this default prioritizes
database-style durability for confirmed writes.

On native targets, async persistent open, point reads, scans, and lazy value
reads enter Trine's storage boundary through async helpers. Native async writes
and maintenance use runtime task boundaries where the current engine internals
are still synchronous. Run `cargo run --example quickstart` for a
complete checked path.

Synchronous callers can use explicit `*_sync` adapters:

```rust
use trine_kv::Db;

let db = Db::open_sync("./trine-data")?;
db.put_sync(b"user:001", b"Ada")?;
db.flush_sync()?;
```

Use an explicit lower durability mode only when the data can tolerate losing
recent confirmed writes after a crash or power loss:

```rust
use trine_kv::{Db, DbOptions, DurabilityMode};

let db = Db::open(
    DbOptions::new("./trine-data").with_durability(DurabilityMode::Buffered),
)
.await?;
```

With the `platform-io` feature enabled on a target that has true Trine-level
platform async storage operations, select the platform I/O runtime for
native-file data reads/writes, WAL append-object opening/append/persist/rewrite,
manifest publish/read, object delete, directory create/sync, and writer lease
acquisition:

```rust
use trine_kv::{Db, DbOptions, RuntimeOptions};

let mut options = DbOptions::new("./trine-data");
options.runtime = RuntimeOptions::platform_io();
let db = Db::open(options).await?;
```

Directory and object listing are also submitted through the platform driver.
The selected platform backend does not expose a directory enumeration primitive,
so those listing calls are reported as
`storage_platform_sync_fallback_tasks`. `DbStats` also separates true
platform async file work (`storage_platform_async_io_tasks`) from platform
backend fallback work (`storage_platform_backend_fallback_tasks`) and from
Trine's bounded sync adapter task count. It also reports sync-adapter
queue capacity, queued/submitted/completed/rejected task counts, total adapter
runtime, and per-storage-operation request/latency counters. On targets where
every current Trine storage operation is fallback-classified,
`RuntimeOptions::platform_io()` uses the bounded sync adapter and does not
advertise `PlatformAsyncIo`.

WASI and browser persistence have explicit option constructors so callers can
select those host boundaries without accidentally falling back to native files.
On WASI targets, `wasi_persistent(path)` uses the host-preopened filesystem at
that path and defaults to inline runtime execution with no background worker
threads:

```rust
let wasi = Db::open(DbOptions::wasi_persistent("./trine-data")).await?;
let wasi_sync = Db::open_sync(DbOptions::wasi_persistent("./trine-data"))?;
```

Strict sync durability is not claimed for WASI yet; `SyncData` and `SyncAll`
return `UnsupportedDurability`. On non-WASI targets, the same option returns
`UnsupportedBackend`. Current WASI file work completes inline through the host
filesystem boundary, so it does not report `PlatformAsyncIo`. WASI persistent
options default to `Flush`, the strongest currently accepted mode for that
backend.

Browser persistence is async-only on `wasm32-unknown-unknown`. Use `Db::open`
and the async mutation and maintenance APIs:

```rust
let db = Db::open(DbOptions::browser_persistent()).await?;
db.put(b"user:001".to_vec(), b"Ada".to_vec()).await?;
db.flush().await?;
```

The browser persistent backend uses browser storage APIs behind Trine's storage
traits. Writable open acquires a Web Locks writer lease, replays WAL, and uses
WAL-backed async writes. Browser storage accepts `Buffered` and `Flush`;
`SyncData` and `SyncAll` return `UnsupportedDurability`. Synchronous browser
persistent open, synchronous mutation, synchronous bucket creation, and
synchronous maintenance return typed unsupported errors. On non-browser targets,
browser persistent async open returns `UnsupportedBackend`. Browser persistent
options default to `Flush`, the strongest currently accepted mode for that
backend.

Read-only browser open is also async:

```rust
let db = Db::open(DbOptions::browser_persistent_read_only()).await?;
```

`Db`, `Bucket`, and `Snapshot` are cheap handles. `Db` writes to the built-in
default bucket. A named `Bucket` keeps its database open, so release bucket
handles before reopening the same directory in the same process.

## Use The Default Bucket

The short examples below use explicit sync adapters so they fit ordinary
functions. For the primary async API, drop the `_sync` suffix and `await` the
returned future.

The default bucket is created automatically and is the right path for simple
embedded storage:

```rust
db.put_sync(b"user:001", b"Ada")?;
assert_eq!(db.get_sync(b"user:001")?, Some(b"Ada".to_vec()));

let rows = db.range_sync(&trine_kv::KeyRange::all())?;
```

Configure the default bucket through `DbOptions`; do not open it by name:

```rust
use trine_kv::{BucketOptions, Db, DbOptions, PrefixExtractor};

let options = DbOptions::memory().with_default_bucket_options(
    BucketOptions::default().with_prefix_extractor(PrefixExtractor::Separator(b':')),
);
let db = Db::open_sync(options)?;
```

## Create A Bucket

A bucket is a named collection of keys with fixed options. `bucket` returns an
existing bucket or creates it with default `BucketOptions`.

```rust
let users = db.bucket_sync("users")?;
```

Use `bucket_with_options` when the bucket needs prefix filters or custom
storage tuning. If the bucket already exists, the options must match because
they are part of the on-disk contract:

```rust
use trine_kv::{BucketOptions, PrefixExtractor};

let users = db.bucket_with_options_sync(
    "users",
    BucketOptions::default().with_prefix_extractor(PrefixExtractor::Separator(b':')),
)?;
```

The name `"default"` is reserved for the built-in default bucket and cannot be
used as a named bucket.

## Write And Read Keys

The bucket helpers write one key at a time when you need a named bucket:

```rust
users.put_sync(b"user:001", b"Ada")?;
users.put_sync(b"user:002", b"Lin")?;

assert_eq!(users.get_sync(b"user:001")?, Some(b"Ada".to_vec()));
```

Deletes use the same bucket handle:

```rust
users.delete_sync(b"user:002")?;
assert_eq!(users.get_sync(b"user:002")?, None);
```

The matching `delete_with_options` and `delete_range_with_options` helpers are
available when deletes need explicit write options.

Keys and values are byte vectors. String keys are fine, but the database does
not require UTF-8.

## Write A Batch

Use `WriteBatch` when several changes must commit at the same sequence:

```rust
use trine_kv::{WriteBatch, WriteOptions};

let mut batch = WriteBatch::new();
batch.put(b"audit:001", b"created");
batch.delete(b"audit:000");
batch.put_bucket("users", b"user:003", b"Grace")?;
batch.delete_bucket("users", b"user:001")?;

let commit = db.write_sync(
    batch,
    WriteOptions::default(),
)?;

println!("committed sequence {}", commit.sequence().get());
```

Batch writes can span buckets. Named-bucket staging methods return `Result`
because empty names and the reserved `"default"` name are rejected before the
batch is submitted. If validation fails during commit, the batch is rejected
before it changes memtables.

## Range And Prefix Scans

Range scans return keys in sorted order:

```rust
use trine_kv::KeyRange;

let range = KeyRange::half_open(b"user:000", b"user:999");
for item in users.range_sync(&range)? {
    let key_value = item?;
    println!("{:?} = {:?}", key_value.key, key_value.value);
}
```

Reverse scans use the same range:

```rust
for item in users.range_reverse_sync(&range)? {
    let key_value = item?;
    println!("{:?}", key_value.key);
}
```

Prefix scans are most useful when the bucket has a prefix extractor:

```rust
for item in users.prefix_sync(b"user:")? {
    let key_value = item?;
    println!("{:?}", key_value.key);
}
```

Prefix filters are advisory: they can skip table work, but they do not replace
MVCC or range-delete checks.

## Snapshots

A snapshot keeps reads pinned to the database sequence that was current when
the snapshot was created:

```rust
let snapshot = db.snapshot();

users.put_sync(b"user:004", b"Barbara")?;

assert_eq!(snapshot.get_sync(&users, b"user:004")?, None);
assert_eq!(users.get_sync(b"user:004")?, Some(b"Barbara".to_vec()));
```

Snapshots can read points, ranges, reverse ranges, and prefixes:

```rust
for item in snapshot.prefix_sync(&users, b"user:")? {
    let key_value = item?;
    println!("{:?}", key_value.key);
}
```

Keep snapshots short-lived when possible. Long-lived snapshots can delay
cleanup of old versions and blob files.

## Repeated Point Reads

You do not need a reader for a normal point read. Use `db.get_sync(key)` for the
default bucket or `bucket.get_sync(key)` for a named bucket.

Use a `BucketReader` only when a read-heavy workload performs many point reads
under one snapshot. `reader.get_sync` returns a `PointValue`, which can be inspected
through `as_bytes()` without forcing an owned `Vec<u8>` for inline table values:

```rust
let snapshot = db.snapshot();
let reader = users.reader(&snapshot)?;

if let Some(value) = reader.get_sync(b"user:001")? {
    assert_eq!(value.as_bytes(), b"Ada");
}
```

Use `reader.get_owned_sync(key)` when the caller needs the same owned-value
shape as the regular sync get API.

## Optimistic Transactions

Transactions read at a fixed sequence and validate their read set at commit:

```rust
use trine_kv::{Error, TransactionOptions};

let mut txn = db.transaction(TransactionOptions::default());
let previous_default = txn.get_sync(b"settings:theme")?;
txn.put(b"settings:theme", b"dark");

let previous_user = txn.get_bucket_sync("users", b"user:001")?;
txn.put_bucket("users", b"user:005", b"Margaret")?;

match txn.commit_sync() {
    Ok(info) => println!("committed sequence {}", info.sequence().get()),
    Err(Error::Conflict { message }) => println!("retry transaction: {message}"),
    Err(error) => return Err(error),
}
```

Point reads conflict with later point writes, point deletes, or covering range
deletes. Range reads conflict with later point changes inside the range or later
overlapping range deletes. Named-bucket write methods return `Result` for the
same bucket-name validation used by `WriteBatch`.

## Durability

For persistent databases, committed writes append to the WAL before becoming
visible in memtables. Native persistent opens default to safety-first
durability, so ordinary writes can use the default write options:

```rust
use trine_kv::{WriteBatch, WriteOptions};

let mut batch = WriteBatch::new();
batch.put_bucket("users", b"user:006", b"Edsger")?;

db.write_sync(
    batch,
    WriteOptions::default(),
)?;
```

`DbOptions::durability` is a database-level floor. Per-write options can ask for
a stronger mode, but they cannot weaken the mode chosen at open time. If a
database was explicitly opened with lower durability for rebuildable data, use
`Db::persist_sync` as an explicit WAL sync point:

```rust
use trine_kv::DurabilityMode;

db.persist_sync(DurabilityMode::SyncAll)?;
```

Read [durability.md](durability.md) before lowering durability for production
data.

## Large Values And Blob GC

Small values stay inline in SSTables. In persistent mode, values at or above a
bucket's `blob_threshold_bytes` are written into Titan-like blob files when
memtables flush or compaction writes new SSTables. WAL records and memtables
still keep the complete value, so ordinary writes do not need a blob file on
the foreground path.

Configure the default bucket threshold and Level Merge policy through
`DbOptions`:

```rust
use trine_kv::{BlobLevelMergePolicy, BucketOptions, Db, DbOptions};

let db = Db::open_sync(
    DbOptions::new("./trine-data").with_default_bucket_options(
        BucketOptions {
            blob_threshold_bytes: 64 * 1024,
            blob_level_merge_policy: BlobLevelMergePolicy::Auto,
            ..BucketOptions::default()
        },
    ),
)?;
```

`Auto` is the default. It rewrites retained blob values during compaction when
the output would otherwise keep references to multiple blob files or leave
stale bytes behind in input blob files. Use `Disabled` only when you want GC to
handle old blob files without compaction-time rewriting, and `Always` for
benchmarking or workloads that prefer aggressive blob locality.

When range or prefix scans mostly need keys first, use the value-lazy iterator
variants. They keep the same MVCC and range-delete semantics, but blob bytes
are read only when you call `LazyValue::read_sync` or convert the row into a full
`KeyValue`:

```rust
for row in db.range_lazy_sync(&trine_kv::KeyRange::all())? {
    let row = row?;
    println!("key={:?}", row.key);
    let value = row.value.read_sync()?;
    println!("value bytes={}", value.len());
}
```

Blob GC is enabled for persistent databases by default. It runs from the
compaction path, batches all blob files that pass the discard threshold,
rewrites still-live records out of stale blob files, and keeps old blob files
until no snapshot or range iterator can still reach them.

Use database-level options to tune when GC is considered:

```rust
use trine_kv::{BlobGcRatio, DbOptions};

let mut options = DbOptions::new("./trine-data");
options.blob_gc_min_file_bytes = 64 * 1024 * 1024;
options.blob_gc_discardable_ratio = BlobGcRatio::from_millionths(500_000);
```

In-memory databases keep values inline and do not create disk blob files.

## Flush, Compaction, And Stats

Flush writes memtable contents into SSTables and advances the manifest replay
floor:

```rust
db.flush_sync()?;
```

For persistent writable databases, `flush_sync()` is a barrier for writes committed
before the call. It returns after those writes have left active and immutable
memtables and have been published as SSTables. Writes committed concurrently
after the call starts may remain in memory for a later flush.

Manual compaction rewrites overlapping tables while preserving snapshot
visibility:

```rust
db.compact_range_sync(KeyRange::all())?;
```

If another compaction already owns an overlapping key range, `compact_range_sync()`
waits and retries instead of reporting success while the guard is busy.

Cooperative hosts can advance maintenance in bounded steps:

```rust
use trine_kv::MaintenanceBudget;

let outcome = db.run_maintenance_with_budget_sync(MaintenanceBudget::default())?;
if outcome.budget_exhausted() {
    // Call again from the host scheduler when it is ready to do more work.
}
```

`MaintenanceBudget::default()` allows one flush input and one compaction unit.
Each unit is still published atomically; when the budget is exhausted, the next
call resumes by planning from the current manifest and in-memory versions.
Use `compact_range_with_budget_sync(range, budget)` when a host wants only
compaction work for a key range.

Browser persistent databases use the async maintenance variants:

```rust
let outcome = db.run_maintenance_with_budget(MaintenanceBudget::default()).await?;
```

If browser write pressure cannot be relieved inside the write preflight, the
write returns `RuntimeBusy`; call async maintenance from the host scheduler and
retry the write.

Persistent writable databases start one background maintenance worker by
default. Set `DbOptions::background_worker_count = 0` when a test or embedding
needs fully manual maintenance. In-memory and read-only databases never start
background workers.

The database can also compact automatically after flush when L0 file pressure
exceeds `DbOptions::max_l0_files`. Automatic L0 compaction chooses a local
overlapping key span, so unrelated L0 files may remain for later passes instead
of every pressure event rewriting the whole level.

Writes apply pressure handling when immutable memtables or L0 files exceed
configured limits. The write may wait briefly for the background worker or help
with one foreground maintenance pass before accepting more work.

Persistent reads keep one cached file handle per table and reuse it for block
reads. L0/L1 tables keep their table filters and index partitions resident;
deeper levels load partition metadata lazily and let the global block cache
protect metadata entries ahead of data blocks under cache pressure.

Inspect live state with `Db::stats`:

```rust
let stats = db.stats();
println!(
    "buckets={} tables={} cache_hits={} blob_reads={} storage_reads={}",
    stats.live_buckets,
    stats.total_tables,
    stats.block_cache_hits,
    stats.blob_read_count,
    stats.storage_operations.read_object_bytes.requests,
);
```

## Read-Only Open

Use read-only open for inspecting a stable persistent directory:

```rust
use trine_kv::{Db, DbOptions};

let db = Db::open_sync(DbOptions::new("./trine-data").read_only())?;
```

Read-only open does not take the writer lock and does not create a WAL writer.
It still validates files and replays WAL records into memory. V1 does not define
live multi-process reads against a concurrent writer.

## Recovery Boundaries

Startup is conservative. It fails closed on missing referenced files, corrupt
WAL records before the final tail, corrupt tables, corrupt blobs, unsupported
formats, and unexpected formal storage files.

Safe temporary files can be repaired only when explicitly requested:

```rust
use trine_kv::{DbOptions, FailOnCorruptionPolicy};

let mut options = DbOptions::new("./trine-data");
options.fail_on_corruption = FailOnCorruptionPolicy::RepairSafeTemporaryFiles;

let db = Db::open_sync(options)?;
```

This policy is intentionally narrow. It does not repair WAL corruption,
manifest corruption, table corruption, missing referenced files, or blob
corruption.

## Verification Path

Use these commands before trusting a change to documentation or examples:

```text
cargo run --example quickstart
cargo run --example sync_quickstart
cargo run --example user_store
cargo run --example event_index
cargo fmt --check
cargo clippy
cargo test
```

For performance-sensitive changes, also run:

```text
cargo bench --bench v1_bench
```