codetether-agent 4.0.0

# Left Off - Complete Success!

## Summary
We successfully implemented the foundation for the `LocalCudaProvider` using Candle bindings for local GPU inference in the CodeTether agent hybrid swarm architecture. Along the way, we encountered and fixed 124 compilation errors across the entire codebase.

## What Was Accomplished

1. **Hybrid Swarm Architecture Designed & Documented**: 
   - Created `docs/architecture/hybrid_swarm.md` outlining the OKR, PRD, and System Prompt.

2. **LocalCudaProvider Implemented**:
   - Created `src/provider/local_cuda.rs` with the `LocalCudaProvider` struct.
   - Integrated it into the `ProviderRegistry` in `src/provider/mod.rs`.
   - Added environment variable fallbacks for other providers in `from_vault()`.

3. **Fixed 124 Compilation Errors**:
   - **Telemetry Module**: Massively expanded to support missing types (`ContextLimit`, `TokenTotals`, `ProviderSnapshot`, `PersistentStats`). Fixed method signatures for tool executions.
   - **Worktree Module**: Added missing `abort_merge` and `complete_merge` methods. Updated `MergeResult` to include `conflict_diffs`. Updated `ralph_loop.rs` to use asynchronous calls properly.
   - **RLM Oracle Module**: Added `PartialEq` and `Default` traits to validation structs. Fixed `tree_sitter` cursor iteration using `streaming-iterator`. Resolved borrow checker issues in `schema.rs` and `validator.rs`.
   - **Main CLI**: Fixed type annotations and async method calls for `WorktreeManager`.

## Current State
- The codebase compiles with **0 errors**.
- `cargo check` passes cleanly.
- `cargo install --path .` has been triggered to install the updated binary.

## Next Steps
Now that the project is completely error-free and compiling, the next steps are to test the actual local inference:

1. **Finish Candle Inference Logic**: The `LocalCudaProvider::complete()` method currently returns a placeholder error. It needs to be updated to load the actual GGUF model weights, tokenize the prompt, and run the generation loop using `candle-core` and `candle-transformers`.
2. **Test the Swarm**: Run the CodeTether worker node on the RTX 2070 PC and verify that the Cloud Opus orchestrator can successfully route tasks to it and receive generated code back.