rsclaw 2026.5.1

AI Agent Engine Compatible with OpenClaw
Documentation
# RsClaw Roadmap

## v1.0 — 2026.4.20 (Current)

AI Agent Engine with long-term memory, self-learning, and multi-agent orchestration.

- 4 agent types (Main/Named/Sub/Task) with bidirectional team communication
- 4 execution backends (Native Rust/Claude Code/OpenCode/ACP)
- 13 messaging channels + custom webhook
- 15+ LLM providers with failover
- 50+ browser automation actions (agent-browser parity)
- Three-layer memory (redb KV + tantivy FTS + hnsw_rs vector)
- AnyCLI structured web data extraction
- A2A v0.3 cross-machine orchestration
- WASM plugin system (wasmtime v29)
- Skill auto-crystallization from usage patterns
- KV cache optimization (API key isolation, TTL, incremental messages)

## v2.0 — Planned

### 1. Full-Platform Perception & Control

**Screen Awareness**
- Real-time screen capture with vision model understanding
- OCR for on-screen text recognition
- UI element detection (buttons, inputs, menus, dialogs)
- Context awareness: understand what app is active, what task the user is doing

**Desktop Takeover**
- Extend computer_use beyond screenshot+click to full desktop control
- Application-level operations: launch apps, switch windows, drag files
- System-level: notifications, clipboard, global hotkeys, file system monitoring
- Native OS integration (macOS Accessibility API, Windows UI Automation)

**Mobile Native App (iOS / Android)**
- Tauri v2 mobile build: one codebase → desktop + iOS + Android
- Android: native binary (Termux) or Tauri app
- iOS: Tauri v2 app (lib crate already exposed)
- Mobile-optimized UI (touch, compact layout)
- Background agent service with push notifications

**Mobile Device Interaction**
- iOS/Android screen mirroring + touch control
- Cross-device task delegation: phone captures → desktop processes → results pushed back
- Mobile notification monitoring and response

### 2. Cross-Application Intelligent Collaboration

**Application Data Flow**
- Seamless data transfer between apps without manual copy-paste
- Example: Extract data from Excel → compose email → send via WeChat — fully automated
- Clipboard-aware pipeline: detect what user copied, offer relevant actions

**Workflow Orchestration**
- Multi-app automation flows (RPA-level but AI-driven)
- Visual workflow builder in UI
- Conditional branching based on screen state / app response
- Error recovery: detect when an app shows an error, adapt the workflow

**Context-Aware Intelligence**
- Understand which application is in focus and what the user is working on
- Proactive suggestions based on screen content
- Cross-app memory: remember what was done in App A when working in App B

### 3. A2A Protocol Enhancement — Production-Ready

**Agent Discovery**
- Automatic capability-based agent discovery (not just manual config)
- Agent registry / directory service
- Capability advertisement: each agent publishes what it can do
- Matching: find the best remote agent for a given task

**Task Negotiation**
- Multi-agent discussion for task decomposition
- Bidding system: multiple agents can offer to handle a task
- SLA negotiation: deadline, quality, cost constraints
- Conflict resolution when agents disagree

**State Synchronization**
- Cross-machine session sharing (not just message passing)
- Distributed memory: agents share relevant memories
- Consistency model: eventual consistency with conflict resolution
- Secure state transfer (encrypted, authenticated)

**Production Hardening**
- Authentication and authorization between agents
- Rate limiting, quota management
- Monitoring, tracing, audit logs
- Graceful degradation when remote agents are unavailable

### 4. Agent Ecosystem

**AnyCLI Adapter Marketplace**
- anycli.org community hub
- Adapter rating, reviews, usage stats
- One-click install: `rsclaw anycli install <name>`
- Auto-generated adapters via AI (point at a website, get an adapter)

**Skill Marketplace**
- Public skill registry with categories
- Skill versioning and dependency management
- Quality gates: automated testing before publish
- Revenue sharing for premium skills

**WASM Plugin SDK**
- Developer documentation and tutorials
- Plugin template generator
- Local testing framework
- Hot-reload during development
- Plugin marketplace with review process

**Developer Tools**
- `rsclaw dev` CLI for plugin/skill development
- Playground: test agents, tools, skills interactively
- Debugging tools: trace tool calls, inspect memory, replay sessions
- Performance profiling for custom plugins

### 5. Infrastructure (Commercial)

**Distributed Inference (rsclaw-server)**
- GPU node scheduling for 10K+ nodes (internet cafe scenario)
- KV cache P2P migration on node drain
- Incremental message transmission (cache_id + delta)
- Failover with partial generation continuation

**Digital Human Pipeline**
- MuseTalk + CosyVoice2 integration
- Real-time voice conversation (STT → Agent → TTS → lip-sync)
- Video understanding + generation closed loop

---

## Contributing

We welcome contributions in all areas. See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.

Priority areas for community contribution:
- AnyCLI adapters for new websites
- Skills for common workflows
- WASM plugins for service integrations
- Translations (currently 10 languages)
- Documentation and tutorials