car-browser 0.6.0

Browser automation and perception pipeline for Common Agent Runtime
Documentation

car-browser

Browser automation and perception pipeline for the Common Agent Runtime.

What it does

Provides a backend-agnostic browser automation layer. The BrowserBackend trait abstracts over different browser engines (Tauri WebView, Chromium via chromiumoxide, or test mocks). The perception pipeline converts raw browser state (screenshots + accessibility trees) into structured UiMap data for agent consumption.

Usage

use car_browser::{BrowserBackend, BrowserToolExecutor};

// Implement BrowserBackend for your engine, then register tools
let executor = BrowserToolExecutor::new(backend);
// Tools: browse_navigate, browse_click, browse_type, browse_scroll, browse_screenshot

Crate features

  • chromium -- enables the chromiumoxide-based headless backend

Part of CAR -- see the main repo for full documentation.