Expand description
Layered HTTP-based acquisition engine.
Replaces browser-dependent mapping with structured data extraction from raw HTML. The browser becomes a last-resort fallback for mapping and remains required only for ACT and live PERCEIVE.
Modules§
- action_
discovery - Discovers executable HTTP actions from HTML forms, JavaScript patterns, and known e-commerce platforms.
- api_
discovery - Known public API discovery.
- auth
- Authentication handlers – login via password, OAuth, API key.
- canvas_
extractor - Three-tier extraction for canvas and WebGL applications.
- drag_
discovery - Discovers drag-and-drop interactions and maps them to their underlying API calls.
- feed_
parser - RSS/Atom feed discovery and parsing.
- head_
scanner - Parallel HEAD request scanner for URL metadata.
- http_
client - Async HTTP client wrapping reqwest.
- http_
session - HTTP session management for authenticated requests.
- js_
analyzer - JavaScript bundle analyzer for API endpoint discovery.
- pattern_
engine - CSS-selector and regex based data extractor for raw HTML.
- structured
- Parse structured data from raw HTML without DOM rendering.
- webmcp
- WebMCP tool discovery and execution.
- ws_
discovery - Discovers WebSocket endpoints from HTML and JavaScript source code.