Skip to main content

Module parser

Module parser 

Source
Expand description

Parser registry — extensible format detection and parsing.

Invention 2: Format Omniscience. Give AgenticData ANY file and it identifies the format, selects the right parser, and produces Universal Schema records.

Modules§

calendar_parser
iCalendar (.ics) parser — extracts events, todos, and calendar data.
csv_parser
CSV/TSV parser — handles comma and tab-delimited data with schema inference.
detect
Format auto-detection — identifies data format from content and extension.
email_parser
Email (.eml) parser — extracts headers, body, and attachments.
geo_parser
Geospatial format parsers — GeoJSON, KML, GPX.
html_parser
HTML and Markdown parser — extracts structure from documents.
json_parser
JSON / JSON Lines parser — handles objects, arrays, and NDJSON.
log_parser
Log format parser — handles timestamped log entries.
media_parser
Media metadata parser — extracts properties from image, audio, video files.
sql_parser
SQL dump parser — extracts schema and records from SQL statements.
xml_parser
XML parser — extracts structure from XML documents.
yaml_parser
YAML and TOML parser — structured configuration and data formats.

Structs§

FormatDetection
Detected format with confidence.
ParseResult
Result of parsing a data source.

Enums§

DataFormat
All supported data formats.

Functions§

parse_as
Parse data as a specific format.
parse_auto
Parse data from a string, auto-detecting format.
supported_formats
List all supported formats.