spider_transformations
A high-performance transformation library for Rust, used by Spider Cloud for AI-powered content cleaning across multiple locales.
This project depends on the spider
crate.
Usage
[]
= "2"
use content;
Transform types
- Markdown
- Commonmark
- Text
- Markdown (Text Map) or HTML2Text
- WIP: HTML2XML
Enhancements
- Readability
- Encoding
Chunking
There are several chunking utils in the transformation mod.
This project has rewrites and forks of html2md, and html2text for performance and bug fixes.
License
MIT