kreuzberg 4.9.7

High-performance document intelligence library for Rust. Extract text, metadata, and structured data from PDFs, Office documents, images, and 91+ formats and 248 programming languages via tree-sitter code intelligence with async/sync APIs.
Documentation

kreuzberg

There is very little structured metadata to build this page from currently. You should check the main library docs, readme, or Cargo.toml in case the author documented the features in them.

This version has 49 feature flags, 2 of them enabled by default.

default

simd-utf8 (default)

tokio-runtime (default)

api

archives

auto-rotate

bundled-pdfium

chunking

chunking-tokenizers

cli

email

embeddings

excel

excel-wasm

formats

full

html

hwp

iwork

keywords

keywords-rake

keywords-yake

language-detection

layout-detection

liter-llm

mcp

mcp-http

mdx

This feature flag does not enable additional features.

ocr

ocr-wasm

office

ort-bundled

ort-dynamic

otel

paddle-ocr

pdf

pdf-oxide

pool-metrics

This feature flag does not enable additional features.

profiling

quality

server

static-pdfium

stopwords

This feature flag does not enable additional features.

system-pdfium

tower-service

tree-sitter

tree-sitter-wasm

wasm-target

wasm-threads

xml