Backend-independent data types and algorithms for pdfplumber-rs.
This crate provides the foundational types ([BBox], [Char], [Word],
[Line], [Rect], [Table], etc.) and algorithms (text grouping, table
detection) used by pdfplumber-rs. It has no required external dependencies —
all functionality is pure Rust.
Modules
- [
geometry] — Geometric primitives: [Point], [BBox], [Ctm], [Orientation] - [
text] — Character data: [Char], [TextDirection], CJK detection - [
words] — Word extraction: [Word], [WordExtractor], [WordOptions] - [
layout] — Text layout: [TextLine], [TextBlock], [TextOptions] - [
shapes] — Shapes from painted paths: [Line], [Rect], [Curve] - [
edges] — Edge derivation for table detection: [Edge], [EdgeSource] - [
table] — Table detection: [Table], [TableFinder], [TableSettings] - [
images] — Image extraction: [Image], [ImageMetadata] - [
painting] — Graphics state: [Color], [GraphicsState], [PaintedPath] - [
path] — Path construction: [Path], [PathBuilder], [PathSegment] - [
encoding] — Font encoding: [FontEncoding], [EncodingResolver] - [
error] — Errors and warnings: [PdfError], [ExtractWarning], [ExtractOptions] - [
search] — Text search: [SearchMatch], [SearchOptions], [search_chars] - [
unicode_norm] — Unicode normalization: [UnicodeNorm], [normalize_chars]