Skip to main content

Module table

Module table 

Source
Expand description

Table detection: lattice, stream, and explicit strategies. Table detection types and pipeline.

This module provides the configuration types, data structures, and orchestration for detecting tables in PDF pages using Lattice, Stream, or Explicit strategies.

Structs§

Cell
A detected table cell.
ExplicitLines
User-provided line coordinates for Explicit strategy.
Intersection
An intersection point between horizontal and vertical edges.
Table
A detected table.
TableFinder
Orchestrator for the table detection pipeline.
TableFinderDebug
Intermediate results from the table detection pipeline.
TableQuality
Quality metrics for a detected table.
TableSettings
Configuration for table detection.

Enums§

Strategy
Strategy for table detection.

Functions§

cells_to_tables
Group adjacent cells into distinct tables.
edges_to_intersections
Find all intersection points between horizontal and vertical edges.
explicit_lines_to_edges
Convert user-provided explicit line coordinates into edges.
extract_text_for_cells
Extract text content for each cell by finding characters within the cell bbox.
intersections_to_cells
Construct rectangular cells from a grid of intersection points.
join_edge_group
Merge overlapping or adjacent collinear edge segments.
snap_edges
Snap nearby parallel edges to aligned positions.
words_to_edges_stream
Generate synthetic edges from text alignment patterns for the Stream strategy.