Skip to main content

StructElement

Struct StructElement 

Source
pub struct StructElement {
    pub element_type: String,
    pub mcids: Vec<u32>,
    pub alt_text: Option<String>,
    pub actual_text: Option<String>,
    pub lang: Option<String>,
    pub bbox: Option<BBox>,
    pub children: Vec<StructElement>,
    pub page_index: Option<usize>,
}
Expand description

A node in the PDF structure tree.

Represents a logical structure element from a tagged PDF’s /StructTreeRoot. Each element has a type (e.g., “H1”, “P”, “Table”), optional marked content identifiers (MCIDs) linking it to page content, and optional child elements forming a tree structure.

§Tagged PDF Support

Tagged PDFs (ISO 32000-1, Section 14.8) embed semantic structure that is critical for accessibility and increasingly important for AI/LLM document understanding. The structure tree maps logical elements (headings, paragraphs, tables) to their visual representation on the page via MCID references.

§Example

use pdfplumber_core::StructElement;

let heading = StructElement {
    element_type: "H1".to_string(),
    mcids: vec![0],
    alt_text: None,
    actual_text: Some("Chapter 1".to_string()),
    lang: Some("en".to_string()),
    bbox: None,
    children: vec![],
    page_index: Some(0),
};
assert_eq!(heading.element_type, "H1");
assert_eq!(heading.mcids, vec![0]);

Fields§

§element_type: String

The structure type name (e.g., “Document”, “H1”, “P”, “Table”, “TR”, “TD”, “L”, “LI”, “Span”, “Figure”).

§mcids: Vec<u32>

Marked content identifiers linking this element to page content. Each MCID corresponds to a marked-content sequence in a page’s content stream.

§alt_text: Option<String>

Alternative text for accessibility (from /Alt entry).

§actual_text: Option<String>

Replacement text for the element’s content (from /ActualText entry).

§lang: Option<String>

Language of the element’s content (from /Lang entry, e.g., “en-US”).

§bbox: Option<BBox>

Bounding box of the element, if available.

§children: Vec<StructElement>

Child structure elements forming the tree hierarchy.

§page_index: Option<usize>

Page index (0-based) this element belongs to, if determinable.

Trait Implementations§

Source§

impl Clone for StructElement

Source§

fn clone(&self) -> StructElement

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for StructElement

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl PartialEq for StructElement

Source§

fn eq(&self, other: &StructElement) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl StructuralPartialEq for StructElement

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.