Module extractor

Module extractor 

Source
Expand description

Main media extractor API for halldyll-media

Provides unified extraction across all media types:

  • Images (with srcset, lazy loading)
  • Videos (HTML5 + embedded platforms)
  • Audio (HTML5 + streaming platforms)
  • Documents (PDF, Office)
  • Embedded content (maps, social, widgets)

Structs§

MediaCounts
Media count summary
MediaExtractor
Main media extractor with configurable options
MediaExtractorBuilder
Builder for MediaExtractor

Functions§

count_all_media
Quick media count
extract_media
Quick extraction of all media
get_all_media_urls
Get all media URLs
has_any_media
Quick check for any media