Skip to main content

Module formats

Module formats 

Source

Modules§

csv
excel
html
json_xml
opendocument
pdf
powerpoint
word

Functions§

parse_csv_content
Parse CSV content from string
parse_csv_file
Parse CSV/TSV files and extract numbers
parse_excel_file
Parse Excel files (.xlsx, .xls) and extract numbers
parse_html_content
Parse HTML content from string
parse_html_file
Parse HTML files and extract numbers from text content
parse_html_from_url_response
Parse HTML content from a URL response (for web scraping)
parse_json_content
Parse JSON content from string
parse_json_file
Parse JSON files and extract numbers
parse_opendocument_file
Parse OpenDocument files (.odt, .ods) and extract numbers from content
parse_pdf_file
Parse PDF files and extract numbers from text content
parse_powerpoint_file
Parse PowerPoint files (.pptx, .ppt) and extract numbers from slide content
parse_toml_content
Parse TOML content from string
parse_toml_file
Parse TOML files and extract numbers
parse_word_file
Parse Word files (.docx, .doc) and extract numbers from text content
parse_xml_content
Parse XML content from string
parse_xml_file
Parse XML files and extract numbers
parse_yaml_content
Parse YAML content from string
parse_yaml_file
Parse YAML files and extract numbers