Docs.rs
  • doc_loader-0.3.0
    • doc_loader 0.3.0
    • Docs.rs crate page
    • MIT
    • Links
    • Homepage
    • Documentation
    • Repository
    • crates.io
    • Source
    • Owners
    • WillIsback
    • Dependencies
      • anyhow ^1.0 normal
      • async-std ^1.12 normal
      • chrono ^0.4 normal
      • clap ^4.4 normal
      • csv ^1.3 normal
      • dialoguer ^0.11 normal
      • docx-rs ^0.4 normal
      • encoding_rs ^0.8 normal
      • env_logger ^0.10 normal
      • indicatif ^0.17 normal
      • log ^0.4 normal
      • lopdf ^0.32 normal
      • pyo3 ^0.22 normal optional
      • regex ^1.10 normal
      • serde ^1.0 normal
      • serde_json ^1.0 normal
      • thiserror ^1.0 normal
      • tokio ^1.0 normal optional
      • unicode-segmentation ^1.10 normal
      • tempfile ^3.8 dev
    • Versions
    • 51.64% of the crate is documented
  • Go to latest version
  • Platform
    • i686-pc-windows-msvc
    • i686-unknown-linux-gnu
    • x86_64-apple-darwin
    • x86_64-pc-windows-msvc
    • x86_64-unknown-linux-gnu
  • Feature flags
  • docs.rs
    • About docs.rs
    • Badges
    • Builds
    • Metadata
    • Shorthand URLs
    • Download
    • Rustdoc JSON
    • Build queue
    • Privacy policy
  • Rust
    • Rust website
    • The Book
    • Standard Library API Reference
    • Rust by Example
    • The Cargo Guide
    • Clippy Documentation

doc_loader0.3.0

Crate Items

  • Structs
  • Enums
  • Traits
  • Functions
  • Type Aliases

List of all items

Structs

  • core::ChunkMetadata
  • core::ChunkPosition
  • core::DocumentChunk
  • core::DocumentMetadata
  • core::ProcessingInfo
  • core::ProcessingParams
  • core::UniversalOutput
  • processors::UniversalProcessor
  • processors::csv::CsvProcessor
  • processors::docx::DocxMetadata
  • processors::docx::DocxProcessor
  • processors::json::JsonProcessor
  • processors::pdf::PdfProcessor
  • processors::txt::TxtProcessor
  • utils::TextMetadata

Enums

  • core::DocumentType
  • error::DocLoaderError

Traits

  • processors::DocumentProcessor

Functions

  • utils::chunk_text
  • utils::clean_text
  • utils::count_words
  • utils::detect_language
  • utils::estimate_tokens
  • utils::extract_text_metadata
  • utils::normalize_line_breaks
  • utils::remove_empty_lines

Type Aliases

  • error::Result