pub struct ScraperConfig {Show 15 fields
pub registry_url: Option<String>,
pub auth_token: Option<String>,
pub namespace: Option<String>,
pub chunking_strategy: ChunkingStrategy,
pub max_chunk_size: usize,
pub include_overlap: bool,
pub overlap_size: usize,
pub language_options: HashMap<String, LanguageConfig>,
pub exclude_patterns: Vec<String>,
pub create_hierarchy: bool,
pub extract_api_surface: bool,
pub detect_licenses: bool,
pub request_timeout: u64,
pub rate_limit: f64,
pub cache_dir: Option<PathBuf>,
}Expand description
Configuration for the scraper
Fields§
§registry_url: Option<String>Registry URL for publishing chunks
auth_token: Option<String>Authentication token for registry
namespace: Option<String>Namespace for published chunks
chunking_strategy: ChunkingStrategyChunking strategy to use
max_chunk_size: usizeMaximum chunk size in bytes
include_overlap: boolInclude overlapping context between chunks
overlap_size: usizeOverlap size in characters
language_options: HashMap<String, LanguageConfig>Language-specific parsing options
exclude_patterns: Vec<String>Patterns to exclude from scraping
create_hierarchy: boolWhether to create hierarchical chunk relationships
extract_api_surface: boolAuto-extract API surfaces and functions
detect_licenses: boolWhether to auto-detect licenses
request_timeout: u64HTTP request timeout in seconds
rate_limit: f64Rate limit: requests per second
cache_dir: Option<PathBuf>Local cache directory
Trait Implementations§
Source§impl Clone for ScraperConfig
impl Clone for ScraperConfig
Source§fn clone(&self) -> ScraperConfig
fn clone(&self) -> ScraperConfig
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for ScraperConfig
impl Debug for ScraperConfig
Source§impl Default for ScraperConfig
impl Default for ScraperConfig
Source§impl<'de> Deserialize<'de> for ScraperConfig
impl<'de> Deserialize<'de> for ScraperConfig
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Auto Trait Implementations§
impl Freeze for ScraperConfig
impl RefUnwindSafe for ScraperConfig
impl Send for ScraperConfig
impl Sync for ScraperConfig
impl Unpin for ScraperConfig
impl UnwindSafe for ScraperConfig
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more