pub struct GenerateOrchestrator<S: Scraper> { /* private fields */ }Expand description
Orchestrates parallel scraping with adaptive concurrency.
Uses a semaphore to limit concurrent scrape operations and supports progress reporting through callbacks. Can adaptively reduce concurrency on rate limit errors (429 responses).
§Example
ⓘ
use blz_cli::generate::{GenerateOrchestrator, UrlWithLastmod};
use std::sync::Arc;
// With mock scraper for testing
let scraper = MockScraper::new();
let orchestrator = GenerateOrchestrator::new(scraper, 5)
.with_progress(|completed, total| {
println!("Progress: {}/{}", completed, total);
});
let urls = vec![
UrlWithLastmod::new("https://example.com/a".to_string()),
UrlWithLastmod::new("https://example.com/b".to_string()),
];
let results = orchestrator.scrape_all(&urls).await;Implementations§
Source§impl<S: Scraper> GenerateOrchestrator<S>
impl<S: Scraper> GenerateOrchestrator<S>
Sourcepub fn new(scraper: S, concurrency: usize) -> Self
pub fn new(scraper: S, concurrency: usize) -> Self
Create with specified concurrency.
§Arguments
scraper- The scraping backend to useconcurrency- Maximum concurrent scrape operations (clamped to 1-50)
Sourcepub fn with_default_concurrency(scraper: S) -> Self
pub fn with_default_concurrency(scraper: S) -> Self
Create with default concurrency (5).
Sourcepub fn with_progress<F>(self, callback: F) -> Self
pub fn with_progress<F>(self, callback: F) -> Self
Set progress callback.
The callback receives (completed, total) after each URL is processed.
Sourcepub async fn scrape_all(&self, urls: &[UrlWithLastmod]) -> ScrapeResults
pub async fn scrape_all(&self, urls: &[UrlWithLastmod]) -> ScrapeResults
Scrape all URLs in parallel.
Uses a semaphore to limit concurrent operations to the configured concurrency level. Reports progress through the callback if set.
§Returns
Aggregated results with successful pages and failed pages.
Sourcepub const fn concurrency(&self) -> usize
pub const fn concurrency(&self) -> usize
Get current concurrency level.
Sourcepub const fn min_concurrency(&self) -> usize
pub const fn min_concurrency(&self) -> usize
Get minimum concurrency level.
Auto Trait Implementations§
impl<S> Freeze for GenerateOrchestrator<S>where
S: Freeze,
impl<S> !RefUnwindSafe for GenerateOrchestrator<S>
impl<S> Send for GenerateOrchestrator<S>
impl<S> Sync for GenerateOrchestrator<S>
impl<S> Unpin for GenerateOrchestrator<S>where
S: Unpin,
impl<S> !UnwindSafe for GenerateOrchestrator<S>
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> Downcast for Twhere
T: Any,
impl<T> Downcast for Twhere
T: Any,
Source§fn into_any(self: Box<T>) -> Box<dyn Any>
fn into_any(self: Box<T>) -> Box<dyn Any>
Convert
Box<dyn Trait> (where Trait: Downcast) to Box<dyn Any>. Box<dyn Any> can
then be further downcast into Box<ConcreteType> where ConcreteType implements Trait.Source§fn into_any_rc(self: Rc<T>) -> Rc<dyn Any>
fn into_any_rc(self: Rc<T>) -> Rc<dyn Any>
Convert
Rc<Trait> (where Trait: Downcast) to Rc<Any>. Rc<Any> can then be
further downcast into Rc<ConcreteType> where ConcreteType implements Trait.Source§fn as_any(&self) -> &(dyn Any + 'static)
fn as_any(&self) -> &(dyn Any + 'static)
Convert
&Trait (where Trait: Downcast) to &Any. This is needed since Rust cannot
generate &Any’s vtable from &Trait’s.Source§fn as_any_mut(&mut self) -> &mut (dyn Any + 'static)
fn as_any_mut(&mut self) -> &mut (dyn Any + 'static)
Convert
&mut Trait (where Trait: Downcast) to &Any. This is needed since Rust cannot
generate &mut Any’s vtable from &mut Trait’s.Source§impl<T> DowncastSync for T
impl<T> DowncastSync for T
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more