pub struct CheckpointData {
pub request_urls: Vec<String>,
pub seen_fingerprints: Vec<Vec<u8>>,
}Expand description
Serializable snapshot of crawler state for pause/resume support.
This struct is serialized to JSON and written to the crawl directory. On resume, the engine reads it back, re-enqueues the pending URLs, and repopulates the “seen” fingerprint set so that already-fetched pages are not fetched again.
Fields§
§request_urls: Vec<String>URLs of requests still pending in the scheduler’s queue at the time the checkpoint was taken. These are re-enqueued on resume.
seen_fingerprints: Vec<Vec<u8>>SHA-1 fingerprints of requests that have already been seen. Restoring these into the scheduler prevents re-fetching pages that were completed before the pause.
Trait Implementations§
Source§impl Clone for CheckpointData
impl Clone for CheckpointData
Source§fn clone(&self) -> CheckpointData
fn clone(&self) -> CheckpointData
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for CheckpointData
impl Debug for CheckpointData
Source§impl Default for CheckpointData
impl Default for CheckpointData
Source§fn default() -> CheckpointData
fn default() -> CheckpointData
Returns the “default value” for a type. Read more
Source§impl<'de> Deserialize<'de> for CheckpointData
impl<'de> Deserialize<'de> for CheckpointData
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Auto Trait Implementations§
impl Freeze for CheckpointData
impl RefUnwindSafe for CheckpointData
impl Send for CheckpointData
impl Sync for CheckpointData
impl Unpin for CheckpointData
impl UnsafeUnpin for CheckpointData
impl UnwindSafe for CheckpointData
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more