Collector

Struct Collector 

Source
pub struct Collector { /* private fields */ }
Expand description

Collector keeps the state of link collection It drives the link extraction from inputs

Implementations§

Source§

impl Collector

Source

pub fn new(root_dir: Option<PathBuf>, base: Option<Base>) -> Result<Self>

Create a new collector with an empty cache

§Errors

Returns an Err if the root_dir is not an absolute path or if the reqwest Client fails to build

Source

pub const fn skip_missing_inputs(self, yes: bool) -> Self

Skip missing input files (default is to error if they don’t exist)

Source

pub const fn skip_hidden(self, yes: bool) -> Self

Skip files that are hidden

Source

pub const fn skip_ignored(self, yes: bool) -> Self

Skip files that are ignored

Source

pub fn headers(self, headers: HeaderMap) -> Self

Set headers to use when resolving input URLs

Source

pub fn client(self, client: Client) -> Self

Set client to use for checking input URLs

Source

pub const fn use_html5ever(self, yes: bool) -> Self

Use html5ever to parse HTML instead of html5gum.

Source

pub const fn include_verbatim(self, yes: bool) -> Self

Skip over links in verbatim sections (like Markdown code blocks)

Check WikiLinks in Markdown files

Source

pub fn basic_auth_extractor(self, extractor: BasicAuthExtractor) -> Self

Pass a BasicAuthExtractor which is capable to match found URIs to basic auth credentials. These credentials get passed to the request in question.

Source

pub fn excluded_paths(self, excluded_paths: PathExcludes) -> Self

Configure which paths to exclude

Source

pub fn collect_sources( self, inputs: HashSet<Input>, ) -> impl Stream<Item = Result<String>>

Collect all sources from a list of Inputs. For further details, see also Input::get_sources.

Source

pub fn collect_sources_with_file_types( self, inputs: HashSet<Input>, file_extensions: FileExtensions, ) -> impl Stream<Item = Result<String>> + 'static

Collect all sources from a list of Inputs with specific file extensions.

Convenience method to fetch all unique links from inputs with the default extensions.

Fetch all unique links from inputs All relative URLs get prefixed with base (if given). (This can be a directory or a base URL)

§Errors

Will return Err if links cannot be extracted from an input

Trait Implementations§

Source§

impl Clone for Collector

Source§

fn clone(&self) -> Collector

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for Collector

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for Collector

Source§

fn default() -> Self

§Panics

We call Client::new() which can panic in certain scenarios. Use Collector::new() to handle ClientBuilder errors gracefully.

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> PolicyExt for T
where T: ?Sized,

Source§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow only if self and other return Action::Follow. Read more
Source§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow if either self or other returns Action::Follow. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

impl<T> ErasedDestructor for T
where T: 'static,