Expand description

robots.txt parser for Rust

The robots.txt Exclusion Protocol is implemented as specified in https://www.robotstxt.org/norobots-rfc.txt

Installation

Add it to your Cargo.toml:

[dependencies]
robotparser = "0.11"

Examples

use robotparser::http::RobotsTxtClient;
use robotparser::service::RobotsTxtService;
use reqwest::blocking::Client;
use url::Url;

let client = Client::new();
let robots_txt_url = Url::parse("https://www.python.org/robots.txt").unwrap();
let robots_txt = client.fetch_robots_txt(robots_txt_url.origin()).unwrap().get_result();
let fetch_url = Url::parse("https://www.python.org/robots.txt").unwrap();
assert!(robots_txt.can_fetch("*", &fetch_url));

Modules

Request builder & response parsers for other http libraries.

Contains models of robots.txt file.

Contains robots.txt parsers.

Contains robots.txt services.