Expand description
robots.txt parser for Rust
The robots.txt Exclusion Protocol is implemented as specified in https://www.robotstxt.org/norobots-rfc.txt
§Installation
Add it to your Cargo.toml
:
[dependencies]
robotparser = "0.11"
§Examples
use robotparser::http::RobotsTxtClient;
use robotparser::service::RobotsTxtService;
use reqwest::blocking::Client;
use url::Url;
let client = Client::new();
let robots_txt_url = Url::parse("https://www.python.org/robots.txt").unwrap();
let robots_txt = client.fetch_robots_txt(robots_txt_url.origin()).unwrap().get_result();
let fetch_url = Url::parse("https://www.python.org/robots.txt").unwrap();
assert!(robots_txt.can_fetch("*", &fetch_url));
Modules§
- Request builder & response parsers for other http libraries.
- Contains models of robots.txt file.
- Contains robots.txt parsers.
- Contains robots.txt services.