robotxt-0.1.0 has been yanked.
xwde: robotxt
The implementation of the robots.txt protocol (or URL exclusion protocol) in
Rust programming language with the support of crawl-delay, sitemap and
universal * match extensions (according to the RFC specification).
Examples
- parse the
user-agentin the providedrobots.txtfile:
use Robots;
- build the new
robots.txtfile from provided directives:
Note : the builder is not yet implemented.
Links
- Request for Comments: 9309 on RFC-Editor.com
- Introduction to Robots.txt on Google.com
- How Google interprets Robots.txt on Google.com
- What is Robots.txt file on Moz.com
Notes
The parser is based on Smerity/texting_robots.