urlharvest 0.5.0

IRC URL harvest for irssi, leech document title + metadata, generate html pages, implement search
docs.rs failed to build urlharvest-0.5.0
Please check the build logs for more information.
See Builds for ideas on how to fix a failed build, or Metadata for how to configure docs.rs builds.
If you believe this is docs.rs' fault, open an issue.

urlharvest-rs

  • Harvest URLs from irssi IRC client logs, insert into SQLite db
  • Fetch metadata, e.g. title and update db.
  • Generate html pages.
  • Implement a search page

Please note: this harvester is tailing your IRC client (irssi) logs on disk and does not need or include an ircbot of any kind. Consider running your irssi on a cloud vm to stay "always connected" :-)

Any other kind of chat log source would be trivial to implement. Basically, the chat text is just scanned with regex match and detected URLs are saved & indexed.