Expand description
This crate provides a library for extracting the publication date from an article or a blog post. It was heavily influenced by both the original article-date-extractor written in Python, as well as its Haskell port.
§Usage
This crate is on crates.io and can be
used by adding article_date_extractor
to your dependencies in your project’s Cargo.toml
.
[dependencies]
article-date-extractor = "0.1.0"
and this to your crate root:
extern crate article_date_extractor;
§Example: extracting a date from a news article
extract_article_published_date
expects both a link as well as an html body. In the following example the body is provided as part of the library’s test fixtures.
extern crate article_date_extractor;
extern crate chrono;
fn main() {
use article_date_extractor::extract_date::extract_article_published_date;
use chrono::NaiveDate;
use std::env;
let link = "http://edition.cnn.com/2015/11/28/opinions/sutter-cop21-paris-preview-two-degrees/index.html";
let body = include_str!("./tests/fixtures/cnn.html");
assert_eq!(NaiveDate::from_ymd(2015, 11, 28), extract_article_published_date(&link, body).unwrap());
}