Skip to main content

Crate use_warc

Crate use_warc 

Source
Expand description

§use-warc

WARC and ARC web archive labels, record kinds, and extension helpers.

This crate is part of the use-archive facade workspace. It exposes web archive primitive labels only. It does not parse WARC records, read ARC files, fetch web content, or extract payloads.

§Example

use use_warc::{WarcFormat, WarcRecordKind, is_warc_filename};

assert!(is_warc_filename("crawl.warc.gz"));
assert_eq!(WarcFormat::Warc.as_str(), "warc");
assert_eq!(WarcRecordKind::Response.as_str(), "response");

WARC and ARC web archive labels for RustUse.

Enums§

WarcFormat
Web archive format labels.
WarcRecordKind
WARC record kind labels.

Constants§

ARC_EXTENSION
Common ARC extension.
ARC_GZIP_EXTENSION
Common gzip-compressed ARC extension.
WARC_EXTENSION
Common WARC extension.
WARC_EXTENSIONS
Common WARC/ARC-related extensions.
WARC_GZIP_EXTENSION
Common gzip-compressed WARC extension.

Functions§

is_warc_extension
Returns whether extension is a known WARC/ARC extension label.
is_warc_filename
Returns whether name has a known WARC/ARC filename encoding.