Crate serbzip_core
source · [−]Expand description
A lightweight framework for transcoding text from one lexical form to another.
The apparatus that performs this trancoding is called a codec. It is described by the
codecs::Codec
trait. A prominent example of such a codec is Balkanoid —
a quasi-lossless Balkanoidal meta-lingual compressor.
This crate provides mechanisms for creating codecs, as well as one or more useful codec implementations.
Examples
Compression and expansion of text using the codecs::balkanoid::Balkanoid
codec:
use std::fs::File;
use std::io;
use std::io::BufReader;
use serbzip_core::codecs::balkanoid::{Balkanoid, Dict};
use serbzip_core::codecs::Codec;
// this codec needs a dictionary to work from
let mut dict_reader = BufReader::new(File::open("../dict.blk")?);
let dict = Dict::read_from_binary_image(&mut dict_reader)?;
let codec = Balkanoid::new(&dict);
// compress a line and check the output
let input_line = "Ah, distinctly I remember it was in the bleak December";
let compressed_line = codec.compress_line(input_line);
assert_eq!(compressed_line, "H, dstnctly I rmmbr t ws n th blk Dcmbr");
// expand the line; check that it matches the original
let expanded_line = codec.expand_line(&compressed_line)?;
assert_eq!(input_line, expanded_line);
// codecs also have helper methods for parsing I/O streams
let mut input_reader = BufReader::new(File::open("../test_data/antigonish.txt")?);
let mut output_writer = io::Cursor::new(Vec::new());
codec.compress(&mut input_reader, &mut output_writer)?;
let compressed_document = String::from_utf8(output_writer.into_inner())?;
assert_ne!("", compressed_document);
Modules
Machinery common to all codec implementations.
Various types and helpers that reduce boilerplate code.
Houses reusable machinery for transcoding streams, line by line.