Struct constriction::backends::Cursor

source ·

pub struct Cursor<Word, Buf> { /* private fields */ }

Expand description

Adapter that turns an in-memory buffer into an impl ReadWords and/or an impl WriteWords.

A Cursor<Word, Buf> allows you to use an in-memory buffer Buf of a slice of Words as a source and/or sink of compressed data in an entropy coder. The type Buf must implement AsRef<[Word]> to be used as a data source (i.e., an implementation of ReadWords) and it must implement AsMut<[Word]> to be used as a data sink (i.e., an implementation of WriteWords). In the most typical use cases, Buf is either a Vec<Word> (if the entropy coder should own the compressed data) or a reference to a slice of Words, i.e., &[Word] (if the entropy coder should only have shared access to the compressed data, e.g., because you want to keep the compressed data alive even after the entropy coder gets dropped).

A Cursor<Word, Buf> implements ReadWords for both Queue and Stack semantics. By convention, reading with Queue semantics incremenets the Cursor’s index into the slice returned by .as_ref() whereas reading with Stack semantics decrements the index. Whether Queue or Stack semantics will be used is usually decided by the implementation of the entropy coder that uses the Cursor as its backend. If you want to read in the opposite direction than what’s the convention for your use case (e.g., because you’ve already manually reversed the order of the Words in the buffer) then wrap the Cursor in a Reverse. The implementation of WriteWords<Word> (if Buf implements AsMut<[Word]>) always writes in the same direction in which ReadWords<Word, Queue> reads.

Examples

The following example shows how a Cursor can be used to decode both shared and owned compressed data with a RangeDecoder:

use constriction::{
    stream::{
        model::DefaultLeakyQuantizer, queue::{DefaultRangeEncoder, DefaultRangeDecoder},
        Encode, Decode
    },
    UnwrapInfallible,
};

// Some simple entropy model, just for demonstration purpose.
let quantizer = DefaultLeakyQuantizer::new(-100..=100);
let model = quantizer.quantize(probability::distribution::Gaussian::new(25.0, 10.0));

// Encode the symbols `0..100` using a `RangeEncoder` (uses the default `Vec` backend because
// we don't know the size of the compressed data upfront).
let mut encoder = DefaultRangeEncoder::new();
encoder.encode_iid_symbols(0..100, &model);
let compressed = encoder.into_compressed().unwrap_infallible(); // `compressed` is a `Vec<u32>`.
dbg!(compressed.len()); // Prints "compressed.len() = 40".

// Create a `RangeDecoder` with shared access to the compressed data. This constructs a
// `Cursor<u32, &[u32]>` that points to the beginning of the data and loads it in the decoder.
let mut sharing_decoder
    = DefaultRangeDecoder::from_compressed(&compressed[..]).unwrap_infallible();
// `sharing_decoder` has type `RangeDecoder<u32, u64, Cursor<u32, &'a [u32]>`.

// Decode the data and verify correctness.
assert!(sharing_decoder.decode_iid_symbols(100, &model).map(Result::unwrap).eq(0..100));
assert!(sharing_decoder.maybe_exhausted());

// We can still use `compressed` because we gave the decoder only shared access to it. Thus,
// `sharing_decoder` contains a reference into `compressed`, so we couldn't return it from the
// current function. If we want to return a decoder, we have to give it ownership of the data:
let mut owning_decoder = DefaultRangeDecoder::from_compressed(compressed).unwrap_infallible();
// `owning_decoder` has type `RangeDecoder<u32, u64, Cursor<u32, Vec<u32>>`.

// Verify that we can decode the data again.
assert!(owning_decoder.decode_iid_symbols(100, &model).map(Result::unwrap).eq(0..100));
assert!(owning_decoder.maybe_exhausted());

`Cursor`s automatically use the correct `Semantics`

You can use a Cursor also as a stack, e.g., for an AnsCoder. The Cursor will automatically read data in the correct (i.e., reverse) direction when it is invoked with Stack semantics. Note, however, that using a Cursor is not always necessary when you decode with an AnsCoder because the AnsCoder can also decode directly from a Vec (see last example below). However, you’ll need a Cursor if you don’t own the compressed data:

fn decode_shared_data(amt: usize, compressed: &[u32]) -> Vec<i32> {
    // Some simple entropy model, just for demonstration purpose.
    let quantizer = DefaultLeakyQuantizer::new(-100..=100);
    let model = quantizer.quantize(probability::distribution::Gaussian::new(25.0, 10.0));

    // `AnsCoder::from_compressed_slice` wraps the provided compressed data in a `Cursor` and
    // initializes the cursor position at the end (= top of the stack; see documentation of
    // `Reverse` if you want to read the data from the beginning instead).
    let mut decoder = DefaultAnsCoder::from_compressed_slice(compressed).unwrap();
    decoder.decode_iid_symbols(amt, &model).collect::<Result<Vec<_>, _>>().unwrap_infallible()
}

Owning `Cursor`s vs `Vec`s

If you have ownership of the compressed data, then decoding it with an AnsCoder doesn’t always require a Cursor. An AnsCoder can also directly decode from a Vec<Word> backend. The difference between Vec<Word> and an owning cursor Cursor<Word, Vec<Word>> is that decoding from a Vec consumes the compressed data (so you can interleave multiple encoding/decoding steps arbitrarily) whereas a Cursor (whether it be sharing or owning) does not consume the compressed data that is read from it. You can still interleave multiple encoding/decoding steps with an AnsCoder that uses a Cursor instead of a Vec backend, but since a Cursor doesn’t grow or shrink the wrapped buffer you will typically either run out of buffer space at some point or the final buffer will be padded to its original size with some partially overwritten left-over compressed data (for older readers like myself: think of a Cursor as a cassette recorder).

use constriction::{
    backends::Cursor, stream::{model::DefaultLeakyQuantizer, stack::DefaultAnsCoder, Decode},
    CoderError, UnwrapInfallible,
};

// Some simple entropy model, just for demonstration purpose.
let quantizer = DefaultLeakyQuantizer::new(-100..=100);
let model = quantizer.quantize(probability::distribution::Gaussian::new(25.0, 10.0));

// Encode the symbols `0..50` using a stack entropy coder and get the compressed data.
let mut coder = DefaultAnsCoder::new();
coder.encode_iid_symbols_reverse(0..50, &model).unwrap();
let compressed = coder.into_compressed().unwrap_infallible(); // `compressed` is a `Vec<u32>`.
dbg!(compressed.len()); // Prints "compressed.len() = 11".

// We can either reconstruct (a clone of) the original `coder` with `Vec` backend and decode
// data and/or encode some more data, or even do both in any order.
let mut vec_coder = DefaultAnsCoder::from_compressed(compressed.clone()).unwrap();
// Decode the top half of the symbols off the stack and verify correctness.
assert!(
    vec_coder.decode_iid_symbols(25, &model)
        .map(UnwrapInfallible::unwrap_infallible)
        .eq(0..25)
);
// Then encode some more symbols onto it.
vec_coder.encode_iid_symbols_reverse(50..75, &model).unwrap();
let compressed2 = vec_coder.into_compressed().unwrap_infallible();
dbg!(compressed2.len()); // Prints "compressed2.len() = 17"
// `compressed2` is longer than `compressed1` because the symbols we poped off had lower
// information content under the `model` than the symbols we replaced them with.

// In principle, we could have done the same with an `AnsCoder` that uses a `Cursor` backend.
let cursor = Cursor::new_at_write_end(compressed); // Could also use `&mut compressed[..]`.
let mut cursor_coder = DefaultAnsCoder::from_compressed(cursor).unwrap();
// Decode the top half of the symbols off the stack and verify correctness.
assert!(
    cursor_coder.decode_iid_symbols(25, &model)
        .map(UnwrapInfallible::unwrap_infallible)
        .eq(0..25)
);
// Encoding *a few* more symbols works ...
cursor_coder.encode_iid_symbols_reverse(65..75, &model).unwrap();
// ... but at some point we'll run out of buffer space.
assert_eq!(
    cursor_coder.encode_iid_symbols_reverse(50..65, &model),
    Err(CoderError::Backend(constriction::backends::BoundedWriteError::OutOfSpace))
);

Struct constriction::backends::Cursor

Implementations§

impl<Word, Buf> Cursor<Word, Buf>

pub fn new_at_write_beginning(buf: Buf) -> Self

pub fn new_at_write_end(buf: Buf) -> Selfwhere Buf: AsRef<[Word]>,

pub fn new_at_write_end_mut(buf: Buf) -> Selfwhere Buf: AsMut<[Word]>,

pub fn new_at_pos(buf: Buf, pos: usize) -> Result<Self, ()>where Buf: AsRef<[Word]>,

pub fn new_at_pos_mut(buf: Buf, pos: usize) -> Result<Self, ()>where Buf: AsMut<[Word]>,

pub fn as_view(&self) -> Cursor<Word, &[Word]>where Buf: AsRef<[Word]>,

pub fn as_mut_view(&mut self) -> Cursor<Word, &mut [Word]>where Buf: AsMut<[Word]>,

pub fn cloned(&self) -> Cursor<Word, Vec<Word>>where Word: Clone, Buf: AsRef<[Word]>,

pub fn buf(&self) -> &Buf

pub fn buf_mut(&mut self) -> &mut Buf

pub fn into_buf_and_pos(self) -> (Buf, usize)

pub fn into_reversed(self) -> Reverse<Self>where Buf: AsMut<[Word]>,

Trait Implementations§

impl<Word: Clone, Buf: AsRef<[Word]>> BoundedReadWords<Word, Queue> for Cursor<Word, Buf>

fn remaining(&self) -> usize

fn is_exhausted(&self) -> bool

impl<Word: Clone, Buf: SafeBuf<Word>> BoundedReadWords<Word, Stack> for Cursor<Word, Buf>

fn remaining(&self) -> usize

fn is_exhausted(&self) -> bool

impl<Word, Buf: AsMut<[Word]> + AsRef<[Word]>> BoundedWriteWords<Word> for Cursor<Word, Buf>

fn space_left(&self) -> usize

fn is_full(&self) -> bool

impl<Word: Clone, Buf: Clone> Clone for Cursor<Word, Buf>

fn clone(&self) -> Cursor<Word, Buf>

fn clone_from(&mut self, source: &Self)

impl<Word: Debug, Buf: Debug> Debug for Cursor<Word, Buf>

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl<Word, Buf: AsRef<[Word]>> Pos for Cursor<Word, Buf>

fn pos(&self) -> usize

impl<Word, Buf> PosSeek for Cursor<Word, Buf>

type Position = usize

impl<Word: Clone, Buf: AsRef<[Word]>> ReadWords<Word, Queue> for Cursor<Word, Buf>

type ReadError = Infallible

fn read(&mut self) -> Result<Option<Word>, Self::ReadError>

fn maybe_exhausted(&self) -> bool

impl<Word: Clone, Buf: SafeBuf<Word>> ReadWords<Word, Stack> for Cursor<Word, Buf>

type ReadError = Infallible

fn read(&mut self) -> Result<Option<Word>, Self::ReadError>

fn maybe_exhausted(&self) -> bool

impl<Word, Buf: AsRef<[Word]>> Seek for Cursor<Word, Buf>

fn seek(&mut self, pos: usize) -> Result<(), ()>

impl<Word, Buf: AsMut<[Word]>> WriteWords<Word> for Cursor<Word, Buf>

type WriteError = BoundedWriteError

fn write(&mut self, word: Word) -> Result<(), Self::WriteError>

fn extend_from_iter( &mut self, iter: impl Iterator<Item = Word> ) -> Result<(), Self::WriteError>

fn maybe_full(&self) -> bool

Auto Trait Implementations§

impl<Word, Buf> RefUnwindSafe for Cursor<Word, Buf>where Buf: RefUnwindSafe, Word: RefUnwindSafe,

impl<Word, Buf> Send for Cursor<Word, Buf>where Buf: Send, Word: Send,

impl<Word, Buf> Sync for Cursor<Word, Buf>where Buf: Sync, Word: Sync,

impl<Word, Buf> Unpin for Cursor<Word, Buf>where Buf: Unpin, Word: Unpin,

impl<Word, Buf> UnwindSafe for Cursor<Word, Buf>where Buf: UnwindSafe, Word: UnwindSafe,

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

pub fn new_at_write_end(buf: Buf) -> Self
where Buf: AsRef<[Word]>,

pub fn new_at_write_end_mut(buf: Buf) -> Self
where Buf: AsMut<[Word]>,

pub fn new_at_pos(buf: Buf, pos: usize) -> Result<Self, ()>
where Buf: AsRef<[Word]>,

pub fn new_at_pos_mut(buf: Buf, pos: usize) -> Result<Self, ()>
where Buf: AsMut<[Word]>,

pub fn as_view(&self) -> Cursor<Word, &[Word]>
where Buf: AsRef<[Word]>,

pub fn as_mut_view(&mut self) -> Cursor<Word, &mut [Word]>
where Buf: AsMut<[Word]>,

pub fn cloned(&self) -> Cursor<Word, Vec<Word>>
where Word: Clone, Buf: AsRef<[Word]>,

pub fn into_reversed(self) -> Reverse<Self>
where Buf: AsMut<[Word]>,

impl<Word, Buf> RefUnwindSafe for Cursor<Word, Buf>
where Buf: RefUnwindSafe, Word: RefUnwindSafe,

impl<Word, Buf> Send for Cursor<Word, Buf>
where Buf: Send, Word: Send,

impl<Word, Buf> Sync for Cursor<Word, Buf>
where Buf: Sync, Word: Sync,

impl<Word, Buf> Unpin for Cursor<Word, Buf>
where Buf: Unpin, Word: Unpin,

impl<Word, Buf> UnwindSafe for Cursor<Word, Buf>
where Buf: UnwindSafe, Word: UnwindSafe,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,