corncobs 0.1.4 - Docs.rs

//! # `corncobs`: Corny COBS encoding/decoding in Rust
//! 
//! This crate provides [Consistent Overhead Byte Stuffing][cobs] (COBS) support
//! for Rust programs, with a particular focus on resource-limited embedded
//! `no_std` targets:
//! 
//! - Provides both fast (buffer-to-buffer) and small (in-place or
//!   iterator-based) versions of both encode and decode routines.
//! 
//! - Provides a `const fn` for computing the maximum encoded size for a given
//!   input size, so you can define fixed-size buffers precisely without magic
//!   numbers.
//! 
//! - Has pretty good test coverage, [Criterion] benchmarks, and a [honggfuzz]
//!   fuzz testing suite to try to ensure code quality.
//! 
//! ## When to use this crate
//! 
//! COBS lets us take an arbitrary blob of bytes and turn it into a slightly
//! longer blob that doesn't contain a certain byte, except as a terminator at
//! the very end. `corncobs` implements the version of this where the byte is
//! zero.  That is, `corncobs` can take a sequence of arbitrary bytes, and turn
//! it into a slightly longer sequence that doesn't contain zero except at the
//! end.
//! 
//! The main reason you'd want to do this is _framing._ If you're transmitting a
//! series of messages over a stream, you need some way to tell where the
//! messages begin and end. There are many ways to do this -- such as by
//! transmitting a length before every message -- but most of them don't support
//! _sync recovery._ Sync recovery lets a receiver tune in anywhere in a stream
//! and figure out (correctly) where the next message boundary is. The easiest
//! way to provide sync recovery is to use a marker at the beginning/end of each
//! message that you can reliably tell apart from the data in the messages. To
//! find message boundaries in an arbitrary data stream, you only need to hunt
//! for the end of the current message and start parsing from there. COBS can do
//! this by ensuring that the message terminator character (0) only appears
//! between messages.
//! 
//! Unlike a lot of framing methods (particularly [SLIP]), COBS guarantees an
//! upper bound to the size of the encoded output: the original length, plus two
//! bytes, plus one byte per 254 input bytes. `corncobs` provides the
//! [`max_encoded_len`] function for sizing buffers to allow for worst-case
//! encoding overhead, at compile time.
//! 
//! `corncobs` can be used in several different ways, each with different costs
//! and benefits.
//! 
//! - Encoding
//!   - [`encode_buf`]: from one slice to another; efficient, but requires 2x
//!     the available RAM.
//!   - [`encode_iter`]: incremental, using an iterator; somewhat slower, but
//!     requires no additional memory. (This can be useful in a serial interrupt
//!     handler.)
//! - Decoding
//!   - [`decode_buf`]: from one slice to another; efficient, but requires 2x
//!     the available RAM.
//!   - [`decode_in_place`]: in-place in a slice; nearly as efficient, but
//!     overwrites incoming data.
//!
//! ## Design decisions / tradeoffs
//!
//! `corncobs` is optimized for a fast and simple implementation. To get best
//! performance on normal data, it leaves something out: **validation**.
//!
//! Specifically: `corncobs` will decode invalid COBS data that contains zeroes
//! in unexpected places mid-message. It could reject such data by scanning for
//! zeroes. We chose not to do this for performance reasons, and justify it with
//! the following points.
//!
//! First: we don't have to do this to maintain memory safety. Several C
//! implementations of COBS do data validation in an attempt to avoid buffer
//! overruns or out-of-bounds accesses. We're not writing in C and don't have
//! this problem to worry about.
//!
//! Second: it really does improve performance, by about 5x in benchmarks. This
//! is because, by lifting the requirement to inspect every byte hunting for
//! zeroes, we can use `copy_from_slice` to move data around, which calls
//! optimized memory-move routines for the target architecture that are
//! _basically always_ much faster than moving bytes.
//!
//! Third: COBS does not guarantee integrity. Spurious zeroes in the middle of a
//! message is only one way your input data could be corrupted. Your application
//! needs to handle _all_ possible corruption, which means having an integrity
//! check on the COBS-decoded data, such as a CRC.
//!
//! If you feed `corncobs` random invalid data, it will either return
//! unexpectedly short decoded results (which will fail your next-level
//! integrity check), or it will return an `Err`. It will not crash, corrupt
//! memory, or `panic!`, and we have tests to demonstrate this.
//!
//! ## Cargo `features`
//! 
//! No features are enabled by default. Embedded programmers do not need to
//! specify `default-features = false` when using `corncobs` because who said
//! `std` should be the default anyhow? People with lots of RAM, that's who.
//! 
//! Features:
//! 
//! - `std`: if you're on one of them "big computers" with "infinite memory" and
//!   can afford the inherent nondeterminism of dynamic memory allocation, this
//!   feature enables routines for encoding to-from `Vec`, and an `Error` impl
//!   for `CobsError`.
//! 
//! ## Tips for using COBS
//! 
//! If you're designing a protocol or message format and considering using COBS,
//! you have some options.
//! 
//! **Optimizing for size:** COBS encoding has the least overhead when the data
//! being encoded contains `0x00` bytes, at least one for every 254 bytes sent.
//! In practice, most data formats achieve this. However...
//! 
//! **Optimizing for speed:** COBS encode/decode, and particularly the
//! `corncobs` implementation, goes fastest when data contains as _few_ `0x00`
//! bytes as possible -- ideally none. If you can adjust the data you're
//! encoding to avoid zero, you can achieve higher encode/decode rates. For
//! instance, in one of my projects that sends RGB video data, I just declared
//! that red/green/blue value 1 is the same as 0, and made all the 0s into 1s,
//! for a large performance improvement.
//!
//! [cobs]: https://en.wikipedia.org/wiki/Consistent_Overhead_Byte_Stuffing
//! [Criterion]: https://docs.rs/criterion/latest/criterion/
//! [honggfuzz]: https://docs.rs/honggfuzz/latest/honggfuzz/
//! [SLIP]: https://en.wikipedia.org/wiki/Serial_Line_Internet_Protocol

#![cfg_attr(not(feature = "std"), no_std)]

// So far, the implementation is performant without the use of `unsafe`. To
// ensure that I think before breaking this property down the road, I'm
// currently configuring the compiler to reject `unsafe`. This is not a promise
// or a religious decision and might get changed in the future; merely scanning
// for the presence of `unsafe` is neither necessary nor sufficient for auditing
// crates you depend on, including this one.
#![forbid(unsafe_code)]

/// The termination byte used by `corncobs`. Yes, it's a bit silly to have this
/// as a constant -- but the implementation is careful to use this named
/// constant whenever it is talking about the termination byte, for clarity.
///
/// The value of this (`0`) is assumed by the implementation and can't easily be
/// changed.
pub const ZERO: u8 = 0;

/// Longest run of unchanged bytes that can be encoded using COBS.
///
/// Changing this will decrease encoding efficiency and break compatibility with
/// other COBS implementations, so, don't do that.
const MAX_RUN: usize = 254;

/// Returns the largest possible encoded size for an input message of `raw_len`
/// bytes, considering overhead.
///
/// This is a `const fn` so that you can use it to size arrays:
///
/// ```
/// const MSG_SIZE: usize = 254;
/// // Worst-case input message: no zeroes to exploit.
/// let mut msg = [0xFF; MSG_SIZE];
/// // This will still be enough space!
/// let mut encoded = [0; corncobs::max_encoded_len(MSG_SIZE)];
///
/// let len = corncobs::encode_buf(&msg, &mut encoded);
/// assert_eq!(len, encoded.len());
/// ```
pub const fn max_encoded_len(raw_len: usize) -> usize {
    let overhead = if raw_len == 0 {
        // In the special case of an empty message, we wind up generating one
        // byte of overhead.
        1
    } else {
        (raw_len + 253) / 254
    };
    // +1 for terminator byte.
    raw_len + overhead + 1
}

/// Encodes the message `bytes` into the buffer `output`. Returns the number of
/// bytes used in `output`, which also happens to be the index of the first zero
/// byte.
///
/// Bytes in `output` after the part that gets used are left unchanged.
///
/// `output` must be large enough to receive the encoded form, which is
/// `max_encoded_len(bytes.len())` worst-case.
///
/// # Panics
///
/// If `output` is too small to contain the encoded form of `input`.
pub fn encode_buf(bytes: &[u8], mut output: &mut [u8]) -> usize {
    // We'll panic if the precondition is violated regardless, but this makes
    // the error a bit easier to spot in tests:
    debug_assert!(output.len() >= max_encoded_len(bytes.len()));

    // Capture the original size of the output, because we're going to shorten
    // it as we write bytes.
    let orig_size = output.len();

    let mut prev_run_was_maximal = false;

    // The encoding process can be described in terms of "runs" of non-zero
    // bytes in the input data. We process each run individually.
    //
    // Currently, the scanning-for-zeros loop here is the hottest part of the
    // encode profile.
    for mut run in bytes.split(|&b| b == ZERO) {
        // If the last run we encoded was maximal length, we need to encode an
        // explicit zero between it and our current `run`.
        if prev_run_was_maximal {
            let (chunk, new_output) = output.split_at_mut(1);
            chunk[0] = encode_len(0);
            output = new_output;
        }

        // We can only encode a run of up to `MAX_RUN` bytes in COBS. This may
        // require us to split `run` into multiple output chunks -- in the
        // extreme case, if the input contains no zeroes, we'll process all of
        // it here.
        loop {
            let chunk_len = usize::min(run.len(), MAX_RUN);
            let (chunk, new_output) = output.split_at_mut(chunk_len + 1);
            let (run_prefix, new_run) = run.split_at(chunk_len);
            chunk[1..].copy_from_slice(run_prefix);
            chunk[0] = encode_len(chunk_len);

            output = new_output;
            run = new_run;
            prev_run_was_maximal = chunk_len == MAX_RUN;

            // We test this condition here, rather than as a `while` loop,
            // because we want to process empty runs once.
            if run.is_empty() {
                break;
            }
        }
    }
    // We've been shortening the output as we go by lopping off prefixes, so our
    // terminating byte goes at the new start:
    output[0] = 0;
    orig_size - (output.len() - 1)
}

/// Encodes `bytes` into the vector `output`. This is a convenience for cases
/// where you have `std` available.
#[cfg(feature = "std")]
pub fn encode(bytes: &[u8], output: &mut Vec<u8>) {
    // Big computers with `std` have effectively unlimited memory, so, go ahead
    // and resize that vector to the maximum we might need:
    let offset = output.len();
    output.resize(offset + max_encoded_len(bytes.len()), 0);
    // Now just treat it as a slice.
    let actual_len = encode_buf(bytes, &mut output[offset..]);
    output.truncate(offset + actual_len);
}

/// Encoding a len (between `0` and `MAX_RUN` inclusive) into a byte such that
/// we avoid `ZERO`.
#[inline(always)]
fn encode_len(len: usize) -> u8 {
    // This assert is intended to catch mistakes while hacking on the internals
    // of corncobs.
    debug_assert!(len <= MAX_RUN);
    // This function is private and all paths through the code are reasonably
    // well tested, so we're pretty sure the assert above holds even in release
    // builds. As a result, explicitly opt out of overflow checks on this
    // addition.
    //
    // We're doing the addition on `usize` to ensure we don't generate
    // additional zero extend instructions.
    len.wrapping_add(1) as u8
}

/// Encodes `bytes` into COBS form, yielding individual encoded bytes through an
/// iterator.
///
/// This is quite a bit slower than memory-to-memory encoding (e.g.
/// `encode_buf`) because it can't move whole blocks of non-zero bytes at a
/// time -- about 35-40x slower in benchmarks. However, if your throughput is
/// restricted by the speed of a link that gets fed one byte a time, such as a
/// serial peripheral, this can encode messages with no additional memory.
pub fn encode_iter(bytes: &[u8]) -> impl Iterator<Item = u8> + '_ {
    let mut state = Some(EncodeState::Begin(bytes));
    core::iter::from_fn(move || {
        let s = state?;
        let (b, s2) = s.next();
        state = s2;
        Some(b)
    })
}

/// State for incremental encoding.
#[derive(Copy, Clone, Debug)]
enum EncodeState<'a> {
    /// We are at a run boundary and need to determine the size of the next run
    /// and emit an overhead byte.
    ///
    /// From this state we will always emit at least two bytes: an overhead byte
    /// and a terminator.
    ///
    /// If the next run contains only 0, we'll drop it and transition back to
    /// `Begin`.
    ///
    /// Otherwise, we'll transition to `Run` to send the bytes.
    ///
    /// If the data is empty we'll transition to `End`.
    Begin(&'a [u8]),
    /// We are in a non-empty run. We need to emit a literal byte, and then determine
    /// our next state based on whether the first slice is empty.
    ///
    /// If the first slice is empty, and the second slice is `None`, we'll
    /// transition to `End`.
    ///
    /// If the first slice is empty, and the second slice is `Some`, we'll
    /// transition to `Begin`.
    ///
    /// Otherwise we'll remain in `Run`, moving the first byte out of the first
    /// slice.
    Run(u8, &'a [u8], Option<&'a [u8]>),
    /// We have used all the data bytes and just need to emit a terminating
    /// zero.
    ///
    /// This state will always emit exactly one byte.
    End,
}

impl<'a> EncodeState<'a> {
    pub fn next(self) -> (u8, Option<Self>) {
        match self {
            Self::Begin(bytes) => {
                let (run, rest) = take_run(bytes);
                let b = encode_len(run.len());
                (b, Some(Self::next_run_state(run, rest)))
            }
            Self::Run(b, run, rest) => {
                (b, Some(Self::next_run_state(run, rest)))
            }
            Self::End => (0, None),
        }
    }

    fn next_run_state(run: &'a [u8], rest: Option<&'a [u8]>) -> Self {
        if let Some((&b, run)) = run.split_first() {
            // There's data in the run, we must drain it before starting
            // a new one.
            Self::Run(b, run, rest)
        } else {
            Self::new_run_state(rest)
        }
    }

    fn new_run_state(rest: Option<&'a [u8]>) -> Self {
        if let Some(rest) = rest {
            Self::Begin(rest)
        } else {
            Self::End
        }
    }
}

/// Takes a run off the front of `bytes`. The run will be between 0 and
/// `MAX_RUN` bytes, inclusive, and will not include any `ZERO` bytes.
///
/// If the run is empty, it means the next byte in `bytes` was `ZERO`.
///
/// Returns `(run, rest)`, where `rest` is...
///
/// - `None`, if this run consumed the entire slice.
/// - `Some(stuff)`, if after this run there is still data to process.
///
/// Note that `stuff` may be empty, if `bytes` ends in a `ZERO`. It is still
/// important to process `stuff` in that case.
fn take_run(bytes: &[u8]) -> (&[u8], Option<&[u8]>) {
    // The run will be no longer than
    // - All the bytes, or
    // - The fixed MAX_RUN constant.
    let max_len = usize::min(bytes.len(), MAX_RUN);
    // It may be shorter than that if there's a zero. Scan the prefix for a zero
    // and truncate if found.
    let run_len = bytes.iter()
        .take(max_len)
        .position(|&b| b == ZERO)
        .unwrap_or(max_len);

    let (run, rest) = bytes.split_at(run_len);
    let rest = if rest.is_empty() {
        None
    } else if run_len == MAX_RUN {
        // Run does not imply a zero, don't omit one from the output if present.
        Some(rest)
    } else {
        debug_assert_eq!(rest[0], 0);
        // Drop the zero.
        Some(&rest[1..])
    };
    (run, rest)
}

/// Decodes `bytes` into a vector.
///
/// This is a convenience for cases where you have `std` available. Its behavior
/// is otherwise identical to `decode_buf`.
#[cfg(feature = "std")]
pub fn decode(bytes: &[u8], output: &mut Vec<u8>) -> Result<(), CobsError> {
    let offset = output.len();
    output.resize(offset + bytes.len(), 0);
    let actual_len = decode_buf(bytes, &mut output[offset..])?;
    output.truncate(offset + actual_len);
    Ok(())
}

/// Decodes input from `bytes` into `output` starting at index 0. Returns the
/// number of bytes used in `output`.
///
/// # Panics
///
/// If `output` is not long enough to receive the decoded output. To be safe,
/// `output` must be at least `max_encoded_len(bytes.len())`.
pub fn decode_buf(mut bytes: &[u8], mut output: &mut [u8]) -> Result<usize, CobsError> {
    let orig_len = output.len();

    let mut trailing_zero = false;
    // This while-loop is equivalent to `for b in bytes` except that it lets us
    // _also_ consume bytes inside the body, which we totally do.
    while let Some((&head, rest)) = bytes.split_first() {
        bytes = rest;
        // Detect message terminator.
        let n = if let Some(n) = decode_len(head) {
            n
        } else {
            let decoded_len = orig_len - output.len();
            return Ok(decoded_len);
        };
        // If we're not at the end of the message, and our last run was less
        // than MAX_RUN bytes, we need to insert a zero.
        if trailing_zero {
            let (z, new_output) = output.split_at_mut(1);
            z[0] = ZERO;
            output = new_output;
        }

        // Skip a bunch of work if our run length is zero. This cuts the
        // worst-case time in benchmarks (on Intel) by about 50% while only
        // slightly impacting the less pathological cases.
        if n != 0 {
            // Refuse to proceed if the run claims to contain more bytes than
            // the slice does. (This check prevents a panic in decoding
            // truncated data.)
            if bytes.len() < n {
                break;
            }

            // Split the remaining data into the block belonging to this run and
            // allll the rest.
            let (block, rest) = bytes.split_at(n);
            bytes = rest;

            // Blit that block!
            let (block_out, new_output) = output.split_at_mut(block.len());
            block_out.copy_from_slice(block);
            output = new_output;
        }

        // Record whether this run was shorter than the max. Runs shorter than
        // the max in the middle of a message are always ended by zero, which we
        // need to insert in the output. However, a shorter-than-max run at the
        // very _end_ is not terminated by zero, and we handle it above.
        trailing_zero = n != MAX_RUN;
    }

    // If we got here, it's because we ran all the way through `bytes` without
    // finding the terminating ZERO.
    Err(CobsError::Truncated)
}

/// Errors that can occur while decoding.
#[derive(Copy, Clone, Debug)]
pub enum CobsError {
    /// The input ended without completing the last run or without the trailing
    /// zero byte, suggesting that part of it is missing. (This can also occur
    /// spuriously if you pick up in the middle of a stream without finding the
    /// first zero.)
    Truncated,
    /// The input contained an unexpected zero byte. Not all decode methods
    /// promise to detect this case.
    Corrupt,
}

impl core::fmt::Display for CobsError {
    fn fmt(&self, f: &mut core::fmt::Formatter) -> core::fmt::Result {
        match self {
            Self::Truncated => f.write_str("input truncated"),
            Self::Corrupt => f.write_str("input corrupt"),
        }
    }
}

#[cfg(feature = "std")]
impl std::error::Error for CobsError {}

/// Decodes a length-or-terminator byte. If the byte is `ZERO`, returns `None`.
/// Otherwise returns the length of the run encoded by the byte.
#[inline(always)]
fn decode_len(code: u8) -> Option<usize> {
    usize::from(code).checked_sub(1)
}

/// Decodes an encoded message, in-place. This is useful when you're short on
/// memory. Since the decoded form of a COBS frame is always shorter than the
/// encoded form, `bytes` is guaranteed to be long enough.
///
/// The decoded message is deposited into `bytes` starting at index 0, and
/// `decode_in_place` returns the number of decoded bytes.
///
/// If you've got memory to spare, `decode_buf` is often somewhat faster --
/// `decode_in_place` takes between 1x and 3x the time in benchmarks. You may
/// also prefer to use `decode_buf` if you can't overwrite the incoming data,
/// for whatever reason.
pub fn decode_in_place(bytes: &mut [u8]) -> Result<usize, CobsError> {
    let mut inpos = 0;
    let mut outpos = 0;
    let mut extra_zero = false;
    while inpos < bytes.len() {
        let head = bytes[inpos];
        let n = if let Some(n) = decode_len(head) {
            n
        } else {
            break;
        };
        if bytes.len() < inpos + 1 + n {
            return Err(CobsError::Truncated);
        }
        bytes.copy_within(inpos + 1..inpos + 1 + n, outpos);
        inpos += 1 + n;
        outpos += n;
        extra_zero = n != MAX_RUN;
        if extra_zero {
            bytes[outpos] = 0;
            outpos += 1;
        }
    }
    Ok(if extra_zero {
        outpos - 1
    } else {
        outpos
    })
}

#[derive(Clone, Debug, PartialEq, Eq, Default)]
pub struct Decoder {
    state: DState,
}

impl Decoder {
    pub fn advance(&mut self, byte: u8) -> Result<DecodeStatus, CobsError> {
        match self.state {
            DState::Start | DState::Tween(true) => {
                if let Some(count) = byte.checked_sub(1) {
                    if let Some(count2) = count.checked_sub(1) {
                        self.state = DState::Literal(count2, byte == 0xFF);
                    } else {
                        self.state = DState::Tween(false);
                    }
                    Ok(DecodeStatus::Pending)
                } else {
                    self.state = DState::Done;
                    Ok(DecodeStatus::Done)
                }
            }
            DState::Tween(false) => {
                if let Some(count) = byte.checked_sub(1) {
                    if let Some(count2) = count.checked_sub(1) {
                        self.state = DState::Literal(count2, byte == 0xFF);
                    } else {
                        self.state = DState::Tween(false);
                    }
                    Ok(DecodeStatus::Append(ZERO))
                } else {
                    self.state = DState::Done;
                    Ok(DecodeStatus::Done)
                }
            }
            DState::Literal(n, omit_zero) => {
                if byte == ZERO {
                    Err(CobsError::Corrupt)
                } else {
                    if let Some(next_n) = n.checked_sub(1) {
                        self.state = DState::Literal(next_n, omit_zero);
                    } else {
                        self.state = DState::Tween(omit_zero);
                    }

                    Ok(DecodeStatus::Append(byte))
                }
            }
            DState::Done => panic!(),
        }
    }

    pub fn is_done(&self) -> bool {
        self.state == DState::Done
    }
}

#[derive(Copy, Clone, Debug, PartialEq, Eq)]
enum DState {
    Start,
    Tween(bool),
    Literal(u8, bool),
    Done,
}

impl Default for DState {
    fn default() -> Self {
        Self::Start
    }
}

#[derive(Copy, Clone, Debug, PartialEq, Eq)]
pub enum DecodeStatus {
    Pending,
    Append(u8),
    Done,
}

// Tests for private bits; test fixtures require std, unfortunately, so you have
// to run these explicitly with `cargo test --features std`. Most of the API
// tests are broken out into an integration test.
#[cfg(all(test, feature = "std"))]
mod tests {
    use super::*;

    #[test]
    fn take_run_empty() {
        assert_eq!(take_run(&[]), (&[][..], None));
    }

    #[test]
    fn take_run_zero() {
        assert_eq!(take_run(&[0]), (&[][..], Some(&[][..])));
    }

    #[test]
    fn take_run_one() {
        assert_eq!(take_run(&[1]), (&[1][..], None));
    }
}