Skip to main content

Utf16ByteCodec

Struct Utf16ByteCodec 

Source
pub struct Utf16ByteCodec { /* private fields */ }
Expand description

Combined byte-serialized UTF-16 codec.

The codec uses one configured byte order for both decoding and encoding. It does not detect, consume, or emit a BOM automatically; callers should use crate::UnicodeBom when a byte stream may carry an explicit BOM.

§Examples

use qubit_text_codec::{
    ByteOrder,
    CharsetCodec,
    DecodeStatus,
    Charset,
    Utf16,
    Utf16ByteCodec,
};

let codec = Utf16ByteCodec::new(ByteOrder::LittleEndian);
assert_eq!(Charset::UTF_16LE, codec.charset());
assert_eq!(Utf16::MAX_BYTES_PER_CHAR, codec.max_units_per_char());

let mut output = [0_u8; Utf16::MAX_BYTES_PER_CHAR];
let written = codec.encode_one('😀', &mut output, 0).expect("buffer fits");
assert_eq!(
    DecodeStatus::Complete {
        value: '😀',
        consumed: written,
    },
    codec.decode_one(&output[..written], 0).expect("valid UTF-16LE"),
);

Implementations§

Source§

impl Utf16ByteCodec

Source

pub const fn new(byte_order: ByteOrder) -> Self

Creates a byte-serialized UTF-16 codec.

§Parameters
  • byte_order: The byte order used by the byte buffer.
§Returns

Returns a UTF-16 byte codec.

Source

pub const fn byte_order(self) -> ByteOrder

Returns the configured byte order.

§Returns

Returns the byte order used by this codec.

Source

pub const fn charset(self) -> Charset

Returns the fixed-endian UTF-16 charset descriptor.

§Returns

Returns Charset::UTF_16LE or Charset::UTF_16BE according to this codec’s configured byte order.

Source

pub const fn max_units_per_char(self) -> usize

Returns the maximum number of serialized UTF-16 bytes for one character.

§Returns

Returns Utf16::MAX_BYTES_PER_CHAR.

Trait Implementations§

Source§

impl CharsetCodec for Utf16ByteCodec

Source§

fn charset(&self) -> Charset

Returns the fixed-endian UTF-16 charset for the configured byte order.

§Returns

Returns Charset::UTF_16BE when configured with ByteOrder::BigEndian, otherwise Charset::UTF_16LE.

Source§

fn max_units_per_char(&self) -> usize

Returns the maximum number of UTF-16 bytes for a single encoded character.

§Returns

Returns Utf16::MAX_BYTES_PER_CHAR.

Source§

fn decode_one( &self, input: &[u8], index: usize, ) -> CharsetDecodeResult<DecodeStatus>

Decodes one UTF-16 scalar value from a byte-prefixed UTF-16 stream.

§Arguments
  • input - Byte-prefixed UTF-16 buffer.
  • index - Start offset for parsing; must satisfy index <= input.len().
§Returns
  • Ok(DecodeStatus::NeedMore { required, available }) when the current slice has only a partial unit/pair.
  • Ok(DecodeStatus::Complete { value, consumed }) when one Unicode scalar value is decoded.
§Errors
  • CharsetDecodeError when UTF-16 structure is malformed.
Source§

fn encode_one( &self, ch: char, output: &mut [u8], index: usize, ) -> CharsetEncodeResult<usize>

Encodes one Unicode scalar value into UTF-16 bytes at index.

§Arguments
  • ch - The Unicode scalar value to encode.
  • output - Destination byte buffer.
  • index - Start offset where bytes are written; must satisfy index <= output.len().
§Returns

Ok(usize) with the number of written bytes (2 for BMP and 4 for supplementary).

§Errors
Source§

type Unit = u8

Storage unit used by the encoded representation. Read more
Source§

impl Clone for Utf16ByteCodec

Source§

fn clone(&self) -> Utf16ByteCodec

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for Utf16ByteCodec

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Hash for Utf16ByteCodec

Source§

fn hash<H>(&self, state: &mut H)
where H: Hasher,

Hashes the fixed-endian UTF-16 charset identity.

§Parameters
  • state: The hasher receiving this codec’s identity.
1.3.0 · Source§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more
Source§

impl PartialEq for Utf16ByteCodec

Source§

fn eq(&self, other: &Utf16ByteCodec) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 (const: unstable) · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl Copy for Utf16ByteCodec

Source§

impl Eq for Utf16ByteCodec

Source§

impl StructuralPartialEq for Utf16ByteCodec

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.