Struct Wtf8

Source

pub struct Wtf8 { /* private fields */ }

Expand description

A borrowed slice of well-formed WTF-8 data.

Similar to &str, but can additionally contain surrogate code points if they’re not in a surrogate pair.

Implementations§

Source §

impl Wtf8

Source

pub fn new<S: AsRef<Wtf8> + ?Sized>(value: &S) -> &Wtf8

Creates a WTF-8 slice from a UTF-8 &str slice.

Since WTF-8 is a superset of UTF-8, this always succeeds.

Source

pub const unsafe fn from_bytes_unchecked(value: &[u8]) -> &Wtf8

Creates a WTF-8 slice from a WTF-8 byte slice.

§Safety

value must contain valid WTF-8.

Source

pub fn from_bytes(b: &[u8]) -> Option<&Self>

Create a WTF-8 slice from a WTF-8 byte slice.

Source

pub const fn len(&self) -> usize

Returns the length, in WTF-8 bytes.

Source

pub const fn is_empty(&self) -> bool

Source

pub const fn ascii_byte_at(&self, position: usize) -> u8

Returns the code point at position if it is in the ASCII range, or b'\xFF' otherwise.

§Panics

Panics if position is beyond the end of the string.

Source

pub fn code_points(&self) -> Wtf8CodePoints<'_> ⓘ

Returns an iterator for the string’s code points.

Source

pub fn code_point_indices(&self) -> Wtf8CodePointIndices<'_> ⓘ

Returns an iterator for the string’s code points and their indices.

Source

pub const fn as_bytes(&self) -> &[u8] ⓘ

Access raw bytes of WTF-8 data

Source

pub const fn as_str(&self) -> Result<&str, Utf8Error>

Tries to convert the string to UTF-8 and return a &str slice.

Returns None if the string contains surrogates.

This does not copy the data.

Source

pub fn to_wtf8_buf(&self) -> Wtf8Buf

Creates an owned Wtf8Buf from a borrowed Wtf8.

Source

pub fn to_string_lossy(&self) -> Cow<'_, str>

Lossily converts the string to UTF-8. Returns a UTF-8 &str slice if the contents are well-formed in UTF-8.

Surrogates are replaced with "\u{FFFD}" (the replacement character “�”).

This only copies the data if necessary (if it contains any surrogate).

Source

pub fn encode_wide(&self) -> EncodeWide<'_> ⓘ

Converts the WTF-8 string to potentially ill-formed UTF-16 and return an iterator of 16-bit code units.

This is lossless: calling Wtf8Buf::from_ill_formed_utf16 on the resulting code units would always return the original WTF-8 string.

Source

pub const fn chunks(&self) -> Wtf8Chunks<'_> ⓘ

Source

pub fn map_utf8<'a, I>( &'a self, f: impl Fn(&'a str) -> I, ) -> impl Iterator<Item = CodePoint>
where I: Iterator<Item = char>,

Source

pub fn is_code_point_boundary(&self, index: usize) -> bool

Source

pub fn into_box(&self) -> Box<Wtf8>

Boxes this Wtf8.

Source

pub fn empty_box() -> Box<Wtf8>

Creates a boxed, empty Wtf8.

Source

pub fn replacen(&self, from: &Wtf8, to: &Wtf8, n: usize) -> Wtf8Buf

Trait Implementations§

Source §

impl AsRef<[u8]> for Wtf8

Source §

fn as_ref(&self) -> &[u8] ⓘ

Converts this type into a shared reference of the (usually inferred) input type.

Source §

impl AsRef<Wtf8> for Wtf8

Source §

fn as_ref(&self) -> &Wtf8

Converts this type into a shared reference of the (usually inferred) input type.

Source §

impl AsRef<Wtf8> for Wtf8Buf

Source §

fn as_ref(&self) -> &Wtf8

Converts this type into a shared reference of the (usually inferred) input type.

Source §

impl AsRef<Wtf8> for str

Source §

fn as_ref(&self) -> &Wtf8

Converts this type into a shared reference of the (usually inferred) input type.

Source §

impl Borrow<Wtf8> for Wtf8Buf

Source §

fn borrow(&self) -> &Wtf8

Immutably borrows from an owned value. Read more

Source §

impl Clone for Box<Wtf8>

Source §

fn clone(&self) -> Self

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

Source §

impl Debug for Wtf8

Formats the string in double quotes, with characters escaped according to char::escape_debug and unpaired surrogates represented as \u{xxxx}, where each x is a hexadecimal digit.

Source §

fn fmt(&self, formatter: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Source §

impl Default for &Wtf8

Source §

fn default() -> Self

Returns the “default value” for a type. Read more

Source §

impl Default for Box<Wtf8>

Source §

fn default() -> Self

Returns the “default value” for a type. Read more

Source §

impl Display for Wtf8

Formats the string with unpaired surrogates substituted with the replacement character, U+FFFD.

Source §

fn fmt(&self, formatter: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Source §