Crate encoding_c_mem

Expand description

FFI bindings for encoding_rs::mem.

Note: “Latin1” in this module refers to the Unicode range from U+0000 to U+00FF, inclusive, and does not refer to the windows-1252 range. This in-memory encoding is sometimes used as a storage optimization of text when UTF-16 indexing and length semantics are exposed.

Functions§

encoding_mem_check_str_for_latin1_and_bidi^⚠: Checks whether a valid UTF-8 buffer contains code points that trigger right-to-left processing or is all-Latin1.
encoding_mem_check_utf8_for_latin1_and_bidi^⚠: Checks whether a potentially invalid UTF-8 buffer contains code points that trigger right-to-left processing or is all-Latin1.
encoding_mem_check_utf16_for_latin1_and_bidi^⚠: Checks whether a potentially invalid UTF-16 buffer contains code points that trigger right-to-left processing or is all-Latin1.
encoding_mem_convert_latin1_to_utf8^⚠: Converts bytes whose unsigned value is interpreted as Unicode code point (i.e. U+0000 to U+00FF, inclusive) to UTF-8.
encoding_mem_convert_latin1_to_utf8_partial^⚠: Converts bytes whose unsigned value is interpreted as Unicode code point (i.e. U+0000 to U+00FF, inclusive) to UTF-8 with potentially insufficient output space.
encoding_mem_convert_latin1_to_utf16^⚠: Converts bytes whose unsigned value is interpreted as Unicode code point (i.e. U+0000 to U+00FF, inclusive) to UTF-16.
encoding_mem_convert_str_to_utf16^⚠: Converts valid UTF-8 to valid UTF-16.
encoding_mem_convert_utf8_to_latin1_lossy^⚠: If the input is valid UTF-8 representing only Unicode code points from U+0000 to U+00FF, inclusive, converts the input into output that represents the value of each code point as the unsigned byte value of each output byte.
encoding_mem_convert_utf8_to_utf16^⚠: Converts potentially-invalid UTF-8 to valid UTF-16 with errors replaced with the REPLACEMENT CHARACTER.
encoding_mem_convert_utf8_to_utf16_without_replacement^⚠: Converts potentially-invalid UTF-8 to valid UTF-16 signaling on error.
encoding_mem_convert_utf16_to_latin1_lossy^⚠: If the input is valid UTF-16 representing only Unicode code points from U+0000 to U+00FF, inclusive, converts the input into output that represents the value of each code point as the unsigned byte value of each output byte.
encoding_mem_convert_utf16_to_utf8^⚠: Converts potentially-invalid UTF-16 to valid UTF-8 with errors replaced with the REPLACEMENT CHARACTER.
encoding_mem_convert_utf16_to_utf8_partial^⚠: Converts potentially-invalid UTF-16 to valid UTF-8 with errors replaced with the REPLACEMENT CHARACTER with potentially insufficient output space.
encoding_mem_copy_ascii_to_ascii^⚠: Copies ASCII from source to destination up to the first non-ASCII byte (or the end of the input if it is ASCII in its entirety).
encoding_mem_copy_ascii_to_basic_latin^⚠: Copies ASCII from source to destination zero-extending it to UTF-16 up to the first non-ASCII byte (or the end of the input if it is ASCII in its entirety).
encoding_mem_copy_basic_latin_to_ascii^⚠: Copies Basic Latin from source to destination narrowing it to ASCII up to the first non-Basic Latin code unit (or the end of the input if it is Basic Latin in its entirety).
encoding_mem_ensure_utf16_validity^⚠: Replaces unpaired surrogates in the input with the REPLACEMENT CHARACTER.
encoding_mem_is_ascii^⚠: Checks whether the buffer is all-ASCII.
encoding_mem_is_basic_latin^⚠: Checks whether the buffer is all-Basic Latin (i.e. UTF-16 representing only ASCII characters).
encoding_mem_is_char_bidi^⚠: Checks whether a scalar value triggers right-to-left processing.
encoding_mem_is_str_bidi^⚠: Checks whether a valid UTF-8 buffer contains code points that trigger right-to-left processing.
encoding_mem_is_str_latin1^⚠: Checks whether the buffer represents only code points less than or equal to U+00FF.
encoding_mem_is_utf8_bidi^⚠: Checks whether a potentially-invalid UTF-8 buffer contains code points that trigger right-to-left processing.
encoding_mem_is_utf8_latin1^⚠: Checks whether the buffer is valid UTF-8 representing only code points less than or equal to U+00FF.
encoding_mem_is_utf16_bidi^⚠: Checks whether a UTF-16 buffer contains code points that trigger right-to-left processing.
encoding_mem_is_utf16_code_unit_bidi^⚠: Checks whether a UTF-16 code unit triggers right-to-left processing.
encoding_mem_is_utf16_latin1^⚠: Checks whether the buffer represents only code point less than or equal to U+00FF.
encoding_mem_str_latin1_up_to^⚠: Returns the index of first byte that starts a non-Latin1 byte sequence, or the length of the string if there are none.
encoding_mem_utf8_latin1_up_to^⚠: Returns the index of first byte that starts an invalid byte sequence or a non-Latin1 byte sequence, or the length of the string if there are neither.
encoding_mem_utf16_valid_up_to^⚠: Returns the index of the first unpaired surrogate or, if the input is valid UTF-16 in its entirety, the length of the input.

Crate encoding_c_mem

Crate encoding_c_mem Copy item path

Functions§

Crate encoding_c_mem