flexstr
A flexible, simple to use, immutable, clone-efficient String replacement for
Rust. It unifies literals, inlined, and heap allocated strings into a single
type.
Overview
Rust is great, but it's String type is optimized as a mutable string
buffer, not for typical string use cases. Most string use cases don't
modify their contents, often need to copy strings around as if
they were cheap like integers, typically concatenate instead of modify, and
often end up being cloned with identical contents. Additionally, String
isn't able to wrap a string literal without additional allocation and copying.
Rust needs a new string type to unify usage of both literals and
allocated strings in these typical use cases. This crate creates a new string
type
that is optimized for those use cases, while retaining the usage simplicity of
String.
This type is not inherently "better" than String, but different. It
works best in 'typical' string use cases (immutability, concatenation, cheap
multi ownership) whereas String works better in "string buffer" use cases
(mutability, string building, single ownership).
Installation
NOTE: The serde feature is optional and only included when specified.
[]
= { = "0.5", = ["serde"] }
Examples
use ;
How Does It Work?
Internally, FlexStr uses an enum with these variants:
Static- A simple wrapper around a static string literal (&'static str)Inlined- An inlined string (no heap allocation for small strings)Heap- A heap allocated (reference counted) string
The type automatically chooses the best storage and allows you to use them interchangeably as a single string type.
Features
- Optimized for immutability and cheap cloning
- Allows for multiple ownership of the same string memory contents
- Serves as a universal string type (unifying literals and allocated strings)
- Doesn't allocate for literals and short strings (64-bit: up to 22 bytes)
- The same size as a
String(64-bit: 24 bytes) - Optional
serdeserialization support (feature = "serde") - Compatible with embedded systems (doesn't use
std) - Efficient conditional ownership (borrows can take ownership without allocation/copying)
- Both single threaded compatible (
FlexStr) and multi-thread safe (AFlexStr) options - It is simple to use!
Types
FlexStr- regular usageHeapstorage based onRc
AFlexStr- providesSend/Syncfor multi-threaded useHeapstorage based onArc
Usage
Hello World
use IntoFlexStr;
Conversions
use ;
Passing FlexStr to Conditional Ownership Functions
This has always been a confusing situation in Rust, but it is easy with
FlexStr since multi ownership is cheap.
use ;
Performance Characteristics
- Clones are cheap and never allocate
- At minimum, they are just a copy of the enum and at max an additional reference count increment
- Literals are just wrapped when used with
into()and never copied - Calling
into()on aStringwill result in an inline string (if short) otherwise copied into astrwrapped inRc/Arc(which will allocate, copy, and then release originalStringstorage) into_flex_str()andinto_a_flex_str()are equivalent to callinginto()on both literals andString(they are present primarily forletbindings so there is no need to declare a type)to_flex_str()andto_a_flex_str()are meant for taking ownership of borrowed strings and always copy into either an inline string (for short strings) or anRc/Arcwrappedstr(which will allocate)to_stringalways copies into a newString- Conversions back and forth between
AFlexStrandFlexStrusinginto()are cheap when using wrapped literals or inlined strings- Inlined strings and wrapped literals just create a new enum wrapper
- Reference counted wrapped strings will always require an allocation
and copy for the new
RcorArc
Benchmarks
Create
Heap creates are fairly expensive still compared to String (apparently due
to the overhead of creating the enum?), Rc<str> andArc<str>, but
inline/static creation is very fast as expected.
FlexStr
create_static_normal time: [3.7062 ns 3.7213 ns 3.7422 ns]
create_inline_small time: [3.8932 ns 3.9004 ns 3.9084 ns]
create_heap_normal time: [13.533 ns 13.557 ns 13.587 ns]
create_heap_large time: [18.605 ns 18.635 ns 18.664 ns]
create_heap_arc_normal time: [18.535 ns 18.551 ns 18.568 ns]
create_heap_arc_large time: [26.794 ns 26.861 ns 26.937 ns]
Comparables
create_string_small time: [7.4377 ns 7.4572 ns 7.4794 ns]
create_string_normal time: [8.0550 ns 8.0605 ns 8.0667 ns]
create_string_large time: [12.940 ns 12.955 ns 12.973 ns]
create_rc_small time: [8.0525 ns 8.0577 ns 8.0639 ns]
create_rc_normal time: [8.2438 ns 8.2512 ns 8.2604 ns]
create_rc_large time: [13.139 ns 13.153 ns 13.168 ns]
create_arc_small time: [8.7128 ns 8.7231 ns 8.7341 ns]
create_arc_normal time: [8.7454 ns 8.7851 ns 8.8446 ns]
create_arc_large time: [13.827 ns 13.855 ns 13.886 ns]
Clone
Clones are MUCH cheaper than String (except when using Arc). Interested
to find out why the enum wrapper and single branch op causes such a large
differential between the wrapped Rc<str>/Arc<str> and the raw version.
FlexStr
clone_static_normal time: [3.9540 ns 3.9572 ns 3.9610 ns]
clone_inline_small time: [4.4717 ns 4.4763 ns 4.4819 ns]
clone_heap_normal time: [4.4738 ns 4.4839 ns 4.4965 ns]
clone_heap_arc_normal time: [10.596 ns 10.607 ns 10.618 ns]
Comparables
clone_string_small time: [11.774 ns 11.789 ns 11.807 ns]
clone_string_normal time: [12.289 ns 12.422 ns 12.540 ns]
clone_string_large time: [14.931 ns 15.013 ns 15.116 ns]
clone_rc_normal time: [652.97 ps 653.58 ps 654.30 ps]
clone_arc_normal time: [3.2948 ns 3.2986 ns 3.3021 ns]
Conversions
Thanks (mostly) to itoa and ryu our conversions are much faster than
String.
FlexStr
convert_bool time: [3.7080 ns 3.7094 ns 3.7109 ns]
convert_char time: [3.8104 ns 3.8159 ns 3.8222 ns]
convert_i8 time: [3.2817 ns 3.2827 ns 3.2838 ns]
convert_i16 time: [3.5285 ns 3.5379 ns 3.5511 ns]
convert_i32 time: [10.568 ns 10.575 ns 10.582 ns]
convert_i64 time: [7.6351 ns 7.6390 ns 7.6430 ns]
convert_i128 time: [38.756 ns 38.787 ns 38.821 ns]
convert_f32 time: [24.669 ns 24.692 ns 24.721 ns]
convert_f64 time: [33.105 ns 33.145 ns 33.184 ns]
Comparables
convert_string_bool time: [18.466 ns 18.505 ns 18.538 ns]
convert_string_char time: [7.2933 ns 7.2966 ns 7.3003 ns]
convert_string_i8 time: [7.3838 ns 7.4546 ns 7.5457 ns]
convert_string_i16 time: [23.087 ns 23.477 ns 24.025 ns]
convert_string_i32 time: [38.577 ns 38.624 ns 38.683 ns]
convert_string_i64 time: [43.348 ns 43.396 ns 43.446 ns]
convert_string_i128 time: [71.120 ns 71.174 ns 71.225 ns]
convert_string_f32 time: [100.24 ns 100.50 ns 100.78 ns]
convert_string_f64 time: [179.86 ns 180.00 ns 180.14 ns]
Negatives
There is no free lunch:
- Due to usage of
Rc(orArc), when on-boardingStringit will need to reallocate and copy - Due to the enum wrapper, every string operation has the overhead of an extra branching operation
- Since
FlexStris notSendorSync, there is a need to consider single-threaded (FlexStr) and multi-threaded (AFlexStr) use cases and convert accordingly
Status
This is currently beta quality and still needs testing. The API may very possibly change but semantic versioning will be followed.
License
This project is licensed optionally under either:
- Apache License, Version 2.0, (LICENSE-APACHE or https://www.apache.org/licenses/LICENSE-2.0)
- MIT license (LICENSE-MIT or https://opensource.org/licenses/MIT)