naive_opt
The optimized naive string-search algorithm.
Features
- The naive string-searching algorithm
- Enhanced with 1-byte search like the libc++ and the libstd++ string::find
- Specializing in UTF-8 strings, which is a feature of rust
- The ASCII Stochastics search
- Support the zero overhead trait.
- Support ignore ascii case match.
- minimum support rustc 1.65.0 (897e37553 2022-11-02)
Compatibility
This crate is implemented to replace the rust std library. However, the method names are different, so please rewrite your code. It shouldn't be too difficult.
compatibility:
rust std::str |
this crate |
|---|---|
std::str::find() |
naive_opt::Search::search() |
std::str::rfind() |
naive_opt::Search::rsearch() |
std::str::contains() |
naive_opt::Search::includes() |
std::str::match_indices() |
naive_opt::Search::search_indices() |
std::str::rmatch_indices() |
naive_opt::Search::rsearch_indices() |
Ignore ascii case match
This crate supports an ASCII case-insensitive match with each function.
| this crate |
|---|
naive_opt::Search::search_ignore_ascii_case() |
naive_opt::Search::rsearch_ignore_ascii_case() |
naive_opt::Search::includes_ignore_ascii_case() |
naive_opt::Search::search_indices_ignore_ascii_case() |
naive_opt::Search::rsearch_indices_ignore_ascii_case() |
Examples
Example function:
use ;
use ;
let haystack = "111 a 111b";
let needle = "a";
let r = string_search;
assert_eq!;
assert_eq!;
assert_eq!;
let v: = string_search_indices.collect;
assert_eq!;
let v: = string_rsearch_indices.collect;
assert_eq!;
Example trait: Search
use Search;
let haystack = "111 a 111b";
let needle = "a";
let r = haystack.search;
assert_eq!;
assert_eq!;
assert_eq!;
let v: = "abc345abc901abc".search_indices.collect;
assert_eq!;
let v: = "abc345abc901abc".rsearch_indices.collect;
assert_eq!;
Example trait: SearchIn
use SearchIn;
let haystack = "111 a 111b";
let needle = "a";
let r = needle.search_in;
assert_eq!;
assert_eq!;
assert_eq!;
Example Ignore ascii case match
use Search;
let haystack = "111 a 111b";
let needle = "A";
let r = haystack.search_ignore_ascii_case;
assert_eq!;
assert_eq!;
let v: = "abc345aBc901abc".search_indices_ignore_ascii_case.collect;
assert_eq!;
let v: = "abc345aBc901abc".rsearch_indices_ignore_ascii_case.collect;
assert_eq!;
assert_eq!;
Benchmark results
- compile by rustc 1.66.0 (69f9c33d7 2022-12-12)
name |
bench:en |
bench:ja |
musl:en |
musl:ja |
|---|---|---|---|---|
| std_str_str | 757.800 μc | 521.920 μc | 728.020 μc | 520.580 μc |
| std_string_string | 760.410 μc | 525.830 μc | 733.070 μc | 536.770 μc |
| func_str_str | 102.100 μc | 121.790 μc | 122.460 μc | 122.300 μc |
| func_string_string | 101.720 μc | 123.290 μc | 102.760 μc | 121.960 μc |
| trait_str_str | 98.238 μc | 120.290 μc | 116.560 μc | 117.960 μc |
| trait_string_string | 98.459 μc | 120.490 μc | 98.106 μc | 118.940 μc |
| std_indices | 470.700 μc | 370.070 μc | 468.480 μc | 392.680 μc |
| func_indices | 100.840 μc | 118.750 μc | 101.620 μc | 146.220 μc |
| trait_indices | 100.920 μc | 118.810 μc | 101.070 μc | 145.120 μc |
- std is std::str::find()
usis micro seconds:enis english haystack.:jais japanese haystack.muslis x86_64-unknown-linux-musl- bench on intel Q6600 @ 2.40GHz
Changelogs
References
- my research: string searching algorithm
- my research: string find
- wikipedia: string searching algprithm
memx- rust crate for the fast mem lib
License
This project is licensed under either of
- Apache License, Version 2.0, (LICENSE-APACHE or https://www.apache.org/licenses/LICENSE-2.0)
- MIT license (LICENSE-MIT or https://opensource.org/licenses/MIT)
at your option.