# icu_segmenter [](https://crates.io/crates/icu_segmenter)
🚧 \[Experimental\] Segment strings by lines, graphemes, word, and sentences.
This module is published as its own crate ([`icu_segmenter`](https://docs.rs/icu_segmenter/latest/icu_segmenter/))
and as part of the [`icu`](https://docs.rs/icu/latest/icu/) crate. See the latter for more details on the ICU4X project.
This module contains segmenter implementation for the following rules.
- Line breaker that is compatible with [Unicode Standard Annex #14][UAX14] and CSS properties.
- Grapheme cluster breaker, word breaker, and sentence breaker that are compatible with
[Unicode Standard Annex #29][UAX29].
<div class="stab unstable">
🚧 This code is experimental; it may change at any time, in breaking or non-breaking ways,
including in SemVer minor releases. It can be enabled with the "experimental" feature
of the icu meta-crate. Use with caution.
<a href="https://github.com/unicode-org/icu4x/issues/2259">#2259</a>
</div>
[UAX14]: https://www.unicode.org/reports/tr14/
[UAX29]: https://www.unicode.org/reports/tr29/
## Examples
### Line Break
Segment a string with default options:
```rust
use icu::segmenter::LineBreakSegmenter;
let segmenter =
LineBreakSegmenter::try_new_unstable(&icu_testdata::unstable())
.expect("Data exists");
let breakpoints: Vec<usize> =
segmenter.segment_str("Hello World").collect();
assert_eq!(&breakpoints, &[6, 11]);
```
See [`LineBreakSegmenter`] for more examples.
### Grapheme Cluster Break
See [`GraphemeClusterBreakSegmenter`] for examples.
### Word Break
Segment a string:
```rust
use icu::segmenter::WordBreakSegmenter;
let segmenter =
WordBreakSegmenter::try_new_unstable(&icu_testdata::unstable())
.expect("Data exists");
let breakpoints: Vec<usize> =
segmenter.segment_str("Hello World").collect();
assert_eq!(&breakpoints, &[0, 5, 6, 11]);
```
See [`WordBreakSegmenter`] for more examples.
### Sentence Break
See [`SentenceBreakSegmenter`] for examples.
## More Information
For more information on development, authorship, contributing etc. please visit [`ICU4X home page`](https://github.com/unicode-org/icu4x).