detone 1.0.1

Decompose Vietnamese tone marks
Documentation
  • Coverage
  • 100%
    4 out of 4 items documented0 out of 3 items with examples
  • Size
  • Source code size: 39.27 kB This is the summed size of all the files inside the crates.io package for this release.
  • Documentation size: 2.31 MB This is the summed size of all files generated by rustdoc for all configured targets
  • Ø build duration
  • this release: 9s Average build duration of successful builds.
  • all releases: 9s Average build duration of successful builds in releases after 2024-10-23.
  • Links
  • Homepage
  • hsivonen/detone
    5 3 0
  • crates.io
  • Dependencies
  • Versions
  • Owners
  • hsivonen

detone

crates.io docs.rs Apache 2 / MIT dual-licensed

An iterator adapter that takes an iterator over char yielding a sequence of chars in Normalization Form C (this precondition is not checked!) and yields chars either such that tone marks that wouldn't otherwise fit into windows-1258 are decomposed or such that text is decomposed into orthographic units.

Use cases include preprocessing before encoding Vietnamese text into windows-1258 or converting precomposed Vietnamese text into a form that looks like it was written with the (non-IME) Vietnamese keyboard layout (e.g. for machine learning training or benchmarking purposes).

Licensing

Please see the file named COPYRIGHT.

Documentation

Generated API documentation is available online.

MSRV

1.60 to use, 1.67 to run tests. Pin version 1.0.0 of this crate if you need an even lower MSRV; there are no non-test changes.

Release Notes

1.0.1

  • Updated metadata, internal documentation, and the dev dependency.
  • No non-test code changes.

1.0.0

  • Initial release.