hebrew_accents 0.0.0

A library for working with Hebrew accents, addressing there complexities.
Documentation
  • Coverage
  • 50%
    1 out of 2 items documented0 out of 1 items with examples
  • Size
  • Source code size: 18.06 kB This is the summed size of all the files inside the crates.io package for this release.
  • Documentation size: 1.03 MB This is the summed size of all files generated by rustdoc for all configured targets
  • Ø build duration
  • this release: 10s Average build duration of successful builds.
  • all releases: 24s Average build duration of successful builds in releases after 2024-10-23.
  • Links
  • Roestdev/hebrew_accents
    1 0 0
  • crates.io
  • Dependencies
  • Versions
  • Owners
  • Roestdev

hebrew_accents

Handling Hebrew accents

Project Status

This project is currently in the design mode.

Current Version: v0.0.0.

This README file provides a general overview of what to expect from this library.

Description

This crate is going to be a library to find, filter, show Hebrew accents.

Brief overview in the accents in the Tanach

The accents used in the Tanach have been the subject of extensive scholarly research and discussion, with numerous books and articles written on the topic over the years. The system of accents employed in the Tanach is complex and nuanced, presenting a challenging area of study for scholars and researchers.

Several factors contribute to the complexity of Hebrew accents in the Tanach, including:

  • In the Tanach, there are two main systems of accents used: one for the majority of the books, known as the "Twenty-One Books" (which includes all the books except for the three poetic books), and another for the three poetic books of Psalms, Proverbs, and Job, known as "Three Books".

  • There are two major categories of Masoretic accents:

    • the disjunctive accents and
    • the conjunctive accents.
  • All disjunctive accents are ordered and are divided in groups

  • Accents can be disjunctive in the "Twenty-One Books" and conjunctive in the "Three Books"

  • Accents have three role at the same time:

    • To indicate of stressed syllables, but not always.
    • To indicate the syntactic relation in a sentence.
    • Intonation of the words, used for singing the sentence.
  • It is possible that one Unicode code-point can be mapped to different Hebrew accents.

UTF-8 and Hebrew Accents: Challenges and Inconsistencies

  • The representation of Hebrew accents in UTF-8 is not without its challenges. One major issue is the inconsistent definition of two accent marks in the Tanach, which has led to confusion and potential errors. The affected marks are:

    • The Yetiv mark
    • The Tevir mark

    In various sources, the Yetiv mark is classified as a conjunctive accent, while the Tevir mark is considered a disjunctive accent. However, the Unicode tables seem to reverse this definition, treating the Yetiv mark as a disjunctive accent and the Tevir mark as a conjunctive accent.

  • Furthermore, the encoding of Hebrew accents in UTF-8 is complex, as a single accent can be represented by either one or two Unicode code-points.

  • Additionally, the mapping of Unicode code-points to Hebrew accents is not always one-to-one, meaning that a single code-point can potentially correspond to multiple Hebrew accents,pendingding of the context.

For more information see Unicode Problems.

Goals

  • Identify Hebrew accents within the provided text.
  • Offer the ability to filter specific accents.
  • Provide an option to display the identified accents.
  • Supply statistical information regarding the usage of the accents.

Non-Goals

  • Detect errors in the text related to accentuation rules.
  • Determine the type of text based on the identified accents.
  • Achieve absolute accuracy (100% correctness).

Examples

TODO

Releases

For an overview of released versions see releases.

How to install hebrew_accents

For installation see the hebrew_accents page at crates.io.

Safety

TODO

Panics

TODO

Errors

TODO

License

The hebrew_accents library is distributed under either of

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in this crate by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

References

Notes

  • Accents are sometimes called Hebrew Cantilationmarks