unimorph-rs
A Rust toolkit for working with UniMorph morphological data.
What is UniMorph?
UniMorph provides morphological paradigm data for 169+ languages in a unified annotation format. Each entry is a triple of lemma, inflected form, and morphological features:
lemma form features
parlare parlo V;IND;PRS;1;SG
parlare parlato V.PTCP;PST
essere sono V;IND;PRS;1;SG
Installation
Homebrew (macOS/Linux)
Cargo
Docker
# Run with persistent data cache
From source
Quick Start
# Download Italian dataset
# Look up all forms of a verb
# Analyze a surface form (reverse lookup)
# Search with filters
# Dataset statistics
# Export to JSON Lines
Library Usage
use ;
async
Documentation
Full documentation is available at joshrotenberg.github.io/unimorph-rs, including:
Project Structure
unimorph-rs/
├── crates/
│ ├── unimorph-core/ # Core library: types, SQLite store, repository
│ └── unimorph-cli/ # Command-line interface
└── docs/ # mdBook documentation
References
License
Apache-2.0