conll
conll is a Rust Crate for efficiently parsing Treebanks in the CoNLL(-U) format.
Usage
You can use the parse program bundled with the crate, or you can use the
library programmatically with the following usage:
let lines: ;
let treebank = parse.unwrap;
Performance
The ConLL-U parser is quite fast. Here is the output of executing the binary
using time, on a 14MB file.
$ time ./target/release/parse nl_alpino-ud-dev.conllu -s
real 0m0.074s
user 0m0.054s
sys 0m0.019s
For comparison, here it is on a 195MB file.
time ./target/release/parse de_hdt-ud-train.conllu -s
real 0m5.006s
user 0m3.866s
sys 0m1.116s