oscar-tools 0.3.0

Tools for processing OSCAR Corpora
oscar-tools-0.3.0 is not a library.
Visit the last successful build: oscar-tools-0.4.0

OSCAR-tools

This is a new set of tools to do common tasks on the OSCAR corpus

The program has a different set of tools for each corpus version:

  • v1: OSCAR 2019-like, text only (.txt files)
  • v2: OSCAR 22.01-like, JSONLines, document-oriented with annotations and line-level identifications