Crate gtars

Crate gtars 

Source
Expand description

§gtars: Performance-critical tools to manipulate, analyze, and process genomic interval data.

gtars is a rust crate that provides a set of tools for working with genomic interval data. Its primary goal is to provide processors for our python package, geniml, a library for machine learning on genomic intervals. However, it can be used as a standalone library for working with genomic intervals as well.

There are several modules in this crate. The most comprehensive is the tokenizers modules which houses genomic region tokenizers for use as pre-processors to machine learning pipelines.

Modules§

bbcache
core
Common, core utilities for gtars
fragsplit
igd
io
Input/Output utilities for genomic data.
overlaprs
refget
Rust implementation of GA4GH Refget sequence collection functions
scoring
tokenizers
Genomic data tokenizers and pre-processors to prepare interval data for machine learning pipelines.
uniwig