Expand description
gfatk
is a tool for Graphical Fragment Assembly (GFA) manipulation.
GFA’s are at their heart, simple, directed graphs. As such, all internal
representations of GFA’s are petgraph::Graph
’s.
gfatk
is designed mainly for reasonably small GFA files, and was designed for
de-tangling and linearising plant mitochondrial genomes on the command line.
Almost all of the core functionality of gfatk
resides in the gfatk::gfa
module. The other modules are entry points for the command line application.
Modules
- Make a DOT language representation of a GFA.
- Extract a subgraph from a GFA.
- Extract the putative plastid subgraph in a GFA.
- Extract the putative mitochondrial subgraph in a GFA.
- Print all the sequences in a GFA to fasta format.
- A module with all the methods to manipulate GFA’s in.
- Coerce a GFA into a fasta, finding the longest path through the graph.
- Helper functions to load a GFA from a file, or read from STDIN. Modified from https://github.com/chfi/rs-gfa-utils/blob/2065b001d107ee9f5d7abe04d65ab82193fc5904/src/commands.rs
- Generate overlapping sequences between segments in a GFA.
- Extract a fasta given a path.
- Rename a GFA’s segment ID’s to a 1-indexed
usize
. - Generate statistics about the input GFA file.
- Utility to trim a GFA of isolated nodes.
- Utility functions used throughout.