fxsplit 0.0.1

split FASTX into N chunks/files/headers
Documentation

Split modes

  • --chunks N: write N records per output file.
  • --files K: write exactly K output files with records as evenly distributed as possible.
  • --headers: FASTA-only mode, one output file per FASTA header using sanitized FASTA IDs as filenames.

--chunks, --files, and --headers are mutually exclusive.

Install

cargo install fxsplit

Usage

fxsplit --help
fxsplit --file input.fasta --chunks 1000 --outdir chunks
fxsplit --file input.fastq.gz --files 8 --outdir parts
fxsplit --file input.fasta.gz --headers --outdir by_header

Docker

Build:

docker build -t fxsplit:local .

Run:

docker run --rm fxsplit:local fxsplit --help
docker run --rm -v "$PWD:/data" fxsplit:local fxsplit --file /data/input.fasta --chunks 1000 --outdir /data/out

Pull:

docker pull alejandrogzi/fxsplit:latest