fxsplit 0.0.3

split FASTX into N chunks/files/headers
Documentation

Split modes

  • --chunks N: write N records per output file.
  • --files K: write exactly K output files with records as evenly distributed as possible.
  • --headers: FASTA-only mode, one output file per FASTA header using sanitized FASTA IDs as filenames.

--chunks, --files, and --headers are mutually exclusive.

Install

cargo install fxsplit

Usage

fxsplit --help
fxsplit --file input.fasta --chunks 1000 --outdir chunks
fxsplit --file input.fastq.gz --files 8 --outdir parts
fxsplit --file input.fasta.gz --headers --outdir by_header

Docker

build:

docker build -t fxsplit:local .

run:

docker run --rm fxsplit:local fxsplit --help
docker run --rm -v "$PWD:/data" fxsplit:local fxsplit --file /data/input.fasta --chunks 1000 --outdir /data/out

pull:

docker pull alejandrogzi/fxsplit:latest

Nextflow

borrow fxsplit module from fxsplit/main.nf and use it in your pipeline.

Conda

conda install -c bioconda fxsplit