barkit-1.0.0-rc.1 has been yanked.
BarKit
[!WARNING]
This tool is under development. Please use the first release version when it becomes available.
BarKit (Barcodes Toolkit) is a toolkit designed for manipulating FASTQ barcodes.
Installation
Extract Command
The extract command is designed to parse barcode sequences from FASTQ reads using approximate regex matching based on a provided pattern.
All parsed barcode sequences are moved to the read header with base quality separated by colons:
@SEQ_ID UMI:ATGC:???? CB:ATGC:???? SB:ATGC:????
- UMI: Unique Molecular Identifier (Molecular Barcode)
- CB: Cell Barcode
- SB: Sample Barcode
Examples
Parse the first twelve nucleotides as a UMI from each forward read:
Parse the first sixteen nucleotides as a cell barcode from each reverse read before the atgccat sequence:
[!NOTE] Use lowercase letters for fuzzy match patterns.