Module jaspar16

Source
Expand description

Parser implementation for matrices in JASPAR (2016) format.

The JASPAR database stores manually curated DNA-binding sites as count matrices.

The JASPAR files contains a FASTA-like header line for each record, followed by one line per symbol storing tab-separated counts at each position. The 2016 version introduces bracketed matrix columns for each symbol, allowing for non-standard alphabets to be used:

>MA0001.3	AGL3
A  [     0      0     82     40     56     35     65     25     64      0 ]
C  [    92     79      1      4      0      0      1      4      0      0 ]
G  [     0      0      2      3      1      0      4      3     28     92 ]
T  [     3     16     10     48     38     60     25     63      3      3 ]

Structs§

Reader
An iterative reader for the JASPAR (2016) format.
Record
A JASPAR (2016) record.

Functions§

read
Read the records from a file in JASPAR (2016) format.