Module node

Module node 

Source
Expand description

Node management and scoring for gene prediction

This module contains all functionality related to creating, scoring, and managing gene prediction nodes. Nodes represent potential start and stop codons in the sequence and are used in dynamic programming to find optimal gene predictions.

Functionsยง

add_nodes
Add nodes for start and stop codons in both directions
calc_orf_gc
calculate_dicodon_gene
Calculate dicodon (6-mer) frequencies in genes and store log likelihood scores relative to background frequencies in the training structure.
find_best_upstream_motif
Find the highest scoring motif/spacer combination for a node.
intergenic_mod
Calculate intergenic modification score for operons and overlaps
raw_coding_score
Calculate raw coding scores
rbs_score
RBS Scoring Function: Calculate the RBS motif and then multiply it by the appropriate weight for that motif (determined in the start training function).
record_gc_bias
Record GC bias for training
record_overlapping_starts
reset_node_scores
Reset all scoring data for nodes to default values
score_nodes
Score nodes for gene finding
sort_nodes_by_position
Sort nodes by genomic position with strand ordering