Crate drain_rs[−][src]
Expand description
Categorized semi-structured text utilizing the drain algorithm: https://arxiv.org/pdf/1806.04356.pdf The main implementation is a fixed-sized prefix tree. Consequently, this assumes that splits that give us more information come earlier in the text.
This might prove to not be optimal given some text formats.
Examples:
Given log values:
Node 2 is online Node 4 going offline
With a fixed tree depth of 3 we would get the following splits
4 // initial root is the number of tokens
|
“Node” // first prefix node of value “Node”
|
“<*>” // Numbers are assumed to be variable and are replaced with wildcard
/
“is” “going” // last two splits of is and going
/
[Node * is online] [Node * going offline] //the individual text templates for this simple case
Structs
DrainTree | Main drain algorithm implementation Contains the structure of the drain prefix tree along with configuration options |
LogCluster | Represents a cluster of logs |