pub fn predict_tree_x4(nodes: &[PackedNode], features: [&[f32]; 4]) -> [f32; 4]Expand description
Predict 4 samples through one tree simultaneously.
Exploits CPU out-of-order execution with 4 independent traversal states. Each sample follows its own path through the tree — the CPU can overlap memory loads across samples since the data dependencies are independent.