pub fn atan2f(a: f32, b: f32) -> f32
Just a simple port from https://mazzo.li/posts/vectorized-atan2.html This is a fast port mostly without branching