pub unsafe fn classify_bytes(input: &[u8]) -> Vec<u8> โ
Classify each byte using NEON โ 16 bytes at a time.
Caller must ensure NEON support (always true on aarch64).