Optimizer kinds. F4 ships SGD and AdamW configs; the actual parameter-update kernels live in F4.x once the gradient buffers are flowing through NCCL.