pub fn estimate_vram_gb(model_name: &str) -> f64
Estimate VRAM usage for a model based on parameter count and quantization.