Module models

Module models 

Source
Expand description

Pre-configured model definitions

This module contains model structs for various LLM providers. Each model implements the Model trait and provider-specific traits like BedrockModel or AnthropicModel.

Models are organized by vendor:

  • claude - Anthropic Claude models
  • llama - Meta Llama models
  • nova - Amazon Nova models
  • mistral - Mistral AI models
  • cohere - Cohere models
  • qwen - Alibaba Qwen models
  • google - Google models
  • deepseek - DeepSeek models
  • kimi - Moonshot Kimi models

Structsยง

Claude3_7Sonnet
Claude 3.7 Sonnet - Latest Claude 3.x with improved reasoning
ClaudeHaiku4_5
Claude Haiku 4.5 - Fast, efficient model for high-throughput tasks
ClaudeOpus4
Claude Opus 4 - High capability reasoning model
ClaudeOpus4_5
Claude Opus 4.5 - Most capable Claude model
ClaudeSonnet4
Claude Sonnet 4 - Balanced performance and cost
ClaudeSonnet4_5
Claude Sonnet 4.5 - Latest Sonnet with improved capabilities
CohereCommandRPlus
Command R+ - Enterprise RAG and multi-step tool use model
DeepSeekR1
DeepSeek R1 - Reasoning-focused model
DeepSeekV3
DeepSeek V3.1 - General purpose model
Gemma3_27B
Gemma 3 27B - Open multimodal model from Google
KimiK2Thinking
Kimi K2 Thinking - Reasoning-enhanced model from Moonshot AI
Llama3_1_8B
Llama 3.1 8B Instruct - Efficient general purpose model
Llama3_1_70B
Llama 3.1 70B Instruct - High capability model
Llama3_1_405B
Llama 3.1 405B Instruct - Largest open-weights model
Llama3_2_1B
Llama 3.2 1B Instruct - Lightweight model for edge deployment
Llama3_2_3B
Llama 3.2 3B Instruct - Efficient small model
Llama3_2_11B
Llama 3.2 11B Instruct - Medium multimodal model
Llama3_2_90B
Llama 3.2 90B Instruct - Large multimodal model
Llama3_3_70B
Llama 3.3 70B Instruct - Latest Llama 3.x flagship
Llama4Maverick17B
Llama 4 Maverick 17B - Larger MoE model with 1M context
Llama4Scout17B
Llama 4 Scout 17B - Efficient MoE model with 10M context
MagistralSmall
Magistral Small - Efficient 24B reasoning model with vision
MistralLarge3
Mistral Large 3 - Flagship 675B MoE model with 41B active parameters
Nova2Lite
Nova 2 Lite - Fast reasoning model with extended thinking support
NovaLite
Nova Lite - Multimodal model for image, video, and text
NovaMicro
Nova Micro - Lightweight, text-only model for simple tasks
NovaPremier
Nova Premier - Highest capability Nova model with 1M context
NovaPro
Nova Pro - Balanced multimodal model
Qwen3Coder480B
Qwen3 Coder 480B - Large coding-focused MoE model
Qwen3_235B
Qwen3 235B - Large MoE model with 22B active parameters