Module models

Expand description

Pre-configured model definitions

This module contains model structs for various LLM providers. Each model implements the Model trait and provider-specific traits like BedrockModel or AnthropicModel.

Models are organized by vendor:

claude - Anthropic Claude models
llama - Meta Llama models
nova - Amazon Nova models
mistral - Mistral AI models
cohere - Cohere models
qwen - Alibaba Qwen models
google - Google models
deepseek - DeepSeek models
kimi - Moonshot Kimi models

Structs§

Claude3_7Sonnet: Claude 3.7 Sonnet - Latest Claude 3.x with improved reasoning
ClaudeHaiku4_5: Claude Haiku 4.5 - Fast, efficient model for high-throughput tasks
ClaudeOpus4: Claude Opus 4 - High capability reasoning model
ClaudeOpus4_5: Claude Opus 4.5 - Most capable Claude model
ClaudeSonnet4: Claude Sonnet 4 - Balanced performance and cost
ClaudeSonnet4_5: Claude Sonnet 4.5 - Latest Sonnet with improved capabilities
CohereCommandRPlus: Command R+ - Enterprise RAG and multi-step tool use model
DeepSeekR1: DeepSeek R1 - Reasoning-focused model
DeepSeekV3: DeepSeek V3.1 - General purpose model
Gemma3_27B: Gemma 3 27B - Open multimodal model from Google
KimiK2Thinking: Kimi K2 Thinking - Reasoning-enhanced model from Moonshot AI
Llama3_1_8B: Llama 3.1 8B Instruct - Efficient general purpose model
Llama3_1_70B: Llama 3.1 70B Instruct - High capability model
Llama3_1_405B: Llama 3.1 405B Instruct - Largest open-weights model
Llama3_2_1B: Llama 3.2 1B Instruct - Lightweight model for edge deployment
Llama3_2_3B: Llama 3.2 3B Instruct - Efficient small model
Llama3_2_11B: Llama 3.2 11B Instruct - Medium multimodal model
Llama3_2_90B: Llama 3.2 90B Instruct - Large multimodal model
Llama3_3_70B: Llama 3.3 70B Instruct - Latest Llama 3.x flagship
Llama4Maverick17B: Llama 4 Maverick 17B - Larger MoE model with 1M context
Llama4Scout17B: Llama 4 Scout 17B - Efficient MoE model with 10M context
MagistralSmall: Magistral Small - Efficient 24B reasoning model with vision
MistralLarge3: Mistral Large 3 - Flagship 675B MoE model with 41B active parameters
Nova2Lite: Nova 2 Lite - Fast reasoning model with extended thinking support
NovaLite: Nova Lite - Multimodal model for image, video, and text
NovaMicro: Nova Micro - Lightweight, text-only model for simple tasks
NovaPremier: Nova Premier - Highest capability Nova model with 1M context
NovaPro: Nova Pro - Balanced multimodal model
Qwen3Coder480B: Qwen3 Coder 480B - Large coding-focused MoE model
Qwen3_235B: Qwen3 235B - Large MoE model with 22B active parameters

Module models

Module models Copy item path

Structs§

Module models