sapient-models — pre-built LLM architecture graph builders.
Each architecture module builds a SAPIENT Graph from a ModelInfo,
matching the exact HuggingFace architecture for that model family.
Supported architectures
| HuggingFace class | Module | Models |
|---|---|---|
LlamaForCausalLM |
llama |
Llama 1/2/3, Mistral, CodeLlama, Vicuna, WizardLM |
PhiForCausalLM |
phi |
Phi-1/2/3/3.5 |
GemmaForCausalLM |
gemma |
Gemma, Gemma 2 |
GPT2LMHeadModel |
gpt2 |
GPT-2, CodeGen, GPT-J |
BertForMaskedLM |
bert |
BERT, RoBERTa, DistilBERT |
Qwen2ForCausalLM |
qwen |
Qwen, Qwen2, Qwen2.5 |
MixtralForCausalLM |
mixtral |
Mixtral-8x7B, Mixtral-8x22B |