sapient-models 0.1.11

Pre-built LLM architecture graph builders for SAPIENT — Llama, Mistral, Phi, Gemma, GPT-2, BERT, Qwen
Documentation

sapient-models — pre-built LLM architecture graph builders.

Each architecture module builds a SAPIENT Graph from a ModelInfo, matching the exact HuggingFace architecture for that model family.

Supported architectures

HuggingFace class Module Models
LlamaForCausalLM llama Llama 1/2/3, Mistral, CodeLlama, Vicuna, WizardLM
PhiForCausalLM phi Phi-1/2/3/3.5
GemmaForCausalLM gemma Gemma, Gemma 2
GPT2LMHeadModel gpt2 GPT-2, CodeGen, GPT-J
BertForMaskedLM bert BERT, RoBERTa, DistilBERT
Qwen2ForCausalLM qwen Qwen, Qwen2, Qwen2.5
MixtralForCausalLM mixtral Mixtral-8x7B, Mixtral-8x22B