Skip to main content

Module starcoder2

Module starcoder2 

Source
Expand description

StarCoder model implementation with quantization support.

StarCoder is a large language model optimized for code generation. This implementation provides quantization for reduced memory and compute.

Key characteristics:

  • Causal self-attention mechanism
  • Multi-query attention (MQA)
  • LayerNorm for normalization
  • Absolute positional embeddings
  • Support for 8-bit quantization

References:

Structsยง

Config
Model