Expand description
StarCoder model implementation with quantization support.
StarCoder is a large language model optimized for code generation. This implementation provides quantization for reduced memory and compute.
Key characteristics:
- Causal self-attention mechanism
- Multi-query attention (MQA)
- LayerNorm for normalization
- Absolute positional embeddings
- Support for 8-bit quantization
References:
- ๐ StarCoder Paper
- ๐ค Model Card