//! Attention configuration trait.
//!
//! This trait abstracts the configuration differences between code predictor
//! and talker attention, allowing unified attention implementation.
/// Configuration trait for attention modules.
///
/// Implemented by `CodePredictorConfig` and `TalkerConfig` to provide
/// common configuration values needed by the unified attention implementation.