Expand description
Code generation evaluation metrics
Provides pass@k — the unbiased estimator for functional correctness of code generation models (Chen et al., 2021 “Evaluating Large Language Models Trained on Code”).
Functions§
- pass_
at_ k - Compute pass@k: unbiased estimator of functional correctness.