←Backopenai/human-eval0Copy as MarkdownView on GitHub↗3,263 stars·444 forks·Python·MIT·0 viewsHuman EvalFeaturesModel Evaluation and Benchmarking - Benchmark for functional correctness in code generation models.Pre-training Research - Benchmark dataset and evaluation code for assessing functional correctness.