HumanEval-ET

Dataset Information
Introduced
2023
License
Unknown
Homepage

Overview

Extension test cases of HumanEval, as well as generated code.

Variants: HumanEval-ET

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Code Generation EG-CFG (DeepSeek-V3-0324) Execution Guided Line-by-Line Code Generation 2025-06-12
Code Generation LPW (GPT-4o) Planning-Driven Programming: A Large Language … 2024-11-21

Research Papers

Recent papers with results on this dataset: