ML Research Wiki / Benchmarks / Program Repair / HumanEvalPack

HumanEvalPack

Program Repair Benchmark

Performance Over Time

📊 Showing 1 results | 📏 Metric: Pass@1

Top Performing Models

Rank Model Paper Pass@1 Date Code
1 MGDebugger (DeepSeek-Coder-V2-Lite) From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging 97.60 2024-10-02 📦 YerbaPage/MGDebugger

All Papers (1)