Mostly Basic Python Programming
The benchmark consists of around 1,000 crowd-sourced Python programming problems, designed to be solvable by entry-level programmers, covering programming fundamentals, standard library functionality, and so on. Each problem consists of a task description, code solution and 3 automated test cases.
Variants: MBPP
This dataset is used in 1 benchmark:
Recent papers with results on this dataset: