TACO-BAAI

Topics in Algorithmic Code generation dataset

Dataset Information
Modalities
Texts
Languages
English
Introduced
2023
License
Apache-2.0 license
Homepage

Overview

TACO (Topics in Algorithmic Code generation dataset) is a dataset focused on algorithmic code generation, designed to provide a more challenging training dataset and evaluation benchmark for the code generation model field. The dataset consists of programming competition problems that are more difficult and closer to real programming scenarios. It emphasizes improving or evaluating the model's understanding and reasoning abilities in practical application scenarios, rather than just implementing predefined function functionalities.

Variants: TACO-BAAI

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Code Generation GPT-4 TACO: Topics in Algorithmic COde … 2023-12-22
Code Generation Starcoder-15.5B TACO: Topics in Algorithmic COde … 2023-12-22
Code Generation CodeLlama-7B-Python TACO: Topics in Algorithmic COde … 2023-12-22

Research Papers

Recent papers with results on this dataset: