CONCODE

Dataset Information
License
Unknown
Homepage

Overview

A new large dataset with over 100,000 examples consisting of Java classes from online code repositories, and develop a new encoder-decoder architecture that models the interaction between the method documentation and the class environment.

Source: Mapping Language to Code in Programmatic Context

Variants: CONCODE

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Code Generation CodeT5 CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder … 2021-09-02
Code Generation Redcoder-ext Retrieval Augmented Code Generation and … 2021-08-26

Research Papers

Recent papers with results on this dataset: