Django

Dataset Information
Modalities
Texts
Introduced
2015
License
Unknown
Homepage

Overview

The Django dataset is a dataset for code generation comprising of 16000 training, 1000 development and 1805 test annotations. Each data point consists of a line of Python code together with a manually created natural language description.

Source: Latent Predictor Networks for Code Generation
Image Source: https://github.com/microsoft/vscode-docs/issues/2696

Variants: Django

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Code Generation TranX + BERT w/mined The impact of lexical and … 2022-02-28
Code Generation BERT + TAE Code Generation from Natural Language … 2021-01-01
Code Generation Tranx TRANX: A Transition-based Neural Abstract … 2018-10-05
Code Generation lpn (Ling et al., 2016) Latent Predictor Networks for Code … 2016-03-22
Code Generation Phrasal Statistical MT (Ling et al., 2016) Latent Predictor Networks for Code … 2016-03-22

Research Papers

Recent papers with results on this dataset: