The Django dataset is a dataset for code generation comprising of 16000 training, 1000 development and 1805 test annotations. Each data point consists of a line of Python code together with a manually created natural language description.
Source: Latent Predictor Networks for Code Generation
Image Source: https://github.com/microsoft/vscode-docs/issues/2696
Variants: Django
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Code Generation | TranX + BERT w/mined | The impact of lexical and … | 2022-02-28 |
Code Generation | BERT + TAE | Code Generation from Natural Language … | 2021-01-01 |
Code Generation | Tranx | TRANX: A Transition-based Neural Abstract … | 2018-10-05 |
Code Generation | lpn (Ling et al., 2016) | Latent Predictor Networks for Code … | 2016-03-22 |
Code Generation | Phrasal Statistical MT (Ling et al., 2016) | Latent Predictor Networks for Code … | 2016-03-22 |
Recent papers with results on this dataset: