The Python dataset introduced in the Parallel Corpus paper (A Parallel Corpus of Python Functions and Documentation Strings for Automated Code Documentation and Code Generation), commonly used for evaluating automated code summarization.
Variants: ParallelCorpus-Python
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Source Code Summarization | AdaMo-noise | Assemble Foundation Models for Automatic … | 2022-01-13 |
Source Code Summarization | AdaMo-basic | Assemble Foundation Models for Automatic … | 2022-01-13 |
Recent papers with results on this dataset: