ParallelCorpus-Python

Dataset Information
License
Unknown
Homepage

Overview

The Python dataset introduced in the Parallel Corpus paper (A Parallel Corpus of Python Functions and Documentation Strings for Automated Code Documentation and Code Generation), commonly used for evaluating automated code summarization.

Variants: ParallelCorpus-Python

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Source Code Summarization AdaMo-noise Assemble Foundation Models for Automatic … 2022-01-13
Source Code Summarization AdaMo-basic Assemble Foundation Models for Automatic … 2022-01-13

Research Papers

Recent papers with results on this dataset: