PhoMT

Dataset Information
Modalities
Texts
Languages
Vietnamese
Introduced
2021
License
Homepage

Overview

PhoMT is a high-quality and large-scale Vietnamese-English parallel dataset of 3.02M sentence pairs for machine translation.

Variants: PhoMT

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

No recent benchmark submissions available for this dataset.

Research Papers

No papers with results on this dataset found.