DaLAJ

Dataset Information
Modalities
Texts
Languages
Swedish
Introduced
2021
License
CC BY 4.0
Homepage

Overview

DaLAJ 1.0, a dataset for Linguistic Acceptability Judgments for Swedish, comprising 9,596 sentences in its first version; and the initial experiment using it for the binary classification task. DaLAJ is based on the SweLL second language learner data, consisting of essays at different levels of proficiency.

Variants: DaLAJ

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Linguistic Acceptability Sw-BERT + H0M Acceptability Judgements via Examining the … 2022-05-19

Research Papers

Recent papers with results on this dataset: