DaLAJ 1.0, a dataset for Linguistic Acceptability Judgments for Swedish, comprising 9,596 sentences in its first version; and the initial experiment using it for the binary classification task. DaLAJ is based on the SweLL second language learner data, consisting of essays at different levels of proficiency.
Variants: DaLAJ
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Linguistic Acceptability | Sw-BERT + H0M | Acceptability Judgements via Examining the … | 2022-05-19 |
Recent papers with results on this dataset: