AVeriTeC

AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web

Dataset Information
Modalities
Texts
Languages
English
License
Unknown
Homepage

Overview

AVeriTeC (Automated Verification of Textual Claims) is a dataset of 4568 real-world claims covering fact-checks by 50 different organizations. Each claim is annotated with question-answer pairs supported by evidence available online, as well as textual justifications explaining how the evidence combines to produce a verdict. The Claims in AVeriTeC are classified into four labels: "Supported", "Refuted", "Not Enough Evidence", "Conflicting Evidence/Cherry-picking". The dataset also contains several fields of metadata such as the speaker of the claim, the publisher of the claim, the date the claim was published, and the location most relevant to the claim. These can be used to support questions, answers, and justifications.

Variants: AVeriTeC

Associated Benchmarks

This dataset is used in 1 benchmark:

  • Fact Checking -

Recent Benchmark Submissions

Task Model Paper Date
Fact Checking HerO HerO at AVeriTeC: The Herd … 2024-10-16
Fact Checking CTU AIC AIC CTU system at AVeriTeC: … 2024-10-15

Research Papers

Recent papers with results on this dataset: