CaseHOLD

Case Holdings On Legal Decisions

Dataset Information
Modalities
Texts
Languages
English
Introduced
2021
License
Unknown
Homepage

Overview

CaseHOLD (Case Holdings On Legal Decisions) is a law dataset comprised of over 53,000+ multiple choice questions to identify the relevant holding of a cited case. This dataset presents a fundamental task to lawyers and is both legally meaningful and difficult from an NLP perspective (F1 of 0.4 with a BiLSTM baseline). The citing context from the judicial decision serves as the prompt for the question. The answer choices are holding statements derived from citations following text in a legal decision. There are five answer choices for each citing text. The correct answer is the holding statement that corresponds to the citing text. The four incorrect answers are other holding statements.

To read more about the dataset, please see our paper or our blogpost.

Variants: CaseHOLD

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Few-Shot Learning CoT-T5-11B (1024 Shot) The CoT Collection: Improving Zero-shot … 2023-05-23
Question Answering Custom Legal-BERT When Does Pretraining Help? Assessing … 2021-04-18
Question Answering Legal-BERT When Does Pretraining Help? Assessing … 2021-04-18
Question Answering BERT When Does Pretraining Help? Assessing … 2021-04-18

Research Papers

Recent papers with results on this dataset: