CaseHOLD

Name: CaseHOLD
Published: 2021-04-18
License: Unknown

Case Holdings On Legal Decisions

Dataset Information

Modalities

Texts

Languages

English

Introduced

2021

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

CaseHOLD (Case Holdings On Legal Decisions) is a law dataset comprised of over 53,000+ multiple choice questions to identify the relevant holding of a cited case. This dataset presents a fundamental task to lawyers and is both legally meaningful and difficult from an NLP perspective (F1 of 0.4 with a BiLSTM baseline). The citing context from the judicial decision serves as the prompt for the question. The answer choices are holding statements derived from citations following text in a legal decision. There are five answer choices for each citing text. The correct answer is the holding statement that corresponds to the citing text. The four incorrect answers are other holding statements.

To read more about the dataset, please see our paper or our blogpost.

Variants: CaseHOLD

Associated Benchmarks

This dataset is used in 2 benchmarks:

Few-Shot Learning - Metrics: Accuracy
Question Answering - Metrics: Macro F1 (10-fold)

Recent Benchmark Submissions

Task	Model	Paper	Date
Few-Shot Learning	CoT-T5-11B (1024 Shot)	The CoT Collection: Improving Zero-shot …	2023-05-23
Question Answering	Custom Legal-BERT	When Does Pretraining Help? Assessing …	2021-04-18
Question Answering	Legal-BERT	When Does Pretraining Help? Assessing …	2021-04-18
Question Answering	BERT	When Does Pretraining Help? Assessing …	2021-04-18

Research Papers

Recent papers with results on this dataset:

External Links:

CaseHOLD

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview