rt-inod-bias

Name: rt-inod-bias
Published: 2024-04-15
License: CC BY-SA 4.0

Red Teaming Innodata Bias

Dataset Information

Introduced

2024

License

CC BY-SA 4.0

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

The Innodata Red Teaming Prompts aims to rigorously assess models’ factuality and safety. This dataset, due to its manual creation and breadth of coverage, facilitates a comprehensive examination of LLM performance across diverse scenarios.

Variants: rt-inod-bias

Associated Benchmarks

This dataset is used in 1 benchmark:

Bias Detection - Metrics: Best-of

Recent Benchmark Submissions

Task	Model	Paper	Date
Bias Detection	GPT-4	Benchmarking Llama2, Mistral, Gemma and …	2024-04-15
Bias Detection	Gemma	Benchmarking Llama2, Mistral, Gemma and …	2024-04-15
Bias Detection	Baseline	Benchmarking Llama2, Mistral, Gemma and …	2024-04-15
Bias Detection	Mistral	Benchmarking Llama2, Mistral, Gemma and …	2024-04-15
Bias Detection	Llama2	Benchmarking Llama2, Mistral, Gemma and …	2024-04-15

Research Papers

Recent papers with results on this dataset:

Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations (2024) -

External Links:

rt-inod-bias

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview