rt-inod-bias

Red Teaming Innodata Bias

Dataset Information
Introduced
2024
License
Homepage

Overview

The Innodata Red Teaming Prompts aims to rigorously assess models’ factuality and safety. This dataset, due to its manual creation and breadth of coverage, facilitates a comprehensive examination of LLM performance across diverse scenarios.

Variants: rt-inod-bias

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Bias Detection GPT-4 Benchmarking Llama2, Mistral, Gemma and … 2024-04-15
Bias Detection Gemma Benchmarking Llama2, Mistral, Gemma and … 2024-04-15
Bias Detection Baseline Benchmarking Llama2, Mistral, Gemma and … 2024-04-15
Bias Detection Mistral Benchmarking Llama2, Mistral, Gemma and … 2024-04-15
Bias Detection Llama2 Benchmarking Llama2, Mistral, Gemma and … 2024-04-15

Research Papers

Recent papers with results on this dataset: