CausalGym

Dataset Information
Modalities
Texts
Languages
English
Introduced
2024
License
MIT
Homepage

Overview

SyntaxGym, adapted for interventional interpretability.

Variants: CausalGym

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Interpretability Techniques for Deep Learning DAS CausalGym: Benchmarking causal interpretability methods … 2024-02-19
Interpretability Techniques for Deep Learning Linear probe CausalGym: Benchmarking causal interpretability methods … 2024-02-19
Interpretability Techniques for Deep Learning Difference-in-means CausalGym: Benchmarking causal interpretability methods … 2024-02-19
Interpretability Techniques for Deep Learning k-means CausalGym: Benchmarking causal interpretability methods … 2024-02-19
Interpretability Techniques for Deep Learning PCA CausalGym: Benchmarking causal interpretability methods … 2024-02-19
Interpretability Techniques for Deep Learning LDA CausalGym: Benchmarking causal interpretability methods … 2024-02-19
Interpretability Techniques for Deep Learning Random CausalGym: Benchmarking causal interpretability methods … 2024-02-19

Research Papers

Recent papers with results on this dataset: