| A Critical Review of Causal Reasoning Benchmarks for Large Language Models | Jul 10, 2024 | Causal Inferencecounterfactual | —Unverified | 0 |
| Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities | Jul 10, 2024 | counterfactualFact Checking | CodeCode Available | 1 |
| Rigorous Probabilistic Guarantees for Robust Counterfactual Explanations | Jul 10, 2024 | Binary Classificationcounterfactual | CodeCode Available | 0 |
| Consistent Document-Level Relation Extraction via Counterfactuals | Jul 9, 2024 | counterfactualDocument-level Relation Extraction | CodeCode Available | 0 |
| Exploring the Causality of End-to-End Autonomous Driving | Jul 9, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 2 |
| New User Event Prediction Through the Lens of Causal Inference | Jul 8, 2024 | Causal Inferencecounterfactual | —Unverified | 0 |
| FairPFN: Transformers Can do Counterfactual Fairness | Jul 8, 2024 | Causal Discoverycounterfactual | —Unverified | 0 |
| A Convexified Matching Approach to Imputation and Individualized Inference | Jul 7, 2024 | counterfactualImputation | —Unverified | 0 |
| CLIMB: A Benchmark of Clinical Bias in Large Language Models | Jul 7, 2024 | counterfactualDecision Making | CodeCode Available | 0 |
| Missed Causes and Ambiguous Effects: Counterfactuals Pose Challenges for Interpreting Neural Networks | Jul 5, 2024 | counterfactual | —Unverified | 0 |