| Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators | Apr 6, 2024 | Chatbotcounterfactual | CodeCode Available | 5 |
| OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning | May 2, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 4 |
| On the limits of agency in agent-based models | Sep 14, 2024 | Computational Efficiencycounterfactual | CodeCode Available | 4 |
| Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment | Jan 16, 2025 | Causal Inferencecounterfactual | CodeCode Available | 4 |
| An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use Cases | Jul 15, 2024 | Attributecounterfactual | CodeCode Available | 3 |
| Locating and Editing Factual Associations in GPT | Feb 10, 2022 | counterfactualModel Editing | CodeCode Available | 3 |
| Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models | Feb 7, 2024 | counterfactualImage Generation | CodeCode Available | 3 |
| Difference-in-Differences Estimation with Spatial Spillovers | May 8, 2021 | counterfactual | CodeCode Available | 3 |
| Sparse Autoencoders Find Highly Interpretable Features in Language Models | Sep 15, 2023 | counterfactualLanguage Modelling | CodeCode Available | 3 |
| Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs | Aug 23, 2023 | counterfactualQuestion Answering | CodeCode Available | 3 |
| OptiChat: Bridging Optimization Models and Practitioners with Large Language Models | Jan 14, 2025 | Code Generationcounterfactual | CodeCode Available | 2 |
| Towards Unifying Feature Attribution and Counterfactual Explanations: Different Means to the Same End | Nov 10, 2020 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| Vision Language Models are Biased | May 29, 2025 | Board Gamescounterfactual | CodeCode Available | 2 |
| auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data | Apr 15, 2022 | BIG-bench Machine Learningcounterfactual | CodeCode Available | 2 |
| Preserving Causal Constraints in Counterfactual Explanations for Machine Learning Classifiers | Dec 6, 2019 | BIG-bench Machine Learningcounterfactual | CodeCode Available | 2 |
| The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models | Aug 12, 2020 | counterfactualSentiment Analysis | CodeCode Available | 2 |
| Unbiased Scene Graph Generation from Biased Training | Feb 27, 2020 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning | Jun 4, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 2 |
| HourVideo: 1-Hour Video-Language Understanding | Nov 7, 2024 | Benchmarkingcounterfactual | CodeCode Available | 2 |
| Benchmarking Large Language Models in Retrieval-Augmented Generation | Sep 4, 2023 | Benchmarkingcounterfactual | CodeCode Available | 2 |
| A Comprehensive Guide to Explainable AI: From Classical Models to LLMs | Dec 1, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality | Oct 7, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows" | Sep 30, 2024 | counterfactualHallucination | CodeCode Available | 2 |
| SocialCircle+: Learning the Angle-based Conditioned Interaction Representation for Pedestrian Trajectory Prediction | Sep 23, 2024 | counterfactualPedestrian Trajectory Prediction | CodeCode Available | 2 |
| Thought Anchors: Which LLM Reasoning Steps Matter? | Jun 23, 2025 | counterfactualSentence | CodeCode Available | 2 |
| Interpretable Counterfactual Explanations Guided by Prototypes | Jul 3, 2019 | counterfactualDiagnostic | CodeCode Available | 2 |
| Explaining Machine Learning Classifiers through Diverse Counterfactual Explanations | May 19, 2019 | BIG-bench Machine Learningcounterfactual | CodeCode Available | 2 |
| Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI | Nov 22, 2024 | counterfactualCounterfactual Explanation | CodeCode Available | 2 |
| Exploring the Causality of End-to-End Autonomous Driving | Jul 9, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 2 |
| MACE: An Efficient Model-Agnostic Framework for Counterfactual Explanation | May 31, 2022 | BIG-bench Machine Learningcounterfactual | CodeCode Available | 2 |
| Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPR | Nov 1, 2017 | counterfactualDecision Making | CodeCode Available | 2 |
| CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models | Jun 11, 2025 | counterfactualDescriptive | CodeCode Available | 2 |
| Counterfactual Learning on Graphs: A Survey | Apr 3, 2023 | counterfactualFairness | CodeCode Available | 2 |
| Causal Reasoning and Large Language Models: Opening a New Frontier for Causality | Apr 28, 2023 | Causal DiscoveryCommon Sense Reasoning | CodeCode Available | 2 |
| CausalVAE: Structured Causal Disentanglement in Variational Autoencoder | Apr 18, 2020 | counterfactualDisentanglement | CodeCode Available | 2 |
| Decomposing and Editing Predictions by Modeling Model Computation | Apr 17, 2024 | counterfactualmodel | CodeCode Available | 2 |
| Counterfactual Phenotyping with Censored Time-to-Events | Feb 22, 2022 | counterfactualCounterfactual Reasoning | CodeCode Available | 2 |
| Extended Mind Transformers | Jun 4, 2024 | Common Sense Reasoningcounterfactual | CodeCode Available | 2 |
| OmniXAI: A Library for Explainable AI | Jun 1, 2022 | counterfactualCounterfactual Explanation | CodeCode Available | 2 |
| Fairness Evaluation for Uplift Modeling in the Absence of Ground Truth | Feb 12, 2024 | counterfactualDecision Making | CodeCode Available | 2 |
| Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection | Apr 9, 2025 | Contrastive Learningcounterfactual | CodeCode Available | 2 |
| Generative Enhancement for 3D Medical Images | Mar 19, 2024 | counterfactualImage Generation | CodeCode Available | 2 |
| CaRTS: Causality-driven Robot Tool Segmentation from Vision and Kinematics Data | Mar 15, 2022 | counterfactualSegmentation | CodeCode Available | 1 |
| CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms | Aug 2, 2021 | Benchmarkingcounterfactual | CodeCode Available | 1 |
| CA-SpaceNet: Counterfactual Analysis for 6D Pose Estimation in Space | Jul 16, 2022 | 6D Pose EstimationCausal Inference | CodeCode Available | 1 |
| Are self-explanations from Large Language Models faithful? | Jan 15, 2024 | counterfactualFaithfulness Critic | CodeCode Available | 1 |
| On Robustness and Bias Analysis of BERT-based Relation Extraction | Sep 14, 2020 | counterfactualRelation | CodeCode Available | 1 |
| Capabilities of GPT-4 on Medical Challenge Problems | Mar 20, 2023 | counterfactualMemorization | CodeCode Available | 1 |
| Causal Action Influence Aware Counterfactual Data Augmentation | May 29, 2024 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |
| Calibrated Explanations: with Uncertainty Information and Counterfactuals | May 3, 2023 | counterfactualExplainable artificial intelligence | CodeCode Available | 1 |