| Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators | Apr 6, 2024 | Chatbotcounterfactual | CodeCode Available | 5 |
| Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment | Jan 16, 2025 | Causal Inferencecounterfactual | CodeCode Available | 4 |
| On the limits of agency in agent-based models | Sep 14, 2024 | Computational Efficiencycounterfactual | CodeCode Available | 4 |
| OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning | May 2, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 4 |
| An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use Cases | Jul 15, 2024 | Attributecounterfactual | CodeCode Available | 3 |
| Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models | Feb 7, 2024 | counterfactualImage Generation | CodeCode Available | 3 |
| Sparse Autoencoders Find Highly Interpretable Features in Language Models | Sep 15, 2023 | counterfactualLanguage Modelling | CodeCode Available | 3 |
| Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs | Aug 23, 2023 | counterfactualQuestion Answering | CodeCode Available | 3 |
| Locating and Editing Factual Associations in GPT | Feb 10, 2022 | counterfactualModel Editing | CodeCode Available | 3 |
| Difference-in-Differences Estimation with Spatial Spillovers | May 8, 2021 | counterfactual | CodeCode Available | 3 |
| Thought Anchors: Which LLM Reasoning Steps Matter? | Jun 23, 2025 | counterfactualSentence | CodeCode Available | 2 |
| CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models | Jun 11, 2025 | counterfactualDescriptive | CodeCode Available | 2 |
| Vision Language Models are Biased | May 29, 2025 | Board Gamescounterfactual | CodeCode Available | 2 |
| Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection | Apr 9, 2025 | Contrastive Learningcounterfactual | CodeCode Available | 2 |
| OptiChat: Bridging Optimization Models and Practitioners with Large Language Models | Jan 14, 2025 | Code Generationcounterfactual | CodeCode Available | 2 |
| A Comprehensive Guide to Explainable AI: From Classical Models to LLMs | Dec 1, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI | Nov 22, 2024 | counterfactualCounterfactual Explanation | CodeCode Available | 2 |
| HourVideo: 1-Hour Video-Language Understanding | Nov 7, 2024 | Benchmarkingcounterfactual | CodeCode Available | 2 |
| Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality | Oct 7, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows" | Sep 30, 2024 | counterfactualHallucination | CodeCode Available | 2 |
| SocialCircle+: Learning the Angle-based Conditioned Interaction Representation for Pedestrian Trajectory Prediction | Sep 23, 2024 | counterfactualPedestrian Trajectory Prediction | CodeCode Available | 2 |
| Exploring the Causality of End-to-End Autonomous Driving | Jul 9, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 2 |
| Extended Mind Transformers | Jun 4, 2024 | Common Sense Reasoningcounterfactual | CodeCode Available | 2 |
| Decomposing and Editing Predictions by Modeling Model Computation | Apr 17, 2024 | counterfactualmodel | CodeCode Available | 2 |
| Generative Enhancement for 3D Medical Images | Mar 19, 2024 | counterfactualImage Generation | CodeCode Available | 2 |
| Fairness Evaluation for Uplift Modeling in the Absence of Ground Truth | Feb 12, 2024 | counterfactualDecision Making | CodeCode Available | 2 |
| Benchmarking Large Language Models in Retrieval-Augmented Generation | Sep 4, 2023 | Benchmarkingcounterfactual | CodeCode Available | 2 |
| Causal Reasoning and Large Language Models: Opening a New Frontier for Causality | Apr 28, 2023 | Causal DiscoveryCommon Sense Reasoning | CodeCode Available | 2 |
| Counterfactual Learning on Graphs: A Survey | Apr 3, 2023 | counterfactualFairness | CodeCode Available | 2 |
| OmniXAI: A Library for Explainable AI | Jun 1, 2022 | counterfactualCounterfactual Explanation | CodeCode Available | 2 |
| MACE: An Efficient Model-Agnostic Framework for Counterfactual Explanation | May 31, 2022 | BIG-bench Machine Learningcounterfactual | CodeCode Available | 2 |
| auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data | Apr 15, 2022 | BIG-bench Machine Learningcounterfactual | CodeCode Available | 2 |
| Counterfactual Phenotyping with Censored Time-to-Events | Feb 22, 2022 | counterfactualCounterfactual Reasoning | CodeCode Available | 2 |
| Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning | Jun 4, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 2 |
| Towards Unifying Feature Attribution and Counterfactual Explanations: Different Means to the Same End | Nov 10, 2020 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models | Aug 12, 2020 | counterfactualSentiment Analysis | CodeCode Available | 2 |
| CausalVAE: Structured Causal Disentanglement in Variational Autoencoder | Apr 18, 2020 | counterfactualDisentanglement | CodeCode Available | 2 |
| Unbiased Scene Graph Generation from Biased Training | Feb 27, 2020 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| Preserving Causal Constraints in Counterfactual Explanations for Machine Learning Classifiers | Dec 6, 2019 | BIG-bench Machine Learningcounterfactual | CodeCode Available | 2 |
| Interpretable Counterfactual Explanations Guided by Prototypes | Jul 3, 2019 | counterfactualDiagnostic | CodeCode Available | 2 |
| Explaining Machine Learning Classifiers through Diverse Counterfactual Explanations | May 19, 2019 | BIG-bench Machine Learningcounterfactual | CodeCode Available | 2 |
| Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPR | Nov 1, 2017 | counterfactualDecision Making | CodeCode Available | 2 |
| Diffusion-based Counterfactual Augmentation: Towards Robust and Interpretable Knee Osteoarthritis Grading | Jun 18, 2025 | Clinical Knowledgecounterfactual | CodeCode Available | 1 |
| Towards Robust Multimodal Emotion Recognition under Missing Modalities and Distribution Shifts | Jun 12, 2025 | Causal Inferencecounterfactual | CodeCode Available | 1 |
| Curriculum Learning With Counterfactual Group Relative Policy Advantage For Multi-Agent Reinforcement Learning | Jun 9, 2025 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs | May 14, 2025 | counterfactual | CodeCode Available | 1 |
| Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception | Apr 29, 2025 | counterfactualHallucination | CodeCode Available | 1 |
| Demand Estimation with Text and Image Data | Mar 26, 2025 | Attributecounterfactual | CodeCode Available | 1 |
| Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models | Mar 20, 2025 | counterfactualRAG | CodeCode Available | 1 |
| DeCaFlow: A Deconfounding Causal Generative Model | Mar 19, 2025 | Causal Inferencecounterfactual | CodeCode Available | 1 |