| Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators | Apr 6, 2024 | Chatbotcounterfactual | CodeCode Available | 5 |
| Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment | Jan 16, 2025 | Causal Inferencecounterfactual | CodeCode Available | 4 |
| On the limits of agency in agent-based models | Sep 14, 2024 | Computational Efficiencycounterfactual | CodeCode Available | 4 |
| OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning | May 2, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 4 |
| An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use Cases | Jul 15, 2024 | Attributecounterfactual | CodeCode Available | 3 |
| Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models | Feb 7, 2024 | counterfactualImage Generation | CodeCode Available | 3 |
| Sparse Autoencoders Find Highly Interpretable Features in Language Models | Sep 15, 2023 | counterfactualLanguage Modelling | CodeCode Available | 3 |
| Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs | Aug 23, 2023 | counterfactualQuestion Answering | CodeCode Available | 3 |
| Locating and Editing Factual Associations in GPT | Feb 10, 2022 | counterfactualModel Editing | CodeCode Available | 3 |
| Difference-in-Differences Estimation with Spatial Spillovers | May 8, 2021 | counterfactual | CodeCode Available | 3 |
| Thought Anchors: Which LLM Reasoning Steps Matter? | Jun 23, 2025 | counterfactualSentence | CodeCode Available | 2 |
| CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models | Jun 11, 2025 | counterfactualDescriptive | CodeCode Available | 2 |
| Vision Language Models are Biased | May 29, 2025 | Board Gamescounterfactual | CodeCode Available | 2 |
| Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection | Apr 9, 2025 | Contrastive Learningcounterfactual | CodeCode Available | 2 |
| OptiChat: Bridging Optimization Models and Practitioners with Large Language Models | Jan 14, 2025 | Code Generationcounterfactual | CodeCode Available | 2 |
| A Comprehensive Guide to Explainable AI: From Classical Models to LLMs | Dec 1, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI | Nov 22, 2024 | counterfactualCounterfactual Explanation | CodeCode Available | 2 |
| HourVideo: 1-Hour Video-Language Understanding | Nov 7, 2024 | Benchmarkingcounterfactual | CodeCode Available | 2 |
| Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality | Oct 7, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows" | Sep 30, 2024 | counterfactualHallucination | CodeCode Available | 2 |
| SocialCircle+: Learning the Angle-based Conditioned Interaction Representation for Pedestrian Trajectory Prediction | Sep 23, 2024 | counterfactualPedestrian Trajectory Prediction | CodeCode Available | 2 |
| Exploring the Causality of End-to-End Autonomous Driving | Jul 9, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 2 |
| Extended Mind Transformers | Jun 4, 2024 | Common Sense Reasoningcounterfactual | CodeCode Available | 2 |
| Decomposing and Editing Predictions by Modeling Model Computation | Apr 17, 2024 | counterfactualmodel | CodeCode Available | 2 |
| Generative Enhancement for 3D Medical Images | Mar 19, 2024 | counterfactualImage Generation | CodeCode Available | 2 |