| Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models | Feb 27, 2025 | counterfactualLanguage Modeling | —Unverified | 0 |
| Can LLMs Explain Themselves Counterfactually? | Feb 25, 2025 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Iterative Counterfactual Data Augmentation | Feb 25, 2025 | counterfactualData Augmentation | CodeCode Available | 0 |
| Flexible Counterfactual Explanations with Generative Models | Feb 24, 2025 | counterfactual | CodeCode Available | 0 |
| All You Need for Counterfactual Explainability Is Principled and Reliable Estimate of Aleatoric and Epistemic Uncertainty | Feb 24, 2025 | Allcounterfactual | —Unverified | 0 |
| Toward a Flexible Framework for Linear Representation Hypothesis Using Maximum Likelihood Estimation | Feb 22, 2025 | counterfactualSand | —Unverified | 0 |
| Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations | Feb 22, 2025 | counterfactual | CodeCode Available | 0 |
| A novel approach to the relationships between data features -- based on comprehensive examination of mathematical, technological, and causal methodology | Feb 20, 2025 | Causal Inferencecounterfactual | —Unverified | 0 |
| SegSub: Evaluating Robustness to Knowledge Conflicts and Hallucinations in Vision-Language Models | Feb 19, 2025 | counterfactualHallucination | CodeCode Available | 0 |
| Population Dynamics Control with Partial Observations | Feb 19, 2025 | counterfactual | —Unverified | 0 |
| Fighter Jet Navigation and Combat using Deep Reinforcement Learning with Explainable AI | Feb 19, 2025 | counterfactualDecision Making | CodeCode Available | 0 |
| Robust Counterfactual Inference in Markov Decision Processes | Feb 19, 2025 | counterfactualCounterfactual Inference | —Unverified | 0 |
| RobustX: Robust Counterfactual Explanations Made Easy | Feb 19, 2025 | counterfactualDecision Making | CodeCode Available | 1 |
| Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images | Feb 19, 2025 | counterfactual | —Unverified | 0 |
| Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL | Feb 18, 2025 | counterfactualDeception Detection | —Unverified | 0 |
| Robust Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning | Feb 18, 2025 | counterfactual | CodeCode Available | 0 |
| Unsupervised Structural-Counterfactual Generation under Domain Shift | Feb 17, 2025 | counterfactual | —Unverified | 0 |
| Counterfactual-Consistency Prompting for Relative Temporal Understanding in Large Language Models | Feb 17, 2025 | counterfactual | —Unverified | 0 |
| DifCluE: Generating Counterfactual Explanations with Diffusion Autoencoders and modal clustering | Feb 17, 2025 | Clusteringcounterfactual | —Unverified | 0 |
| CounterBench: A Benchmark for Counterfactuals Reasoning in Large Language Models | Feb 16, 2025 | Commonsense Causal Reasoningcounterfactual | —Unverified | 0 |
| Causal Information Prioritization for Efficient Reinforcement Learning | Feb 14, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Revisiting the Berkeley Admissions data: Statistical Tests for Causal Hypotheses | Feb 14, 2025 | counterfactualFairness | —Unverified | 0 |
| Generating Causally Compliant Counterfactual Explanations using ASP | Feb 13, 2025 | Attributecounterfactual | —Unverified | 0 |
| Reevaluating Policy Gradient Methods for Imperfect-Information Games | Feb 13, 2025 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| Generalizability through Explainability: Countering Overfitting with Counterfactual Examples | Feb 13, 2025 | counterfactualData Augmentation | —Unverified | 0 |