SOTAVerified

Counterfactual Reasoning

Papers

Showing 101150 of 219 papers

TitleStatusHype
HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation0
IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions0
IITK-RSA at SemEval-2020 Task 5: Detecting Counterfactuals0
Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs0
Interactive Autonomous Navigation with Internal State Inference and Interactivity Estimation0
Intervention-based Recurrent Casual Model for Non-stationary Video Causal Discovery0
Intrinsic Social Motivation via Causal Influence in Multi-Agent RL0
Kindness in Multi-Agent Reinforcement Learning0
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games0
Learning to Communicate Using Counterfactual Reasoning0
Less is More: Attention Supervision with Counterfactuals for Text Classification0
Leveraging Contextual Counterfactuals Toward Belief Calibration0
Leveraging counterfactual concepts for debugging and improving CNN model performance0
MASCOTS: Model-Agnostic Symbolic COunterfactual explanations for Time Series0
Medical idioms for clinical Bayesian network development0
Mining Causality: AI-Assisted Search for Instrumental Variables0
Neural Causal Models for Counterfactual Identification and Estimation0
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning0
On the Arrow of Inference0
On Preemption and Overdetermination in Formal Theories of Causality0
On the Complexity of Counterfactual Reasoning0
Orca 2: Teaching Small Language Models How to Reason0
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows0
Popular News Always Compete for the User's Attention! POPK: Mitigating Popularity Bias via a Temporal-Counterfactual0
Probabilistic and Causal Satisfiability: Constraining the Model0
From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy0
Prompt Engineering a Prompt Engineer0
Prompting Large Language Models With the Socratic Method0
Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation0
Reducing Selection Bias in Counterfactual Reasoning for Individual Treatment Effects Estimation0
RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge0
Score-Based Explanations in Data Management and Machine Learning: An Answer-Set Programming Approach to Counterfactual Analysis0
Silico-centric Theory of Mind0
SocialLight: Distributed Cooperation Learning towards Network-Wide Traffic Signal Control0
Text-Driven Fashion Image Editing with Compositional Concept Learning and Counterfactual Abduction0
To do or not to do: finding causal relations in smart homes0
Towards Causal Model-Based Policy Optimization0
Causal Temporal Reasoning for Markov Decision Processes0
Transfer learning with causal counterfactual reasoning in Decision Transformers0
Treatment-Response Models for Counterfactual Reasoning with Continuous-time, Continuous-valued Interventions0
Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning0
Use-Case-Grounded Simulations for Explanation Evaluation0
Using Deep Image Priors to Generate Counterfactual Explanations0
Viewing the process of generating counterfactuals as a source of knowledge: a new approach for explaining classifiers0
Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning0
Watch Out for the Safety-Threatening Actors: Proactively Mitigating Safety Hazards0
WGSR-Bench: Wargame-based Game-theoretic Strategic Reasoning Benchmark for Large Language Models0
What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning0
When Counterfactual Reasoning Fails: Chaos and Real-World Complexity0
Who is Responsible? Explaining Safety Violations in Multi-Agent Cyber-Physical Systems0
Show:102550
← PrevPage 3 of 5Next →

No leaderboard results yet.