SOTAVerified

counterfactual

Papers

Showing 150 of 2765 papers

TitleStatusHype
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic EvaluatorsCode5
On the limits of agency in agent-based modelsCode4
Beyond Reward Hacking: Causal Rewards for Large Language Model AlignmentCode4
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual ReasoningCode4
Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMsCode3
Difference-in-Differences Estimation with Spatial SpilloversCode3
Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion ModelsCode3
An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use CasesCode3
Locating and Editing Factual Associations in GPTCode3
Sparse Autoencoders Find Highly Interpretable Features in Language ModelsCode3
OptiChat: Bridging Optimization Models and Practitioners with Large Language ModelsCode2
Unbiased Scene Graph Generation from Biased TrainingCode2
Preserving Causal Constraints in Counterfactual Explanations for Machine Learning ClassifiersCode2
MACE: An Efficient Model-Agnostic Framework for Counterfactual ExplanationCode2
SocialCircle+: Learning the Angle-based Conditioned Interaction Representation for Pedestrian Trajectory PredictionCode2
Thought Anchors: Which LLM Reasoning Steps Matter?Code2
Vision Language Models are BiasedCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
Interpretable Counterfactual Explanations Guided by PrototypesCode2
auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event DataCode2
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object DetectionCode2
Model-agnostic and Scalable Counterfactual Explanations via Reinforcement LearningCode2
OmniXAI: A Library for Explainable AICode2
The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP ModelsCode2
Extended Mind TransformersCode2
Towards Unifying Feature Attribution and Counterfactual Explanations: Different Means to the Same EndCode2
Decomposing and Editing Predictions by Modeling Model ComputationCode2
Fairness Evaluation for Uplift Modeling in the Absence of Ground TruthCode2
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention CausalityCode2
CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video ModelsCode2
Counterfactual Learning on Graphs: A SurveyCode2
Causal Reasoning and Large Language Models: Opening a New Frontier for CausalityCode2
CausalVAE: Structured Causal Disentanglement in Variational AutoencoderCode2
Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPRCode2
Counterfactual Phenotyping with Censored Time-to-EventsCode2
Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAICode2
Explaining Machine Learning Classifiers through Diverse Counterfactual ExplanationsCode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"Code2
Benchmarking Large Language Models in Retrieval-Augmented GenerationCode2
Generative Enhancement for 3D Medical ImagesCode2
HourVideo: 1-Hour Video-Language UnderstandingCode2
CaRTS: Causality-driven Robot Tool Segmentation from Vision and Kinematics DataCode1
CA-SpaceNet: Counterfactual Analysis for 6D Pose Estimation in SpaceCode1
Capabilities of GPT-4 on Medical Challenge ProblemsCode1
Are self-explanations from Large Language Models faithful?Code1
CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation AlgorithmsCode1
Causal Action Influence Aware Counterfactual Data AugmentationCode1
CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room ScenesCode1
Algorithmic Recourse: from Counterfactual Explanations to InterventionsCode1
Show:102550
← PrevPage 1 of 56Next →

No leaderboard results yet.