SOTAVerified

counterfactual

Papers

Showing 150 of 2765 papers

TitleStatusHype
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic EvaluatorsCode5
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual ReasoningCode4
On the limits of agency in agent-based modelsCode4
Beyond Reward Hacking: Causal Rewards for Large Language Model AlignmentCode4
An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use CasesCode3
Locating and Editing Factual Associations in GPTCode3
Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion ModelsCode3
Difference-in-Differences Estimation with Spatial SpilloversCode3
Sparse Autoencoders Find Highly Interpretable Features in Language ModelsCode3
Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMsCode3
OptiChat: Bridging Optimization Models and Practitioners with Large Language ModelsCode2
Towards Unifying Feature Attribution and Counterfactual Explanations: Different Means to the Same EndCode2
Vision Language Models are BiasedCode2
auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event DataCode2
Preserving Causal Constraints in Counterfactual Explanations for Machine Learning ClassifiersCode2
The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP ModelsCode2
Unbiased Scene Graph Generation from Biased TrainingCode2
Model-agnostic and Scalable Counterfactual Explanations via Reinforcement LearningCode2
HourVideo: 1-Hour Video-Language UnderstandingCode2
Benchmarking Large Language Models in Retrieval-Augmented GenerationCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention CausalityCode2
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"Code2
SocialCircle+: Learning the Angle-based Conditioned Interaction Representation for Pedestrian Trajectory PredictionCode2
Thought Anchors: Which LLM Reasoning Steps Matter?Code2
Interpretable Counterfactual Explanations Guided by PrototypesCode2
Explaining Machine Learning Classifiers through Diverse Counterfactual ExplanationsCode2
Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAICode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
MACE: An Efficient Model-Agnostic Framework for Counterfactual ExplanationCode2
Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPRCode2
CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video ModelsCode2
Counterfactual Learning on Graphs: A SurveyCode2
Causal Reasoning and Large Language Models: Opening a New Frontier for CausalityCode2
CausalVAE: Structured Causal Disentanglement in Variational AutoencoderCode2
Decomposing and Editing Predictions by Modeling Model ComputationCode2
Counterfactual Phenotyping with Censored Time-to-EventsCode2
Extended Mind TransformersCode2
OmniXAI: A Library for Explainable AICode2
Fairness Evaluation for Uplift Modeling in the Absence of Ground TruthCode2
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object DetectionCode2
Generative Enhancement for 3D Medical ImagesCode2
CaRTS: Causality-driven Robot Tool Segmentation from Vision and Kinematics DataCode1
CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation AlgorithmsCode1
CA-SpaceNet: Counterfactual Analysis for 6D Pose Estimation in SpaceCode1
Are self-explanations from Large Language Models faithful?Code1
On Robustness and Bias Analysis of BERT-based Relation ExtractionCode1
Capabilities of GPT-4 on Medical Challenge ProblemsCode1
Causal Action Influence Aware Counterfactual Data AugmentationCode1
Calibrated Explanations: with Uncertainty Information and CounterfactualsCode1
Show:102550
← PrevPage 1 of 56Next →

No leaderboard results yet.