SOTAVerified

counterfactual

Papers

Showing 76100 of 2765 papers

TitleStatusHype
Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering0
Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals0
Success is in the Details: Evaluate and Enhance Details Sensitivity of Code LLMs through CounterfactualsCode0
Characterization of Efficient Influence Function for Off-Policy Evaluation Under Optimal Policies0
Causal Cartographer: From Mapping to Reasoning Over Counterfactual WorldsCode0
Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image0
Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability0
When Bias Backfires: The Modulatory Role of Counterfactual Explanations on the Adoption of Algorithmic Bias in XAI-Supported Human Decision-MakingCode0
Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models0
Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning0
Counterfactual Explanations for Continuous Action Reinforcement LearningCode0
From What Ifs to Insights: Counterfactuals in Causal Inference vs. Explainable AI0
The Stablecoin Discount: Evidence of Tether's U.S. Treasury Bill Market Share in Lowering Yields0
A New Bayesian Bootstrap for Quantitative Trade and Spatial Models0
Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution BehaviorsCode0
EarthSynth: Generating Informative Earth Observation with Diffusion Models0
Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals0
Finding Counterfactual Evidences for Node ClassificationCode0
Analysis of Customer Journeys Using Prototype Detection and Counterfactual Explanations for Sequential Data0
Beyond the Known: Decision Making with Counterfactual Reasoning Decision TransformerCode0
Counterfactual Strategies for Markov Decision Processes0
Sequential Treatment Effect Estimation with Unmeasured Confounders0
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMsCode1
Adaptively-weighted Nearest Neighbors for Matrix CompletionCode0
On the interplay of Explainability, Privacy and Predictive Performance with Explanation-assisted Model Extraction0
Show:102550
← PrevPage 4 of 111Next →

No leaderboard results yet.