| Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos | Oct 3, 2024 | counterfactual | CodeCode Available | 1 |
| Reasoning Elicitation in Language Models via Counterfactual Feedback | Oct 2, 2024 | counterfactualQuestion Answering | —Unverified | 0 |
| Explainable Earth Surface Forecasting under Extreme Events | Oct 2, 2024 | counterfactualEarth Observation | CodeCode Available | 1 |
| Learning Personalized Treatment Decisions in Precision Medicine: Disentangling Treatment Assignment Bias in Counterfactual Outcome Prediction and Biomarker Identification | Oct 1, 2024 | counterfactual | CodeCode Available | 0 |
| FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows" | Sep 30, 2024 | counterfactualHallucination | CodeCode Available | 2 |
| What If We Had Used a Different App? Reliable Counterfactual KPI Analysis in Wireless Systems | Sep 30, 2024 | Conformal Predictioncounterfactual | CodeCode Available | 0 |
| Mitigating Propensity Bias of Large Language Models for Recommender Systems | Sep 30, 2024 | counterfactualCounterfactual Inference | —Unverified | 0 |
| Ads Supply Personalization via Doubly Robust Learning | Sep 29, 2024 | counterfactual | —Unverified | 0 |
| Counterfactual Evaluation of Ads Ranking Models through Domain Adaptation | Sep 29, 2024 | counterfactualDomain Adaptation | —Unverified | 0 |
| Good Data Is All Imitation Learning Needs | Sep 26, 2024 | Allcounterfactual | —Unverified | 0 |