| WGSR-Bench: Wargame-based Game-theoretic Strategic Reasoning Benchmark for Large Language Models | Jun 12, 2025 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Towards Robust Multimodal Emotion Recognition under Missing Modalities and Distribution Shifts | Jun 12, 2025 | Causal Inferencecounterfactual | CodeCode Available | 1 |
| CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models | Jun 11, 2025 | counterfactualDescriptive | CodeCode Available | 2 |
| ORIDa: Object-centric Real-world Image Composition Dataset | Jun 10, 2025 | counterfactualObject | —Unverified | 0 |
| Diffusion Counterfactual Generation with Semantic Abduction | Jun 9, 2025 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 |
| Curriculum Learning With Counterfactual Group Relative Policy Advantage For Multi-Agent Reinforcement Learning | Jun 9, 2025 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Cross-Entropy Games for Language Models: From Implicit Knowledge to General Capability Measures | Jun 7, 2025 | Anomaly Detectioncounterfactual | —Unverified | 0 |
| CrimeMind: Simulating Urban Crime with Multi-Modal LLM Agents | Jun 6, 2025 | counterfactual | —Unverified | 0 |
| Enhancing the Merger Simulation Toolkit with ML/AI | Jun 5, 2025 | counterfactual | —Unverified | 0 |
| Flattery, Fluff, and Fog: Diagnosing and Mitigating Idiosyncratic Biases in Preference Models | Jun 5, 2025 | counterfactualData Augmentation | CodeCode Available | 0 |
| Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation | Jun 5, 2025 | counterfactualRAG | CodeCode Available | 0 |
| Counterfactual reasoning: an analysis of in-context emergence | Jun 5, 2025 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 |
| Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research | Jun 4, 2025 | counterfactualEconometrics | —Unverified | 0 |
| WANDER: An Explainable Decision-Support Framework for HPC | Jun 4, 2025 | counterfactual | —Unverified | 0 |
| WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning | Jun 4, 2025 | counterfactualMotion Planning | —Unverified | 0 |
| A meaningful prediction of functional decline in amyotrophic lateral sclerosis based on multi-event survival analysis | Jun 2, 2025 | counterfactualSurvival Analysis | —Unverified | 0 |
| Pricing the Right to Renege in Search Markets: Evidence from Trucking | Jun 2, 2025 | counterfactual | —Unverified | 0 |
| Life Sequence Transformer: Generative Modelling for Counterfactual Simulation | Jun 2, 2025 | counterfactual | —Unverified | 0 |
| Counterfactual Activation Editing for Post-hoc Prosody and Mispronunciation Correction in TTS Models | Jun 1, 2025 | counterfactualSpeech Synthesis | —Unverified | 0 |
| Recover Experimental Data with Selection Bias using Counterfactual Logic | May 31, 2025 | Causal Inferencecounterfactual | —Unverified | 0 |
| Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes | May 30, 2025 | counterfactualVideo Generation | —Unverified | 0 |
| Data Fusion for Partial Identification of Causal Effects | May 30, 2025 | Causal Inferencecounterfactual | —Unverified | 0 |
| From Invariant Representations to Invariant Data: Provable Robustness to Spurious Correlations via Noisy Counterfactual Matching | May 30, 2025 | counterfactualDiversity | CodeCode Available | 0 |
| FOLIAGE: Towards Physical Intelligence World Models Via Unbounded Surface Evolution | May 29, 2025 | counterfactualCross-Modal Retrieval | —Unverified | 0 |
| DiCoFlex: Model-agnostic diverse counterfactuals with flexible control | May 29, 2025 | counterfactualDecision Making | —Unverified | 0 |