| Diffusion Counterfactual Generation with Semantic Abduction | Jun 9, 2025 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 |
| Cross-Entropy Games for Language Models: From Implicit Knowledge to General Capability Measures | Jun 7, 2025 | Anomaly Detectioncounterfactual | —Unverified | 0 |
| CrimeMind: Simulating Urban Crime with Multi-Modal LLM Agents | Jun 6, 2025 | counterfactual | —Unverified | 0 |
| Enhancing the Merger Simulation Toolkit with ML/AI | Jun 5, 2025 | counterfactual | —Unverified | 0 |
| Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation | Jun 5, 2025 | counterfactualRAG | CodeCode Available | 0 |
| Counterfactual reasoning: an analysis of in-context emergence | Jun 5, 2025 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 |
| Flattery, Fluff, and Fog: Diagnosing and Mitigating Idiosyncratic Biases in Preference Models | Jun 5, 2025 | counterfactualData Augmentation | CodeCode Available | 0 |
| Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research | Jun 4, 2025 | counterfactualEconometrics | —Unverified | 0 |
| WANDER: An Explainable Decision-Support Framework for HPC | Jun 4, 2025 | counterfactual | —Unverified | 0 |
| WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning | Jun 4, 2025 | counterfactualMotion Planning | —Unverified | 0 |