| Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Feb 20, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation | Feb 20, 2025 | Decision MakingEfficient Exploration | —Unverified | 0 |
| How Far are LLMs from Being Our Digital Twins? A Benchmark for Persona-Based Behavior Chain Simulation | Feb 20, 2025 | Decision Making | CodeCode Available | 1 |
| Online detection of forecast model inadequacies using forecast errors | Feb 20, 2025 | Decision Making | —Unverified | 0 |
| STeCa: Step-level Trajectory Calibration for LLM Agent Learning | Feb 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Human Misperception of Generative-AI Alignment: A Laboratory Experiment | Feb 20, 2025 | Decision Making | —Unverified | 0 |
| MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models | Feb 20, 2025 | Decision MakingHallucination | —Unverified | 0 |
| The Impact and Feasibility of Self-Confidence Shaping for AI-Assisted Decision-Making | Feb 20, 2025 | Decision Making | —Unverified | 0 |
| Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems | Feb 20, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition | Feb 19, 2025 | AttributeDecision Making | —Unverified | 0 |