| Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents | Jun 10, 2025 | Decision Making | —Unverified | 0 |
| Understanding Software Engineering Agents Through the Lens of Traceability: An Empirical Study | Jun 10, 2025 | Code GenerationDecision Making | —Unverified | 0 |
| HGFormer: A Hierarchical Graph Transformer Framework for Two-Stage Colonel Blotto Games via Reinforcement Learning | Jun 10, 2025 | Decision MakingGraph Neural Network | —Unverified | 0 |
| Diffusion of Responsibility in Collective Decision Making | Jun 9, 2025 | Decision Making | —Unverified | 0 |
| A Unified Anti-Jamming Design in Complex Environments Based on Cross-Modal Fusion and Intelligent Decision-Making | Jun 9, 2025 | Decision Making | —Unverified | 0 |
| LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement | Jun 9, 2025 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models | Jun 9, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis | Jun 9, 2025 | Action ClassificationBenchmarking | —Unverified | 0 |
| Improving Fairness of Large Language Models in Multi-document Summarization | Jun 9, 2025 | AttributeDecision Making | CodeCode Available | 0 |
| Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation | Jun 9, 2025 | Decision MakingMuJoCo | —Unverified | 0 |