| UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models | Jun 24, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization | Jun 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients | Jun 21, 2024 | Decision MakingManagement | CodeCode Available | 0 |
| MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading | Jun 20, 2024 | Algorithmic TradingDecision Making | CodeCode Available | 2 |
| Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond | Jun 19, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| ARDuP: Active Region Video Diffusion for Universal Policies | Jun 19, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Efficient Sequential Decision Making with Large Language Models | Jun 17, 2024 | Decision MakingModel Selection | —Unverified | 0 |
| Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Jun 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Model Adaptation for Time Constrained Embodied Control | Jun 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits | Jun 9, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |