| Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion | Mar 19, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Fast Value Tracking for Deep Reinforcement Learning | Mar 19, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Supervised Fine-Tuning as Inverse Reinforcement Learning | Mar 18, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Mar 18, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC | Mar 16, 2024 | Decision MakingEdge-computing | —Unverified | 0 |
| Regret Minimization via Saddle Point Optimization | Mar 15, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents | Mar 13, 2024 | Decision MakingIn-Context Learning | —Unverified | 0 |
| Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer | Mar 12, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Mar 11, 2024 | Recommendation SystemsReinforcement Learning (RL) | —Unverified | 0 |
| LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem | Mar 10, 2024 | Computational EfficiencyDecision Making | —Unverified | 0 |