| State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Mar 18, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Supervised Fine-Tuning as Inverse Reinforcement Learning | Mar 18, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC | Mar 16, 2024 | Decision MakingEdge-computing | —Unverified | 0 |
| Regret Minimization via Saddle Point Optimization | Mar 15, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents | Mar 13, 2024 | Decision MakingIn-Context Learning | —Unverified | 0 |
| CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Mar 11, 2024 | Recommendation SystemsReinforcement Learning (RL) | —Unverified | 0 |
| LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem | Mar 10, 2024 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem | Mar 8, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Cooperative Bayesian Optimization for Imperfect Agents | Mar 7, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation | Mar 6, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |