| Fast Value Tracking for Deep Reinforcement Learning | Mar 19, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion | Mar 19, 2024 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Supervised Fine-Tuning as Inverse Reinforcement Learning | Mar 18, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Mar 18, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC | Mar 16, 2024 | Decision MakingEdge-computing | —Unverified | 0 |
| Regret Minimization via Saddle Point Optimization | Mar 15, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents | Mar 13, 2024 | Decision MakingIn-Context Learning | —Unverified | 0 |
| Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer | Mar 12, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Mar 11, 2024 | Recommendation SystemsReinforcement Learning (RL) | —Unverified | 0 |
| LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem | Mar 10, 2024 | Computational EfficiencyDecision Making | —Unverified | 0 |
| TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision | Mar 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem | Mar 8, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Cooperative Bayesian Optimization for Imperfect Agents | Mar 7, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation | Mar 6, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Language Guided Exploration for RL Agents in Text Environments | Mar 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games | Mar 1, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds | Mar 1, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections | Feb 26, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| How Can LLM Guide RL? A Value-Based Approach | Feb 25, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 1 |
| Reward Design for Justifiable Sequential Decision-Making | Feb 24, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Information-Theoretic Safe Bayesian Optimization | Feb 23, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| On the Performance of Empirical Risk Minimization with Smoothed Data | Feb 22, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay | Feb 22, 2024 | Autonomous RacingDecision Making | —Unverified | 0 |
| Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Feb 21, 2024 | Decision MakingImitation Learning | CodeCode Available | 2 |
| Toward TransfORmers: Revolutionizing the Solution of Mixed Integer Programs with Transformers | Feb 20, 2024 | Decision MakingDecoder | —Unverified | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 |
| Self-evolving Autoencoder Embedded Q-Network | Feb 18, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Probability Tools for Sequential Random Projection | Feb 16, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control | Feb 16, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent | Feb 15, 2024 | AllDecision Making | CodeCode Available | 2 |
| Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings | Feb 13, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Online Sequential Decision-Making with Unknown Delays | Feb 12, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization | Feb 12, 2024 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Auxiliary Reward Generation with Transition Distance Representation Learning | Feb 12, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss | Feb 9, 2024 | Computational Efficiencycontinuous-control | CodeCode Available | 1 |
| Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming | Feb 8, 2024 | Decision MakingPhysiological Computing | —Unverified | 0 |
| Sym-Q: Adaptive Symbolic Regression via Sequential Decision-Making | Feb 7, 2024 | Decision Makingregression | CodeCode Available | 1 |
| Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents | Feb 6, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System | Feb 5, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills | Feb 5, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs | Feb 5, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Vertical Symbolic Regression via Deep Policy Gradient | Feb 1, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Zero-Shot Reinforcement Learning via Function Encoders | Jan 30, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis | Jan 30, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| Regularized Q-Learning with Linear Function Approximation | Jan 26, 2024 | Decision Making Under UncertaintyQ-Learning | —Unverified | 0 |
| Long-Term Fair Decision Making through Deep Generative Models | Jan 20, 2024 | Decision MakingFairness | CodeCode Available | 0 |
| Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning | Jan 19, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Learning Non-myopic Power Allocation in Constrained Scenarios | Jan 18, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| LLMs for Relational Reasoning: How Far are We? | Jan 17, 2024 | Common Sense ReasoningDecision Making | —Unverified | 0 |
| Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback | Jan 17, 2024 | Decision MakingLearning-To-Rank | —Unverified | 0 |