| Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach | Dec 5, 2024 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum | Dec 3, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control | Dec 3, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Selective Reviews of Bandit Problems in AI via a Statistical View | Dec 3, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Failure Probability Estimation for Black-Box Autonomous Systems using State-Dependent Importance Sampling Proposals | Dec 3, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games | Dec 1, 2024 | Atari GamesDecision Making | CodeCode Available | 0 |
| STEVE-Audio: Expanding the Goal Conditioning Modalities of Embodied Agents in Minecraft | Dec 1, 2024 | Decision MakingMinecraft | —Unverified | 0 |
| Market Making without Regret | Nov 21, 2024 | Sequential Decision Making | —Unverified | 0 |
| On adaptivity and minimax optimality of two-sided nearest neighbors | Nov 20, 2024 | Decision MakingMatrix Completion | CodeCode Available | 0 |
| Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet | Nov 18, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review | Nov 15, 2024 | Reinforcement Learning (RL)Sequential Decision Making | —Unverified | 0 |
| Fair Resource Allocation in Weakly Coupled Markov Decision Processes | Nov 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate | Nov 13, 2024 | Decision MakingGPU | CodeCode Available | 0 |
| Optimal Control of Mechanical Ventilators with Learned Respiratory Dynamics | Nov 12, 2024 | Decision MakingManagement | CodeCode Available | 0 |
| Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective | Nov 12, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| PageRank Bandits for Link Prediction | Nov 3, 2024 | Decision MakingGraph Learning | CodeCode Available | 0 |
| LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation | Nov 1, 2024 | Logical ReasoningSequential Decision Making | CodeCode Available | 1 |
| EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization | Oct 31, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Quantum Reinforcement Learning-Based Two-Stage Unit Commitment Framework for Enhanced Power Systems Robustness | Oct 28, 2024 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits | Oct 26, 2024 | Active LearningBlocking | —Unverified | 0 |
| Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting | Oct 25, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks | Oct 25, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning Versatile Skills with Curriculum Masking | Oct 23, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning | Oct 22, 2024 | Decision MakingDiversity | —Unverified | 0 |
| Hierarchical Upper Confidence Bounds for Constrained Online Learning | Oct 22, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making | Oct 16, 2024 | Attributecounterfactual | CodeCode Available | 0 |
| SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling | Oct 16, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Communication-Control Codesign for Large-Scale Wireless Networked Control Systems | Oct 15, 2024 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes | Oct 14, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Efficient Reinforcement Learning with Large Language Model Priors | Oct 10, 2024 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Offline Hierarchical Reinforcement Learning via Inverse Optimization | Oct 10, 2024 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| On the Modeling Capabilities of Large Language Models for Sequential Decision Making | Oct 8, 2024 | Decision MakingDiversity | —Unverified | 0 |
| DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback | Oct 8, 2024 | MathSequential Decision Making | CodeCode Available | 1 |
| DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback | Oct 7, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| Preference Optimization as Probabilistic Inference | Oct 5, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Minimax-optimal trust-aware multi-armed bandits | Oct 4, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory | Oct 3, 2024 | Representation LearningSequential Decision Making | CodeCode Available | 0 |
| Adaptive teachers for amortized samplers | Oct 2, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 0 |
| AVID: Adapting Video Diffusion Models to World Models | Oct 1, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Safe Time-Varying Optimization based on Gaussian Processes with Spatio-Temporal Kernel | Sep 26, 2024 | Bayesian OptimizationChange Detection | —Unverified | 0 |
| Collaborative Comic Generation: Integrating Visual Narrative Theories with AI Models for Enhanced Creativity | Sep 25, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Learning Utilities from Demonstrations in Markov Decision Processes | Sep 25, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reference Points, Risk-Taking Behavior, and Competitive Outcomes in Sequential Settings | Sep 20, 2024 | counterfactualDecision Making | —Unverified | 0 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 |
| Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark | Sep 13, 2024 | Sequential Decision MakingWorld Knowledge | —Unverified | 0 |
| Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation | Sep 11, 2024 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| HierLLM: Hierarchical Large Language Model for Question Recommendation | Sep 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting | Sep 9, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Sep 9, 2024 | D4RLDecision Making | —Unverified | 0 |
| An Introduction to Quantum Reinforcement Learning (QRL) | Sep 9, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |