| Sliding-Window Thompson Sampling for Non-Stationary Settings | Sep 8, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A naive aggregation algorithm for improving generalization in a class of learning problems | Sep 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management | Sep 5, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| A Sequential Decision-Making Model for Perimeter Identification | Sep 4, 2024 | Decision Makingmodel | —Unverified | 0 |
| Temporal Elections: Welfare, Strategyproofness, and Proportionality | Aug 24, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| How to Measure Human-AI Prediction Accuracy in Explainable AI Systems | Aug 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation | Aug 22, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing | Aug 20, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Contextual Bandits for Unbounded Context Distributions | Aug 19, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey | Aug 19, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 5 |
| Enhancing Heterogeneous Multi-Agent Cooperation in Decentralized MARL via GNN-driven Intrinsic Rewards | Aug 12, 2024 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Meta Clustering of Neural Bandits | Aug 10, 2024 | ClusteringDecision Making | —Unverified | 0 |
| Structure and Reduction of MCTS for Explainable-AI | Aug 10, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Non-maximizing policies that fulfill multi-criterion aspirations in expectation | Aug 8, 2024 | Sequential Decision Making | —Unverified | 0 |
| Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps | Aug 6, 2024 | Bayesian OptimizationMeta-Learning | —Unverified | 0 |
| RELIEF: Reinforcement Learning Empowered Graph Feature Prompt Tuning | Aug 6, 2024 | Combinatorial OptimizationGraph Neural Network | CodeCode Available | 1 |
| Reinforcement Learning applied to Insurance Portfolio Pursuit | Aug 1, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| How to Choose a Reinforcement-Learning Algorithm | Jul 30, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Sustainable Energy: A Survey | Jul 26, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Assessing AI Utility: The Random Guesser Test for Sequential Decision-Making Systems | Jul 25, 2024 | Decision MakingRecommendation Systems | —Unverified | 0 |
| Adversarially Robust Decision Transformer | Jul 25, 2024 | Adversarial RobustnessSequential Decision Making | CodeCode Available | 0 |
| Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning | Jul 25, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning Meets Visual Odometry | Jul 22, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 3 |
| Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning | Jul 20, 2024 | AllAutonomous Driving | —Unverified | 0 |
| Scalable Exploration via Ensemble++ | Jul 18, 2024 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Managing Risk using Rolling Forecasts in Energy-Limited and Stochastic Energy Systems | Jul 18, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent | Jul 16, 2024 | Decision MakingMinecraft | —Unverified | 0 |
| Exploration Unbound | Jul 16, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning | Jul 15, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Preserving the Privacy of Reward Functions in MDPs through Deception | Jul 13, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning | Jul 12, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Long-Term Fairness in Sequential Multi-Agent Selection with Positive Reinforcement | Jul 10, 2024 | Decision MakingFairness | CodeCode Available | 0 |
| MDP Geometry, Normalization and Reward Balancing Solvers | Jul 9, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs | Jul 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Maximizing utility in multi-agent environments by anticipating the behavior of other learners | Jul 5, 2024 | Sequential Decision Making | —Unverified | 0 |
| Short-Long Policy Evaluation with Novel Actions | Jul 4, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Exploring a Physics-Informed Decision Transformer for Distribution System Restoration: Methodology and Performance Analysis | Jun 30, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Tradeoffs When Considering Deep Reinforcement Learning for Contingency Management in Advanced Air Mobility | Jun 28, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Operator World Models for Reinforcement Learning | Jun 28, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Instance Temperature Knowledge Distillation | Jun 27, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 0 |
| UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models | Jun 24, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization | Jun 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients | Jun 21, 2024 | Decision MakingManagement | CodeCode Available | 0 |
| MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading | Jun 20, 2024 | Algorithmic TradingDecision Making | CodeCode Available | 2 |
| Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond | Jun 19, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| ARDuP: Active Region Video Diffusion for Universal Policies | Jun 19, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Efficient Sequential Decision Making with Large Language Models | Jun 17, 2024 | Decision MakingModel Selection | —Unverified | 0 |
| Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Jun 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Model Adaptation for Time Constrained Embodied Control | Jun 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits | Jun 9, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |