Deep Reinforcement Learning for Traffic Light Control in Intelligent Transportation Systems Feb 4, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Generalization of Deep Reinforcement Learning for Jammer-Resilient Frequency and Power Allocation Feb 4, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Developing Driving Strategies Efficiently: A Skill-Based Hierarchical Reinforcement Learning Approach Feb 4, 2023 Autonomous Driving Hierarchical Reinforcement Learning
— Unverified 0Online Reinforcement Learning in Non-Stationary Context-Driven Environments Feb 4, 2023 MuJoCo reinforcement-learning
Code Code Available 0Reinforcement Learning in Low-Rank MDPs with Density Features Feb 4, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning with History-Dependent Dynamic Contexts Feb 4, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcing User Retention in a Billion Scale Short Video Recommender System Feb 3, 2023 Recommendation Systems reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Cyber System Defense under Dynamic Adversarial Uncertainties Feb 3, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Online Error Detection in Cyber-Physical Systems Feb 3, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Distributional constrained reinforcement learning for supply chain optimization Feb 3, 2023 Distributional Reinforcement Learning Policy Gradient Methods
Code Code Available 0ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints Feb 2, 2023 OpenAI Gym Reinforcement Learning (RL)
— Unverified 0Lower Bounds for Learning in Revealing POMDPs Feb 2, 2023 Reinforcement Learning (RL)
— Unverified 0Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition Feb 2, 2023 Diversity Q-Learning
— Unverified 0MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks Feb 2, 2023 Reinforcement Learning (RL)
— Unverified 0ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs Feb 2, 2023 continuous-control Continuous Control
— Unverified 0Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms Feb 2, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Sample Complexity of Kernel-Based Q-Learning Feb 1, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0Multi-zone HVAC Control with Model-Based Deep Reinforcement Learning Feb 1, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Selective Uncertainty Propagation in Offline RL Feb 1, 2023 Offline RL reinforcement-learning
— Unverified 0Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO) Feb 1, 2023 MuJoCo reinforcement-learning
— Unverified 0QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing Feb 1, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Collaborating with language models for embodied reasoning Feb 1, 2023 In-Context Learning Language Modeling
— Unverified 0Combining Deep Reinforcement Learning and Search with Generative Models for Game-Theoretic Opponent Modeling Feb 1, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0A Reinforcement Learning Framework for Dynamic Mediation Analysis Jan 31, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning Jan 31, 2023 Active Learning Computational Efficiency
Code Code Available 0CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning Jan 31, 2023 Decoder reinforcement-learning
Code Code Available 0Enabling surrogate-assisted evolutionary reinforcement learning via policy embedding Jan 31, 2023 Atari Games Evolutionary Algorithms
— Unverified 0Towards interpretable quantum machine learning via single-photon quantum walks Jan 31, 2023 Decision Making Quantum Machine Learning
— Unverified 0Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural Networks Jan 31, 2023 Blocking Graph Neural Network
— Unverified 0Scaling laws for single-agent reinforcement learning Jan 31, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Skill Decision Transformer Jan 31, 2023 D4RL Descriptive
Code Code Available 0Scalable Grid-Aware Dynamic Matching using Deep Reinforcement Learning Jan 31, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning Jan 31, 2023 Edge-computing Management
— Unverified 0Planning Multiple Epidemic Interventions with Reinforcement Learning Jan 30, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0STEEL: Singularity-aware Reinforcement Learning Jan 30, 2023 Off-policy evaluation reinforcement-learning
— Unverified 0V2N Service Scaling with Deep Reinforcement Learning Jan 30, 2023 Deep Reinforcement Learning Edge-computing
— Unverified 0Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies Jan 30, 2023 Data Augmentation Feature Engineering
Code Code Available 0Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents Jan 30, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0PAC-Bayesian Soft Actor-Critic Learning Jan 30, 2023 Reinforcement Learning (RL)
Code Code Available 0Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem Jan 30, 2023 Management reinforcement-learning
— Unverified 0Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation Jan 30, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning Jan 30, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs Jan 30, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0A Deep Reinforcement Learning Framework for Optimizing Congestion Control in Data Centers Jan 29, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Autonomous Satellite Docking via Adaptive Optimal Output Rregulation: A Reinforcement Learning Approach Jan 29, 2023 Position reinforcement-learning
— Unverified 0Sample Efficient Deep Reinforcement Learning via Local Planning Jan 29, 2023 Deep Reinforcement Learning Montezuma's Revenge
— Unverified 0STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning Jan 28, 2023 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Turbulence control in plane Couette flow using low-dimensional neural ODE-based models and deep reinforcement learning Jan 28, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0SaFormer: A Conditional Sequence Modeling Approach to Offline Safe Reinforcement Learning Jan 28, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Towards Learning Rubik's Cube with N-tuple-based Reinforcement Learning Jan 28, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0