Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs Mar 17, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Semi-Markov Offline Reinforcement Learning for Healthcare Mar 17, 2022 Offline RL reinforcement-learning
Code Code Available 0A Deep Reinforcement Learning-Based Caching Strategy for IoT Networks with Transient Data Mar 16, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks Mar 16, 2022 Offline RL reinforcement-learning
Code Code Available 0A Survey of Multi-Agent Deep Reinforcement Learning with Communication Mar 16, 2022 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Backpropagation through Time and Space: Learning Numerical Methods with Multi-Agent Reinforcement Learning Mar 16, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for Efficient and Safe Driving Strategies Mar 16, 2022 Autonomous Driving Autonomous Vehicles
— Unverified 0Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents Mar 16, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Latent-Variable Advantage-Weighted Policy Optimization for Offline RL Mar 16, 2022 continuous-control Continuous Control
Code Code Available 1Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act Mar 16, 2022 Atari Games Decision Making
— Unverified 0CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning Mar 16, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration Mar 16, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning Mar 15, 2022 Diagnostic reinforcement-learning
— Unverified 0Multi-View Dreaming: Multi-View World Model with Contrastive Learning Mar 15, 2022 Contrastive Learning reinforcement-learning
— Unverified 0Zipfian environments for Reinforcement Learning Mar 15, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling Mar 15, 2022 Reinforcement Learning (RL)
— Unverified 0Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement Learning Mar 15, 2022 Collision Avoidance reinforcement-learning
— Unverified 0A Differentiable Approach to Combinatorial Optimization using Dataless Neural Networks Mar 15, 2022 Combinatorial Optimization Community Detection
— Unverified 0An Introduction to Multi-Agent Reinforcement Learning and Review of its Application to Autonomous Mobility Mar 15, 2022 Autonomous Vehicles Multi-agent Reinforcement Learning
— Unverified 0L2Explorer: A Lifelong Reinforcement Learning Assessment Environment Mar 14, 2022 Continual Learning Lifelong learning
Code Code Available 0Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation Mar 14, 2022 Autonomous Driving Gaussian Processes
— Unverified 0FRL-FI: Transient Fault Analysis for Federated Reinforcement Learning-Based Navigation Systems Mar 14, 2022 Fault Detection reinforcement-learning
— Unverified 0The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its Warehouse Applications Mar 14, 2022 Multi-Agent Path Finding Multi-agent Reinforcement Learning
— Unverified 0Switch Trajectory Transformer with Distributional Value Approximation for Multi-Task Reinforcement Learning Mar 14, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Uncertainty Estimation for Language Reward Models Mar 14, 2022 Active Learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Optimal Control of a District Cooling Energy Plant Mar 14, 2022 Model Predictive Control Q-Learning
— Unverified 0Orchestrated Value Mapping for Reinforcement Learning Mar 14, 2022 Ensemble Learning Q-Learning
Code Code Available 0Calibration of Derivative Pricing Models: a Multi-Agent Reinforcement Learning Perspective Mar 14, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning Mar 13, 2022 Offline RL reinforcement-learning
— Unverified 0Auto-FedRL: Federated Hyperparameter Optimization for Multi-institutional Medical Image Segmentation Mar 12, 2022 Federated Learning Hyperparameter Optimization
— Unverified 0The Health Gym: Synthetic Health-Related Datasets for the Development of Reinforcement Learning Algorithms Mar 12, 2022 BIG-bench Machine Learning Generative Adversarial Network
Code Code Available 1Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems Mar 12, 2022 Graph Attention reinforcement-learning
— Unverified 0Combining imitation and deep reinforcement learning to accomplish human-level performance on a virtual foraging task Mar 11, 2022 Deep Reinforcement Learning Imitation Learning
Code Code Available 0Active Phase-Encode Selection for Slice-Specific Fast MR Scanning Using a Transformer-Based Deep Reinforcement Learning Framework Mar 11, 2022 Deep Reinforcement Learning Image Reconstruction
— Unverified 0Deep Binary Reinforcement Learning for Scalable Verification Mar 11, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Graph Neural Networks for Relational Inductive Bias in Vision-based Deep Reinforcement Learning of Robot Control Mar 11, 2022 Deep Reinforcement Learning Graph Neural Network
— Unverified 0A Machine Learning Approach for Prosumer Management in Intraday Electricity Markets Mar 11, 2022 BIG-bench Machine Learning Management
— Unverified 0Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism Mar 11, 2022 Decision Making reinforcement-learning
— Unverified 0Reinforcement Learning for Linear Quadratic Control is Vulnerable Under Cost Manipulation Mar 11, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Random Ensemble Reinforcement Learning for Traffic Signal Control Mar 10, 2022 Ensemble Learning reinforcement-learning
— Unverified 0Near-optimal Deep Reinforcement Learning Policies from Data for Zone Temperature Control Mar 10, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Learning Torque Control for Quadrupedal Locomotion Mar 10, 2022 Position Reinforcement Learning (RL)
— Unverified 0Action-Constrained Reinforcement Learning for Frame-Level Bit Allocation in HEVC/H.265 through Frank-Wolfe Policy Optimization Mar 10, 2022 Reinforcement Learning (RL)
— Unverified 0Artificial Intelligence in Vehicular Wireless Networks: A Case Study Using ns-3 Mar 10, 2022 Reinforcement Learning (RL)
— Unverified 0Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework Mar 10, 2022 Data Augmentation Multi-agent Reinforcement Learning
— Unverified 0SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement Learning Mar 9, 2022 Deep Reinforcement Learning Minecraft
Code Code Available 0Multi-robot Cooperative Pursuit via Potential Field-Enhanced Reinforcement Learning Mar 9, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Neuro-symbolic Natural Logic with Introspective Revision for Natural Language Inference Mar 9, 2022 Natural Language Inference reinforcement-learning
Code Code Available 0Multi-Objective reward generalization: Improving performance of Deep Reinforcement Learning for applications in single-asset trading Mar 9, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Gym-saturation: an OpenAI Gym environment for saturation provers Mar 9, 2022 OpenAI Gym Reinforcement Learning (RL)
— Unverified 0