Evolving Reinforcement Learning Environment to Minimize Learner's Achievable Reward: An Application on Hardening Active Directory Systems Apr 8, 2023 Diversity Management
— Unverified 0DREAM: Adaptive Reinforcement Learning based on Attention Mechanism for Temporal Knowledge Graph Reasoning Apr 8, 2023 Knowledge Graphs Missing Elements
— Unverified 0Efficient bimanual handover and rearrangement via symmetry-aware actor-critic learning Apr 7, 2023 Reinforcement Learning (RL)
Code Code Available 0Continuous Input Embedding Size Search For Recommender Systems Apr 7, 2023 Recommendation Systems Reinforcement Learning (RL)
— Unverified 0DiffMimic: Efficient Motion Mimicking with Differentiable Physics Apr 6, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 2AutoRL Hyperparameter Landscapes Apr 5, 2023 AutoML Hyperparameter Optimization
Code Code Available 0Persuading to Prepare for Quitting Smoking with a Virtual Coach: Using States and User Characteristics to Predict Behavior Apr 5, 2023 Reinforcement Learning (RL)
— Unverified 0A Multiagent CyberBattleSim for RL Cyber Operation Agents Apr 3, 2023 CyberBattleSim Reinforcement Learning (RL)
— Unverified 0Quantitative Trading using Deep Q Learning Apr 3, 2023 Q-Learning reinforcement-learning
— Unverified 0Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents Apr 3, 2023 Deep Reinforcement Learning Offline RL
— Unverified 0A Tutorial Introduction to Reinforcement Learning Apr 3, 2023 Q-Learning reinforcement-learning
— Unverified 0Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning Apr 3, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1Enabling A Network AI Gym for Autonomous Cyber Agents Apr 3, 2023 Deep Reinforcement Learning Offline RL
— Unverified 0Managing power grids through topology actions: A comparative study between advanced rule-based and reinforcement learning agents Apr 3, 2023 Management Reinforcement Learning (RL)
Code Code Available 1Risk-Sensitive and Robust Model-Based Reinforcement Learning and Planning Apr 2, 2023 Decision Making Model-based Reinforcement Learning
— Unverified 0Restarted Bayesian Online Change-point Detection for Non-Stationary Markov Decision Processes Apr 1, 2023 Change Point Detection Reinforcement Learning (RL)
— Unverified 0On Context Distribution Shift in Task Representation Learning for Offline Meta RL Apr 1, 2023 continuous-control Continuous Control
Code Code Available 0Mastering Pair Trading with Risk-Aware Recurrent Reinforcement Learning Apr 1, 2023 PAIR TRADING reinforcement-learning
— Unverified 0Multi-view Tensor Graph Neural Networks Through Reinforced Aggregation Apr 1, 2023 Graph Representation Learning Reinforcement Learning (RL)
Code Code Available 1Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization Mar 31, 2023 Offline RL Q-Learning
— Unverified 0Accelerating exploration and representation learning with offline pre-training Mar 31, 2023 Decision Making NetHack
— Unverified 0Language Models can Solve Computer Tasks Mar 30, 2023 Language Modelling Large Language Model
Code Code Available 2When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning Mar 30, 2023 Reinforcement Learning (RL)
— Unverified 0MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations Mar 30, 2023 Decision Making Imitation Learning
Code Code Available 0Learning in Factored Domains with Information-Constrained Visual Representations Mar 30, 2023 Reinforcement Learning (RL) Representation Learning
— Unverified 0On the Analysis of Computational Delays in Reinforcement Learning-based Rate Adaptation Algorithms Mar 30, 2023 Reinforcement Learning (RL)
— Unverified 0Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions Mar 30, 2023 Diversity Offline RL
— Unverified 0Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning Mar 29, 2023 GPU reinforcement-learning
Code Code Available 2Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks Mar 29, 2023 Minecraft Multi-Task Learning
— Unverified 0Does Sparsity Help in Learning Misspecified Linear Bandits? Mar 29, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0On-line reinforcement learning for optimization of real-life energy trading strategy Mar 28, 2023 energy trading reinforcement-learning
— Unverified 0Planning with Sequence Models through Iterative Energy Minimization Mar 28, 2023 Language Modeling Language Modelling
— Unverified 0Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization Mar 28, 2023 D4RL Offline RL
Code Code Available 1Multi-Flow Transmission in Wireless Interference Networks: A Convergent Graph Learning Approach Mar 27, 2023 Graph Learning Graph Neural Network
— Unverified 0Robust Risk-Aware Option Hedging Mar 27, 2023 Reinforcement Learning (RL)
— Unverified 0Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning Mar 27, 2023 Collision Avoidance Deep Reinforcement Learning
— Unverified 0Inverse Reinforcement Learning without Reinforcement Learning Mar 26, 2023 continuous-control Continuous Control
Code Code Available 1Control of synaptic plasticity via the fusion of reinforcement learning and unsupervised learning in neural networks Mar 26, 2023 Reinforcement Learning (RL)
— Unverified 0Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments Mar 24, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0marl-jax: Multi-Agent Reinforcement Leaning Framework Mar 24, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Optimal Transport for Offline Imitation Learning Mar 24, 2023 D4RL Decision Making
Code Code Available 1Learning to Operate in Open Worlds by Adapting Planning Models Mar 24, 2023 Reinforcement Learning (RL)
— Unverified 0Communication Load Balancing via Efficient Inverse Reinforcement Learning Mar 22, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Reuse for Communication Load Balancing in Unseen Traffic Scenarios Mar 22, 2023 Reinforcement Learning (RL)
— Unverified 0Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian Interactions using Reinforcement Learning Mar 22, 2023 Autonomous Vehicles Management
— Unverified 0A Hierarchical Hybrid Learning Framework for Multi-agent Trajectory Prediction Mar 22, 2023 Autonomous Vehicles Motion Planning
— Unverified 0Deep RL with Hierarchical Action Exploration for Dialogue Generation Mar 22, 2023 Dialogue Generation Offline RL
— Unverified 0Synthetic Health-related Longitudinal Data with Mixed-type Variables Generated using Diffusion Models Mar 22, 2023 Reinforcement Learning (RL)
— Unverified 0Beam Management Driven by Radio Environment Maps in O-RAN Architecture Mar 21, 2023 Management Reinforcement Learning (RL)
— Unverified 0Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale Mar 20, 2023 Imitation Learning reinforcement-learning
— Unverified 0