Model-based reinforcement learning for protein backbone design May 3, 2024 model Model-based Reinforcement Learning
— Unverified 0Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach May 3, 2024 Q-Learning reinforcement-learning
— Unverified 0Tabular and Deep Reinforcement Learning for Gittins Index May 2, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks May 2, 2024 Language Modeling Language Modelling
— Unverified 0Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation May 2, 2024 Machine Translation NMT
— Unverified 0Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk May 2, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning-Guided Semi-Supervised Learning May 2, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Learning Force Control for Legged Manipulation May 2, 2024 Reinforcement Learning (RL)
— Unverified 0FLAME: Factuality-Aware Alignment for Large Language Models May 2, 2024 Hallucination Instruction Following
— Unverified 0Constrained Reinforcement Learning Under Model Mismatch May 2, 2024 model reinforcement-learning
— Unverified 0Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning May 1, 2024 Language Modeling Language Modelling
— Unverified 0Queue-based Eco-Driving at Roundabouts with Reinforcement Learning May 1, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Towards Generalist Robot Learning from Internet Video: A Survey Apr 30, 2024 Natural Language Understanding Reinforcement Learning (RL)
— Unverified 0Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning Apr 30, 2024 Reinforcement Learning (RL)
— Unverified 0Learning to Communicate Functional States with Nonverbal Expressions for Improved Human-Robot Collaboration Apr 30, 2024 Reinforcement Learning (RL)
Code Code Available 0Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement Learning Apr 30, 2024 Reinforcement Learning (RL) Text Generation
Code Code Available 0Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies Apr 29, 2024 Knowledge Distillation reinforcement-learning
— Unverified 0Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs Apr 29, 2024 Diagnostic General Knowledge
— Unverified 0Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty Apr 29, 2024 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning Problem Solving with Large Language Models Apr 29, 2024 Q-Learning reinforcement-learning
— Unverified 0Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly Apr 26, 2024 Contact-rich Manipulation Offline RL
— Unverified 0Enhancing Privacy and Security of Autonomous UAV Navigation Apr 26, 2024 Autonomous Navigation Disaster Response
— Unverified 0Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic Review Apr 26, 2024 Decision Making reinforcement-learning
— Unverified 0EEG_RL-Net: Enhancing EEG MI Classification through Reinforcement Learning-Optimised Graph Neural Networks Apr 26, 2024 Classification EEG
— Unverified 0Offline Reinforcement Learning with Behavioral Supervisor Tuning Apr 25, 2024 Offline RL reinforcement-learning
— Unverified 0Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks Apr 25, 2024 Fairness Multi-Armed Bandits
— Unverified 0GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL Apr 24, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment Modeling Apr 24, 2024 Reinforcement Learning (RL)
— Unverified 0DPO: A Differential and Pointwise Control Approach to Reinforcement Learning Apr 24, 2024 Benchmarking reinforcement-learning
— Unverified 0An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models Apr 23, 2024 image-classification Image Classification
— Unverified 0Impedance Matching: Enabling an RL-Based Running Jump in a Quadruped Robot Apr 23, 2024 Reinforcement Learning (RL)
— Unverified 0Using deep reinforcement learning to promote sustainable human behaviour on a common pool resource problem Apr 23, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Planning the path with Reinforcement Learning: Optimal Robot Motion Planning in RoboCup Small Size League Environments Apr 23, 2024 Motion Planning Reinforcement Learning (RL)
Code Code Available 0Reinforcement Learning with Adaptive Regularization for Safe Control of Critical Systems Apr 23, 2024 Reinforcement Learning (RL)
Code Code Available 0Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras Apr 22, 2024 Disentanglement reinforcement-learning
Code Code Available 0Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation Apr 22, 2024 continuous-control Continuous Control
— Unverified 0Fairness Incentives in Response to Unfair Dynamic Pricing Apr 22, 2024 Fairness Reinforcement Learning (RL)
— Unverified 0Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories Apr 22, 2024 Edge-computing Reinforcement Learning (RL)
— Unverified 0An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems Apr 19, 2024 Efficient Exploration Multi-Task Learning
— Unverified 0Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty Apr 19, 2024 Q-Learning reinforcement-learning
— Unverified 0Data-Incremental Continual Offline Reinforcement Learning Apr 19, 2024 Continual Learning Offline RL
— Unverified 0Reinforcement Learning Approach for Integrating Compressed Contexts into Knowledge Graphs Apr 19, 2024 Knowledge Graphs reinforcement-learning
— Unverified 0TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents Apr 18, 2024 energy management Offline RL
Code Code Available 0Actor-Critic Reinforcement Learning with Phased Actor Apr 18, 2024 Policy Gradient Methods reinforcement-learning
— Unverified 0LTL-Constrained Policy Optimization with Cycle Experience Replay Apr 17, 2024 continuous-control Continuous Control
— Unverified 0Learn to Tour: Operator Design For Solution Feasibility Mapping in Pickup-and-delivery Traveling Salesman Problem Apr 17, 2024 Reinforcement Learning (RL) Traveling Salesman Problem
— Unverified 0Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding Apr 17, 2024 Language Modeling Language Modelling
— Unverified 0Physics-informed Actor-Critic for Coordination of Virtual Inertia from Power Distribution Systems Apr 17, 2024 Reinforcement Learning (RL)
— Unverified 0Achieving Constant Regret in Linear Markov Decision Processes Apr 16, 2024 Reinforcement Learning (RL)
— Unverified 0Simplex Decomposition for Portfolio Allocation Constraints in Reinforcement Learning Apr 16, 2024 Portfolio Optimization reinforcement-learning
— Unverified 0