Intrinsic Rewards for Exploration without Harm from Observational Noise: A Simulation Study Based on the Free Energy Principle May 13, 2024 Efficient Exploration Navigate
— Unverified 0Hype or Heuristic? Quantum Reinforcement Learning for Join Order Optimisation May 13, 2024 Low-latency processing reinforcement-learning
Code Code Available 0CAGES: Cost-Aware Gradient Entropy Search for Efficient Local Multi-Fidelity Bayesian Optimization May 13, 2024 Bayesian Optimization Reinforcement Learning (RL)
Code Code Available 0Reducing Risk for Assistive Reinforcement Learning Policies with Diffusion Models May 13, 2024 Imitation Learning reinforcement-learning
— Unverified 0Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback May 13, 2024 Reinforcement Learning (RL)
— Unverified 0Neural Network Compression for Reinforcement Learning Tasks May 13, 2024 Neural Network Compression reinforcement-learning
— Unverified 0Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning May 12, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Fairness in Reinforcement Learning: A Survey May 11, 2024 Autonomous Vehicles Fairness
— Unverified 0Space Processor Computation Time Analysis for Reinforcement Learning and Run Time Assurance Control Policies May 10, 2024 Reinforcement Learning (RL)
— Unverified 0Improving Targeted Molecule Generation through Language Model Fine-Tuning Via Reinforcement Learning May 10, 2024 Drug Design Language Modeling
— Unverified 0Dominion: A New Frontier for AI Research May 10, 2024 Reinforcement Learning (RL)
— Unverified 0Value Augmented Sampling for Language Model Alignment and Personalization May 10, 2024 Language Modeling Language Modelling
Code Code Available 1An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models May 9, 2024 Hierarchical Reinforcement Learning Management
— Unverified 0Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning May 8, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Roadside Units Assisted Localized Automated Vehicle Maneuvering: An Offline Reinforcement Learning Approach May 7, 2024 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems May 7, 2024 CPU GPU
Code Code Available 0ACEGEN: Reinforcement learning of generative chemical agents for drug discovery May 7, 2024 Benchmarking Decision Making
Code Code Available 3Genetic Drift Regularization: on preventing Actor Injection from breaking Evolution Strategies May 7, 2024 Evolutionary Algorithms Reinforcement Learning (RL)
— Unverified 0DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model May 7, 2024 Language Modeling Language Modelling
Code Code Available 9Improving Offline Reinforcement Learning with Inaccurate Simulators May 7, 2024 D4RL Generative Adversarial Network
— Unverified 0Human-centric Reward Optimization for Reinforcement Learning-based Automated Driving using Large Language Models May 7, 2024 In-Context Learning Reinforcement Learning (RL)
Code Code Available 1Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows May 6, 2024 Causal Inference counterfactual
— Unverified 0Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning May 6, 2024 Reinforcement Learning (RL)
Code Code Available 2Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints May 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics May 4, 2024 continuous-control Continuous Control
Code Code Available 0UDUC: An Uncertainty-driven Approach for Learning-based Robust Control May 4, 2024 Contrastive Learning Model Predictive Control
— Unverified 0Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning May 3, 2024 Deep Reinforcement Learning Object Tracking
— Unverified 0Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning May 3, 2024 Reinforcement Learning (RL)
— Unverified 0Proximal Curriculum with Task Correlations for Deep Reinforcement Learning May 3, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Learning Robust Autonomous Navigation and Locomotion for Wheeled-Legged Robots May 3, 2024 Autonomous Navigation Navigate
— Unverified 0A Model-based Multi-Agent Personalized Short-Video Recommender System May 3, 2024 Recommendation Systems Reinforcement Learning (RL)
— Unverified 0Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach May 3, 2024 Q-Learning reinforcement-learning
— Unverified 0Simulating the Economic Impact of Rationality through Reinforcement Learning and Agent-Based Modelling May 3, 2024 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Model-based reinforcement learning for protein backbone design May 3, 2024 model Model-based Reinforcement Learning
— Unverified 0Learning Optimal Deterministic Policies with Stochastic Policy Gradients May 3, 2024 Reinforcement Learning (RL)
— Unverified 0Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk May 2, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation May 2, 2024 MuJoCo Reinforcement Learning (RL)
Code Code Available 5Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation May 2, 2024 Machine Translation NMT
— Unverified 0Reinforcement Learning-Guided Semi-Supervised Learning May 2, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Constrained Reinforcement Learning Under Model Mismatch May 2, 2024 model reinforcement-learning
— Unverified 0Tabular and Deep Reinforcement Learning for Gittins Index May 2, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0FLAME: Factuality-Aware Alignment for Large Language Models May 2, 2024 Hallucination Instruction Following
— Unverified 0Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks May 2, 2024 Language Modeling Language Modelling
— Unverified 0Learning Force Control for Legged Manipulation May 2, 2024 Reinforcement Learning (RL)
— Unverified 0Queue-based Eco-Driving at Roundabouts with Reinforcement Learning May 1, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO May 1, 2024 MuJoCo Reinforcement Learning (RL)
Code Code Available 1Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning May 1, 2024 Language Modeling Language Modelling
— Unverified 0Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning Apr 30, 2024 Reinforcement Learning (RL)
— Unverified 0Towards Generalist Robot Learning from Internet Video: A Survey Apr 30, 2024 Natural Language Understanding Reinforcement Learning (RL)
— Unverified 0Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement Learning Apr 30, 2024 Reinforcement Learning (RL) Text Generation
Code Code Available 0