Horizon-Free Regret for Linear Markov Decision Processes Mar 15, 2024 LEMMA Reinforcement Learning (RL)
— Unverified 0Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning Mar 14, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Meta-operators for Enabling Parallel Planning Using Deep Reinforcement Learning Mar 13, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Multi-Objective Optimization Using Adaptive Distributed Reinforcement Learning Mar 13, 2024 Cloud Computing Few-Shot Learning
— Unverified 0Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis Mar 13, 2024 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning Mar 13, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning to Describe for Predicting Zero-shot Drug-Drug Interactions Mar 13, 2024 Language Modeling Language Modelling
Code Code Available 0HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback Mar 13, 2024 Language Modelling Large Language Model
— Unverified 0Adaptive Gain Scheduling using Reinforcement Learning for Quadcopter Control Mar 12, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective Mar 12, 2024 D4RL reinforcement-learning
Code Code Available 0Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning Mar 11, 2024 Denoising Diagnostic
— Unverified 0In-context Exploration-Exploitation for Reinforcement Learning Mar 11, 2024 Bayesian Inference Bayesian Optimization
— Unverified 0CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation Mar 11, 2024 Recommendation Systems Reinforcement Learning (RL)
— Unverified 0ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment Mar 11, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts Mar 11, 2024 Mixture-of-Experts Reinforcement Learning (RL)
— Unverified 0Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning Mar 11, 2024 Reinforcement Learning (RL)
— Unverified 0(N,K)-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model Mar 11, 2024 Benchmarking Language Modeling
— Unverified 0RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models Mar 11, 2024 Prompt Engineering Reinforcement Learning (RL)
— Unverified 0Distributional Successor Features Enable Zero-Shot Policy Optimization Mar 10, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0PEaRL: Personalized Privacy of Human-Centric Systems using Early-Exit Reinforcement Learning Mar 9, 2024 Reinforcement Learning (RL)
— Unverified 0Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning Mar 9, 2024 Decision Making Offline RL
— Unverified 0Enhancing Classification Performance via Reinforcement Learning for Feature Selection Mar 9, 2024 Classification feature selection
— Unverified 0Enhancing Multi-Hop Knowledge Graph Reasoning through Reward Shaping Techniques Mar 9, 2024 Knowledge Graphs Navigate
— Unverified 0Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem Mar 8, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Switching the Loss Reduces the Cost in Batch Reinforcement Learning Mar 8, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Simulating Battery-Powered TinyML Systems Optimised using Reinforcement Learning in Image-Based Anomaly Detection Mar 8, 2024 Anomaly Detection Reinforcement Learning (RL)
— Unverified 0Why Online Reinforcement Learning is Causal Mar 7, 2024 counterfactual Offline RL
— Unverified 0Zero-shot cross-modal transfer of Reinforcement Learning policies through a Global Workspace Mar 7, 2024 Attribute Contrastive Learning
Code Code Available 0Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy Mar 7, 2024 Language Modeling Language Modelling
— Unverified 0Noisy Spiking Actor Network for Exploration Mar 7, 2024 continuous-control Continuous Control
— Unverified 0RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning Mar 7, 2024 counterfactual Form
— Unverified 0Learning Human-to-Humanoid Real-Time Whole-Body Teleoperation Mar 7, 2024 Reinforcement Learning (RL)
— Unverified 0A Natural Extension To Online Algorithms For Hybrid RL With Limited Coverage Mar 7, 2024 Efficient Exploration Reinforcement Learning (RL)
— Unverified 0Dexterous Legged Locomotion in Confined 3D Spaces with Reinforcement Learning Mar 6, 2024 Deep Reinforcement Learning Navigate
— Unverified 0Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations Mar 6, 2024 Q-Learning Reinforcement Learning (RL)
Code Code Available 0Stop Regressing: Training Value Functions via Classification for Scalable Deep RL Mar 6, 2024 Atari Games Deep Reinforcement Learning
— Unverified 0Language Guided Exploration for RL Agents in Text Environments Mar 5, 2024 Decision Making Language Modeling
— Unverified 0Koopman-Assisted Reinforcement Learning Mar 4, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Iterated Q-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning Mar 4, 2024 Atari Games continuous-control
— Unverified 0Twisting Lids Off with Two Hands Mar 4, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks Mar 3, 2024 Diversity reinforcement-learning
— Unverified 0Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning Mar 1, 2024 Reinforcement Learning (RL)
— Unverified 0Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning Mar 1, 2024 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Robustifying a Policy in Multi-Agent RL with Diverse Cooperative Behaviors and Adversarial Style Sampling for Assistive Tasks Mar 1, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Robust Policy Learning via Offline Skill Diffusion Mar 1, 2024 Decoder Imitation Learning
— Unverified 0RL-GPT: Integrating Reinforcement Learning and Code-as-policy Feb 29, 2024 Minecraft reinforcement-learning
— Unverified 0Offline Fictitious Self-Play for Competitive Games Feb 29, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Investigating Gender Fairness in Machine Learning-driven Personalized Care for Chronic Pain Feb 29, 2024 Decision Making Fairness
— Unverified 0Reinforcement Learning and Graph Neural Networks for Probabilistic Risk Assessment Feb 28, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0