Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective Mar 17, 2024 Problem Decomposition Reinforcement Learning (RL)
— Unverified 0Diffusion-Reinforcement Learning Hierarchical Motion Planning in Multi-agent Adversarial Games Mar 16, 2024 Autonomous Navigation Efficient Exploration
Code Code Available 1The Fallacy of Minimizing Cumulative Regret in the Sequential Task Setting Mar 16, 2024 Reinforcement Learning (RL)
— Unverified 0Neural-Kernel Conditional Mean Embeddings Mar 16, 2024 Deep Learning Density Estimation
— Unverified 0Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC Mar 16, 2024 Decision Making Edge-computing
— Unverified 0ViSaRL: Visual Reinforcement Learning Guided by Human Saliency Mar 16, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Horizon-Free Regret for Linear Markov Decision Processes Mar 15, 2024 LEMMA Reinforcement Learning (RL)
— Unverified 0EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning Mar 15, 2024 Natural Language Understanding reinforcement-learning
Code Code Available 0Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Mar 14, 2024 Math Reinforcement Learning (RL)
Code Code Available 2Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning Mar 14, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Meta-operators for Enabling Parallel Planning Using Deep Reinforcement Learning Mar 13, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis Mar 13, 2024 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Multi-Objective Optimization Using Adaptive Distributed Reinforcement Learning Mar 13, 2024 Cloud Computing Few-Shot Learning
— Unverified 0TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning Mar 13, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning to Describe for Predicting Zero-shot Drug-Drug Interactions Mar 13, 2024 Language Modeling Language Modelling
Code Code Available 0LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments Mar 13, 2024 Decision Making Language Modeling
Code Code Available 2HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback Mar 13, 2024 Language Modelling Large Language Model
— Unverified 0Adaptive Gain Scheduling using Reinforcement Learning for Quadcopter Control Mar 12, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective Mar 12, 2024 D4RL reinforcement-learning
Code Code Available 0ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment Mar 11, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts Mar 11, 2024 Mixture-of-Experts Reinforcement Learning (RL)
— Unverified 0(N,K)-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model Mar 11, 2024 Benchmarking Language Modeling
— Unverified 0RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models Mar 11, 2024 Prompt Engineering Reinforcement Learning (RL)
— Unverified 0Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning Mar 11, 2024 Reinforcement Learning (RL)
— Unverified 0In-context Exploration-Exploitation for Reinforcement Learning Mar 11, 2024 Bayesian Inference Bayesian Optimization
— Unverified 0Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning Mar 11, 2024 Denoising Diagnostic
— Unverified 0CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation Mar 11, 2024 Recommendation Systems Reinforcement Learning (RL)
— Unverified 0Distributional Successor Features Enable Zero-Shot Policy Optimization Mar 10, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0PEaRL: Personalized Privacy of Human-Centric Systems using Early-Exit Reinforcement Learning Mar 9, 2024 Reinforcement Learning (RL)
— Unverified 0Enhancing Classification Performance via Reinforcement Learning for Feature Selection Mar 9, 2024 Classification feature selection
— Unverified 0Enhancing Multi-Hop Knowledge Graph Reasoning through Reward Shaping Techniques Mar 9, 2024 Knowledge Graphs Navigate
— Unverified 0Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning Mar 9, 2024 Decision Making Offline RL
— Unverified 0Switching the Loss Reduces the Cost in Batch Reinforcement Learning Mar 8, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Simulating Battery-Powered TinyML Systems Optimised using Reinforcement Learning in Image-Based Anomaly Detection Mar 8, 2024 Anomaly Detection Reinforcement Learning (RL)
— Unverified 0Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem Mar 8, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0A Natural Extension To Online Algorithms For Hybrid RL With Limited Coverage Mar 7, 2024 Efficient Exploration Reinforcement Learning (RL)
— Unverified 0RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning Mar 7, 2024 counterfactual Form
— Unverified 0Zero-shot cross-modal transfer of Reinforcement Learning policies through a Global Workspace Mar 7, 2024 Attribute Contrastive Learning
Code Code Available 0Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy Mar 7, 2024 Language Modeling Language Modelling
— Unverified 0Noisy Spiking Actor Network for Exploration Mar 7, 2024 continuous-control Continuous Control
— Unverified 0Learning Human-to-Humanoid Real-Time Whole-Body Teleoperation Mar 7, 2024 Reinforcement Learning (RL)
— Unverified 0Why Online Reinforcement Learning is Causal Mar 7, 2024 counterfactual Offline RL
— Unverified 0Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems Mar 6, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations Mar 6, 2024 Q-Learning Reinforcement Learning (RL)
Code Code Available 0Dexterous Legged Locomotion in Confined 3D Spaces with Reinforcement Learning Mar 6, 2024 Deep Reinforcement Learning Navigate
— Unverified 0Stop Regressing: Training Value Functions via Classification for Scalable Deep RL Mar 6, 2024 Atari Games Deep Reinforcement Learning
— Unverified 0Language Guided Exploration for RL Agents in Text Environments Mar 5, 2024 Decision Making Language Modeling
— Unverified 0SplAgger: Split Aggregation for Meta-Reinforcement Learning Mar 5, 2024 continuous-control Continuous Control
Code Code Available 1Twisting Lids Off with Two Hands Mar 4, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Iterated Q-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning Mar 4, 2024 Atari Games continuous-control
— Unverified 0