Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models Jan 15, 2024 Form Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning from LLM Feedback to Counteract Goal Misgeneralization Jan 14, 2024 Language Modeling Language Modelling
— Unverified 0Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation Jan 14, 2024 Language Modeling Language Modelling
— Unverified 0Discovering Command and Control Channels Using Reinforcement Learning Jan 13, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0BP(λ): Online Learning via Synthetic Gradients Jan 13, 2024 Reinforcement Learning (RL)
— Unverified 0UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution Jan 12, 2024 Multi-agent Reinforcement Learning Recommendation Systems
— Unverified 0Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Case Study Jan 12, 2024 Efficient Exploration Reinforcement Learning (RL)
— Unverified 0Model-Free Reinforcement Learning for Automated Fluid Administration in Critical Care Jan 11, 2024 Q-Learning reinforcement-learning
— Unverified 0Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents Jan 11, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint Jan 11, 2024 Question Answering Reinforcement Learning (RL)
Code Code Available 1The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewards Jan 11, 2024 continuous-control Continuous Control
Code Code Available 0Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation Jan 11, 2024 Image Generation Reinforcement Learning (RL)
— Unverified 0Optimistic Model Rollouts for Pessimistic Offline Policy Optimization Jan 11, 2024 model Offline RL
— Unverified 0Reinforcement Learning for Optimizing RAG for Domain Chatbots Jan 10, 2024 Chatbot Question Answering
— Unverified 0Innate-Values-driven Reinforcement Learning based Cooperative Multi-Agent Cognitive Modeling Jan 10, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces Jan 10, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0An Information Theoretic Approach to Interaction-Grounded Learning Jan 10, 2024 Decoder reinforcement-learning
— Unverified 0StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments Jan 9, 2024 Imputation Reinforcement Learning (RL)
— Unverified 0Deep Reinforcement Learning for Multi-Truck Vehicle Routing Problems with Multi-Leg Demand Routes Jan 8, 2024 Decoder Deep Reinforcement Learning
— Unverified 0Behavioural Cloning in VizDoom Jan 8, 2024 Behavioural cloning Imitation Learning
— Unverified 0Long-term Safe Reinforcement Learning with Binary Feedback Jan 8, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Using reinforcement learning to improve drone-based inference of greenhouse gas fluxes Jan 8, 2024 Reinforcement Learning (RL)
Code Code Available 0NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open Worlds Jan 7, 2024 Autonomous Vehicles Benchmarking
— Unverified 0On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond Jan 6, 2024 Decision Making Diversity
— Unverified 0Synergistic Formulaic Alpha Generation for Quantitative Trading based on Reinforcement Learning Jan 5, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Adaptive Discounting of Training Time Attacks Jan 5, 2024 Reinforcement Learning (RL)
— Unverified 0A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty Jan 5, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0A comprehensive survey of research towards AI-enabled unmanned aerial systems in pre-, active-, and post-wildfire management Jan 4, 2024 Management Reinforcement Learning (RL)
— Unverified 0Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach Jan 4, 2024 Decision Making Imitation Learning
— Unverified 0A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In Distributional Reinforcement Learning Jan 4, 2024 Atari Games Distributional Reinforcement Learning
Code Code Available 0GLIDE-RL: Grounded Language Instruction through DEmonstration in RL Jan 3, 2024 Continual Learning reinforcement-learning
— Unverified 0Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning Jan 1, 2024 Decision Making Diversity
— Unverified 0Improving Unsupervised Hierarchical Representation with Reinforcement Learning Jan 1, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning Jan 1, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 0DMR: Decomposed Multi-Modality Representations for Frames and Events Fusion in Visual Reinforcement Learning Jan 1, 2024 Reinforcement Learning (RL)
Code Code Available 1Regularized Parameter Uncertainty for Improving Generalization in Reinforcement Learning Jan 1, 2024 Out-of-Distribution Generalization reinforcement-learning
— Unverified 0Personalized Dynamic Pricing Policy for Electric Vehicles: Reinforcement learning approach Jan 1, 2024 Q-Learning reinforcement-learning
— Unverified 0Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning Jan 1, 2024 continuous-control Continuous Control
— Unverified 0Data Assimilation in Chaotic Systems Using Deep Reinforcement Learning Jan 1, 2024 Autonomous Vehicles Deep Reinforcement Learning
Code Code Available 0Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise Dec 31, 2023 Reinforcement Learning (RL)
— Unverified 0Online Symbolic Music Alignment with Offline Reinforcement Learning Dec 31, 2023 Dynamic Time Warping Offline RL
Code Code Available 1Laboratory Experiments of Model-based Reinforcement Learning for Adaptive Optics Control Dec 30, 2023 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0Causal State Distillation for Explainable Reinforcement Learning Dec 30, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Design Space Exploration of Approximate Computing Techniques with a Reinforcement Learning Approach Dec 29, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Resilient Constrained Reinforcement Learning Dec 28, 2023 Decision Making reinforcement-learning
— Unverified 0Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity Dec 28, 2023 Reinforcement Learning (RL)
Code Code Available 0Generalizable Visual Reinforcement Learning with Segment Anything Model Dec 28, 2023 Data Augmentation model
Code Code Available 1Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e Dec 28, 2023 Reinforcement Learning (RL)
— Unverified 0RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems Dec 27, 2023 channel selection Model Selection
— Unverified 0Conversational Question Answering with Reformulations over Knowledge Graph Dec 27, 2023 Conversational Question Answering Knowledge Graphs
— Unverified 0