UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution Jan 12, 2024 Multi-agent Reinforcement Learning Recommendation Systems
— Unverified 0Optimistic Model Rollouts for Pessimistic Offline Policy Optimization Jan 11, 2024 model Offline RL
— Unverified 0Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation Jan 11, 2024 Image Generation Reinforcement Learning (RL)
— Unverified 0Model-Free Reinforcement Learning for Automated Fluid Administration in Critical Care Jan 11, 2024 Q-Learning reinforcement-learning
— Unverified 0The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewards Jan 11, 2024 continuous-control Continuous Control
Code Code Available 0Reinforcement Learning for Optimizing RAG for Domain Chatbots Jan 10, 2024 Chatbot Question Answering
— Unverified 0Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces Jan 10, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0An Information Theoretic Approach to Interaction-Grounded Learning Jan 10, 2024 Decoder reinforcement-learning
— Unverified 0Innate-Values-driven Reinforcement Learning based Cooperative Multi-Agent Cognitive Modeling Jan 10, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments Jan 9, 2024 Imputation Reinforcement Learning (RL)
— Unverified 0Using reinforcement learning to improve drone-based inference of greenhouse gas fluxes Jan 8, 2024 Reinforcement Learning (RL)
Code Code Available 0Behavioural Cloning in VizDoom Jan 8, 2024 Behavioural cloning Imitation Learning
— Unverified 0Long-term Safe Reinforcement Learning with Binary Feedback Jan 8, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Deep Reinforcement Learning for Multi-Truck Vehicle Routing Problems with Multi-Leg Demand Routes Jan 8, 2024 Decoder Deep Reinforcement Learning
— Unverified 0NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open Worlds Jan 7, 2024 Autonomous Vehicles Benchmarking
— Unverified 0On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond Jan 6, 2024 Decision Making Diversity
— Unverified 0Synergistic Formulaic Alpha Generation for Quantitative Trading based on Reinforcement Learning Jan 5, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Adaptive Discounting of Training Time Attacks Jan 5, 2024 Reinforcement Learning (RL)
— Unverified 0A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty Jan 5, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In Distributional Reinforcement Learning Jan 4, 2024 Atari Games Distributional Reinforcement Learning
Code Code Available 0A comprehensive survey of research towards AI-enabled unmanned aerial systems in pre-, active-, and post-wildfire management Jan 4, 2024 Management Reinforcement Learning (RL)
— Unverified 0Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach Jan 4, 2024 Decision Making Imitation Learning
— Unverified 0GLIDE-RL: Grounded Language Instruction through DEmonstration in RL Jan 3, 2024 Continual Learning reinforcement-learning
— Unverified 0Improving Unsupervised Hierarchical Representation with Reinforcement Learning Jan 1, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Data Assimilation in Chaotic Systems Using Deep Reinforcement Learning Jan 1, 2024 Autonomous Vehicles Deep Reinforcement Learning
Code Code Available 0Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning Jan 1, 2024 continuous-control Continuous Control
— Unverified 0Regularized Parameter Uncertainty for Improving Generalization in Reinforcement Learning Jan 1, 2024 Out-of-Distribution Generalization reinforcement-learning
— Unverified 0POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning Jan 1, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 0Personalized Dynamic Pricing Policy for Electric Vehicles: Reinforcement learning approach Jan 1, 2024 Q-Learning reinforcement-learning
— Unverified 0Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning Jan 1, 2024 Decision Making Diversity
— Unverified 0Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise Dec 31, 2023 Reinforcement Learning (RL)
— Unverified 0Laboratory Experiments of Model-based Reinforcement Learning for Adaptive Optics Control Dec 30, 2023 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0Causal State Distillation for Explainable Reinforcement Learning Dec 30, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Design Space Exploration of Approximate Computing Techniques with a Reinforcement Learning Approach Dec 29, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e Dec 28, 2023 Reinforcement Learning (RL)
— Unverified 0Resilient Constrained Reinforcement Learning Dec 28, 2023 Decision Making reinforcement-learning
— Unverified 0Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity Dec 28, 2023 Reinforcement Learning (RL)
Code Code Available 0RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems Dec 27, 2023 channel selection Model Selection
— Unverified 0Conversational Question Answering with Reformulations over Knowledge Graph Dec 27, 2023 Conversational Question Answering Knowledge Graphs
— Unverified 0General Method for Solving Four Types of SAT Problems Dec 27, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Learning Online Policies for Person Tracking in Multi-View Environments Dec 26, 2023 Human Detection Reinforcement Learning (RL)
— Unverified 0A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration Dec 26, 2023 Deep Reinforcement Learning Edge-computing
— Unverified 0Agent based modelling for continuously varying supply chains Dec 24, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision Quantization Dec 23, 2023 Quantization Reinforcement Learning (RL)
— Unverified 0Gradient Shaping for Multi-Constraint Safe Reinforcement Learning Dec 23, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning Dec 23, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Safe Occupancy Strategies in Educational Spaces during an Epidemic Dec 23, 2023 Management Q-Learning
— Unverified 0Mutual Information as Intrinsic Reward of Reinforcement Learning Agents for On-demand Ride Pooling Dec 23, 2023 Reinforcement Learning (RL)
— Unverified 0Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning Dec 22, 2023 AI Agent Reinforcement Learning (RL)
— Unverified 0REBEL: Reward Regularization-Based Approach for Robotic Reinforcement Learning from Human Feedback Dec 22, 2023 Bilevel Optimization continuous-control
— Unverified 0