Distributional Reinforcement Learning with Online Risk-awareness Adaption Oct 8, 2023 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0DeepQTest: Testing Autonomous Driving Systems with Reinforcement Learning and Real-world Weather Data Oct 8, 2023 Autonomous Driving Q-Learning
Code Code Available 0Learning Generalizable Agents via Saliency-Guided Features Decorrelation Oct 8, 2023 Reinforcement Learning (RL)
— Unverified 0GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models Oct 8, 2023 GPU Reinforcement Learning (RL)
Code Code Available 1Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration Oct 7, 2023 Offline RL reinforcement-learning
— Unverified 0Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API Oct 7, 2023 Decoder document understanding
— Unverified 0Optimal Sequential Decision-Making in Geosteering: A Reinforcement Learning Approach Oct 7, 2023 Decision Making reinforcement-learning
— Unverified 0Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets Oct 6, 2023 D4RL Decision Making
Code Code Available 1Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning Oct 6, 2023 Multi-agent Reinforcement Learning Offline RL
— Unverified 0Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A Comparison Oct 6, 2023 Continuous Control reinforcement-learning
— Unverified 0Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning Oct 6, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems Oct 6, 2023 Navigate Recommendation Systems
— Unverified 0Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC Oct 5, 2023 Model Predictive Control Q-Learning
— Unverified 0RTDK-BO: High Dimensional Bayesian Optimization with Reinforced Transformer Deep kernels Oct 5, 2023 Bayesian Optimization Meta-Learning
— Unverified 0LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework Oct 5, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Resilient Legged Local Navigation: Learning to Traverse with Compromised Perception End-to-End Oct 5, 2023 Anomaly Detection CPU
— Unverified 0Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning Oct 5, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0How the level sampling process impacts zero-shot generalisation in deep reinforcement learning Oct 5, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms Oct 5, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design Oct 4, 2023 Deep Reinforcement Learning General Reinforcement Learning
Code Code Available 1Proximal Policy Optimization-Based Reinforcement Learning Approach for DC-DC Boost Converter Control: A Comparative Evaluation Against Traditional Control Techniques Oct 4, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0B-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis Oct 4, 2023 Code Generation Deep Reinforcement Learning
— Unverified 0Multi-Agent Reinforcement Learning for Power Grid Topology Optimization Oct 4, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Neural architecture impact on identifying temporally extended Reinforcement Learning tasks Oct 4, 2023 Deep Reinforcement Learning image-classification
— Unverified 0Searching for High-Value Molecules Using Reinforcement Learning and Transformers Oct 4, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own Oct 4, 2023 Quantization reinforcement-learning
— Unverified 0Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making Oct 4, 2023 Decision Making Reinforcement Learning (RL)
— Unverified 0PGDQN: Preference-Guided Deep Q-Network Oct 3, 2023 Atari Games Benchmarking
Code Code Available 1Navigating Uncertainty in ESG Investing Oct 3, 2023 Navigate Reinforcement Learning (RL)
— Unverified 0On Representation Complexity of Model-based and Model-free Reinforcement Learning Oct 3, 2023 model MuJoCo
— Unverified 0Prioritized Soft Q-Decomposition for Lexicographic Reinforcement Learning Oct 3, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback Oct 3, 2023 Deep Reinforcement Learning Q-Learning
— Unverified 0AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model Oct 3, 2023 Attribute Reinforcement Learning (RL)
— Unverified 0Towards a Unified Framework for Sequential Decision Making Oct 3, 2023 Bayesian Inference Decision Making
— Unverified 0Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency Oct 3, 2023 Reinforcement Learning (RL)
Code Code Available 0Blending Imitation and Reinforcement Learning for Robust Policy Improvement Oct 3, 2023 Imitation Learning reinforcement-learning
— Unverified 0REMEDI: REinforcement learning-driven adaptive MEtabolism modeling of primary sclerosing cholangitis DIsease progression Oct 2, 2023 Reinforcement Learning (RL)
— Unverified 0Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning Oct 2, 2023 Offline RL reinforcement-learning
— Unverified 0Improving Dialogue Management: Quality Datasets vs Models Oct 2, 2023 Dialog Learning Dialogue Management
Code Code Available 0From Bandits Model to Deep Deterministic Policy Gradient, Reinforcement Learning with Contextual Information Oct 1, 2023 Decision Making reinforcement-learning
— Unverified 0Adaptive Control of an Inverted Pendulum by a Reinforcement Learning-based LQR Method Sep 30, 2023 Benchmarking Reinforcement Learning (RL)
— Unverified 0Controlling Neural Style Transfer with Deep Reinforcement Learning Sep 30, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Improving Planning with Large Language Models: A Modular Agentic Architecture Sep 30, 2023 In-Context Learning Reinforcement Learning (RL)
Code Code Available 1Learning to Rewrite Prompts for Personalized Text Generation Sep 29, 2023 Language Modelling Large Language Model
— Unverified 0Reinforcement Learning for Node Selection in Branch-and-Bound Sep 29, 2023 Graph Neural Network reinforcement-learning
— Unverified 0Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness Sep 29, 2023 Offline RL reinforcement-learning
Code Code Available 0AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback Sep 29, 2023 Common Sense Reasoning Decision Making
Code Code Available 1Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning Sep 29, 2023 Image Generation Offline RL
Code Code Available 1ComSD: Balancing Behavioral Quality and Diversity in Unsupervised Skill Discovery Sep 29, 2023 Contrastive Learning Diversity
Code Code Available 0A Quantum States Preparation Method Based on Difference-Driven Reinforcement Learning Sep 29, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0