Twisting Lids Off with Two Hands Mar 4, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks Mar 3, 2024 Diversity reinforcement-learning
— Unverified 0Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving the Validity of Automatically Generated Feedback via Reinforcement Learning Mar 2, 2024 Math Misconceptions
Code Code Available 1EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data Mar 1, 2024 continuous-control Continuous Control
Code Code Available 2Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning Mar 1, 2024 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Robustifying a Policy in Multi-Agent RL with Diverse Cooperative Behaviors and Adversarial Style Sampling for Assistive Tasks Mar 1, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning Mar 1, 2024 Reinforcement Learning (RL)
— Unverified 0Robust Policy Learning via Offline Skill Diffusion Mar 1, 2024 Decoder Imitation Learning
— Unverified 0Large Language Models are Learnable Planners for Long-Term Recommendation Feb 29, 2024 Decision Making Language Modelling
Code Code Available 1Offline Fictitious Self-Play for Competitive Games Feb 29, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Curiosity-driven Red-teaming for Large Language Models Feb 29, 2024 Red Teaming Reinforcement Learning (RL)
Code Code Available 2Investigating Gender Fairness in Machine Learning-driven Personalized Care for Chronic Pain Feb 29, 2024 Decision Making Fairness
— Unverified 0ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Feb 29, 2024 Language Modeling Language Modelling
Code Code Available 3RL-GPT: Integrating Reinforcement Learning and Code-as-policy Feb 29, 2024 Minecraft reinforcement-learning
— Unverified 0Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation Feb 28, 2024 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Reinforcement Learning and Graph Neural Networks for Probabilistic Risk Assessment Feb 28, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Learning to Program Variational Quantum Circuits with Fast Weights Feb 27, 2024 Quantum Machine Learning Reinforcement Learning (RL)
— Unverified 0reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use Feb 27, 2024 Reinforcement Learning (RL)
Code Code Available 0Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning Feb 27, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test Feb 26, 2024 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0Flexible Robust Beamforming for Multibeam Satellite Downlink using Reinforcement Learning Feb 26, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning Feb 26, 2024 GPU Minecraft
Code Code Available 3QF-tuner: Breaking Tradition in Reinforcement Learning Feb 26, 2024 OpenAI Gym Q-Learning
— Unverified 0Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials Feb 26, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory Feb 26, 2024 Imitation Learning MuJoCo
— Unverified 0Feedback Efficient Online Fine-Tuning of Diffusion Models Feb 26, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 2GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction Feb 25, 2024 3D Reconstruction Active 3D Reconstruction
Code Code Available 2How Can LLM Guide RL? A Value-Based Approach Feb 25, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning Feb 24, 2024 Bayesian Optimization Bilevel Optimization
— Unverified 0AltGraph: Redesigning Quantum Circuits Using Generative Graph Models for Efficient Optimization Feb 23, 2024 Reinforcement Learning (RL)
— Unverified 0HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding Feb 23, 2024 Imitation Learning Reinforcement Learning (RL)
Code Code Available 1Foundation Policies with Hilbert Representations Feb 23, 2024 Reinforcement Learning (RL) Unsupervised Pre-training
Code Code Available 2Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding Feb 23, 2024 Offline RL reinforcement-learning
— Unverified 0EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems Feb 23, 2024 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 2PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning Feb 23, 2024 Language Modeling Language Modelling
— Unverified 0Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation Feb 23, 2024 Reinforcement Learning (RL)
Code Code Available 0Reinforcement Learning with Elastic Time Steps Feb 22, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Automated Design and Optimization of Distributed Filtering Circuits via Reinforcement Learning Feb 22, 2024 CPU GPU
— Unverified 0Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation Feb 21, 2024 Multi-Objective Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Learning Dual-arm Object Rearrangement for Cartesian Robots Feb 21, 2024 Computational Efficiency Object
— Unverified 0Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark Feb 21, 2024 Reinforcement Learning (RL)
— Unverified 0Reinforcement learning-assisted quantum architecture search for variational quantum algorithms Feb 21, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0AttackGNN: Red-Teaming GNNs in Hardware Security Using Reinforcement Learning Feb 21, 2024 Graph Neural Network Red Teaming
— Unverified 0Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning Feb 21, 2024 Cross-Modal Retrieval Image Captioning
Code Code Available 1Deep Hedging with Market Impact Feb 20, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning Feb 20, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Reflect-RL: Two-Player Online RL Fine-Tuning for LMs Feb 20, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Antifragile Perimeter Control: Anticipating and Gaining from Disruptions with Reinforcement Learning Feb 20, 2024 Deep Reinforcement Learning Model Predictive Control
— Unverified 0XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques Feb 20, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1