Deterministic Exploration via Stationary Bellman Error Maximization Oct 31, 2024 Reinforcement Learning (RL)
— Unverified 0Noise as a Double-Edged Sword: Reinforcement Learning Exploits Randomized Defenses in Neural Networks Oct 31, 2024 Reinforcement Learning (RL)
— Unverified 0A Non-Monolithic Policy Approach of Offline-to-Online Reinforcement Learning Oct 31, 2024 Reinforcement Learning (RL)
Code Code Available 0Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use Oct 31, 2024 Diversity Informativeness
Code Code Available 0Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone Connectivity Oct 31, 2024 MuJoCo Q-Learning
Code Code Available 1Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers Oct 31, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Stepping Out of the Shadows: Reinforcement Learning in Shadow Mode Oct 30, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Resource Governance in Networked Systems via Integrated Variational Autoencoders and Reinforcement Learning Oct 30, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks Oct 30, 2024 General Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 2Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation Oct 30, 2024 Offline RL Q-Learning
— Unverified 0Offline Behavior Distillation Oct 30, 2024 D4RL Reinforcement Learning (RL)
Code Code Available 0SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving Oct 30, 2024 Autonomous Driving Imitation Learning
— Unverified 0Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning Oct 30, 2024 D4RL reinforcement-learning
— Unverified 0Self-Driving Car Racing: Application of Deep Reinforcement Learning Oct 30, 2024 AI Agent Autonomous Driving
— Unverified 0Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback Oct 30, 2024 Decision Making Language Modeling
Code Code Available 1PC-Gym: Benchmark Environments For Process Control Problems Oct 29, 2024 Benchmarking Chemical Process
Code Code Available 2A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks Oct 29, 2024 Mamba Reinforcement Learning (RL)
Code Code Available 1Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning Oct 29, 2024 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Learning Successor Features the Simple Way Oct 29, 2024 Continual Learning Deep Reinforcement Learning
Code Code Available 1A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio Applications Oct 28, 2024 Multi-agent Reinforcement Learning OpenAI Gym
— Unverified 0Bilevel Model for Electricity Market Mechanism Optimisation via Quantum Computing Enhanced Reinforcement Learning Oct 28, 2024 Bilevel Optimization Reinforcement Learning (RL)
— Unverified 0The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure Oct 28, 2024 Reinforcement Learning (RL) Transfer Reinforcement Learning
— Unverified 0LongReward: Improving Long-context Large Language Models with AI Feedback Oct 28, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 2ODRL: A Benchmark for Off-Dynamics Reinforcement Learning Oct 28, 2024 Benchmarking reinforcement-learning
Code Code Available 2Getting By Goal Misgeneralization With a Little Help From a Mentor Oct 28, 2024 Reinforcement Learning (RL)
— Unverified 0FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning Agents Oct 28, 2024 Fairness reinforcement-learning
Code Code Available 0GFlowNet Fine-tuning for Diverse Correct Solutions in Mathematical Reasoning Tasks Oct 26, 2024 Diversity Mathematical Reasoning
— Unverified 0Off-Policy Selection for Initiating Human-Centric Experimental Design Oct 26, 2024 Experimental Design Reinforcement Learning (RL)
— Unverified 0OGBench: Benchmarking Offline Goal-Conditioned RL Oct 26, 2024 Benchmarking reinforcement-learning
Code Code Available 3Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning Oct 26, 2024 Reinforcement Learning (RL)
— Unverified 0Random Policy Enables In-Context Reinforcement Learning within Trust Horizons Oct 25, 2024 In-Context Learning In-Context Reinforcement Learning
— Unverified 0On-Robot Reinforcement Learning with Goal-Contrastive Rewards Oct 25, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting Oct 25, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Provably Adaptive Average Reward Reinforcement Learning for Metric Spaces Oct 25, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression Oct 25, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 1AgentForge: A Flexible Low-Code Platform for Reinforcement Learning Agent Design Oct 25, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 0Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors Oct 25, 2024 Reinforcement Learning (RL) Small Language Model
— Unverified 0Adversarial Environment Design via Regret-Guided Diffusion Models Oct 25, 2024 Deep Reinforcement Learning Diversity
— Unverified 0PointPatchRL -- Masked Reconstruction Improves Reinforcement Learning on Point Clouds Oct 24, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance Oct 24, 2024 D4RL reinforcement-learning
— Unverified 0Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning Oct 24, 2024 Autonomous Driving Autonomous Racing
— Unverified 0Optimizing Load Scheduling in Power Grids Using Reinforcement Learning and Markov Decision Processes Oct 23, 2024 Management Q-Learning
— Unverified 0The Hive Mind is a Single Reinforcement Learning Agent Oct 23, 2024 Attribute Decision Making
— Unverified 0Primal-Dual Spectral Representation for Off-policy Evaluation Oct 23, 2024 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration Oct 23, 2024 Efficient Exploration Reinforcement Learning (RL)
Code Code Available 1Dynamic Spectrum Access for Ambient Backscatter Communication-assisted D2D Systems with Quantum Reinforcement Learning Oct 23, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Process Supervision-Guided Policy Optimization for Code Generation Oct 23, 2024 Code Generation Reinforcement Learning (RL)
— Unverified 0Learning Versatile Skills with Curriculum Masking Oct 23, 2024 Decision Making Offline RL
Code Code Available 0Multi-Modal Transformer and Reinforcement Learning-based Beam Management Oct 22, 2024 Beam Prediction Management
— Unverified 0Survival of the Fittest: Evolutionary Adaptation of Policies for Environmental Shifts Oct 22, 2024 Reinforcement Learning (RL)
— Unverified 0