Maximum Entropy Hindsight Experience Replay Oct 31, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use Oct 31, 2024 Diversity Informativeness
Code Code Available 0A Non-Monolithic Policy Approach of Offline-to-Online Reinforcement Learning Oct 31, 2024 Reinforcement Learning (RL)
Code Code Available 0Deterministic Exploration via Stationary Bellman Error Maximization Oct 31, 2024 Reinforcement Learning (RL)
— Unverified 0Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation Oct 30, 2024 Offline RL Q-Learning
— Unverified 0Stepping Out of the Shadows: Reinforcement Learning in Shadow Mode Oct 30, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Resource Governance in Networked Systems via Integrated Variational Autoencoders and Reinforcement Learning Oct 30, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Self-Driving Car Racing: Application of Deep Reinforcement Learning Oct 30, 2024 AI Agent Autonomous Driving
— Unverified 0Offline Behavior Distillation Oct 30, 2024 D4RL Reinforcement Learning (RL)
Code Code Available 0SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving Oct 30, 2024 Autonomous Driving Imitation Learning
— Unverified 0Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning Oct 30, 2024 D4RL reinforcement-learning
— Unverified 0Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning Oct 29, 2024 Imitation Learning Reinforcement Learning (RL)
— Unverified 0A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio Applications Oct 28, 2024 Multi-agent Reinforcement Learning OpenAI Gym
— Unverified 0Getting By Goal Misgeneralization With a Little Help From a Mentor Oct 28, 2024 Reinforcement Learning (RL)
— Unverified 0FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning Agents Oct 28, 2024 Fairness reinforcement-learning
Code Code Available 0The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure Oct 28, 2024 Reinforcement Learning (RL) Transfer Reinforcement Learning
— Unverified 0Bilevel Model for Electricity Market Mechanism Optimisation via Quantum Computing Enhanced Reinforcement Learning Oct 28, 2024 Bilevel Optimization Reinforcement Learning (RL)
— Unverified 0Off-Policy Selection for Initiating Human-Centric Experimental Design Oct 26, 2024 Experimental Design Reinforcement Learning (RL)
— Unverified 0GFlowNet Fine-tuning for Diverse Correct Solutions in Mathematical Reasoning Tasks Oct 26, 2024 Diversity Mathematical Reasoning
— Unverified 0Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning Oct 26, 2024 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting Oct 25, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Random Policy Enables In-Context Reinforcement Learning within Trust Horizons Oct 25, 2024 In-Context Learning In-Context Reinforcement Learning
— Unverified 0AgentForge: A Flexible Low-Code Platform for Reinforcement Learning Agent Design Oct 25, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 0On-Robot Reinforcement Learning with Goal-Contrastive Rewards Oct 25, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Adversarial Environment Design via Regret-Guided Diffusion Models Oct 25, 2024 Deep Reinforcement Learning Diversity
— Unverified 0Provably Adaptive Average Reward Reinforcement Learning for Metric Spaces Oct 25, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors Oct 25, 2024 Reinforcement Learning (RL) Small Language Model
— Unverified 0SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance Oct 24, 2024 D4RL reinforcement-learning
— Unverified 0PointPatchRL -- Masked Reconstruction Improves Reinforcement Learning on Point Clouds Oct 24, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning Oct 24, 2024 Autonomous Driving Autonomous Racing
— Unverified 0Primal-Dual Spectral Representation for Off-policy Evaluation Oct 23, 2024 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Optimizing Load Scheduling in Power Grids Using Reinforcement Learning and Markov Decision Processes Oct 23, 2024 Management Q-Learning
— Unverified 0Dynamic Spectrum Access for Ambient Backscatter Communication-assisted D2D Systems with Quantum Reinforcement Learning Oct 23, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Learning Versatile Skills with Curriculum Masking Oct 23, 2024 Decision Making Offline RL
Code Code Available 0The Hive Mind is a Single Reinforcement Learning Agent Oct 23, 2024 Attribute Decision Making
— Unverified 0Process Supervision-Guided Policy Optimization for Code Generation Oct 23, 2024 Code Generation Reinforcement Learning (RL)
— Unverified 0Meta Stackelberg Game: Robust Federated Learning against Adaptive and Mixed Poisoning Attacks Oct 22, 2024 Federated Learning Meta-Learning
— Unverified 0Survival of the Fittest: Evolutionary Adaptation of Policies for Environmental Shifts Oct 22, 2024 Reinforcement Learning (RL)
— Unverified 0DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning Oct 22, 2024 Ensemble Learning reinforcement-learning
— Unverified 0Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards Oct 22, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning Oct 22, 2024 Autonomous Driving Multi-agent Reinforcement Learning
— Unverified 0DyPNIPP: Predicting Environment Dynamics for RL-based Robust Informative Path Planning Oct 22, 2024 Reinforcement Learning (RL)
— Unverified 0Multi-Modal Transformer and Reinforcement Learning-based Beam Management Oct 22, 2024 Beam Prediction Management
— Unverified 0Curriculum Reinforcement Learning for Complex Reward Functions Oct 22, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Benchmarking Smoothness and Reducing High-Frequency Oscillations in Continuous Control Policies Oct 22, 2024 Benchmarking continuous-control
— Unverified 0Offline reinforcement learning for job-shop scheduling problems Oct 21, 2024 Combinatorial Optimization Deep Learning
— Unverified 0Reinforcement Learning for Dynamic Memory Allocation Oct 20, 2024 Management reinforcement-learning
Code Code Available 0Training Language Models to Critique With Multi-agent Feedback Oct 20, 2024 Reinforcement Learning (RL)
— Unverified 0MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning Oct 19, 2024 Deep Reinforcement Learning Mixture-of-Experts
— Unverified 0Augmented Lagrangian-Based Safe Reinforcement Learning Approach for Distribution System Volt/VAR Control Oct 19, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0