Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning Feb 7, 2025 continuous-control Continuous Control
— Unverified 0Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits Feb 7, 2025 Informativeness Offline RL
— Unverified 0Convergent NMPC-based Reinforcement Learning Using Deep Expected Sarsa and Nonlinear Temporal Difference Learning Feb 7, 2025 Reinforcement Learning (RL)
— Unverified 0Adversarially-Robust TD Learning with Markovian Data: Finite-Time Rates and Fundamental Limits Feb 7, 2025 Adversarial Robustness Reinforcement Learning (RL)
— Unverified 0Towards Smarter Sensing: 2D Clutter Mitigation in RL-Driven Cognitive MIMO Radar Feb 7, 2025 Integrated sensing and communication Reinforcement Learning (RL)
— Unverified 0Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization Feb 7, 2025 counterfactual Decision Making
— Unverified 0Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning Feb 6, 2025 Dataset Generation MuJoCo
— Unverified 0Illuminating Spaces: Deep Reinforcement Learning and Laser-Wall Partitioning for Architectural Layout Generation Feb 6, 2025 Deep Reinforcement Learning Layout Design
— Unverified 0Autotelic Reinforcement Learning: Exploring Intrinsic Motivations for Skill Acquisition in Open-Ended Environments Feb 6, 2025 Reinforcement Learning (RL)
— Unverified 0LLM Alignment as Retriever Optimization: An Information Retrieval Perspective Feb 6, 2025 Information Retrieval Misinformation
— Unverified 0Transforming Multimodal Models into Action Models for Radiotherapy Feb 6, 2025 Anatomy Few-Shot Learning
— Unverified 0Mirror Descent Actor Critic via Bounded Advantage Learning Feb 6, 2025 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning Based Prediction of PID Controller Gains for Quadrotor UAVs Feb 6, 2025 Reinforcement Learning (RL)
— Unverified 0AI-driven materials design: a mini-review Feb 5, 2025 Evolutionary Algorithms Reinforcement Learning (RL)
— Unverified 0OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds Feb 5, 2025 Few-Shot Learning Imitation Learning
— Unverified 0Underwater Soft Fin Flapping Motion with Deep Neural Network Based Surrogate Model Feb 5, 2025 Reinforcement Learning (RL)
Code Code Available 0Optimizing Electric Vehicles Charging using Large Language Models and Graph Neural Networks Feb 5, 2025 Reinforcement Learning (RL)
— Unverified 0Calibrated Unsupervised Anomaly Detection in Multivariate Time-series using Reinforcement Learning Feb 5, 2025 Anomaly Detection Reinforcement Learning (RL)
— Unverified 0VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play Feb 4, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement Learning Control Feb 4, 2025 Reinforcement Learning (RL)
— Unverified 0RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation Feb 4, 2025 Drone navigation Reinforcement Learning (RL)
— Unverified 0Circular Microalgae-Based Carbon Control for Net Zero Feb 4, 2025 Reinforcement Learning (RL)
Code Code Available 0Brief analysis of DeepSeek R1 and it's implications for Generative AI Feb 4, 2025 GPU Mixture-of-Experts
— Unverified 0Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning Feb 3, 2025 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Preference VLM: Leveraging VLMs for Scalable Preference-Based Reinforcement Learning Feb 3, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0ACECODER: Acing Coder RL via Automated Test-Case Synthesis Feb 3, 2025 HumanEval mbpp
— Unverified 0Reinforcement Learning for Long-Horizon Interactive LLM Agents Feb 3, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Dynamic object goal pushing with mobile manipulators through model-free constrained reinforcement learning Feb 3, 2025 Friction Object
— Unverified 0Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning Feb 3, 2025 Meta-Learning Offline RL
— Unverified 0The Differences Between Direct Alignment Algorithms are a Blur Feb 3, 2025 Language Modeling Language Modelling
— Unverified 0Reinforcement Learning with Segment Feedback Feb 3, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer Feb 2, 2025 Reinforcement Learning (RL) Video Generation
— Unverified 0Model-Free Predictive Control: Introductory Algebraic Calculations, and a Comparison with HEOL and ANNs Feb 1, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network Feb 1, 2025 continuous-control Continuous Control
— Unverified 0Recursive generalized type-2 fuzzy radial basis function neural networks for joint position estimation and adaptive EMG-based impedance control of lower limb exoskeletons Feb 1, 2025 Electromyography (EMG) GPU
Code Code Available 0A Differentiated Reward Method for Reinforcement Learning based Multi-Vehicle Cooperative Decision-Making Algorithms Feb 1, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0O-MAPL: Offline Multi-agent Preference Learning Jan 31, 2025 Reinforcement Learning (RL) SMAC
— Unverified 0RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception Jan 31, 2025 Reinforcement Learning (RL) Spatial Reasoning
— Unverified 0Towards Physiologically Sensible Predictions via the Rule-based Reinforcement Learning Layer Jan 31, 2025 Reinforcement Learning (RL)
— Unverified 0Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning Jan 31, 2025 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Optimizing Job Allocation using Reinforcement Learning with Graph Neural Networks Jan 31, 2025 Reinforcement Learning (RL) Scheduling
— Unverified 0B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning Jan 30, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Model-Free RL Agents Demonstrate System 1-Like Intentionality Jan 30, 2025 Jurisprudence Reinforcement Learning (RL)
— Unverified 0Neural Operator based Reinforcement Learning for Control of first-order PDEs with Spatially-Varying State Delay Jan 30, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0From Sparse to Dense: Toddler-inspired Reward Transition in Goal-Oriented Reinforcement Learning Jan 29, 2025 Navigate Reinforcement Learning (RL)
— Unverified 0Reinforcement-Learning Portfolio Allocation with Dynamic Embedding of Market Information Jan 29, 2025 Meta-Learning reinforcement-learning
— Unverified 0A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning Jan 29, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems Jan 29, 2025 Knowledge Distillation Natural Language Understanding
— Unverified 0Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care Jan 28, 2025 Reinforcement Learning (RL)
— Unverified 0RLPP: A Residual Method for Zero-Shot Real-World Autonomous Racing on Scaled Platforms Jan 28, 2025 Autonomous Racing Reinforcement Learning (RL)
Code Code Available 0