B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning Jan 30, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Neural Operator based Reinforcement Learning for Control of first-order PDEs with Spatially-Varying State Delay Jan 30, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Model-Free RL Agents Demonstrate System 1-Like Intentionality Jan 30, 2025 Jurisprudence Reinforcement Learning (RL)
— Unverified 0RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems Jan 29, 2025 Knowledge Distillation Natural Language Understanding
— Unverified 0Reinforcement-Learning Portfolio Allocation with Dynamic Embedding of Market Information Jan 29, 2025 Meta-Learning reinforcement-learning
— Unverified 0Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning Jan 29, 2025 continuous-control Continuous Control
Code Code Available 1From Sparse to Dense: Toddler-inspired Reward Transition in Goal-Oriented Reinforcement Learning Jan 29, 2025 Navigate Reinforcement Learning (RL)
— Unverified 0A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning Jan 29, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care Jan 28, 2025 Reinforcement Learning (RL)
— Unverified 0RLPP: A Residual Method for Zero-Shot Real-World Autonomous Racing on Scaled Platforms Jan 28, 2025 Autonomous Racing Reinforcement Learning (RL)
Code Code Available 0Exploratory Mean-Variance Portfolio Optimization with Regime-Switching Market Dynamics Jan 28, 2025 Portfolio Optimization Reinforcement Learning (RL)
— Unverified 0SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Jan 28, 2025 Arithmetic Reasoning Memorization
— Unverified 0Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning Jan 28, 2025 Federated Learning Knowledge Distillation
— Unverified 0Improving Vision-Language-Action Model with Online Reinforcement Learning Jan 28, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies Jan 28, 2025 Reinforcement Learning (RL)
— Unverified 0xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking Jan 28, 2025 Reinforcement Learning (RL) Safety Alignment
Code Code Available 1Safe Reinforcement Learning for Real-World Engine Control Jan 28, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0MPC4RL -- A Software Package for Reinforcement Learning based on Model Predictive Control Jan 27, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback Jan 27, 2025 Offline RL Reinforcement Learning (RL)
— Unverified 0FuzzyLight: A Robust Two-Stage Fuzzy Approach for Traffic Signal Control Works in Real Cities Jan 27, 2025 compressed sensing Reinforcement Learning (RL)
— Unverified 0Benchmarking Quantum Reinforcement Learning Jan 27, 2025 Benchmarking reinforcement-learning
Code Code Available 0Towards General-Purpose Model-Free Reinforcement Learning Jan 27, 2025 model reinforcement-learning
— Unverified 0Selective Experience Sharing in Reinforcement Learning Enhances Interference Management Jan 27, 2025 Management Multi-agent Reinforcement Learning
— Unverified 0Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning Jan 26, 2025 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Learning-Enhanced Safeguard Control for High-Relative-Degree Systems: Robust Optimization under Disturbances and Faults Jan 26, 2025 Reinforcement Learning (RL) Safe Exploration
— Unverified 0Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning Jan 25, 2025 Answer Generation Multi-agent Reinforcement Learning
Code Code Available 2EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning Jan 25, 2025 Benchmarking Evolutionary Algorithms
Code Code Available 7Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction Jan 25, 2025 Decoder Machine Translation
— Unverified 0Data Center Cooling System Optimization Using Offline Reinforcement Learning Jan 25, 2025 Graph Neural Network Offline RL
— Unverified 0Towards Efficient Multi-Objective Optimisation for Real-World Power Grid Topology Control Jan 24, 2025 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Age and Power Minimization via Meta-Deep Reinforcement Learning in UAV Networks Jan 24, 2025 Deep Reinforcement Learning Meta-Learning
— Unverified 0Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework Jan 24, 2025 Q-Learning Reinforcement Learning (RL)
— Unverified 0An Attentive Graph Agent for Topology-Adaptive Cyber Defence Jan 24, 2025 Graph Attention Graph Neural Network
Code Code Available 1Large Language Model driven Policy Exploration for Recommender Systems Jan 23, 2025 Language Modeling Language Modelling
— Unverified 0To Measure or Not: A Cost-Sensitive, Selective Measuring Environment for Agricultural Management Decisions with Reinforcement Learning Jan 22, 2025 Management Reinforcement Learning (RL)
Code Code Available 0Kimi k1.5: Scaling Reinforcement Learning with LLMs Jan 22, 2025 Math reinforcement-learning
Code Code Available 7State Combinatorial Generalization In Decision Making With Conditional Diffusion Models Jan 22, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking Jan 22, 2025 Reinforcement Learning (RL)
— Unverified 0Deep Reinforcement Learning with Hybrid Intrinsic Reward Model Jan 22, 2025 Deep Reinforcement Learning Diversity
— Unverified 0Adaptive Data Exploitation in Deep Reinforcement Learning Jan 22, 2025 Computational Efficiency Deep Reinforcement Learning
Code Code Available 0Exploring the Technology Landscape through Topic Modeling, Expert Involvement, and Reinforcement Learning Jan 22, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration Jan 22, 2025 Reinforcement Learning (RL)
— Unverified 0Reinforcement learning Based Automated Design of Differential Evolution Algorithm for Black-box Optimization Jan 22, 2025 Evolutionary Algorithms Meta-Learning
— Unverified 0Evolution and The Knightian Blindspot of Machine Learning Jan 22, 2025 Reinforcement Learning (RL)
— Unverified 0DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Jan 22, 2025 Mathematical Reasoning Multi-task Language Understanding
Code Code Available 15AdaWM: Adaptive World Model based Planning for Autonomous Driving Jan 22, 2025 Autonomous Driving Model-based Reinforcement Learning
— Unverified 0Reinforcement Learning Constrained Beam Search for Parameter Optimization of Paper Drying Under Flexible Constraints Jan 21, 2025 Combinatorial Optimization reinforcement-learning
— Unverified 0RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Jan 21, 2025 Autonomous Driving Object Recognition
— Unverified 0Extend Adversarial Policy Against Neural Machine Translation via Unknown Token Jan 21, 2025 Machine Translation NMT
— Unverified 0Improving thermal state preparation of Sachdev-Ye-Kitaev model with reinforcement learning on quantum hardware Jan 20, 2025 Reinforcement Learning (RL)
Code Code Available 0