Generalization in Monitored Markov Decision Processes (Mon-MDPs) May 13, 2025 Reinforcement Learning (RL)
— Unverified 0Preference Optimization for Combinatorial Optimization Problems May 13, 2025 Combinatorial Optimization Reinforcement Learning (RL)
— Unverified 0Automatic Curriculum Learning for Driving Scenarios: Towards Robust and Efficient Reinforcement Learning May 13, 2025 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0Modeling Unseen Environments with Language-guided Composable Causal Components in Reinforcement Learning May 13, 2025 Meta-Learning Reinforcement Learning (RL)
— Unverified 0Adaptive Diffusion Policy Optimization for Robotic Manipulation May 13, 2025 continuous-control Continuous Control
Code Code Available 0DSADF: Thinking Fast and Slow for Decision Making May 13, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning May 12, 2025 Image Generation Reinforcement Learning (RL)
— Unverified 0DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward May 12, 2025 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 0Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review May 12, 2025 Active Learning Bayesian Inference
— Unverified 0INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning May 12, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0The Exploratory Multi-Asset Mean-Variance Portfolio Selection using Reinforcement Learning May 12, 2025 Reinforcement Learning (RL)
— Unverified 0Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains May 12, 2025 continuous-control Continuous Control
— Unverified 0Design and Experimental Test of Datatic Approximate Optimal Filter in Nonlinear Dynamic Systems May 11, 2025 Computational Efficiency Reinforcement Learning (RL)
— Unverified 0Learning Value of Information towards Joint Communication and Control in 6G V2X May 11, 2025 Autonomous Vehicles Decision Making
— Unverified 0FACET: Force-Adaptive Control via Impedance Reference Tracking for Legged Robots May 11, 2025 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning (RL) Meets Urban Climate Modeling: Investigating the Efficacy and Impacts of RL-Based HVAC Control May 11, 2025 Reinforcement Learning (RL)
— Unverified 0X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real May 11, 2025 Domain Adaptation Imitation Learning
— Unverified 0Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach May 10, 2025 Autonomous Driving Offline RL
— Unverified 0Balancing Progress and Safety: A Novel Risk-Aware Objective for RL in Autonomous Driving May 10, 2025 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback May 10, 2025 Reinforcement Learning (RL)
— Unverified 0LineFlow: A Framework to Learn Active Control of Production Lines May 10, 2025 Reinforcement Learning (RL)
Code Code Available 0Remote Rowhammer Attack using Adversarial Observations on Federated Learning Clients May 9, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Active Perception for Tactile Sensing: A Task-Agnostic Attention-Based Approach May 9, 2025 Decision Making Pose Estimation
— Unverified 0Interaction-Aware Parameter Privacy-Preserving Data Sharing in Coupled Systems via Particle Filter Reinforcement Learning May 9, 2025 Decision Making Privacy Preserving
— Unverified 0Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning May 9, 2025 D4RL Offline RL
— Unverified 0Reinforcement Learning for Game-Theoretic Resource Allocation on Graphs May 8, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Multi-agent Embodied AI: Advances and Future Directions May 8, 2025 Navigate Reinforcement Learning (RL)
— Unverified 0On Corruption-Robustness in Performative Reinforcement Learning May 8, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0USPR: Learning a Unified Solver for Profiled Routing May 8, 2025 Computational Efficiency Decoder
Code Code Available 0Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach May 8, 2025 D4RL Decision Making
— Unverified 0RL-DAUNCE: Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles May 8, 2025 Computational Efficiency Reinforcement Learning (RL)
— Unverified 0Enhancing Reinforcement Learning for the Floorplanning of Analog ICs with Beam Search May 8, 2025 Reinforcement Learning (RL)
— Unverified 0Large Language Models are Autonomous Cyber Defenders May 7, 2025 Reinforcement Learning (RL)
Code Code Available 0Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers May 7, 2025 Math Reinforcement Learning (RL)
— Unverified 0Extending a Quantum Reinforcement Learning Exploration Policy with Flags to Connect Four May 7, 2025 Reinforcement Learning (RL)
— Unverified 0Fight Fire with Fire: Defending Against Malicious RL Fine-Tuning via Reward Neutralization May 7, 2025 Reinforcement Learning (RL)
— Unverified 0Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions May 7, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making May 6, 2025 Decision Making General Knowledge
— Unverified 0AMO: Adaptive Motion Optimization for Hyper-Dexterous Humanoid Whole-Body Control May 6, 2025 Imitation Learning Reinforcement Learning (RL)
— Unverified 0The Steganographic Potentials of Language Models May 6, 2025 Reinforcement Learning (RL)
— Unverified 0Deep Q-Network (DQN) multi-agent reinforcement learning (MARL) for Stock Trading May 6, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Actor-Critics Can Achieve Optimal Sample Efficiency May 6, 2025 Reinforcement Learning (RL)
— Unverified 0Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems May 6, 2025 Reinforcement Learning (RL) Scheduling
— Unverified 0Automated Hybrid Reward Scheduling via Large Language Models for Robotic Skill Learning May 5, 2025 Reinforcement Learning (RL) Scheduling
— Unverified 0Online Phase Estimation of Human Oscillatory Motions using Deep Learning May 5, 2025 Deep Learning Position
— Unverified 0EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning May 5, 2025 Ensemble Learning Large Language Model
Code Code Available 0Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study May 4, 2025 Offline RL Reinforcement Learning (RL)
— Unverified 0Prompt-responsive Object Retrieval with Memory-augmented Student-Teacher Learning May 4, 2025 Reinforcement Learning (RL) Retrieval
— Unverified 0Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning May 3, 2025 D4RL Offline RL
— Unverified 0A Generalised and Adaptable Reinforcement Learning Stopping Method May 3, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0