Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope? Feb 18, 2025 Benchmarking Blocking
Code Code Available 1Navigating Demand Uncertainty in Container Shipping: Deep Reinforcement Learning for Enabling Adaptive and Feasible Master Stowage Planning Feb 18, 2025 Combinatorial Optimization Deep Reinforcement Learning
Code Code Available 0Addressing Moral Uncertainty using Large Language Models for Ethical Decision-Making Feb 17, 2025 Decision Making Ethics
— Unverified 0Learning Plasma Dynamics and Robust Rampdown Trajectories with Predict-First Experiments at TCV Feb 17, 2025 Reinforcement Learning (RL)
— Unverified 0Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning Feb 17, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation Feb 17, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 1Scaling Test-Time Compute Without Verification or RL is Suboptimal Feb 17, 2025 Math Reinforcement Learning (RL)
— Unverified 0CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning Feb 17, 2025 MuJoCo Reinforcement Learning (RL)
— Unverified 0FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control Feb 17, 2025 Imitation Learning reinforcement-learning
— Unverified 0Intersectional Fairness in Reinforcement Learning with Large State and Constraint Spaces Feb 17, 2025 Fairness Reinforcement Learning (RL)
— Unverified 0Robot Deformable Object Manipulation via NMPC-generated Demonstrations in Deep Reinforcement Learning Feb 17, 2025 Deep Reinforcement Learning Deformable Object Manipulation
— Unverified 0VLP: Vision-Language Preference Learning for Embodied Manipulation Feb 17, 2025 Reinforcement Learning (RL)
— Unverified 0FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading Feb 17, 2025 Decision Making parameter-efficient fine-tuning
— Unverified 0Evaluating the Paperclip Maximizer: Are RL-Based Language Models More Likely to Pursue Instrumental Goals? Feb 16, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0Scalable Multi-Agent Offline Reinforcement Learning and the Role of Information Feb 16, 2025 Informativeness Reinforcement Learning (RL)
— Unverified 0Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents Feb 15, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0Tackling the Zero-Shot Reinforcement Learning Loss Directly Feb 15, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation Feb 14, 2025 Reinforcement Learning (RL)
— Unverified 0Dynamic Reinforcement Learning for Actors Feb 14, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Causal Information Prioritization for Efficient Reinforcement Learning Feb 14, 2025 continuous-control Continuous Control
— Unverified 0Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning Feb 14, 2025 Reinforcement Learning (RL) Skills Assessment
Code Code Available 2BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds Feb 14, 2025 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations Feb 14, 2025 Atari Games Game of Go
— Unverified 0Digi-Q: Learning Q-Value Functions for Training Device-Control Agents Feb 13, 2025 Q-Learning Reinforcement Learning (RL)
Code Code Available 2Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches Feb 13, 2025 D4RL Offline RL
— Unverified 0Safe Reinforcement Learning-based Control for Hydrogen Diesel Dual-Fuel Engines Feb 13, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0A Survey of Reinforcement Learning for Optimization in Automation Feb 13, 2025 Meta-Learning Navigate
— Unverified 0Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems Feb 12, 2025 Reinforcement Learning (RL)
Code Code Available 1A Survey on Data-Centric AI: Tabular Learning from Reinforcement Learning and Generative AI Perspective Feb 12, 2025 Feature Engineering feature selection
— Unverified 0COMBO-Grasp: Learning Constraint-Based Manipulation for Bimanual Occluded Grasping Feb 12, 2025 Reinforcement Learning (RL) Robot Manipulation
— Unverified 0Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning Feb 12, 2025 regression Reinforcement Learning (RL)
— Unverified 0A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards Feb 12, 2025 Reinforcement Learning (RL)
— Unverified 0Hierarchical Multi-Agent Framework for Carbon-Efficient Liquid-Cooled Data Center Clusters Feb 12, 2025 Cloud Computing Reinforcement Learning (RL)
— Unverified 0Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning Feb 11, 2025 Decision Making reinforcement-learning
— Unverified 0A Survey of In-Context Reinforcement Learning Feb 11, 2025 In-Context Reinforcement Learning reinforcement-learning
— Unverified 0Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol Feb 11, 2025 Model Selection Off-policy evaluation
— Unverified 0Active Advantage-Aligned Online Reinforcement Learning with Offline Data Feb 11, 2025 Offline RL reinforcement-learning
Code Code Available 0Optimal Actuator Attacks on Autonomous Vehicles Using Reinforcement Learning Feb 11, 2025 Autonomous Vehicles reinforcement-learning
— Unverified 0Exploratory Diffusion Model for Unsupervised Reinforcement Learning Feb 11, 2025 Efficient Exploration model
— Unverified 0Towards a Formal Theory of the Need for Competence via Computational Intrinsic Motivation Feb 11, 2025 Reinforcement Learning (RL)
— Unverified 0Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning Feb 11, 2025 Reinforcement Learning (RL)
— Unverified 0Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Feb 10, 2025 Math Mathematical Reasoning
Code Code Available 2Smell of Source: Learning-Based Odor Source Localization with Molecular Communication Feb 10, 2025 Computational Efficiency Disaster Response
— Unverified 0Select before Act: Spatially Decoupled Action Repetition for Continuous Control Feb 10, 2025 continuous-control Continuous Control
— Unverified 0A view on learning robust goal-conditioned value functions: Interplay between RL and MPC Feb 10, 2025 Model Predictive Control Reinforcement Learning (RL)
Code Code Available 0Intelligent Offloading in Vehicular Edge Computing: A Comprehensive Review of Deep Reinforcement Learning Approaches and Architectures Feb 10, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language Models Feb 8, 2025 Conformal Prediction Decision Making
Code Code Available 0Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning Feb 8, 2025 Combinatorial Optimization Computational Efficiency
— Unverified 0Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization Feb 7, 2025 counterfactual Decision Making
— Unverified 0Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits Feb 7, 2025 Informativeness Offline RL
— Unverified 0