NIL: No-data Imitation Learning by Leveraging Pre-trained Video Diffusion Models Mar 13, 2025 Imitation Learning Reinforcement Learning (RL)
— Unverified 0DeepSeek-Inspired Exploration of RL-based LLMs and Synergy with Wireless Networks: A Survey Mar 13, 2025 Edge-computing Intelligent Communication
— Unverified 0Scalable Evaluation of Online Facilitation Strategies via Synthetic Simulation of Discussions Mar 13, 2025 Reinforcement Learning (RL)
Code Code Available 0H2-MARL: Multi-Agent Reinforcement Learning for Pareto Optimality in Hospital Capacity Strain and Human Mobility during Epidemic Mar 13, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0SortingEnv: An Extendable RL-Environment for an Industrial Sorting Process Mar 13, 2025 Reinforcement Learning (RL)
— Unverified 0SySLLM: Generating Synthesized Policy Summaries for Reinforcement Learning Agents Using Large Language Models Mar 13, 2025 Reinforcement Learning (RL) World Knowledge
— Unverified 0Edge AI-Powered Real-Time Decision-Making for Autonomous Vehicles in Adverse Weather Conditions Mar 12, 2025 Autonomous Navigation Autonomous Vehicles
— Unverified 0Local Look-Ahead Guidance via Verifier-in-the-Loop for Automated Theorem Proving Mar 12, 2025 Automated Theorem Proving Reinforcement Learning (RL)
— Unverified 0Large-scale Regional Traffic Signal Control Based on Single-Agent Reinforcement Learning Mar 12, 2025 Reinforcement Learning (RL) Traffic Signal Control
— Unverified 0Unified Locomotion Transformer with Simultaneous Sim-to-Real Transfer for Quadrupeds Mar 12, 2025 Deep Reinforcement Learning Knowledge Distillation
— Unverified 0Solving Bayesian inverse problems with diffusion priors and off-policy RL Mar 12, 2025 Reinforcement Learning (RL)
— Unverified 0Optimisation of the Accelerator Control by Reinforcement Learning: A Simulation-Based Approach Mar 12, 2025 Reinforcement Learning (RL)
— Unverified 0Evaluating Reinforcement Learning Safety and Trustworthiness in Cyber-Physical Systems Mar 12, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0MarineGym: A High-Performance Reinforcement Learning Platform for Underwater Robotics Mar 12, 2025 Benchmarking GPU
— Unverified 0Balancing SoC in Battery Cells using Safe Action Perturbations Mar 11, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning Mar 11, 2025 Disentanglement Reinforcement Learning (RL)
— Unverified 0Zero-Shot Action Generalization with Limited Observations Mar 11, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents Mar 11, 2025 Navigate Reinforcement Learning (RL)
— Unverified 0MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models Mar 11, 2025 Large Language Model Mixture-of-Experts
— Unverified 0In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents Mar 11, 2025 Management Reinforcement Learning (RL)
— Unverified 0A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models Mar 11, 2025 Decision Making global-optimization
— Unverified 0Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model Mar 11, 2025 Reinforcement Learning (RL)
— Unverified 0Adaptive routing protocols for determining optimal paths in AI multi-agent systems: a priority- and learning-enhanced approach Mar 10, 2025 Reinforcement Learning (RL)
— Unverified 0Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning Mar 10, 2025 Math Meta Reinforcement Learning
— Unverified 0Efficient Neural Clause-Selection Reinforcement Mar 10, 2025 Automated Theorem Proving CPU
— Unverified 0Automated Proof of Polynomial Inequalities via Reinforcement Learning Mar 9, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks Mar 9, 2025 Card Games Diversity
— Unverified 0Dynamic Load Balancing for EV Charging Stations Using Reinforcement Learning and Demand Prediction Mar 9, 2025 Graph Neural Network Reinforcement Learning (RL)
— Unverified 0A Novel Multi-Objective Reinforcement Learning Algorithm for Pursuit-Evasion Game Mar 9, 2025 Multi-Objective Reinforcement Learning Q-Learning
— Unverified 0Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models Mar 9, 2025 Anomaly Detection Mamba
Code Code Available 0Probabilistic Shielding for Safe Reinforcement Learning Mar 9, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0UAV-Assisted Coverage Hole Detection Using Reinforcement Learning in Urban Cellular Networks Mar 9, 2025 Reinforcement Learning (RL)
— Unverified 0ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning Mar 8, 2025 Bayesian Optimization Deep Reinforcement Learning
— Unverified 0Synergizing AI and Digital Twins for Next-Generation Network Optimization, Forecasting, and Security Mar 8, 2025 Federated Learning Reinforcement Learning (RL)
— Unverified 0Vairiational Stochastic Games Mar 8, 2025 Reinforcement Learning (RL) Variational Inference
— Unverified 0Tractable Representations for Convergent Approximation of Distributional HJB Equations Mar 7, 2025 Reinforcement Learning (RL)
— Unverified 0Generative Multi-Agent Q-Learning for Policy Optimization: Decentralized Wireless Networks Mar 7, 2025 Q-Learning Reinforcement Learning (RL)
— Unverified 0Policy Constraint by Only Support Constraint for Offline Reinforcement Learning Mar 7, 2025 Offline RL reinforcement-learning
Code Code Available 0Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation Mar 7, 2025 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Multi-Fidelity Policy Gradient Algorithms Mar 7, 2025 Reinforcement Learning (RL)
— Unverified 0Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation Mar 7, 2025 Deep Reinforcement Learning Out-of-Distribution Detection
— Unverified 0Data-Efficient Learning from Human Interventions for Mobile Robots Mar 6, 2025 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Provably Correct Automata Embeddings for Optimal Automata-Conditioned Reinforcement Learning Mar 6, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Lessons learned from field demonstrations of model predictive control and reinforcement learning for residential and commercial HVAC: A review Mar 6, 2025 Model Predictive Control Reinforcement Learning (RL)
Code Code Available 0Towards Autonomous Reinforcement Learning for Real-World Robotic Manipulation with Large Language Models Mar 6, 2025 Motion Planning reinforcement-learning
— Unverified 0Can We Optimize Deep RL Policy Weights as Trajectory Modeling? Mar 6, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Energy-Weighted Flow Matching for Offline Reinforcement Learning Mar 6, 2025 Offline RL reinforcement-learning
— Unverified 0Rebalanced Multimodal Learning with Data-aware Unimodal Sampling Mar 5, 2025 Reinforcement Learning (RL)
— Unverified 0Quantitative Resilience Modeling for Autonomous Cyber Defense Mar 4, 2025 Reinforcement Learning (RL)
— Unverified 0Rewarding Doubt: A Reinforcement Learning Approach to Confidence Calibration of Large Language Models Mar 4, 2025 Reinforcement Learning (RL)
— Unverified 0