MPCritic: A plug-and-play MPC architecture for reinforcement learning Apr 1, 2025 Model Predictive Control Reinforcement Learning (RL)
Code Code Available 1ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation Mar 27, 2025 Question Answering RAG
Code Code Available 1NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios Mar 25, 2025 Benchmarking Offline RL
Code Code Available 1Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training Mar 24, 2025 Diversity Large Language Model
Code Code Available 1Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation Mar 17, 2025 Mathematical Reasoning Reinforcement Learning (RL)
Code Code Available 1TERL: Large-Scale Multi-Target Encirclement Using Transformer-Enhanced Reinforcement Learning Mar 16, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1Regulatory DNA sequence Design with Reinforcement Learning Mar 11, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1VisRL: Intention-Driven Visual Perception via Reinforced Reasoning Mar 10, 2025 Reinforcement Learning (RL) Visual Reasoning
Code Code Available 1Discrete Codebook World Models for Continuous Control Mar 1, 2025 continuous-control Continuous Control
Code Code Available 1Reinforcement learning with combinatorial actions for coupled restless bandits Mar 1, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model Feb 26, 2025 Reinforcement Learning (RL)
Code Code Available 1Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning Feb 26, 2025 In-Context Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Generating π-Functional Molecules Using STGG+ with Active Learning Feb 20, 2025 Active Learning reinforcement-learning
Code Code Available 1Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope? Feb 18, 2025 Benchmarking Blocking
Code Code Available 1Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation Feb 17, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 1Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems Feb 12, 2025 Reinforcement Learning (RL)
Code Code Available 1DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails Feb 7, 2025 Reinforcement Learning (RL) Synthetic Data Generation
Code Code Available 1Analytical Lyapunov Function Discovery: An RL-based Generative Approach Feb 4, 2025 Reinforcement Learning (RL) valid
Code Code Available 1GNN-DT: Graph Neural Network Enhanced Decision Transformer for Efficient Optimization in Dynamic Environments Feb 3, 2025 Efficient Exploration Graph Neural Network
Code Code Available 1SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments Jan 31, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning Jan 29, 2025 continuous-control Continuous Control
Code Code Available 1xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking Jan 28, 2025 Reinforcement Learning (RL) Safety Alignment
Code Code Available 1An Attentive Graph Agent for Topology-Adaptive Cyber Defence Jan 24, 2025 Graph Attention Graph Neural Network
Code Code Available 1From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training Jan 10, 2025 Reinforcement Learning (RL)
Code Code Available 1Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies Jan 6, 2025 Decision Making Deep Reinforcement Learning
Code Code Available 1Exploiting Hybrid Policy in Reinforcement Learning for Interpretable Temporal Logic Manipulation Dec 29, 2024 Reinforcement Learning (RL)
Code Code Available 1Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference Dec 18, 2024 Reinforcement Learning (RL)
Code Code Available 1RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement Dec 16, 2024 Reinforcement Learning (RL)
Code Code Available 1Entropy-Regularized Process Reward Model Dec 15, 2024 GSM8K Math
Code Code Available 1Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning Dec 15, 2024 Decision Making Large Language Model
Code Code Available 1Are Expressive Models Truly Necessary for Offline RL? Dec 15, 2024 D4RL Offline RL
Code Code Available 1Reinforcement Learning Policy as Macro Regulator Rather than Macro Placer Dec 10, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1M^3PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model Dec 7, 2024 D4RL model
Code Code Available 1Mind the Gap: Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning Dec 5, 2024 Large Language Model Meta Reinforcement Learning
Code Code Available 1AI-Driven Day-to-Day Route Choice Dec 4, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Multi-Agent Environments for Vehicle Routing Problems Nov 21, 2024 Benchmarking reinforcement-learning
Code Code Available 1LEDRO: LLM-Enhanced Design Space Reduction and Optimization for Analog Circuits Nov 19, 2024 Bayesian Optimization Reinforcement Learning (RL)
Code Code Available 1Doubly Mild Generalization for Offline Reinforcement Learning Nov 12, 2024 MuJoCo Offline RL
Code Code Available 1Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC Nov 6, 2024 Computational Efficiency Deep Reinforcement Learning
Code Code Available 1Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone Connectivity Oct 31, 2024 MuJoCo Q-Learning
Code Code Available 1Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers Oct 31, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback Oct 30, 2024 Decision Making Language Modeling
Code Code Available 1Learning Successor Features the Simple Way Oct 29, 2024 Continual Learning Deep Reinforcement Learning
Code Code Available 1A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks Oct 29, 2024 Mamba Reinforcement Learning (RL)
Code Code Available 1Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression Oct 25, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 1Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration Oct 23, 2024 Efficient Exploration Reinforcement Learning (RL)
Code Code Available 1Reinforced Imitative Trajectory Planning for Urban Automated Driving Oct 21, 2024 Imitation Learning reinforcement-learning
Code Code Available 1Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning Oct 17, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Safety Filtering While Training: Improving the Performance and Sample Efficiency of Reinforcement Learning Agents Oct 15, 2024 Reinforcement Learning (RL)
Code Code Available 1Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient Oct 11, 2024 Mamba Model-based Reinforcement Learning
Code Code Available 1