Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Evolutionary Policy Optimization Mar 24, 2025 Diversity Evolutionary Algorithms
— Unverified 0Continual Reinforcement Learning for HVAC Systems Control: Integrating Hypernetworks and Transfer Learning Mar 24, 2025 Continual Learning Deep Reinforcement Learning
Code Code Available 0RLCAD: Reinforcement Learning Training Gym for Revolution Involved CAD Command Sequence Generation Mar 24, 2025 Reinforcement Learning (RL)
— Unverified 0AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models Mar 24, 2025 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0Parental Guidance: Efficient Lifelong Learning through Evolutionary Distillation Mar 24, 2025 Continual Learning Diversity
— Unverified 0Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning Mar 24, 2025 Decision Making Hierarchical Reinforcement Learning
— Unverified 0Adaptive Multi-Fidelity Reinforcement Learning for Variance Reduction in Engineering Design Optimization Mar 23, 2025 Reinforcement Learning (RL) Scheduling
— Unverified 0Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization Mar 23, 2025 Reinforcement Learning (RL) Response Generation
— Unverified 0ViVa: Video-Trained Value Functions for Guiding Online RL from Diverse Data Mar 23, 2025 Reinforcement Learning (RL)
— Unverified 0Optimizing Navigation And Chemical Application in Precision Agriculture With Deep Reinforcement Learning And Conditional Action Tree Mar 23, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0A Roadmap Towards Improving Multi-Agent Reinforcement Learning With Causal Discovery And Inference Mar 22, 2025 Causal Discovery Multi-agent Reinforcement Learning
— Unverified 0ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation Mar 22, 2025 Image Generation Reinforcement Learning (RL)
— Unverified 0Transferable Latent-to-Latent Locomotion Policy for Efficient and Versatile Motion Control of Diverse Legged Robots Mar 22, 2025 Reinforcement Learning (RL)
— Unverified 0Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent Mar 21, 2025 Large Language Model Privacy Preserving
— Unverified 0Curriculum RL meets Monte Carlo Planning: Optimization of a Real World Container Management Problem Mar 21, 2025 Collision Avoidance Management
Code Code Available 0Causally Aligned Curriculum Learning Mar 21, 2025 Reinforcement Learning (RL)
— Unverified 0UAS Visual Navigation in Large and Unseen Environments via a Meta Agent Mar 20, 2025 Incremental Learning Meta Reinforcement Learning
— Unverified 0OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning Mar 20, 2025 Reinforcement Learning (RL)
— Unverified 0Towards Automated Semantic Interpretability in Reinforcement Learning via Vision-Language Models Mar 20, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming Mar 20, 2025 Combinatorial Optimization reinforcement-learning
Code Code Available 0Grammar and Gameplay-aligned RL for Game Description Generation with LLMs Mar 20, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models Mar 20, 2025 Image Generation Medical Image Generation
— Unverified 0Behaviour Discovery and Attribution for Explainable Reinforcement Learning Mar 19, 2025 Offline RL reinforcement-learning
— Unverified 0Reinforcement Learning Environment with LLM-Controlled Adversary in D&D 5th Edition Combat Mar 19, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Empowering Medical Multi-Agents with Clinical Consultation Flow for Dynamic Diagnosis Mar 19, 2025 Decision Making Diagnostic
— Unverified 0Comprehensive Review of Reinforcement Learning for Medical Ultrasound Imaging Mar 19, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning Mar 19, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0LogLLaMA: Transformer-based log anomaly detection with LLaMA Mar 19, 2025 Anomaly Detection Reinforcement Learning (RL)
— Unverified 0Reward Training Wheels: Adaptive Auxiliary Rewards for Robotics Reinforcement Learning Mar 19, 2025 Reinforcement Learning (RL)
— Unverified 0Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better Mar 19, 2025 Attribute Reinforcement Learning (RL)
— Unverified 01000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities Mar 19, 2025 Reinforcement Learning (RL) Self-Supervised Learning
— Unverified 0DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning Mar 19, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Pauli Network Circuit Synthesis with Reinforcement Learning Mar 18, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0CTSAC: Curriculum-Based Transformer Soft Actor-Critic for Goal-Oriented Robot Exploration Mar 18, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Revealing higher-order neural representations of uncertainty with the Noise Estimation through Reinforcement-based Diffusion (NERD) model Mar 18, 2025 Denoising Noise Estimation
— Unverified 0A Reinforcement Learning-Driven Transformer GAN for Molecular Generation Mar 17, 2025 Drug Discovery reinforcement-learning
— Unverified 0FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation Mar 17, 2025 Imitation Learning Object
— Unverified 0APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games Mar 17, 2025 Atari Games Q-Learning
— Unverified 0Synchronous vs Asynchronous Reinforcement Learning in a Real World Robot Mar 17, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Dynamic Angle Selection in X-Ray CT: A Reinforcement Learning Approach to Optimal Stopping Mar 16, 2025 Computed Tomography (CT) Experimental Design
— Unverified 0Evaluation-Time Policy Switching for Offline Reinforcement Learning Mar 15, 2025 Behavioural cloning Offline RL
— Unverified 0Adaptive Torque Control of Exoskeletons under Spasticity Conditions via Reinforcement Learning Mar 14, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Exploring Competitive and Collusive Behaviors in Algorithmic Pricing with Deep Reinforcement Learning Mar 14, 2025 Deep Reinforcement Learning Q-Learning
— Unverified 0Learning to reset in target search problems Mar 14, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0Dynamic Obstacle Avoidance with Bounded Rationality Adversarial Reinforcement Learning Mar 14, 2025 Benchmarking Navigate
— Unverified 0Reinforcement Learning-Based Controlled Switching Approach for Inrush Current Minimization in Power Transformers Mar 14, 2025 Reinforcement Learning (RL)
— Unverified 0Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches Mar 14, 2025 Imitation Learning reinforcement-learning
— Unverified 0DeepSeek-Inspired Exploration of RL-based LLMs and Synergy with Wireless Networks: A Survey Mar 13, 2025 Edge-computing Intelligent Communication
— Unverified 0Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model Mar 13, 2025 Language Modeling Language Modelling
— Unverified 0