SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 72017225 of 15113 papers

TitleStatusHype
VisualHints: A Visual-Lingual Environment for Multimodal Reinforcement Learning0
Towards Learning to Imitate from a Single Video Demonstration0
Visual Imitation with Reinforcement Learning using Recurrent Siamese Networks0
Visualizing the Loss Landscape of Actor Critic Methods with Applications in Inventory Optimization0
Visual-Policy Learning through Multi-Camera View to Single-Camera View Knowledge Distillation for Robot Manipulation Tasks0
Visual processing in context of reinforcement learning0
Visual Radial Basis Q-Network0
Visual Rationalizations in Deep Reinforcement Learning for Atari Games0
Visual search and recognition for robot task execution and monitoring0
Visual Semantic Planning using Deep Successor Representations0
Visual Sensor Network Reconfiguration with Deep Reinforcement Learning0
Software Simulation and Visualization of Quantum Multi-Drone Reinforcement Learning0
Visual-Tactile Multimodality for Following Deformable Linear Objects Using Reinforcement Learning0
Visual Tracking by means of Deep Reinforcement Learning and an Expert Demonstrator0
Visuomotor Mechanical Search: Learning to Retrieve Target Objects in Clutter0
ViVa: Video-Trained Value Functions for Guiding Online RL from Diverse Data0
Vizarel: A System to Help Better Understand RL Agents0
VLMLight: Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning0
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making0
VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving0
VLP: Vision-Language Preference Learning for Embodied Manipulation0
VL-SAFE: Vision-Language Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving0
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control0
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement0
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play0
Show:102550
← PrevPage 289 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified