SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 92519300 of 15113 papers

TitleStatusHype
Reinforcement Learning Agents for Ubisoft's Roller Champions0
A Deep Reinforcement Learning Approach for Ramp Metering Based on Traffic Video Data0
Deep Reinforcement Learning for Long Term Hydropower Production Scheduling0
Deep Reinforcement Learning for Stock Portfolio Optimization0
Interactive Search Based on Deep Reinforcement Learning0
Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation0
Semi-Supervised Off Policy Reinforcement Learning0
Transfer Learning for Efficient Iterative Safety Validation0
MLComp: A Methodology for Machine Learning-based Performance Estimation and Adaptive Selection of Pareto-Optimal Compiler Optimization Sequences0
Resolving Implicit Coordination in Multi-Agent Deep Reinforcement Learning with Deep Q-Networks & Game TheoryCode0
Emergence of Different Modes of Tool Use in a Reaching and Dragging Task0
Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman ProblemCode1
NavRep: Unsupervised Representations for Reinforcement Learning of Robot Navigation in Dynamic Human EnvironmentsCode1
The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems0
Models, Pixels, and Rewards: Evaluating Design Trade-offs in Visual Model-Based Reinforcement LearningCode1
Vehicular Cooperative Perception Through Action Branching and Federated Reinforcement Learning0
Selective Pseudo-Labeling with Reinforcement Learning for Semi-Supervised Domain Adaptation0
Reset-Free Lifelong Learning with Skill-Space PlanningCode1
Efficient Reservoir Management through Deep Reinforcement Learning0
GAEA: Graph Augmentation for Equitable Access via Reinforcement LearningCode1
Battery Model Calibration with Deep Reinforcement Learning0
Fever Basketball: A Complex, Flexible, and Asynchronized Sports Game Environment for Multi-agent Reinforcement Learning0
Multi-agent navigation based on deep reinforcement learning and traditional pathfinding algorithm0
RLOC: Terrain-Aware Legged Locomotion using Reinforcement Learning and Optimal ControlCode1
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation0
ACN-Sim: An Open-Source Simulator for Data-Driven Electric Vehicle Charging ResearchCode1
Demonstration-efficient Inverse Reinforcement Learning in Procedurally Generated Environments0
Neural Dynamic Policies for End-to-End Sensorimotor Learning0
Offline Meta-level Model-based Reinforcement Learning Approach for Cold-Start Recommendation0
Model-Agnostic Learning to Meta-Learn0
Partially Connected Automated Vehicle Cooperative Control Strategy with a Deep Reinforcement Learning Approach0
DeepCrawl: Deep Reinforcement Learning for Turn-based Strategy Games0
Emergent Complexity and Zero-shot Transfer via Unsupervised Environment DesignCode0
Dynamic RAN Slicing for Service-Oriented Vehicular Networks via Constrained Learning0
Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning0
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points0
Pareto Deterministic Policy Gradients and Its Application in 5G Massive MIMO Networks0
A Safe Reinforcement Learning Architecture for Antenna Tilt Optimisation0
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER0
Driving-Policy Adaptive Safeguard for Autonomous Vehicles Using Reinforcement Learning0
Coinbot: Intelligent Robotic Coin Bag Manipulation Using Deep Reinforcement Learning And Machine Teaching0
Are Gradient-based Saliency Maps Useful in Deep Reinforcement Learning?0
BSODA: A Bipartite Scalable Framework for Online Disease Diagnosis0
Improving the Naturalness and Diversity of Referring Expression Generation models using Minimum Risk Training0
Improving Neural Machine Translation for Sanskrit-English0
Combining Cognitive Modeling and Reinforcement Learning for Clarification in Dialogue0
Answer-driven Deep Question Generation based on Reinforcement Learning0
A Learning-Exploring Method to Generate Diverse Paraphrases with Multi-Objective Deep Reinforcement Learning0
ExpanRL: Hierarchical Reinforcement Learning for Course Concept Expansion in MOOCs0
Text Simplification with Reinforcement Learning Using Supervised Rewards on Grammaticality, Meaning Preservation, and Simplicity0
Show:102550
← PrevPage 186 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified