SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1140111450 of 15113 papers

TitleStatusHype
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement LearningCode0
PolicyGNN: Aggregation Optimization for Graph Neural Networks0
Predicting Goal-directed Attention Control Using Inverse-Reinforcement Learning0
Preventing Imitation Learning with Adversarial Policy Ensembles0
A Deep Reinforcement Learning Approach to Concurrent Bilateral Negotiation0
Locally Private Distributed Reinforcement Learning0
Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles0
Robust Multimodal Image Registration Using Deep Recurrent Reinforcement Learning0
Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning0
Distal Explanations for Model-free Explainable Reinforcement Learning0
Data-driven control of micro-climate in buildings: an event-triggered reinforcement learning approach0
Real-time calibration of coherent-state receivers: learning by trial and errorCode0
Some Insights into Lifelong Reinforcement Learning SystemsCode0
Rotation, Translation, and Cropping for Zero-Shot GeneralizationCode0
Unsupervised Program Synthesis for Images By Sampling Without Replacement0
Reinforcement Learning-based Application Autoscaling in the Cloud: A Survey0
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement LearningCode0
Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning0
Computing the Feedback Capacity of Finite State Channels using Reinforcement LearningCode0
Constrained Upper Confidence Reinforcement Learning0
Tractable Reinforcement Learning of Signal Temporal Logic ObjectivesCode0
Sentiment and Knowledge Based Algorithmic Trading with Deep Reinforcement Learning0
Multitask radiological modality invariant landmark localization using deep reinforcement learningCode0
Following Instructions by Imagining and Reaching Visual Goals0
Deep Reinforcement Learning based Blind mmWave MIMO Beam Alignment0
End-to-End Vision-Based Adaptive Cruise Control (ACC) Using Deep Reinforcement Learning0
EgoMap: Projective mapping and structured egocentric memory for Deep RL0
Pricing commodity swing options0
Multi-objective Neural Architecture Search via Non-stationary Policy Gradient0
Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework0
Reducing Non-Normative Text Generation from Language Models0
Local Policy Optimization for Trajectory-Centric Reinforcement Learning0
GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal BabblingCode0
Reinforcement Learning Based Vehicle-cell Association Algorithm for Highly Mobile Millimeter Wave Communication0
On Solving Cooperative MARL Problems with a Few Good Experiences0
Unsupervisedly Learned Representations: Should the Quest be Over?0
Lyceum: An efficient and scalable ecosystem for robot learning0
Emergence of Pragmatics from Referential Game between Theory of Mind AgentsCode0
Cooperative Highway Work Zone Merge Control based on Reinforcement Learning in A Connected and Automated Environment0
Intelligent Bandwidth Allocation for Latency Management in NG-EPON using Reinforcement Learning Methods0
Improving Interaction Quality Estimation with BiLSTMs and the Impact on Dialogue Policy Learning0
Memristor Hardware-Friendly Reinforcement Learning0
Reinforcement Learning with Probabilistically Complete Exploration0
Nested-Wasserstein Self-Imitation Learning for Sequence Generation0
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human Feedback0
A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions0
Learning Options from Demonstration using Skill Segmentation0
BNAS:An Efficient Neural Architecture Search Approach Using Broad Scalable Architecture0
cube2net: Efficient Query-Specific Network Construction with Data Cube Organization0
Multi-agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning0
Show:102550
← PrevPage 229 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified