SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 301350 of 514 papers

TitleStatusHype
Sparse graphs using exchangeable random measures0
Misspecification-robust likelihood-free inference in high dimensions0
n-Regret for Learning in Markov Decision Processes with Function Approximation and Low Bellman Rank0
Structured exploration in the finite horizon linear quadratic dual control problem0
Successor-Predecessor Intrinsic Exploration0
Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery0
TANDEM: Learning Joint Exploration and Decision Making with Tactile Sensors0
Targeting the partition function of chemically disordered materials with a generative approach based on inverse variational autoencoders0
Task-agnostic Exploration in Reinforcement Learning0
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning0
The Eigenoption-Critic Framework0
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors0
The Role of Coverage in Online Reinforcement Learning0
The University of Cambridge Russian-English System at WMT130
Thompson Sampling Algorithms for Cascading Bandits0
TopoNav: Topological Navigation for Efficient Exploration in Sparse Reward Environments0
Towards A Unified Agent with Foundation Models0
Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation0
Reinforcement Learning in Credit Scoring and Underwriting0
Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand0
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning0
VASE: Variational Assorted Surprise Exploration for Reinforcement Learning0
VDSC: Enhancing Exploration Timing with Value Discrepancy and State Counts0
Vector Quantization using the Improved Differential Evolution Algorithm for Image Compression0
Virtual Action Actor-Critic Framework for Exploration (Student Abstract)0
Visual Analytics for Efficient Image Exploration and User-Guided Image Captioning0
Vlearn: Off-Policy Learning with Efficient State-Value Function Estimation0
Volumetric Spanners: an Efficient Exploration Basis for Learning0
Weakly-Supervised Reinforcement Learning for Controllable Behavior0
When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms0
Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects0
WoMAP: World Models For Embodied Open-Vocabulary Object Localization0
World Models with Hints of Large Language Models for Goal Achieving0
KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance0
Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces0
Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts0
FIT-SLAM -- Fisher Information and Traversability estimation-based Active SLAM for exploration in 3D environments0
f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences0
Fractional Langevin Monte Carlo: Exploring Levy Driven Stochastic Differential Equations for MCMC0
Fractional Langevin Monte Carlo: Exploring Lévy Driven Stochastic Differential Equations for Markov Chain Monte Carlo0
FragFM: Hierarchical Framework for Efficient Molecule Generation via Fragment-Level Discrete Flow Matching0
From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making0
From proprioception to long-horizon planning in novel environments: A hierarchical RL model0
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning0
GFlowNets for AI-Driven Scientific Discovery0
GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices0
Goal-oriented Trajectories for Efficient Exploration0
Go-Browse: Training Web Agents with Structured Exploration0
Go-Explore for Residential Energy Management0
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering0
Show:102550
← PrevPage 7 of 11Next →

No leaderboard results yet.