SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 451500 of 514 papers

TitleStatusHype
Exploration by Uncertainty in Reward Space0
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain0
Exploration in Model-based Reinforcement Learning with Randomized Reward0
Exploration of the search space of Gaussian graphical models for paired data0
Exploration via Epistemic Value Estimation0
Exploratory Diffusion Model for Unsupervised Reinforcement Learning0
Explore until Confident: Efficient Exploration for Embodied Question Answering0
Exploring More When It Needs in Deep Reinforcement Learning0
Cognitive Planning for Object Goal Navigation using Generative AI Models0
Extended Formulations for Online Linear Bandit Optimization0
FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs0
Fast exploration and learning of latent graphs with aliased observations0
Feature and Instance Joint Selection: A Reinforcement Learning Perspective0
Feature Engineering for Predictive Modeling using Reinforcement Learning0
Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces0
Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts0
FIT-SLAM -- Fisher Information and Traversability estimation-based Active SLAM for exploration in 3D environments0
f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences0
Fractional Langevin Monte Carlo: Exploring Levy Driven Stochastic Differential Equations for MCMC0
Fractional Langevin Monte Carlo: Exploring Lévy Driven Stochastic Differential Equations for Markov Chain Monte Carlo0
FragFM: Hierarchical Framework for Efficient Molecule Generation via Fragment-Level Discrete Flow Matching0
From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making0
From proprioception to long-horizon planning in novel environments: A hierarchical RL model0
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning0
GFlowNets for AI-Driven Scientific Discovery0
GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices0
Goal-oriented Trajectories for Efficient Exploration0
Go-Browse: Training Web Agents with Structured Exploration0
Go-Explore for Residential Energy Management0
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering0
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion0
Guarded Policy Optimization with Imperfect Online Demonstrations0
Guided Exploration for Efficient Relational Model Learning0
Hands-Free Segmentation of Medical Volumes via Binary Inputs0
Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning0
HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression0
HelixDesign-Binder: A Scalable Production-Grade Platform for Binder Design Built on HelixFold30
Hierarchical reinforcement learning for efficient exploration and transfer0
HyperArm Bandit Optimization: A Novel approach to Hyperparameter Optimization and an Analysis of Bandit Algorithms in Stochastic and Adversarial Settings0
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning0
IB-MVS: An Iterative Algorithm for Deep Multi-View Stereo based on Binary Decisions0
Image-Based Deep Reinforcement Learning with Intrinsically Motivated Stimuli: On the Execution of Complex Robotic Tasks0
Impact of detecting clinical trial elements in exploration of COVID-19 literature0
Implicit Generative Modeling for Efficient Exploration0
Improving a State-of-the-Art Heuristic for the Minimum Latency Problem with Data Mining0
Incentivizing Exploration with Selective Data Disclosure0
Inferring Hierarchical Structure in Multi-Room Maze Environments0
Information Content Exploration0
Interpretable SHAP-bounded Bayesian Optimization for Underwater Acoustic Metamaterial Coating Design0
Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search0
Show:102550
← PrevPage 10 of 11Next →

No leaderboard results yet.