SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 201250 of 514 papers

TitleStatusHype
Improving a State-of-the-Art Heuristic for the Minimum Latency Problem with Data Mining0
Deep density networks and uncertainty in recommender systems0
Co-NavGPT: Multi-Robot Cooperative Visual Semantic Navigation Using Vision Language Models0
FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs0
Fast exploration and learning of latent graphs with aliased observations0
Feature and Instance Joint Selection: A Reinforcement Learning Perspective0
Deep exploration by novelty-pursuit with maximum state entropy0
Feature Engineering for Predictive Modeling using Reinforcement Learning0
Efficient exploration of zero-sum stochastic games0
Image-Based Deep Reinforcement Learning with Intrinsically Motivated Stimuli: On the Execution of Complex Robotic Tasks0
Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces0
Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts0
DEEPGONET: Multi-label Prediction of GO Annotation for Protein from Sequence Using Cascaded Convolutional and Recurrent Network0
FIT-SLAM -- Fisher Information and Traversability estimation-based Active SLAM for exploration in 3D environments0
Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization0
ActiveGAMER: Active GAussian Mapping through Efficient Rendering0
Impact of detecting clinical trial elements in exploration of COVID-19 literature0
Fractional Langevin Monte Carlo: Exploring Lévy Driven Stochastic Differential Equations for Markov Chain Monte Carlo0
FragFM: Hierarchical Framework for Efficient Molecule Generation via Fragment-Level Discrete Flow Matching0
From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making0
From proprioception to long-horizon planning in novel environments: A hierarchical RL model0
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning0
Incentivizing Exploration with Selective Data Disclosure0
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models0
A Straightforward Gradient-Based Approach for High-Tc Superconductor Design: Leveraging Domain Knowledge via Adaptive Constraints0
Computing low-thrust transfers in the asteroid belt, a comparison between astrodynamical manipulations and a machine learning approach0
Efficient Exploration of Gradient Space for Online Learning to Rank0
Efficient Exploration in Resource-Restricted Reinforcement Learning0
Computational Discovery of Microstructured Composites with Optimal Stiffness-Toughness Trade-Offs0
GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices0
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving0
Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads0
Goal-oriented Trajectories for Efficient Exploration0
Diffusion Models Meet Contextual Bandits with Large Action Spaces0
Efficient Exploration in Deep Reinforcement Learning: A Novel Bayesian Actor-Critic Algorithm0
Efficient Exploration in Continuous-time Model-based Reinforcement Learning0
Go-Explore for Residential Energy Management0
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering0
Comprehensive decision-strategy space exploration for efficient territorial planning strategies0
Guarded Policy Optimization with Imperfect Online Demonstrations0
Guided Exploration for Efficient Relational Model Learning0
Hands-Free Segmentation of Medical Volumes via Binary Inputs0
Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning0
HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression0
HelixDesign-Binder: A Scalable Production-Grade Platform for Binder Design Built on HelixFold30
KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance0
Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path0
Efficient Exploration in Binary and Preferential Bayesian Optimization0
A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search0
Aerial Active STAR-RIS-assisted Satellite-Terrestrial Covert Communications0
Show:102550
← PrevPage 5 of 11Next →

No leaderboard results yet.