SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 201250 of 514 papers

TitleStatusHype
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution0
Successor-Predecessor Intrinsic Exploration0
Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models0
Joint Falsification and Fidelity Settings Optimization for Validation of Safety-Critical Systems: A Theoretical Analysis0
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization0
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement LearningCode0
Fast exploration and learning of latent graphs with aliased observations0
Exploration of the search space of Gaussian graphical models for paired data0
Policy Mirror Descent Inherently Explores Action Space0
Exploration via Epistemic Value Estimation0
Guarded Policy Optimization with Imperfect Online Demonstrations0
Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization0
Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation0
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret0
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical RobotCode2
Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization0
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning0
Computational Discovery of Microstructured Composites with Optimal Stiffness-Toughness Trade-Offs0
GFlowNets for AI-Driven Scientific Discovery0
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation0
Embodied Agents for Efficient Exploration and Smart Scene Description0
Exploration in Model-based Reinforcement Learning with Randomized Reward0
Strangeness-driven Exploration in Multi-Agent Reinforcement LearningCode0
SHIRO: Soft Hierarchical Reinforcement Learning0
Generative Colorization of Structured Mobile Web PagesCode1
Reinforcement Learning in Credit Scoring and Underwriting0
Efficient Exploration in Resource-Restricted Reinforcement Learning0
Learn to Explore: on Bootstrapping Interactive Data Exploration with Meta-learning0
CURO: Curriculum Learning for Relative Overgeneralization0
HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression0
Safe and Efficient Reinforcement Learning Using Disturbance-Observer-Based Control Barrier Functions0
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control0
Efficient Exploration using Model-Based Quality-Diversity with Gradients0
Active Exploration based on Information Gain by Particle Filter for Efficient Spatial Concept Formation0
Exploring through Random Curiosity with General Value FunctionsCode0
Planning to the Information Horizon of BAMDPs via Epistemic State Abstraction0
Design of Convolutional Extreme Learning Machines for Vision-Based Navigation Around Small Bodies0
GeoThermalCloud: Machine Learning for Geothermal Resource ExplorationCode1
Deep Active Ensemble Sampling For Image Classification0
LECO: Learnable Episodic Count for Task-Specific Intrinsic RewardCode0
The Role of Coverage in Online Reinforcement Learning0
Learning Dexterous Manipulation from Exemplar Object Trajectories and Pre-GraspsCode1
Self-supervised Sequential Information Bottleneck for Robust Exploration in Deep Reinforcement Learning0
Deterministic Sequencing of Exploration and Exploitation for Reinforcement Learning0
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement LearningCode0
SC-Explorer: Incremental 3D Scene Completion for Safe and Efficient Exploration Mapping and PlanningCode1
Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and ExplorationsCode0
Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks0
SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks0
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning0
Show:102550
← PrevPage 5 of 11Next →

No leaderboard results yet.