SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 101150 of 514 papers

TitleStatusHype
Evolutionary Reinforcement Learning via Cooperative Coevolution0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
Deep Exploration via Randomized Value Functions0
DEEPGONET: Multi-label Prediction of GO Annotation for Protein from Sequence Using Cascaded Convolutional and Recurrent Network0
Exploratory Diffusion Model for Unsupervised Reinforcement Learning0
BayesCNS: A Unified Bayesian Approach to Address Cold Start and Non-Stationarity in Search Systems at Scale0
Design of Convolutional Extreme Learning Machines for Vision-Based Navigation Around Small Bodies0
Deterministic Sequencing of Exploration and Exploitation for Reinforcement Learning0
Differentially Evolving Memory Ensembles: Pareto Optimization based on Computational Intelligence for Embedded Memories on a System Level0
DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models0
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning0
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation0
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving0
Diffusion Models Meet Contextual Bandits with Large Action Spaces0
End-Effect Exploration Drive for Effective Motor Learning0
Directed Exploration for Reinforcement Learning0
Directed Exploration in PAC Model-Free Reinforcement Learning0
DISCO-10M: A Large-Scale Music Dataset0
A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration0
KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance0
Discovering Context Specific Causal Relationships0
BooVI: Provably Efficient Bootstrapped Value Iteration0
Distilling Realizable Students from Unrealizable Teachers0
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
Curiosity in exploring chemical space: Intrinsic rewards for deep molecular reinforcement learning0
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning0
Emotion-Agent: Unsupervised Deep Reinforcement Learning with Distribution-Prototype Reward for Continuous Emotional EEG Analysis0
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning0
Bandit Algorithms for Tree Search0
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning0
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling0
Reinforcement Learning for Causal Discovery without Acyclicity Constraints0
Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization0
Embodied Agents for Efficient Exploration and Smart Scene Description0
Entropic Risk-Sensitive Reinforcement Learning: A Meta Regret Framework with Function Approximation0
Bag of Policies for Distributional Deep Exploration0
Bridging Text and Crystal Structures: Literature-driven Contrastive Learning for Materials Science0
A Human Mixed Strategy Approach to Deep Reinforcement Learning0
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model0
A Web-scale system for scientific knowledge exploration0
Efficient Policy Space Response Oracles0
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable0
Context-Dependent Upper-Confidence Bounds for Directed Exploration0
Active Model Estimation in Markov Decision Processes0
Constrained Hybrid Metaheuristic Algorithm for Probabilistic Neural Networks Learning0
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation0
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization0
Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation0
Show:102550
← PrevPage 3 of 11Next →

No leaderboard results yet.