The split Gibbs sampler revisited: improvements to its algorithmic structure and augmented target distribution Jun 28, 2022 Data Augmentation Deblurring
Code Code Available 0Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation Jun 22, 2022 Efficient Exploration Object
— Unverified 0A Langevin-like Sampler for Discrete Distributions Jun 20, 2022 Efficient Exploration Text Generation
Code Code Available 1Scalable Exploration for Neural Online Learning to Rank with Perturbed Feedback Jun 13, 2022 Computational Efficiency Efficient Exploration
— Unverified 0On Preemption and Learning in Stochastic Scheduling May 31, 2022 Efficient Exploration Scheduling
Code Code Available 0Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems May 31, 2022 Efficient Exploration reinforcement-learning
— Unverified 0Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions May 28, 2022 Arithmetic Reasoning Efficient Exploration
Code Code Available 1Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration May 27, 2022 Efficient Exploration graph partitioning
Code Code Available 1Personalized Algorithmic Recourse with Preference Elicitation May 27, 2022 Efficient Exploration
Code Code Available 0SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning May 26, 2022 continuous-control Continuous Control
— Unverified 0The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure May 20, 2022 Efficient Exploration Policy Gradient Methods
Code Code Available 1Feature and Instance Joint Selection: A Reinforcement Learning Perspective May 12, 2022 Efficient Exploration feature selection
— Unverified 0Fire Burns, Sword Cuts: Commonsense Inductive Bias for Exploration in Text-based Games May 1, 2022 Deep Reinforcement Learning Efficient Exploration
Code Code Available 0On Machine Learning-Driven Surrogates for Sound Transmission Loss Simulations Apr 25, 2022 BIG-bench Machine Learning Decision Making
Code Code Available 0A Variational Approach to Bayesian Phylogenetic Inference Apr 16, 2022 Efficient Exploration Variational Inference
Code Code Available 0Efficient Exploration via First-Person Behavior Cloning Assisted Rapidly-Exploring Random Trees Mar 23, 2022 Efficient Exploration
— Unverified 0TANDEM: Learning Joint Exploration and Decision Making with Tactile Sensors Mar 1, 2022 Decision Making Efficient Exploration
— Unverified 0Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share? Feb 24, 2022 Efficient Exploration Transfer Learning
Code Code Available 0Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation Feb 23, 2022 Efficient Exploration Navigate
Code Code Available 2Learning Causal Overhypotheses through Exploration in Children and Computational Models Feb 21, 2022 Causal Inference Efficient Exploration
— Unverified 0A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search Feb 11, 2022 Atari Games Decision Making
— Unverified 0Online Decision Transformer Feb 11, 2022 D4RL Efficient Exploration
Code Code Available 2Lagrangian Manifold Monte Carlo on Monge Patches Feb 1, 2022 Efficient Exploration
Code Code Available 0Efficient Policy Space Response Oracles Jan 28, 2022 Efficient Exploration
— Unverified 0Learning to Act with Affordance-Aware Multimodal Neural SLAM Jan 24, 2022 Efficient Exploration Test unseen
Code Code Available 0Synthesizing explainable counterfactual policies for algorithmic recourse with program synthesis Jan 18, 2022 counterfactual Efficient Exploration
Code Code Available 0Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand Jan 3, 2022 Efficient Exploration
— Unverified 0JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning Dec 7, 2021 Efficient Exploration Hierarchical Reinforcement Learning
— Unverified 0A Fast and Scalable Polyatomic Frank-Wolfe Algorithm for the LASSO Dec 6, 2021 compressed sensing Efficient Exploration
Code Code Available 0BooVI: Provably Efficient Bootstrapped Value Iteration Dec 1, 2021 Efficient Exploration Reinforcement Learning (RL)
— Unverified 0NovelD: A Simple yet Effective Exploration Criterion Dec 1, 2021 Atari Games Deep Reinforcement Learning
Code Code Available 1HelixMO: Sample-Efficient Molecular Optimization in Scene-Sensitive Latent Space Nov 30, 2021 Drug Design Drug Discovery
— Unverified 0IB-MVS: An Iterative Algorithm for Deep Multi-View Stereo based on Binary Decisions Nov 29, 2021 3D Reconstruction Efficient Exploration
— Unverified 0Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration Nov 22, 2021 Efficient Exploration Multi-agent Reinforcement Learning
Code Code Available 1Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning Nov 18, 2021 Efficient Exploration reinforcement-learning
Code Code Available 0Discovering and Exploiting Sparse Rewards in a Learned Behavior Space Nov 2, 2021 Efficient Exploration
Code Code Available 0Bayesian optimization of distributed neurodynamical controller models for spatial navigation Oct 31, 2021 Bayesian Optimization Efficient Exploration
— Unverified 0Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives Oct 28, 2021 Efficient Exploration reinforcement-learning
— Unverified 0Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization Oct 27, 2021 Efficient Exploration Multi-Armed Bandits
Code Code Available 0Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning Oct 26, 2021 Efficient Exploration Hierarchical Reinforcement Learning
Code Code Available 1Map Induction: Compositional spatial submap learning for efficient exploration in novel environments Oct 23, 2021 Efficient Exploration Program induction
Code Code Available 0Hierarchical Skills for Efficient Exploration Oct 20, 2021 continuous-control Continuous Control
Code Code Available 1More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences Oct 20, 2021 Efficient Exploration Open-Ended Question Answering
— Unverified 0Balancing Value Underestimation and Overestimation with Realistic Actor-Critic Oct 19, 2021 continuous-control Continuous Control
Code Code Available 0Efficient Exploration in Binary and Preferential Bayesian Optimization Oct 18, 2021 Bayesian Optimization Efficient Exploration
— Unverified 0Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization Oct 10, 2021 continuous-control Continuous Control
— Unverified 0Reinforcement Learning in Reward-Mixing MDPs Oct 7, 2021 Efficient Exploration reinforcement-learning
— Unverified 0Divide and Explore: Multi-Agent Separate Exploration with Shared Intrinsic Motivations Sep 29, 2021 Distributed Computing Efficient Exploration
— Unverified 0Learning to Solve Combinatorial Problems via Efficient Exploration Sep 29, 2021 Efficient Exploration Reinforcement Learning (RL)
— Unverified 0HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning Sep 29, 2021 Deep Reinforcement Learning Efficient Exploration
Code Code Available 1