Designing Rewards for Fast Learning May 30, 2022 Q-Learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning with a Terminator May 30, 2022 Autonomous Driving reinforcement-learning
Code Code Available 0Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning May 30, 2022 Multiple Instance Learning Reinforcement Learning (RL)
Code Code Available 0Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic Regrets May 30, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Residual Q-Networks for Value Function Factorizing in Multi-Agent Reinforcement Learning May 30, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Stock Trading Optimization through Model-based Reinforcement Learning with Resistance Support Relative Strength May 30, 2022 Decision Making Model-based Reinforcement Learning
— Unverified 0Multi-Agent Reinforcement Learning is a Sequence Modeling Problem May 30, 2022 Decision Making MuJoCo
Code Code Available 2SEREN: Knowing When to Explore and When to Exploit May 30, 2022 MuJoCo Reinforcement Learning (RL)
— Unverified 0RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch May 30, 2022 Continuous Control Deep Reinforcement Learning
Code Code Available 1Learning Open Domain Multi-hop Search Using Reinforcement Learning May 30, 2022 Information Retrieval Reading Comprehension
— Unverified 0Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning May 30, 2022 Data Poisoning Deep Reinforcement Learning
Code Code Available 0GraMeR: Graph Meta Reinforcement Learning for Multi-Objective Influence Maximization May 30, 2022 Computational Efficiency Marketing
— Unverified 0Learning Security Strategies through Game Play and Optimal Stopping May 29, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Provable Benefits of Representational Transfer in Reinforcement Learning May 29, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1On the Robustness of Safe Reinforcement Learning under Observational Perturbations May 29, 2022 Adversarial Attack reinforcement-learning
Code Code Available 1Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning May 29, 2022 Continuous Control Deep Reinforcement Learning
— Unverified 0Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories May 28, 2022 Imitation Learning reinforcement-learning
Code Code Available 1Task-Agnostic Continual Reinforcement Learning: Gaining Insights and Overcoming Challenges May 28, 2022 Continual Learning Continuous Control
Code Code Available 1Survival Analysis on Structured Data using Deep Reinforcement Learning May 28, 2022 Deep Learning Deep Reinforcement Learning
— Unverified 0Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning May 28, 2022 Continuous Control Model-based Reinforcement Learning
— Unverified 0Tutorial on Course-of-Action (COA) Attack Search Methods in Computer Networks May 27, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Off-Beat Multi-Agent Reinforcement Learning May 27, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Non-Markovian policies occupancy measures May 27, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters May 27, 2022 D4RL Offline RL
— Unverified 0Provably Sample-Efficient RL with Side Information about Latent Dynamics May 27, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration May 27, 2022 Efficient Exploration graph partitioning
Code Code Available 1GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis May 27, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Deep Reinforcement Learning for Distributed and Uncoordinated Cognitive Radios Resource Allocation May 27, 2022 Deep Reinforcement Learning Q-Learning
— Unverified 0IGLU 2022: Interactive Grounded Language Understanding in a Collaborative Environment at NeurIPS 2022 May 27, 2022 Natural Language Understanding Reinforcement Learning (RL)
Code Code Available 0Double Deep Q Networks for Sensor Management in Space Situational Awareness May 27, 2022 Management reinforcement-learning
— Unverified 0KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal May 27, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0FedFormer: Contextual Federation with Attention in Reinforcement Learning May 27, 2022 Federated Learning reinforcement-learning
Code Code Available 1Feudal Multi-Agent Reinforcement Learning with Adaptive Network Partition for Traffic Signal Control May 27, 2022 Graph Neural Network Multi-agent Reinforcement Learning
— Unverified 0Does DQN Learn? May 26, 2022 Q-Learning reinforcement-learning
— Unverified 0DRLComplex: Reconstruction of protein quaternary structures using deep reinforcement learning May 26, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Dynamic Network Reconfiguration for Entropy Maximization using Deep Reinforcement Learning May 26, 2022 Deep Reinforcement Learning Navigate
Code Code Available 0Reinforcement Learning Approach for Mapping Applications to Dataflow-Based Coarse-Grained Reconfigurable Array May 26, 2022 Graph Attention Graph Neural Network
Code Code Available 0Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes May 26, 2022 Causal Inference Offline RL
— Unverified 0Physics-Guided Hierarchical Reward Mechanism for Learning-Based Robotic Grasping May 26, 2022 Computational Efficiency Deep Reinforcement Learning
— Unverified 0SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning May 26, 2022 continuous-control Continuous Control
— Unverified 0Unsupervised Reinforcement Adaptation for Class-Imbalanced Text Classification May 26, 2022 Classification Domain Adaptation
Code Code Available 0RACE: A Reinforcement Learning Framework for Improved Adaptive Control of NoC Channel Buffers May 26, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency May 26, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0A Fair Federated Learning Framework With Reinforcement Learning May 26, 2022 Fairness Federated Learning
— Unverified 0Constrained Reinforcement Learning for Short Video Recommendation May 26, 2022 Recommendation Systems reinforcement-learning
— Unverified 0Scalable Multi-Agent Model-Based Reinforcement Learning May 25, 2022 Mamba model
Code Code Available 1Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Function May 25, 2022 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Multimodal Knowledge Alignment with Reinforcement Learning May 25, 2022 Audio captioning Language Modeling
Code Code Available 1Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments May 25, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning May 25, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 2