AutoCost: Evolving Intrinsic Cost for Zero-violation Reinforcement Learning Jan 24, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning Jan 24, 2023 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0SMART: Self-supervised Multi-task pretrAining with contRol Transformers Jan 24, 2023 Decision Making Imitation Learning
— Unverified 0Story Shaping: Teaching Agents Human-like Behavior with Stories Jan 24, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Model Based Reinforcement Learning with Non-Gaussian Environment Dynamics and its Application to Portfolio Optimization Jan 23, 2023 Algorithmic Trading Decision Making
— Unverified 0Forecaster-aided User Association and Load Balancing in Multi-band Mobile Networks Jan 23, 2023 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Learning to View: Decision Transformers for Active Object Detection Jan 23, 2023 Active Object Detection Motion Planning
— Unverified 0The configurable tree graph (CT-graph): measurable problems in partially observable and distal reward environments for lifelong reinforcement learning Jan 21, 2023 Lifelong learning reinforcement-learning
Code Code Available 0Quasi-optimal Reinforcement Learning with Continuous Actions Jan 21, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement learning-based estimation for partial differential equations Jan 20, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Multi-agent Reinforcement Learning with Graph Q-Networks for Antenna Tuning Jan 20, 2023 Graph Neural Network Multi-agent Reinforcement Learning
— Unverified 0Modeling Moral Choices in Social Dilemmas with Multi-Agent Reinforcement Learning Jan 20, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Multi-Armed Bandits and Quantum Channel Oracles Jan 20, 2023 Multi-Armed Bandits reinforcement-learning
— Unverified 0Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning Jan 20, 2023 continuous-control Continuous Control
— Unverified 0Generative Slate Recommendation with Reinforcement Learning Jan 20, 2023 Recommendation Systems reinforcement-learning
— Unverified 0Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets Jan 20, 2023 Deep Reinforcement Learning Management
— Unverified 0Domain-adapted Learning and Imitation: DRL for Power Arbitrage Jan 19, 2023 Imitation Learning reinforcement-learning
— Unverified 0Advanced Scaling Methods for VNF deployment with Reinforcement Learning Jan 19, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Generalization through Diversity: Improving Unsupervised Environment Design Jan 19, 2023 Decision Making Diversity
— Unverified 0Domain-adapted Learning and Interpretability: DRL for Gas Trading Jan 19, 2023 Deep Reinforcement Learning Ensemble Learning
— Unverified 0A Survey of Meta-Reinforcement Learning Jan 19, 2023 Deep Reinforcement Learning Meta Reinforcement Learning
— Unverified 0Tight Guarantees for Interactive Decision Making with the Decision-Estimation Coefficient Jan 19, 2023 Decision Making reinforcement-learning
— Unverified 0Multi-compartment Neuron and Population Encoding Powered Spiking Neural Network for Deep Distributional Reinforcement Learning Jan 18, 2023 Atari Games Distributional Reinforcement Learning
— Unverified 0PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNav Jan 18, 2023 Imitation Learning Navigate
Code Code Available 1Human-Timescale Adaptation in an Open-Ended Task Space Jan 18, 2023 In-Context Learning Meta Reinforcement Learning
— Unverified 0A reinforcement learning path planning approach for range-only underwater target localization with autonomous vehicles Jan 17, 2023 Autonomous Vehicles Reinforcement Learning (RL)
Code Code Available 1DQNAS: Neural Architecture Search using Reinforcement Learning Jan 17, 2023 Face Recognition Neural Architecture Search
— Unverified 0Learning to solve arithmetic problems with a virtual abacus Jan 17, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Heterogeneous Multi-Robot Reinforcement Learning Jan 17, 2023 Graph Neural Network Multi-agent Reinforcement Learning
Code Code Available 2Adversarial Robust Deep Reinforcement Learning Requires Redefining Robustness Jan 17, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Sim-Anchored Learning for On-the-Fly Adaptation Jan 17, 2023 Reinforcement Learning (RL)
Code Code Available 0Show me what you want: Inverse reinforcement learning to automatically design robot swarms by demonstration Jan 17, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Neuro-Symbolic World Models for Adapting to Open World Novelty Jan 16, 2023 Decision Making reinforcement-learning
— Unverified 0Neuro-symbolic Meta Reinforcement Learning for Trading Jan 15, 2023 Decision Making Meta Reinforcement Learning
— Unverified 0CogReact: A Reinforced Framework to Model Human Cognitive Reaction Modulated by Dynamic Intervention Jan 15, 2023 Deep Reinforcement Learning Logical Reasoning
— Unverified 0Reinforcement Learning for Protocol Synthesis in Resource-Constrained Wireless Sensor and IoT Networks Jan 14, 2023 Fairness reinforcement-learning
— Unverified 0PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets Jan 14, 2023 Management Mixture-of-Experts
— Unverified 0Risk-Averse Reinforcement Learning via Dynamic Time-Consistent Risk Measures Jan 14, 2023 Q-Learning reinforcement-learning
— Unverified 0Deep-Reinforcement-Learning-based Path Planning for Industrial Robots using Distance Sensors as Observation Jan 14, 2023 Deep Reinforcement Learning Industrial Robots
Code Code Available 1First Three Years of the International Verification of Neural Networks Competition (VNN-COMP) Jan 14, 2023 image-classification Image Classification
— Unverified 0Decentralized model-free reinforcement learning in stochastic games with average-reward objective Jan 13, 2023 Q-Learning reinforcement-learning
— Unverified 0Hierarchical Deep Q-Learning Based Handover in Wireless Networks with Dual Connectivity Jan 13, 2023 Q-Learning reinforcement-learning
— Unverified 0A Constrained-Optimization Approach to the Execution of Prioritized Stacks of Learned Multi-Robot Tasks Jan 13, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Multi-Target Landmark Detection with Incomplete Images via Reinforcement Learning and Shape Prior Jan 13, 2023 Medical Image Analysis Reinforcement Learning (RL)
— Unverified 0Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning Jan 13, 2023 Decision Making reinforcement-learning
— Unverified 0Mutation Testing of Deep Reinforcement Learning Based on Real Faults Jan 13, 2023 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Safe Policy Improvement for POMDPs via Finite-State Controllers Jan 12, 2023 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning-based Joint Handover and Beam Tracking in Millimeter-wave Networks Jan 12, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Predictive World Models from Real-World Partial Observations Jan 12, 2023 Continual Learning Open-Ended Question Answering
Code Code Available 0Asynchronous training of quantum reinforcement learning Jan 12, 2023 Decision Making Quantum Machine Learning
— Unverified 0