Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning Nov 12, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization Nov 12, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning Nov 12, 2024 Imitation Learning Offline RL
— Unverified 0Robust Offline Reinforcement Learning for Non-Markovian Decision Processes Nov 12, 2024 Dataset Distillation reinforcement-learning
— Unverified 0QuadWBG: Generalizable Quadrupedal Whole-Body Grasping Nov 11, 2024 Reinforcement Learning (RL) Transparent objects
— Unverified 0Reinforcement learning for Quantum Tiq-Taq-Toe Nov 10, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0CROPS: A Deployable Crop Management System Over All Possible State Availabilities Nov 9, 2024 All Management
— Unverified 0Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings Nov 8, 2024 Decoder Machine Translation
— Unverified 0Emergent Cooperative Strategies for Multi-Agent Shepherding via Reinforcement Learning Nov 8, 2024 Reinforcement Learning (RL)
— Unverified 0Improving Multi-Domain Task-Oriented Dialogue System with Offline Reinforcement Learning Nov 8, 2024 Language Modeling Language Modelling
— Unverified 0Plasticity Loss in Deep Reinforcement Learning: A Survey Nov 7, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping Nov 7, 2024 Deep Reinforcement Learning Motion Planning
— Unverified 0Sharp Analysis for KL-Regularized Contextual Bandits and RLHF Nov 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model Nov 7, 2024 Language Modeling Language Modelling
— Unverified 0Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games Nov 7, 2024 Meta-Learning Reinforcement Learning (RL)
Code Code Available 0Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations Nov 7, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Think Smart, Act SMARL! Analyzing Probabilistic Logic Shields for Multi-Agent Reinforcement Learning Nov 7, 2024 Multi-agent Reinforcement Learning Q-Learning
Code Code Available 0Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity Nov 7, 2024 Diversity Meta Reinforcement Learning
Code Code Available 0Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning Nov 7, 2024 Offline RL Policy Gradient Methods
— Unverified 0Opportunities of Reinforcement Learning in South Africa's Just Transition Nov 6, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Approximate Equivariance in Reinforcement Learning Nov 6, 2024 continuous-control Continuous Control
— Unverified 0A Comparative Study of Deep Reinforcement Learning for Crop Production Management Nov 6, 2024 Deep Reinforcement Learning Management
— Unverified 0Interpretable and Efficient Data-driven Discovery and Control of Distributed Systems Nov 6, 2024 Dimensionality Reduction Reinforcement Learning (RL)
— Unverified 0Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data Nov 6, 2024 Reinforcement Learning (RL) Transfer Reinforcement Learning
Code Code Available 0Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC Nov 6, 2024 Computational Efficiency Deep Reinforcement Learning
Code Code Available 1An Open-source Sim2Real Approach for Sensor-independent Robot Navigation in a Grid Nov 5, 2024 Autonomous Navigation Reinforcement Learning (RL)
Code Code Available 0Pre-trained Visual Dynamics Representations for Efficient Policy Learning Nov 5, 2024 Reinforcement Learning (RL) Video Prediction
— Unverified 0Embedding Safety into RL: A New Take on Trust Region Methods Nov 5, 2024 Reinforcement Learning (RL)
— Unverified 0When to Localize? A Risk-Constrained Reinforcement Learning Approach Nov 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation Nov 5, 2024 Fault Detection In-Context Learning
— Unverified 0N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs Nov 4, 2024 In-Context Learning Reinforcement Learning (RL)
— Unverified 0Risk-sensitive control as inference with Rényi divergence Nov 4, 2024 Reinforcement Learning (RL) Variational Inference
Code Code Available 0Show, Don't Tell: Learning Reward Machines from Demonstrations for Reinforcement Learning-Based Cardiac Pacemaker Synthesis Nov 4, 2024 Reinforcement Learning (RL)
— Unverified 0Simulation of Nanorobots with Artificial Intelligence and Reinforcement Learning for Advanced Cancer Cell Detection and Tracking Nov 4, 2024 Cell Detection Navigate
Code Code Available 0So You Think You Can Scale Up Autonomous Robot Data Collection? Nov 4, 2024 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Diversity Progress for Goal Selection in Discriminability-Motivated RL Nov 3, 2024 Diversity Reinforcement Learning (RL)
— Unverified 0GITSR: Graph Interaction Transformer-based Scene Representation for Multi Vehicle Collaborative Decision-making Nov 3, 2024 Decision Making Graph Neural Network
— Unverified 0Hedging and Pricing Structured Products Featuring Multiple Underlying Assets Nov 2, 2024 Distributional Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization Nov 2, 2024 Reinforcement Learning (RL)
— Unverified 0StepCountJITAI: simulation environment for RL with application to physical activity adaptive intervention Nov 1, 2024 Reinforcement Learning (RL)
Code Code Available 0A Review of Reinforcement Learning in Financial Applications Nov 1, 2024 Benchmarking Decision Making
— Unverified 0Towards Building Secure UAV Navigation with FHE-aware Knowledge Distillation Nov 1, 2024 Knowledge Distillation Reinforcement Learning (RL)
— Unverified 0AI-based traffic analysis in digital twin networks Nov 1, 2024 Fairness Federated Learning
— Unverified 0Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayes Theory Nov 1, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions Nov 1, 2024 Bayesian Inference Offline RL
Code Code Available 0Effective ML Model Versioning in Edge Networks Nov 1, 2024 model reinforcement-learning
— Unverified 0EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization Oct 31, 2024 Bayesian Optimization Decision Making
— Unverified 0Scalable Reinforcement Post-Training Beyond Static Human Prompts: Evolving Alignment via Asymmetric Self-Play Oct 31, 2024 Reinforcement Learning (RL)
— Unverified 0Maximum Entropy Hindsight Experience Replay Oct 31, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Deterministic Exploration via Stationary Bellman Error Maximization Oct 31, 2024 Reinforcement Learning (RL)
— Unverified 0