Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks Jun 26, 2024 Autonomous Vehicles Decision Making
— Unverified 0Preference Elicitation for Offline Reinforcement Learning Jun 26, 2024 Offline RL reinforcement-learning
— Unverified 0GenRL: Multimodal-foundation world models for generalization in embodied agents Jun 26, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 2Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control Jun 26, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Combining Automated Optimisation of Hyperparameters and Reward Shape Jun 26, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback Jun 25, 2024 Reinforcement Learning (RL) Sentence
Code Code Available 0Leveraging Reinforcement Learning in Red Teaming for Advanced Ransomware Attack Simulations Jun 25, 2024 Red Teaming Reinforcement Learning (RL)
— Unverified 0Privacy Preserving Reinforcement Learning for Population Processes Jun 25, 2024 Privacy Preserving reinforcement-learning
— Unverified 0The State-Action-Reward-State-Action Algorithm in Spatial Prisoner's Dilemma Game Jun 25, 2024 Decision Making Imitation Learning
— Unverified 0Human-Object Interaction from Human-Level Instructions Jun 25, 2024 Common Sense Reasoning Human-Object Interaction Detection
— Unverified 0EXTRACT: Efficient Policy Learning by Extracting Transferable Robot Skills from Offline Data Jun 25, 2024 Reinforcement Learning (RL) Robot Manipulation
— Unverified 0Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems Jun 24, 2024 Autonomous Vehicles Reinforcement Learning (RL)
— Unverified 0Decentralized RL-Based Data Transmission Scheme for Energy Efficient Harvesting Jun 24, 2024 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning via Auxiliary Task Distillation Jun 24, 2024 Object Rearrangement reinforcement-learning
Code Code Available 0KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning Jun 24, 2024 Hierarchical Reinforcement Learning Knowledge Graphs
Code Code Available 0Confidence Aware Inverse Constrained Reinforcement Learning Jun 24, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Uncertainty-Aware Reward-Free Exploration with General Function Approximation Jun 24, 2024 Reinforcement Learning (RL)
Code Code Available 0Memory-Enhanced Neural Solvers for Efficient Adaptation in Combinatorial Optimization Jun 24, 2024 Combinatorial Optimization Reinforcement Learning (RL)
Code Code Available 1OCALM: Object-Centric Assessment with Language Models Jun 24, 2024 Object Reinforcement Learning (RL)
— Unverified 0Enhancing Commentary Strategies for Imperfect Information Card Games: A Study of Large Language Models in Guandan Commentary Jun 23, 2024 Card Games Reinforcement Learning (RL)
Code Code Available 0Diffusion Spectral Representation for Reinforcement Learning Jun 23, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Multistep Criticality Search and Power Shaping in Microreactors with Reinforcement Learning Jun 22, 2024 energy management Reinforcement Learning (RL)
— Unverified 0Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models Jun 22, 2024 Reinforcement Learning (RL) SMAC
Code Code Available 0SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning Jun 21, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0KalMamba: Towards Efficient Probabilistic State Space Models for RL under Uncertainty Jun 21, 2024 Computational Efficiency Mamba
— Unverified 0Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning Jun 21, 2024 Reinforcement Learning (RL)
— Unverified 0Direct Multi-Turn Preference Optimization for Language Agents Jun 21, 2024 Reinforcement Learning (RL)
Code Code Available 2MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading Jun 20, 2024 Algorithmic Trading Decision Making
Code Code Available 2Beyond Optimism: Exploration With Partially Observable Rewards Jun 20, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 0Revealing the learning process in reinforcement learning agents through attention-oriented metrics Jun 20, 2024 Reinforcement Learning (RL)
— Unverified 0Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing Jun 20, 2024 Autonomous Driving Data Augmentation
— Unverified 0Resource Optimization for Tail-Based Control in Wireless Networked Control Systems Jun 20, 2024 GPR Prediction
— Unverified 0Optimizing Novelty of Top-k Recommendations using Large Language Models and Reinforcement Learning Jun 20, 2024 Product Recommendation Reinforcement Learning (RL)
— Unverified 0RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold Jun 20, 2024 Math Reinforcement Learning (RL)
Code Code Available 1Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization Jun 20, 2024 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Equivariant Offline Reinforcement Learning Jun 20, 2024 Offline RL Q-Learning
— Unverified 0Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue Jun 20, 2024 Dialogue State Tracking Reinforcement Learning (RL)
— Unverified 0Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond Jun 19, 2024 Decision Making reinforcement-learning
— Unverified 0Optimizing Wireless Discontinuous Reception via MAC Signaling Learning Jun 19, 2024 Reinforcement Learning (RL)
— Unverified 0Oralytics Reinforcement Learning Algorithm Jun 19, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Imagining In-distribution States: How Predictable Robot Behavior Can Enable User Control Over Learned Policies Jun 19, 2024 Reinforcement Learning (RL)
Code Code Available 0Physics-informed Imitative Reinforcement Learning for Real-world Driving Jun 18, 2024 Autonomous Driving Imitation Learning
— Unverified 0Adaptive Safe Reinforcement Learning-Enabled Optimization of Battery Fast-Charging Protocols Jun 18, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0Quantum Compiling with Reinforcement Learning on a Superconducting Processor Jun 18, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Discovering Minimal Reinforcement Learning Environments Jun 18, 2024 continuous-control Continuous Control
Code Code Available 1Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning Jun 18, 2024 Autonomous Navigation Reinforcement Learning (RL)
— Unverified 0Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback Jun 18, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0A Systematization of the Wagner Framework: Graph Theory Conjectures and Reinforcement Learning Jun 18, 2024 Reinforcement Learning (RL) Systematic Generalization
Code Code Available 0More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling Jun 18, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Run Time Assured Reinforcement Learning for Six Degree-of-Freedom Spacecraft Inspection Jun 17, 2024 Reinforcement Learning (RL)
— Unverified 0