Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models Dec 18, 2024 HumanEval Imitation Learning
— Unverified 0Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference Dec 18, 2024 Reinforcement Learning (RL)
Code Code Available 1Harvesting energy from turbulent winds with Reinforcement Learning Dec 18, 2024 Model Predictive Control reinforcement-learning
— Unverified 0CLIP-RLDrive: Human-Aligned Autonomous Driving via CLIP-Based Reward Shaping in Reinforcement Learning Dec 17, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0Guiding Generative Protein Language Models with Reinforcement Learning Dec 17, 2024 Diversity reinforcement-learning
Code Code Available 2Tilted Quantile Gradient Updates for Quantile-Constrained Reinforcement Learning Dec 17, 2024 Form reinforcement-learning
Code Code Available 0Learning Visuotactile Estimation and Control for Non-prehensile Manipulation under Occlusions Dec 17, 2024 Reinforcement Learning (RL)
— Unverified 0ParMod: A Parallel and Modular Framework for Learning Non-Markovian Tasks Dec 17, 2024 NMT Reinforcement Learning (RL)
— Unverified 0Multi-Task Reinforcement Learning for Quadrotors Dec 17, 2024 Autonomous Racing reinforcement-learning
— Unverified 0Design of Restricted Normalizing Flow towards Arbitrary Stochastic Policy with Computational Efficiency Dec 17, 2024 Computational Efficiency Reinforcement Learning (RL)
— Unverified 0Using machine learning to inform harvest control rule design in complex fishery settings Dec 16, 2024 Bayesian Optimization Management
Code Code Available 0Equivariant Action Sampling for Reinforcement Learning and Planning Dec 16, 2024 continuous-control Continuous Control
— Unverified 0MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned Weighted Supervised Learning Dec 16, 2024 Data Augmentation Reinforcement Learning (RL)
— Unverified 0Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents Dec 16, 2024 Autonomous Driving Language Modeling
— Unverified 0Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation Dec 16, 2024 Deep Reinforcement Learning GPU
— Unverified 0MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization Dec 16, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement Dec 16, 2024 Reinforcement Learning (RL)
Code Code Available 1Entropy-Regularized Process Reward Model Dec 15, 2024 GSM8K Math
Code Code Available 1Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning Dec 15, 2024 Decision Making Large Language Model
Code Code Available 1Are Expressive Models Truly Necessary for Offline RL? Dec 15, 2024 D4RL Offline RL
Code Code Available 1Adaptive Reward Design for Reinforcement Learning Dec 14, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Continuous-time optimal investment with portfolio constraints: a reinforcement learning approach Dec 14, 2024 Reinforcement Learning (RL)
— Unverified 0Automated Driving with Evolution Capability: A Reinforcement Learning Method with Monotonic Performance Enhancement Dec 14, 2024 Decision Making reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Scalable Multiagent Spacecraft Inspection Dec 13, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Reward Machine Inference for Robotic Manipulation Dec 13, 2024 Reinforcement Learning (RL)
— Unverified 0Physics Instrument Design with Reinforcement Learning Dec 13, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0PickLLM: Context-Aware RL-Assisted Large Language Model Routing Dec 12, 2024 Language Modeling Language Modelling
— Unverified 0From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning Dec 12, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer Dec 12, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Quantum-Train-Based Distributed Multi-Agent Reinforcement Learning Dec 12, 2024 Distributed Computing Multi-agent Reinforcement Learning
— Unverified 0Radiology Report Generation via Multi-objective Preference Optimization Dec 12, 2024 Multi-Objective Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning Dec 11, 2024 Autonomous Driving Offline RL
Code Code Available 0Coarse-to-Fine: A Dual-Phase Channel-Adaptive Method for Wireless Image Transmission Dec 11, 2024 Reinforcement Learning (RL)
— Unverified 0SINERGYM -- A virtual testbed for building energy optimization with Reinforcement Learning Dec 11, 2024 continuous-control Continuous Control
Code Code Available 3Ask1: Development and Reinforcement Learning-Based Control of a Custom Quadruped Robot Dec 11, 2024 Reinforcement Learning (RL)
— Unverified 0Preference Adaptive and Sequential Text-to-Image Generation Dec 10, 2024 Image Generation Language Modeling
— Unverified 0Optimizing Sensor Redundancy in Sequential Decision-Making Problems Dec 10, 2024 Decision Making OpenAI Gym
— Unverified 0Mobile-TeleVision: Predictive Motion Priors for Humanoid Whole-Body Control Dec 10, 2024 motion retargeting Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning Policy as Macro Regulator Rather than Macro Placer Dec 10, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data Dec 10, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 2Progressive-Resolution Policy Distillation: Leveraging Coarse-Resolution Simulations for Time-Efficient Fine-Resolution Policy Learning Dec 10, 2024 Reinforcement Learning (RL)
— Unverified 0Swarm Behavior Cloning Dec 10, 2024 Decision Making Imitation Learning
— Unverified 0ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks Dec 9, 2024 GPU Imitation Learning
Code Code Available 2Skill-Enhanced Reinforcement Learning Acceleration from Demonstrations Dec 9, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone Dec 9, 2024 global-optimization Imitation Learning
— Unverified 0Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Dec 9, 2024 Reinforcement Learning (RL)
— Unverified 0Mean--Variance Portfolio Selection by Continuous-Time Reinforcement Learning: Algorithms, Regret Analysis, and Empirical Study Dec 8, 2024 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for a Discrete-Time Linear-Quadratic Control Problem with an Application Dec 8, 2024 Management Reinforcement Learning (RL)
— Unverified 0M^3PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model Dec 7, 2024 D4RL model
Code Code Available 1Learning Soft Driving Constraints from Vectorized Scene Embeddings while Imitating Expert Trajectories Dec 7, 2024 Imitation Learning Motion Planning
— Unverified 0