| DeLF: Designing Learning Environments with Foundation Models | Jan 17, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Interactions between dynamic team composition and coordination: An agent-based modeling approach | Jan 11, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Graph Q-Learning for Combinatorial Optimization | Jan 11, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond | Jan 6, 2024 | Decision MakingDiversity | —Unverified | 0 |
| Decision Making in Non-Stationary Environments with Policy-Augmented Search | Jan 6, 2024 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach | Jan 4, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision Processes | Jan 3, 2024 | Decision MakingHeuristic Search | CodeCode Available | 0 |
| Harnessing the Power of Federated Learning in Federated Contextual Bandits | Dec 26, 2023 | Decision MakingFederated Learning | CodeCode Available | 0 |
| Solving Long-run Average Reward Robust MDPs via Stochastic Games | Dec 21, 2023 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing | Dec 21, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Parameterized Projected Bellman Operator | Dec 20, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Robust Active Measuring under Model Uncertainty | Dec 18, 2023 | Decision Makingmodel | CodeCode Available | 0 |
| Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge Graphs | Dec 18, 2023 | Decision MakingKnowledge Graphs | CodeCode Available | 0 |
| Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints | Dec 16, 2023 | Decision MakingFairness | —Unverified | 0 |
| Risk-Aware Continuous Control with Neural Contextual Bandits | Dec 15, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A Customizable Generator for Comic-Style Visual Narrative | Dec 14, 2023 | ARCDecision Making | —Unverified | 0 |
| Learning adaptive planning representations with natural language guidance | Dec 13, 2023 | Decision MakingMinecraft | —Unverified | 0 |
| LLF-Bench: Benchmark for Interactive Learning from Language Feedback | Dec 11, 2023 | Information RetrievalOpenAI Gym | CodeCode Available | 1 |
| Online Decision Making with History-Average Dependent Costs (Extended) | Dec 11, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| A Review of Cooperation in Multi-agent Learning | Dec 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making | Dec 8, 2023 | Decision MakingFairness | CodeCode Available | 0 |
| Distributed Optimization via Kernelized Multi-armed Bandits | Dec 7, 2023 | Decision MakingDistributed Optimization | —Unverified | 0 |
| Generalization to New Sequential Decision Making Tasks with In-Context Learning | Dec 6, 2023 | Decision MakingDiversity | —Unverified | 0 |
| Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym | Dec 6, 2023 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games | Dec 4, 2023 | Atari GamesDecision Making | —Unverified | 0 |
| Learning Curricula in Open-Ended Worlds | Dec 3, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities | Nov 30, 2023 | Decision MakingDrug Discovery | —Unverified | 0 |
| TORE: Token Recycling in Vision Transformers for Efficient Active Visual Exploration | Nov 26, 2023 | Decision MakingDecoder | CodeCode Available | 0 |
| History Filtering in Imperfect Information Games: Algorithms and Complexity | Nov 24, 2023 | Card GamesDecision Making | —Unverified | 0 |
| Learning Dynamic Selection and Pricing of Out-of-Home Deliveries | Nov 23, 2023 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents | Nov 22, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Code Models are Zero-shot Precondition Reasoners | Nov 16, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making | Nov 12, 2023 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| An advantage based policy transfer algorithm for reinforcement learning with measures of transferability | Nov 12, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Likelihood Ratio Confidence Sets for Sequential Decision Making | Nov 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search | Nov 6, 2023 | Decision MakingGraph Generation | —Unverified | 0 |
| Safe Sequential Optimization for Switching Environments | Nov 3, 2023 | Bayesian OptimizationChange Point Detection | —Unverified | 0 |
| Using General Value Functions to Learn Domain-Backed Inventory Management Policies | Nov 3, 2023 | Decision MakingManagement | —Unverified | 0 |
| Efficient Symbolic Policy Learning with Differentiable Symbolic Expression | Nov 2, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Rethinking Decision Transformer via Hierarchical Reinforcement Learning | Nov 1, 2023 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving | Oct 31, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Oct 30, 2023 | Decision MakingOffline RL | CodeCode Available | 1 |
| Regret-Minimization Algorithms for Multi-Agent Cooperative Learning Systems | Oct 30, 2023 | Cloud ComputingDecision Making | —Unverified | 0 |
| High-Dimensional Prediction for Sequential Decision Making | Oct 26, 2023 | Combinatorial OptimizationConformal Prediction | —Unverified | 0 |
| Robust Visual Imitation Learning with Inverse Dynamics Representations | Oct 22, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models | Oct 22, 2023 | Decision MakingIn-Context Learning | —Unverified | 0 |
| Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes | Oct 20, 2023 | Decision MakingMulti-Task Learning | —Unverified | 0 |
| Eureka: Human-Level Reward Design via Coding Large Language Models | Oct 19, 2023 | Decision MakingIn-Context Learning | CodeCode Available | 4 |
| Auction-Based Scheduling | Oct 18, 2023 | Decision MakingFairness | —Unverified | 0 |
| Partially Observable Stochastic Games with Neural Perception Mechanisms | Oct 17, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |