| Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback | Jan 17, 2024 | Decision MakingLearning-To-Rank | —Unverified | 0 |
| Graph Q-Learning for Combinatorial Optimization | Jan 11, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Interactions between dynamic team composition and coordination: An agent-based modeling approach | Jan 11, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond | Jan 6, 2024 | Decision MakingDiversity | —Unverified | 0 |
| Decision Making in Non-Stationary Environments with Policy-Augmented Search | Jan 6, 2024 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach | Jan 4, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision Processes | Jan 3, 2024 | Decision MakingHeuristic Search | CodeCode Available | 0 |
| Harnessing the Power of Federated Learning in Federated Contextual Bandits | Dec 26, 2023 | Decision MakingFederated Learning | CodeCode Available | 0 |
| Solving Long-run Average Reward Robust MDPs via Stochastic Games | Dec 21, 2023 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing | Dec 21, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Parameterized Projected Bellman Operator | Dec 20, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Robust Active Measuring under Model Uncertainty | Dec 18, 2023 | Decision Makingmodel | CodeCode Available | 0 |
| Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge Graphs | Dec 18, 2023 | Decision MakingKnowledge Graphs | CodeCode Available | 0 |
| Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints | Dec 16, 2023 | Decision MakingFairness | —Unverified | 0 |
| Risk-Aware Continuous Control with Neural Contextual Bandits | Dec 15, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A Customizable Generator for Comic-Style Visual Narrative | Dec 14, 2023 | ARCDecision Making | —Unverified | 0 |
| Learning adaptive planning representations with natural language guidance | Dec 13, 2023 | Decision MakingMinecraft | —Unverified | 0 |
| Online Decision Making with History-Average Dependent Costs (Extended) | Dec 11, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| LLF-Bench: Benchmark for Interactive Learning from Language Feedback | Dec 11, 2023 | Information RetrievalOpenAI Gym | CodeCode Available | 1 |
| A Review of Cooperation in Multi-agent Learning | Dec 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making | Dec 8, 2023 | Decision MakingFairness | CodeCode Available | 0 |
| Distributed Optimization via Kernelized Multi-armed Bandits | Dec 7, 2023 | Decision MakingDistributed Optimization | —Unverified | 0 |
| Generalization to New Sequential Decision Making Tasks with In-Context Learning | Dec 6, 2023 | Decision MakingDiversity | —Unverified | 0 |
| Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym | Dec 6, 2023 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games | Dec 4, 2023 | Atari GamesDecision Making | —Unverified | 0 |