| Parameterized Projected Bellman Operator | Dec 20, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Robust Active Measuring under Model Uncertainty | Dec 18, 2023 | Decision Makingmodel | CodeCode Available | 0 |
| Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge Graphs | Dec 18, 2023 | Decision MakingKnowledge Graphs | CodeCode Available | 0 |
| Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints | Dec 16, 2023 | Decision MakingFairness | —Unverified | 0 |
| Risk-Aware Continuous Control with Neural Contextual Bandits | Dec 15, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A Customizable Generator for Comic-Style Visual Narrative | Dec 14, 2023 | ARCDecision Making | —Unverified | 0 |
| Learning adaptive planning representations with natural language guidance | Dec 13, 2023 | Decision MakingMinecraft | —Unverified | 0 |
| Online Decision Making with History-Average Dependent Costs (Extended) | Dec 11, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| LLF-Bench: Benchmark for Interactive Learning from Language Feedback | Dec 11, 2023 | Information RetrievalOpenAI Gym | CodeCode Available | 1 |
| A Review of Cooperation in Multi-agent Learning | Dec 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |