| Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Cross-Prediction-Powered Inference | Sep 28, 2023 | Decision MakingMissing Labels | CodeCode Available | 2 |
| DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models | Sep 28, 2023 | 10-shot image generation1 Image, 2*2 Stitchi | CodeCode Available | 2 |
| ExpeL: LLM Agents Are Experiential Learners | Aug 20, 2023 | Decision MakingTransfer Learning | CodeCode Available | 2 |
| MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models | Aug 17, 2023 | Decision MakingHallucination | CodeCode Available | 2 |
| BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents | Aug 11, 2023 | BenchmarkingDecision Making | CodeCode Available | 2 |
| Cumulative Reasoning with Large Language Models | Aug 8, 2023 | Decision MakingLogical Reasoning | CodeCode Available | 2 |
| Global birdsong embeddings enable superior transfer learning for bioacoustic classification | Jul 12, 2023 | Audio ClassificationDecision Making | CodeCode Available | 2 |
| Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX | Jun 16, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 2 |
| Adversarial attacks and defenses in explainable artificial intelligence: A survey | Jun 6, 2023 | Decision MakingExplainable artificial intelligence | CodeCode Available | 2 |
| STEVE-1: A Generative Model for Text-to-Behavior in Minecraft | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 2 |
| Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning | May 31, 2023 | Decision MakingGeneral Knowledge | CodeCode Available | 2 |
| Training Diffusion Models with Reinforcement Learning | May 22, 2023 | Decision MakingDenoising | CodeCode Available | 2 |
| AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models | Apr 13, 2023 | Decision MakingMath | CodeCode Available | 2 |
| Large AI Models in Health Informatics: Applications, Challenges, and the Future | Mar 21, 2023 | Decision MakingDrug Discovery | CodeCode Available | 2 |
| Trieste: Efficiently Exploring The Depths of Black-box Functions with TensorFlow | Feb 16, 2023 | Active LearningBayesian Optimization | CodeCode Available | 2 |
| Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning | Feb 6, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 2 |
| ADAPT: Action-aware Driving Caption Transformer | Feb 1, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Towards Reasoning in Large Language Models: A Survey | Dec 20, 2022 | Decision MakingSurvey | CodeCode Available | 2 |
| ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency | Nov 29, 2022 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Dungeons and Data: A Large-Scale NetHack Dataset | Nov 1, 2022 | Decision MakingNetHack | CodeCode Available | 2 |
| PlanT: Explainable Planning Transformers via Object-Level Representations | Oct 25, 2022 | CARLA longest6Decision Making | CodeCode Available | 2 |
| Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts | Oct 13, 2022 | Atari GamesDecision Making | CodeCode Available | 2 |
| HierarchicalForecast: A Reference Framework for Hierarchical Forecasting in Python | Jul 7, 2022 | BIG-bench Machine LearningDecision Making | CodeCode Available | 2 |
| WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents | Jul 4, 2022 | Decision MakingImitation Learning | CodeCode Available | 2 |