| AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents | Jul 5, 2024 | Decision MakingMulti-hop Question Answering | CodeCode Available | 2 |
| Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First Time | Feb 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities | Sep 30, 2024 | Decision Making | CodeCode Available | 2 |
| Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning | Dec 12, 2024 | Decision Making | CodeCode Available | 2 |
| iVideoGPT: Interactive VideoGPTs are Scalable World Models | May 24, 2024 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 2 |
| Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent | Feb 15, 2024 | AllDecision Making | CodeCode Available | 2 |
| A Review of Safe Reinforcement Learning: Methods, Theory and Applications | May 20, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design | Nov 23, 2023 | Decision MakingLanguage Modelling | CodeCode Available | 2 |
| Large AI Models in Health Informatics: Applications, Challenges, and the Future | Mar 21, 2023 | Decision MakingDrug Discovery | CodeCode Available | 2 |
| FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making | Jul 9, 2024 | Decision Making | CodeCode Available | 2 |
| Fairness Evaluation for Uplift Modeling in the Absence of Ground Truth | Feb 12, 2024 | counterfactualDecision Making | CodeCode Available | 2 |
| Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning | May 31, 2023 | Decision MakingGeneral Knowledge | CodeCode Available | 2 |
| Global birdsong embeddings enable superior transfer learning for bioacoustic classification | Jul 12, 2023 | Audio ClassificationDecision Making | CodeCode Available | 2 |
| GaussianAD: Gaussian-Centric End-to-End Autonomous Driving | Dec 13, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models | Oct 21, 2024 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| LLM-PySC2: Starcraft II learning environment for Large Language Models | Nov 8, 2024 | Decision MakingLanguage Modelling | CodeCode Available | 2 |
| LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks | May 23, 2024 | Decision Making | CodeCode Available | 2 |
| LVBench: An Extreme Long Video Understanding Benchmark | Jun 12, 2024 | Decision MakingVideo Understanding | CodeCode Available | 2 |
| Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental Learning | Mar 18, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 |
| Embodied LLM Agents Learn to Cooperate in Organized Teams | Mar 19, 2024 | Decision MakingHuman Agent Collaboration | CodeCode Available | 2 |
| MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation | Oct 5, 2023 | BenchmarkingDecision Making | CodeCode Available | 2 |
| Mechanistic understanding and validation of large AI models with SemanticLens | Jan 9, 2025 | Decision Making | CodeCode Available | 2 |
| Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models | Apr 15, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 |
| Dungeons and Data: A Large-Scale NetHack Dataset | Nov 1, 2022 | Decision MakingNetHack | CodeCode Available | 2 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 |
| Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| ExpeL: LLM Agents Are Experiential Learners | Aug 20, 2023 | Decision MakingTransfer Learning | CodeCode Available | 2 |
| Do As I Can, Not As I Say: Grounding Language in Robotic Affordances | Apr 4, 2022 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning | May 26, 2025 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 2 |
| Doe-1: Closed-Loop Autonomous Driving with Large World Model | Dec 12, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Disentangling Memory and Reasoning Ability in Large Language Models | Nov 20, 2024 | Decision MakingRetrieval | CodeCode Available | 2 |
| Aligning Superhuman AI with Human Behavior: Chess as a Model System | Jun 2, 2020 | Decision Making | CodeCode Available | 2 |
| Distribution-Free, Risk-Controlling Prediction Sets | Jan 7, 2021 | BIG-bench Machine LearningClassification | CodeCode Available | 2 |
| DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation | Nov 18, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models | Apr 13, 2023 | Decision MakingMath | CodeCode Available | 2 |
| Digital Player: Evaluating Large Language Models based Human-like Agent in Games | Feb 28, 2025 | Decision Making | CodeCode Available | 2 |
| Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching | Dec 16, 2020 | Combinatorial OptimizationDecision Making | CodeCode Available | 2 |
| DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning | Feb 28, 2024 | Contrastive LearningDecision Making | CodeCode Available | 2 |
| Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Feb 21, 2024 | Decision MakingImitation Learning | CodeCode Available | 2 |
| Diffusion Actor-Critic with Entropy Regulator | May 24, 2024 | Decision MakingMuJoCo | CodeCode Available | 2 |
| DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models | Sep 28, 2023 | 10-shot image generation1 Image, 2*2 Stitchi | CodeCode Available | 2 |
| Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving | Oct 3, 2023 | Action GenerationAutonomous Driving | CodeCode Available | 2 |
| Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving | May 24, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making | Nov 6, 2024 | Decision MakingDiversity | CodeCode Available | 2 |
| Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents | Apr 25, 2024 | Decision MakingSpecificity | CodeCode Available | 2 |
| Context is Key: A Benchmark for Forecasting with Essential Textual Information | Oct 24, 2024 | Decision MakingTime Series | CodeCode Available | 2 |
| Agentic Knowledgeable Self-awareness | Apr 4, 2025 | Decision Making | CodeCode Available | 2 |
| Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPR | Nov 1, 2017 | counterfactualDecision Making | CodeCode Available | 2 |
| ADAPT: Action-aware Driving Caption Transformer | Feb 1, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games | Mar 12, 2025 | Decision MakingVision-Language-Action | CodeCode Available | 2 |