| A Demonstration of Adaptive Collaboration of Large Language Models for Medical Decision-Making | Oct 31, 2024 | Decision MakingDiagnostic | CodeCode Available | 3 |
| Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making | Oct 9, 2024 | BenchmarkingDecision Making | CodeCode Available | 3 |
| Sentiment Reasoning for Healthcare | Jul 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| Reinforcement Learning Meets Visual Odometry | Jul 22, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 3 |
| ACEGEN: Reinforcement learning of generative chemical agents for drug discovery | May 7, 2024 | BenchmarkingDecision Making | CodeCode Available | 3 |
| Evolve Cost-aware Acquisition Functions Using Large Language Models | Apr 25, 2024 | Bayesian OptimizationDecision Making | CodeCode Available | 3 |
| MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making | Apr 22, 2024 | Decision MakingMedical Diagnosis | CodeCode Available | 3 |
| Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python | Apr 9, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| Automatic Gradient Estimation for Calibrating Crowd Models with Discrete Decision Making | Apr 6, 2024 | Decision Making | CodeCode Available | 3 |
| Behavior Generation with Latent Actions | Mar 5, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 3 |
| Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping | Feb 21, 2024 | Decision MakingDecoder | CodeCode Available | 3 |
| UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal Prediction | Feb 19, 2024 | Decision MakingManagement | CodeCode Available | 3 |
| SPO: Sequential Monte Carlo Policy Optimisation | Feb 12, 2024 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 3 |
| V-IRL: Grounding Virtual Intelligence in Real Life | Feb 5, 2024 | Decision Making | CodeCode Available | 3 |
| PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models | Feb 2, 2024 | Action GenerationDecision Making | CodeCode Available | 3 |
| Evaluating Language Model Agency through Negotiations | Jan 9, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning | Jun 5, 2023 | Benchmarking | CodeCode Available | 3 |
| Hierarchical Prompting Assists Large Language Model on Web Navigation | May 23, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| Planning with Diffusion for Flexible Behavior Synthesis | May 20, 2022 | Decision MakingDenoising | CodeCode Available | 3 |
| Attention is not not Explanation | Aug 13, 2019 | Decision MakingDiagnostic | CodeCode Available | 3 |
| NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments | Jun 30, 2025 | Decision MakingVision and Language Navigation | CodeCode Available | 2 |
| CausalPFN: Amortized Causal Effect Estimation via In-Context Learning | Jun 9, 2025 | Decision MakingHeterogeneous Treatment Effect Estimation | CodeCode Available | 2 |
| Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning | May 26, 2025 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 2 |
| Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey | Apr 29, 2025 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models | Apr 15, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 |
| Agentic Knowledgeable Self-awareness | Apr 4, 2025 | Decision Making | CodeCode Available | 2 |
| MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search | Mar 26, 2025 | Decision MakingRAG | CodeCode Available | 2 |
| CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games | Mar 12, 2025 | Decision MakingVision-Language-Action | CodeCode Available | 2 |
| V-Max: A Reinforcement Learning Framework for Autonomous Driving | Mar 11, 2025 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories | Mar 11, 2025 | Decision MakingInteractive Segmentation | CodeCode Available | 2 |
| SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories | Mar 11, 2025 | Decision MakingInteractive Segmentation | CodeCode Available | 2 |
| What Makes a Good Diffusion Planner for Decision Making? | Mar 1, 2025 | Action GenerationDecision Making | CodeCode Available | 2 |
| Digital Player: Evaluating Large Language Models based Human-like Agent in Games | Feb 28, 2025 | Decision Making | CodeCode Available | 2 |
| Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support | Feb 25, 2025 | Decision MakingDiagnostic | CodeCode Available | 2 |
| Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First Time | Feb 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| On the Guidance of Flow Matching | Feb 4, 2025 | Decision MakingImage Generation | CodeCode Available | 2 |
| OptiChat: Bridging Optimization Models and Practitioners with Large Language Models | Jan 14, 2025 | Code Generationcounterfactual | CodeCode Available | 2 |
| LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking | Jan 14, 2025 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| UAV-VLA: Vision-Language-Action System for Large Scale Aerial Mission Generation | Jan 9, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Mechanistic understanding and validation of large AI models with SemanticLens | Jan 9, 2025 | Decision Making | CodeCode Available | 2 |
| PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models | Jan 6, 2025 | Decision Making | CodeCode Available | 2 |
| LatteReview: A Multi-Agent Framework for Systematic Review Automation Using Large Language Models | Jan 5, 2025 | Decision MakingRAG | CodeCode Available | 2 |
| GaussianAD: Gaussian-Centric End-to-End Autonomous Driving | Dec 13, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning | Dec 12, 2024 | Decision Making | CodeCode Available | 2 |
| Doe-1: Closed-Loop Autonomous Driving with Large World Model | Dec 12, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| GPD-1: Generative Pre-training for Driving | Dec 11, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| A Comprehensive Guide to Explainable AI: From Classical Models to LLMs | Dec 1, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI | Nov 21, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Natural Language Reinforcement Learning | Nov 21, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 2 |
| Disentangling Memory and Reasoning Ability in Large Language Models | Nov 20, 2024 | Decision MakingRetrieval | CodeCode Available | 2 |