| Agentic Knowledgeable Self-awareness | Apr 4, 2025 | Decision Making | CodeCode Available | 2 |
| MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search | Mar 26, 2025 | Decision MakingRAG | CodeCode Available | 2 |
| CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games | Mar 12, 2025 | Decision MakingVision-Language-Action | CodeCode Available | 2 |
| V-Max: A Reinforcement Learning Framework for Autonomous Driving | Mar 11, 2025 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories | Mar 11, 2025 | Decision MakingInteractive Segmentation | CodeCode Available | 2 |
| SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories | Mar 11, 2025 | Decision MakingInteractive Segmentation | CodeCode Available | 2 |
| What Makes a Good Diffusion Planner for Decision Making? | Mar 1, 2025 | Action GenerationDecision Making | CodeCode Available | 2 |
| Digital Player: Evaluating Large Language Models based Human-like Agent in Games | Feb 28, 2025 | Decision Making | CodeCode Available | 2 |
| Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support | Feb 25, 2025 | Decision MakingDiagnostic | CodeCode Available | 2 |
| Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First Time | Feb 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| On the Guidance of Flow Matching | Feb 4, 2025 | Decision MakingImage Generation | CodeCode Available | 2 |
| OptiChat: Bridging Optimization Models and Practitioners with Large Language Models | Jan 14, 2025 | Code Generationcounterfactual | CodeCode Available | 2 |
| LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking | Jan 14, 2025 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| UAV-VLA: Vision-Language-Action System for Large Scale Aerial Mission Generation | Jan 9, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Mechanistic understanding and validation of large AI models with SemanticLens | Jan 9, 2025 | Decision Making | CodeCode Available | 2 |
| PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models | Jan 6, 2025 | Decision Making | CodeCode Available | 2 |
| LatteReview: A Multi-Agent Framework for Systematic Review Automation Using Large Language Models | Jan 5, 2025 | Decision MakingRAG | CodeCode Available | 2 |
| GaussianAD: Gaussian-Centric End-to-End Autonomous Driving | Dec 13, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning | Dec 12, 2024 | Decision Making | CodeCode Available | 2 |
| Doe-1: Closed-Loop Autonomous Driving with Large World Model | Dec 12, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| GPD-1: Generative Pre-training for Driving | Dec 11, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| A Comprehensive Guide to Explainable AI: From Classical Models to LLMs | Dec 1, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI | Nov 21, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Natural Language Reinforcement Learning | Nov 21, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 2 |
| Disentangling Memory and Reasoning Ability in Large Language Models | Nov 20, 2024 | Decision MakingRetrieval | CodeCode Available | 2 |