| PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models | Jan 6, 2025 | Decision Making | CodeCode Available | 2 |
| LatteReview: A Multi-Agent Framework for Systematic Review Automation Using Large Language Models | Jan 5, 2025 | Decision MakingRAG | CodeCode Available | 2 |
| GaussianAD: Gaussian-Centric End-to-End Autonomous Driving | Dec 13, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning | Dec 12, 2024 | Decision Making | CodeCode Available | 2 |
| Doe-1: Closed-Loop Autonomous Driving with Large World Model | Dec 12, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| GPD-1: Generative Pre-training for Driving | Dec 11, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| A Comprehensive Guide to Explainable AI: From Classical Models to LLMs | Dec 1, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI | Nov 21, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Natural Language Reinforcement Learning | Nov 21, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 2 |
| Disentangling Memory and Reasoning Ability in Large Language Models | Nov 20, 2024 | Decision MakingRetrieval | CodeCode Available | 2 |