| Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Mar 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| Relationships are Complicated! An Analysis of Relationships Between Datasets on the Web | Aug 26, 2024 | Decision MakingMulti-class Classification | CodeCode Available | 4 |
| Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents | Aug 13, 2024 | Decision Making | CodeCode Available | 4 |
| Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond | May 6, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 4 |
| OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning | May 2, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 4 |
| AutoWebGLM: A Large Language Model-based Web Navigating Agent | Apr 4, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| A Survey on Large Language Model-Based Game Agents | Apr 2, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| Eureka: Human-Level Reward Design via Coding Large Language Models | Oct 19, 2023 | Decision MakingIn-Context Learning | CodeCode Available | 4 |
| Cognitive Architectures for Language Agents | Sep 5, 2023 | Decision Making | CodeCode Available | 4 |
| AgentBench: Evaluating LLMs as Agents | Aug 7, 2023 | Decision MakingInstruction Following | CodeCode Available | 4 |