| AutoWebGLM: A Large Language Model-based Web Navigating Agent | Apr 4, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| A Survey on Large Language Model-Based Game Agents | Apr 2, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| Eureka: Human-Level Reward Design via Coding Large Language Models | Oct 19, 2023 | Decision MakingIn-Context Learning | CodeCode Available | 4 |
| Cognitive Architectures for Language Agents | Sep 5, 2023 | Decision Making | CodeCode Available | 4 |
| AgentBench: Evaluating LLMs as Agents | Aug 7, 2023 | Decision MakingInstruction Following | CodeCode Available | 4 |
| TorchRL: A data-driven decision-making library for PyTorch | Jun 1, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 4 |
| pgmpy: A Python Toolkit for Bayesian Networks | Apr 17, 2023 | Causal DiscoveryCausal Identification | CodeCode Available | 4 |
| Reflexion: Language Agents with Verbal Reinforcement Learning | Mar 20, 2023 | Decision MakingHumanEval | CodeCode Available | 4 |
| Mastering Diverse Domains through World Models | Jan 10, 2023 | Atari Games 100kDecision Making | CodeCode Available | 4 |
| Constitutional AI: Harmlessness from AI Feedback | Dec 15, 2022 | Decision Making | CodeCode Available | 4 |
| ReAct: Synergizing Reasoning and Acting in Language Models | Oct 6, 2022 | Decision MakingFact Verification | CodeCode Available | 4 |
| A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning | Jun 3, 2025 | Decision MakingDiagnostic | CodeCode Available | 3 |
| FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution | Apr 9, 2025 | 2kDecision Making | CodeCode Available | 3 |
| Playing Non-Embedded Card-Based Games with Reinforcement Learning | Apr 7, 2025 | Board GamesDecision Making | CodeCode Available | 3 |
| Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB | Apr 1, 2025 | Decision MakingRAG | CodeCode Available | 3 |
| Will LLMs be Professional at Fund Investment? DeepFund: A Live Arena Perspective | Mar 24, 2025 | Decision Making | CodeCode Available | 3 |
| A Survey on the Optimization of Large Language Model-based Agents | Mar 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems | Mar 5, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| Automated Hypothesis Validation with Agentic Sequential Falsifications | Feb 14, 2025 | Decision MakingHallucination | CodeCode Available | 3 |
| Rethinking Early Stopping: Refine, Then Calibrate | Jan 31, 2025 | Decision Making | CodeCode Available | 3 |
| MineStudio: A Streamlined Package for Minecraft AI Agent Development | Dec 24, 2024 | AI AgentDecision Making | CodeCode Available | 3 |
| Embodied CoT Distillation From LLM To Off-the-shelf Agents | Dec 16, 2024 | Decision MakingIn-Context Learning | CodeCode Available | 3 |
| AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games | Dec 14, 2024 | Decision Making | CodeCode Available | 3 |
| Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models | Nov 29, 2024 | Decision MakingRAG | CodeCode Available | 3 |
| Game-theoretic LLM: Agent Workflow for Negotiation Games | Nov 8, 2024 | Decision Making | CodeCode Available | 3 |