| Will LLMs be Professional at Fund Investment? DeepFund: A Live Arena Perspective | Mar 24, 2025 | Decision Making | CodeCode Available | 3 |
| A Survey on the Optimization of Large Language Model-based Agents | Mar 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems | Mar 5, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| Automated Hypothesis Validation with Agentic Sequential Falsifications | Feb 14, 2025 | Decision MakingHallucination | CodeCode Available | 3 |
| Rethinking Early Stopping: Refine, Then Calibrate | Jan 31, 2025 | Decision Making | CodeCode Available | 3 |
| MineStudio: A Streamlined Package for Minecraft AI Agent Development | Dec 24, 2024 | AI AgentDecision Making | CodeCode Available | 3 |
| Embodied CoT Distillation From LLM To Off-the-shelf Agents | Dec 16, 2024 | Decision MakingIn-Context Learning | CodeCode Available | 3 |
| AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games | Dec 14, 2024 | Decision Making | CodeCode Available | 3 |
| Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models | Nov 29, 2024 | Decision MakingRAG | CodeCode Available | 3 |
| Game-theoretic LLM: Agent Workflow for Negotiation Games | Nov 8, 2024 | Decision Making | CodeCode Available | 3 |