| PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC | Feb 20, 2025 | Decision Making | CodeCode Available | 9 | 5 |
| FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models | May 23, 2024 | AI AgentDecision Making | CodeCode Available | 9 | 5 |
| Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research | Nov 7, 2024 | AI AgentDecision Making | CodeCode Available | 9 | 5 |
| Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion | Jul 1, 2024 | Decision MakingPrediction | CodeCode Available | 9 | 5 |
| RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning | Apr 24, 2025 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 7 | 5 |
| Better than classical? The subtle art of benchmarking quantum machine learning models | Mar 11, 2024 | BenchmarkingBinary Classification | CodeCode Available | 7 | 5 |
| Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models | Oct 6, 2023 | Decision MakingRetrieval | CodeCode Available | 6 | 5 |
| Large Language Model based Multi-Agents: A Survey of Progress and Challenges | Jan 21, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 5 | 5 |
| GraphCast: Learning skillful medium-range global weather forecasting | Dec 24, 2022 | Decision MakingWeather Forecasting | CodeCode Available | 5 | 5 |
| LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models | Apr 10, 2024 | Decision Making | CodeCode Available | 5 | 5 |
| Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey | Aug 19, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 5 | 5 |
| Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B | Jun 11, 2024 | Decision MakingGSM8K | CodeCode Available | 5 | 5 |
| Maia-2: A Unified Model for Human-AI Alignment in Chess | Sep 30, 2024 | Decision Making | CodeCode Available | 5 | 5 |
| Tree of Thoughts: Deliberate Problem Solving with Large Language Models | May 17, 2023 | Arithmetic ReasoningDecision Making | CodeCode Available | 5 | 5 |
| Neural Fields in Robotics: A Survey | Oct 26, 2024 | 3D ReconstructionAutonomous Driving | CodeCode Available | 5 | 5 |
| Deep Lake: a Lakehouse for Deep Learning | Sep 22, 2022 | Decision MakingDeep Learning | CodeCode Available | 5 | 5 |
| Differentiable Tree Search Network | Jan 22, 2024 | Decision MakingInductive Bias | CodeCode Available | 5 | 5 |
| TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools | Mar 14, 2025 | AI AgentDecision Making | CodeCode Available | 5 | 5 |
| GenCast: Diffusion-based ensemble forecasting for medium-range weather | Dec 25, 2023 | Decision MakingWeather Forecasting | CodeCode Available | 5 | 5 |
| Mastering Diverse Domains through World Models | Jan 10, 2023 | Atari Games 100kDecision Making | CodeCode Available | 4 | 5 |
| AgentBench: Evaluating LLMs as Agents | Aug 7, 2023 | Decision MakingInstruction Following | CodeCode Available | 4 | 5 |
| Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond | May 6, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 4 | 5 |
| A Survey on Large Language Model-Based Game Agents | Apr 2, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 4 | 5 |
| TorchRL: A data-driven decision-making library for PyTorch | Jun 1, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 4 | 5 |
| Relationships are Complicated! An Analysis of Relationships Between Datasets on the Web | Aug 26, 2024 | Decision MakingMulti-class Classification | CodeCode Available | 4 | 5 |
| ReAct: Synergizing Reasoning and Acting in Language Models | Oct 6, 2022 | Decision MakingFact Verification | CodeCode Available | 4 | 5 |
| AutoWebGLM: A Large Language Model-based Web Navigating Agent | Apr 4, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 4 | 5 |
| Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Mar 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 4 | 5 |
| Reflexion: Language Agents with Verbal Reinforcement Learning | Mar 20, 2023 | Decision MakingHumanEval | CodeCode Available | 4 | 5 |
| OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning | May 2, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 4 | 5 |
| OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model | Mar 30, 2025 | Autonomous DrivingDecision Making | CodeCode Available | 4 | 5 |
| Constitutional AI: Harmlessness from AI Feedback | Dec 15, 2022 | Decision Making | CodeCode Available | 4 | 5 |
| Eureka: Human-Level Reward Design via Coding Large Language Models | Oct 19, 2023 | Decision MakingIn-Context Learning | CodeCode Available | 4 | 5 |
| Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents | Aug 13, 2024 | Decision Making | CodeCode Available | 4 | 5 |
| Cognitive Architectures for Language Agents | Sep 5, 2023 | Decision Making | CodeCode Available | 4 | 5 |
| pgmpy: A Python Toolkit for Bayesian Networks | Apr 17, 2023 | Causal DiscoveryCausal Identification | CodeCode Available | 4 | 5 |
| LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning | Jun 5, 2023 | Benchmarking | CodeCode Available | 3 | 5 |
| MineStudio: A Streamlined Package for Minecraft AI Agent Development | Dec 24, 2024 | AI AgentDecision Making | CodeCode Available | 3 | 5 |
| Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB | Apr 1, 2025 | Decision MakingRAG | CodeCode Available | 3 | 5 |
| Sentiment Reasoning for Healthcare | Jul 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 | 5 |
| Hierarchical Prompting Assists Large Language Model on Web Navigation | May 23, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 3 | 5 |
| FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution | Apr 9, 2025 | 2kDecision Making | CodeCode Available | 3 | 5 |
| Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping | Feb 21, 2024 | Decision MakingDecoder | CodeCode Available | 3 | 5 |
| Game-theoretic LLM: Agent Workflow for Negotiation Games | Nov 8, 2024 | Decision Making | CodeCode Available | 3 | 5 |
| Automatic Gradient Estimation for Calibrating Crowd Models with Discrete Decision Making | Apr 6, 2024 | Decision Making | CodeCode Available | 3 | 5 |
| ACEGEN: Reinforcement learning of generative chemical agents for drug discovery | May 7, 2024 | BenchmarkingDecision Making | CodeCode Available | 3 | 5 |
| Evaluating Language Model Agency through Negotiations | Jan 9, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 3 | 5 |
| A Demonstration of Adaptive Collaboration of Large Language Models for Medical Decision-Making | Oct 31, 2024 | Decision MakingDiagnostic | CodeCode Available | 3 | 5 |
| Automated Hypothesis Validation with Agentic Sequential Falsifications | Feb 14, 2025 | Decision MakingHallucination | CodeCode Available | 3 | 5 |
| Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models | Nov 29, 2024 | Decision MakingRAG | CodeCode Available | 3 | 5 |