| Constitutional AI: Harmlessness from AI Feedback | Dec 15, 2022 | Decision Making | CodeCode Available | 4 | 5 |
| AgentBench: Evaluating LLMs as Agents | Aug 7, 2023 | Decision MakingInstruction Following | CodeCode Available | 4 | 5 |
| Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Mar 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 4 | 5 |
| Cognitive Architectures for Language Agents | Sep 5, 2023 | Decision Making | CodeCode Available | 4 | 5 |
| Mastering Diverse Domains through World Models | Jan 10, 2023 | Atari Games 100kDecision Making | CodeCode Available | 4 | 5 |
| pgmpy: A Python Toolkit for Bayesian Networks | Apr 17, 2023 | Causal DiscoveryCausal Identification | CodeCode Available | 4 | 5 |
| Behavior Generation with Latent Actions | Mar 5, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 3 | 5 |
| FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution | Apr 9, 2025 | 2kDecision Making | CodeCode Available | 3 | 5 |
| Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models | Nov 29, 2024 | Decision MakingRAG | CodeCode Available | 3 | 5 |
| ACEGEN: Reinforcement learning of generative chemical agents for drug discovery | May 7, 2024 | BenchmarkingDecision Making | CodeCode Available | 3 | 5 |