| Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Mar 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 4 | 5 |
| ChatHaruhi: Reviving Anime Character in Reality via Large Language Model | Aug 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| AgentGym: Evolving Large Language Model-based Agents across Diverse Environments | Jun 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| Safurai 001: New Qualitative Approach for Code LLM Evaluation | Sep 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data | Oct 2, 2024 | Arithmetic ReasoningLarge Language Model | CodeCode Available | 4 | 5 |
| Multimodal Table Understanding | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Evalverse: Unified and Accessible Library for Large Language Model Evaluation | Apr 1, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 3 | 5 |
| Multi-agent Architecture Search via Agentic Supernet | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python | Apr 9, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 3 | 5 |
| Evaluation Report on MCP Servers | Apr 15, 2025 | Large Language Model | CodeCode Available | 3 | 5 |
| OceanGPT: A Large Language Model for Ocean Science Tasks | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning | Jun 3, 2025 | Decision MakingDiagnostic | CodeCode Available | 3 | 5 |
| MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Apr 8, 2024 | Image GenerationImage-to-Image Translation | CodeCode Available | 3 | 5 |
| DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation | Mar 11, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 | 5 |
| Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Feb 8, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 | 5 |
| Odyssey: Empowering Minecraft Agents with Open-World Skills | Jul 22, 2024 | Language ModellingLarge Language Model | CodeCode Available | 3 | 5 |
| MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare Copilot | Feb 6, 2025 | DiagnosticLarge Language Model | CodeCode Available | 3 | 5 |
| MeshXL: Neural Coordinate Field for Generative 3D Foundation Models | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory | Oct 14, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 3 | 5 |
| M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models | Mar 31, 2024 | Image-text RetrievalLanguage Modeling | CodeCode Available | 3 | 5 |
| LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale | Aug 10, 2024 | GPULanguage Modelling | CodeCode Available | 3 | 5 |
| APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts | Jun 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls | Feb 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | May 29, 2025 | Large Language Modelscientific discovery | CodeCode Available | 3 | 5 |
| Detecting hallucinations in large language models using semantic entropy | Jun 19, 2024 | Large Language ModelQuestion Answering | CodeCode Available | 3 | 5 |