| Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data | Apr 3, 2023 | ChatbotLanguage Modeling | CodeCode Available | 4 |
| ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge | Mar 24, 2023 | Information RetrievalLanguage Modeling | CodeCode Available | 4 |
| Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference | Mar 8, 2023 | Hyperparameter OptimizationLanguage Modeling | CodeCode Available | 4 |
| Galactica: A Large Language Model for Science | Nov 16, 2022 | AnachronismsBias Detection | CodeCode Available | 4 |
| Fast Transformer Decoding: One Write-Head is All You Need | Nov 6, 2019 | AllLanguage Modelling | CodeCode Available | 4 |
| ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation | Jun 22, 2025 | GPUImage Generation | CodeCode Available | 3 |
| FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation | Jun 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems | Jun 9, 2025 | Large Language Model | CodeCode Available | 3 |
| A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning | Jun 3, 2025 | Decision MakingDiagnostic | CodeCode Available | 3 |
| BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | May 29, 2025 | Large Language Modelscientific discovery | CodeCode Available | 3 |
| Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models | May 1, 2025 | Large Language Model | CodeCode Available | 3 |
| Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning | Apr 15, 2025 | Automated Theorem ProvingLarge Language Model | CodeCode Available | 3 |
| Evaluation Report on MCP Servers | Apr 15, 2025 | Large Language Model | CodeCode Available | 3 |
| SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression | Mar 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Survey on the Optimization of Large Language Model-based Agents | Mar 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing | Mar 13, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 3 |
| Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems | Mar 5, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction | Feb 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Prompt-to-Leaderboard | Feb 20, 2025 | ChatbotLanguage Modeling | CodeCode Available | 3 |
| Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks | Feb 18, 2025 | graph constructionLarge Language Model | CodeCode Available | 3 |
| Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving | Feb 11, 2025 | Automated Theorem ProvingLarge Language Model | CodeCode Available | 3 |
| Multi-agent Architecture Search via Agentic Supernet | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare Copilot | Feb 6, 2025 | DiagnosticLarge Language Model | CodeCode Available | 3 |