| CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis | Mar 25, 2022 | Code GenerationHumanEval | CodeCode Available | 6 | 5 |
| Efficient Memory Management for Large Language Model Serving with PagedAttention | Sep 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 | 5 |
| Gorilla: Large Language Model Connected with Massive APIs | May 24, 2023 | HallucinationLanguage Modeling | CodeCode Available | 6 | 5 |
| CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society | Mar 31, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 6 | 5 |
| FinGPT: Open-Source Financial Large Language Models | Jun 9, 2023 | Algorithmic TradingLanguage Modeling | CodeCode Available | 6 | 5 |
| Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages | Aug 23, 2023 | Image GenerationImage to text | CodeCode Available | 6 | 5 |
| LAB: Large-Scale Alignment for ChatBots | Mar 2, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 5 | 5 |
| PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU | Dec 16, 2023 | CPUGPU | CodeCode Available | 5 | 5 |
| ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment | Mar 8, 2024 | DenoisingImage Generation | CodeCode Available | 5 | 5 |
| AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning | Jun 2, 2025 | AI AgentDiversity | CodeCode Available | 5 | 5 |
| FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU | Mar 13, 2023 | CPUGPU | CodeCode Available | 5 | 5 |
| DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition | Apr 30, 2025 | Automated Theorem ProvingLarge Language Model | CodeCode Available | 5 | 5 |
| NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 | 5 |
| DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models | Jan 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 5 | 5 |
| GRUtopia: Dream General Robots in a City at Scale | Jul 15, 2024 | Language ModellingLarge Language Model | CodeCode Available | 5 | 5 |
| MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments | Feb 1, 2024 | Embodied Question AnsweringLanguage Modeling | CodeCode Available | 5 | 5 |
| R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement Learning | Mar 7, 2025 | Emotion RecognitionLanguage Modeling | CodeCode Available | 5 | 5 |
| Generating Physically Stable and Buildable LEGO Designs from Text | May 8, 2025 | 3D GenerationLarge Language Model | CodeCode Available | 5 | 5 |
| InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation | Feb 28, 2025 | Audio GenerationForm | CodeCode Available | 5 | 5 |
| Datasets for Large Language Models: A Comprehensive Survey | Feb 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 5 | 5 |
| MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts | Apr 13, 2024 | DiversityLanguage Modeling | CodeCode Available | 5 | 5 |
| FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration | Jan 24, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 5 | 5 |
| MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 | 5 |
| Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language Model | Jun 28, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 5 | 5 |
| FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning | Feb 29, 2024 | GPULanguage Modeling | CodeCode Available | 5 | 5 |