| Retrieval-Augmented Generation for AI-Generated Content: A Survey | Feb 29, 2024 | Information RetrievalLarge Language Model | CodeCode Available | 5 |
| Datasets for Large Language Models: A Comprehensive Survey | Feb 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 5 |
| MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments | Feb 1, 2024 | Embodied Question AnsweringLanguage Modeling | CodeCode Available | 5 |
| Executable Code Actions Elicit Better LLM Agents | Feb 1, 2024 | Language ModellingLarge Language Model | CodeCode Available | 5 |
| Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs | Jan 22, 2024 | Diffusion Personalization Tuning FreeImage Generation | CodeCode Available | 5 |
| Large Language Model based Multi-Agents: A Survey of Progress and Challenges | Jan 21, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 5 |
| Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding | Jan 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models | Jan 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 5 |
| Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects | Jan 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| StarVector: Generating Scalable Vector Graphics Code from Images and Text | Dec 17, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 5 |
| PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU | Dec 16, 2023 | CPUGPU | CodeCode Available | 5 |
| Weakly Supervised Detection of Hallucinations in LLM Activations | Dec 5, 2023 | HallucinationLanguage Modeling | CodeCode Available | 5 |
| Ferret: Refer and Ground Anything Anywhere at Any Granularity | Oct 11, 2023 | HallucinationLanguage Modeling | CodeCode Available | 5 |
| CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving | Oct 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| The Rise and Potential of Large Language Model Based Agents: A Survey | Sep 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language Model | Jun 28, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 5 |
| FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU | Mar 13, 2023 | CPUGPU | CodeCode Available | 5 |
| Seed-Coder: Let the Code Model Curate Data for Itself | Jun 4, 2025 | Code CompletionCode Generation | CodeCode Available | 4 |
| ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding | Jun 2, 2025 | 3D GenerationLarge Language Model | CodeCode Available | 4 |
| A Survey of LLM DATA | May 24, 2025 | Large Language ModelManagement | CodeCode Available | 4 |
| lmgame-Bench: How Good are LLMs at Playing Games? | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model | May 6, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 4 |
| Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Mar 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| R1-Onevision:An Open-Source Multimodal Large Language Model Capable of Deep Reasoning | Feb 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 4 |