| Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria Reranking | Mar 14, 2025 | AllLarge Language Model | CodeCode Available | 13 |
| Autonomous Agents for Collaborative Task under Information Asymmetry | Jun 21, 2024 | Language ModellingLarge Language Model | CodeCode Available | 13 |
| Zep: A Temporal Knowledge Graph Architecture for Agent Memory | Jan 20, 2025 | Large Language ModelRAG | CodeCode Available | 12 |
| HybridFlow: A Flexible and Efficient RLHF Framework | Sep 28, 2024 | Large Language Model | CodeCode Available | 11 |
| JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation | Nov 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 11 |
| CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training | May 23, 2025 | Automatic Speech RecognitionEmotion Recognition | CodeCode Available | 11 |
| DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence | Jan 25, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 11 |
| CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens | Jul 7, 2024 | Language ModellingLarge Language Model | CodeCode Available | 11 |
| Scaling Synthetic Data Creation with 1,000,000,000 Personas | Jun 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 11 |
| OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation | May 29, 2025 | Large Language Model | CodeCode Available | 11 |
| IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System | Feb 8, 2025 | DecoderLanguage Modeling | CodeCode Available | 11 |
| CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion | May 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Apr 3, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 9 |
| SkyReels-V2: Infinite-length Film Generative Model | Apr 17, 2025 | Large Language Modelmodel | CodeCode Available | 9 |
| FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models | May 23, 2024 | AI AgentDecision Making | CodeCode Available | 9 |
| LawGPT: A Chinese Legal Knowledge-Enhanced Large Language Model | Jun 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning | Mar 26, 2024 | GPUGSM8K | CodeCode Available | 9 |
| PowerInfer-2: Fast Large Language Model Inference on a Smartphone | Jun 10, 2024 | CPULanguage Modeling | CodeCode Available | 9 |
| Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention | Jul 2, 2024 | GPULanguage Modelling | CodeCode Available | 9 |
| AutoAgent: A Fully-Automated and Zero-Code Framework for LLM Agents | Feb 9, 2025 | Large Language ModelRAG | CodeCode Available | 9 |
| MiniCPM4: Ultra-Efficient LLMs on End Devices | Jun 9, 2025 | Large Language Model | CodeCode Available | 9 |
| Moshi: a speech-text foundation model for real-time dialogue | Sep 17, 2024 | Action DetectionActivity Detection | CodeCode Available | 9 |
| Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition | Jul 17, 2023 | DecoderLanguage Modeling | CodeCode Available | 8 |
| Elixir: Train a Large Language Model on a Small GPU Cluster | Dec 10, 2022 | CPUGPU | CodeCode Available | 7 |