| Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens | Mar 3, 2025 | Attributetext-to-speech | CodeCode Available | 11 |
| Yi: Open Foundation Models by 01.AI | Mar 7, 2024 | AttributeChatbot | CodeCode Available | 9 |
| aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing | Oct 17, 2024 | AttributeCode Completion | CodeCode Available | 7 |
| Learning Flow Fields in Attention for Controllable Person Image Generation | Dec 11, 2024 | AttributeImage Generation | CodeCode Available | 5 |
| OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations | Dec 10, 2024 | AttributeBenchmarking | CodeCode Available | 5 |
| IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation | Oct 9, 2024 | AttributeImage Generation | CodeCode Available | 5 |
| Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following | Nov 28, 2023 | AttributeDenoising | CodeCode Available | 5 |
| XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation | Jun 26, 2025 | AttributeImage Generation | CodeCode Available | 4 |
| Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free | May 10, 2025 | AttributeMixture-of-Experts | CodeCode Available | 4 |
| Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement | Nov 10, 2024 | AttributeImage Generation | CodeCode Available | 4 |