| EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models | Dec 10, 2024 | | CodeCode Available | 5 |
| Online Iterative Reinforcement Learning from Human Feedback with General Preference Model | Feb 11, 2024 | | CodeCode Available | 5 |
| Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions | Jan 7, 2024 | BenchmarkingImage Segmentation | CodeCode Available | 5 |
| aeon: a Python toolkit for learning from time series | Jun 20, 2024 | Anomaly DetectionModel Selection | CodeCode Available | 5 |
| Controllable Generation with Text-to-Image Diffusion Models: A Survey | Mar 7, 2024 | Denoising | CodeCode Available | 5 |
| Datasets for Large Language Models: A Comprehensive Survey | Feb 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 5 |
| Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding | Jan 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis | Jan 16, 2024 | 3D ReconstructionFace Generation | CodeCode Available | 5 |
| Make Your LLM Fully Utilize the Context | Apr 25, 2024 | 4kInformation Retrieval | CodeCode Available | 5 |
| Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training | May 23, 2023 | Contrastive LearningSelf-Supervised Learning | CodeCode Available | 5 |
| Unified Training of Universal Time Series Forecasting Transformers | Feb 4, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 5 |
| InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework | Apr 16, 2025 | Image Generation | CodeCode Available | 5 |
| TimeMixer++: A General Time Series Pattern Machine for Universal Predictive Analysis | Oct 21, 2024 | Anomaly DetectionImputation | CodeCode Available | 5 |
| Learning Flow Fields in Attention for Controllable Person Image Generation | Dec 11, 2024 | AttributeImage Generation | CodeCode Available | 5 |
| MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts | Apr 13, 2024 | DiversityLanguage Modeling | CodeCode Available | 5 |
| OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens | Mar 2, 2026 | | —Unverified | 4 |
| Unified Personalized Reward Model for Vision Generation | Feb 10, 2026 | | —Unverified | 4 |
| Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills | Mar 9, 2026 | | —Unverified | 4 |
| Reinforcement Learning via Self-Distillation | Feb 16, 2026 | | —Unverified | 4 |
| SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks | Mar 13, 2026 | | —Unverified | 4 |
| ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models | Feb 18, 2026 | | —Unverified | 4 |
| Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery | Mar 18, 2026 | | —Unverified | 4 |
| QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining | Feb 6, 2026 | | —Unverified | 4 |
| VideoWorld 2: Learning Transferable Knowledge from Real-world Videos | Feb 10, 2026 | | —Unverified | 4 |
| R-Zero: Self-Evolving Reasoning LLM from Zero Data | Feb 13, 2026 | | —Unverified | 4 |
| ATOM: AdapTive and OptiMized dynamic temporal knowledge graph construction using LLMs | Jan 24, 2026 | | —Unverified | 4 |
| Precise Object and Effect Removal with Adaptive Target-Aware Attention | Mar 16, 2026 | | —Unverified | 4 |
| MOSS-TTS Technical Report | Mar 18, 2026 | | —Unverified | 4 |
| SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations | Feb 2, 2026 | | —Unverified | 4 |
| MotionStream: Real-Time Video Generation with Interactive Motion Controls | Mar 5, 2026 | | —Unverified | 4 |
| On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models | Feb 3, 2026 | | —Unverified | 4 |
| Closing the Loop: Universal Repository Representation with RPG-Encoder | Feb 3, 2026 | | —Unverified | 4 |
| MOVA: Towards Scalable and Synchronized Video-Audio Generation | Feb 10, 2026 | | —Unverified | 4 |
| Cautious Weight Decay | Feb 24, 2026 | | —Unverified | 4 |
| Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs | Jan 22, 2026 | | —Unverified | 4 |
| Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching | Mar 17, 2026 | | —Unverified | 4 |
| SkillNet: Create, Evaluate, and Connect AI Skills | Feb 26, 2026 | | —Unverified | 4 |
| TTT3R: 3D Reconstruction as Test-Time Training | Mar 3, 2026 | | —Unverified | 4 |
| Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation | Feb 6, 2026 | | —Unverified | 4 |
| SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds | Jan 22, 2026 | | —Unverified | 4 |
| UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers | Mar 1, 2026 | | —Unverified | 4 |
| Utonia: Toward One Encoder for All Point Clouds | Mar 3, 2026 | | —Unverified | 4 |
| Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models | Jan 29, 2026 | | —Unverified | 4 |
| On the Theoretical Limitations of Embedding-Based Retrieval | Mar 12, 2026 | | —Unverified | 4 |
| MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator | Mar 16, 2026 | | —Unverified | 4 |
| Hyperagents | Mar 19, 2026 | | —Unverified | 4 |
| Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations | Feb 28, 2026 | | —Unverified | 4 |
| Masked Depth Modeling for Spatial Perception | Jan 25, 2026 | | —Unverified | 4 |
| AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research | Feb 6, 2026 | | —Unverified | 4 |
| Learning to Discover at Test Time | Feb 5, 2026 | | —Unverified | 4 |