| EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery | Mar 9, 2026 | | —Unverified | 5 |
| OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data | Feb 14, 2026 | | —Unverified | 5 |
| MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE | Feb 4, 2026 | | —Unverified | 5 |
| Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length | Mar 16, 2026 | | —Unverified | 5 |
| InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery | Feb 9, 2026 | | —Unverified | 5 |
| FireRed-Image-Edit-1.0 Technical Report | Feb 12, 2026 | | —Unverified | 5 |
| SAMTok: Representing Any Mask with Two Words | Jan 22, 2026 | | —Unverified | 5 |
| CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning | Feb 5, 2026 | | —Unverified | 5 |
| World Action Models are Zero-shot Policies | Feb 17, 2026 | | —Unverified | 5 |
| Helios: Real Real-Time Long Video Generation Model | Mar 4, 2026 | | —Unverified | 5 |
| Rethinking the Design of Reinforcement Learning-Based Deep Research Agents | Feb 21, 2026 | | —Unverified | 5 |
| Kimi K2.5: Visual Agentic Intelligence | Feb 2, 2026 | | —Unverified | 5 |
| Training Large Language Models to Reason in a Continuous Latent Space | Dec 9, 2024 | Logical Reasoning | CodeCode Available | 5 |
| YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception | Jun 21, 2025 | Computational Efficiencyobject-detection | CodeCode Available | 5 |
| YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications | Sep 7, 2022 | GPUObject Detection | CodeCode Available | 5 |
| FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification | Oct 14, 2024 | Image Generation | CodeCode Available | 5 |
| OminiControl2: Efficient Conditioning for Diffusion Transformers | Mar 11, 2025 | Conditional Image GenerationDenoising | CodeCode Available | 5 |
| Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B | Jun 11, 2024 | Decision MakingGSM8K | CodeCode Available | 5 |
| Semantic Operators: A Declarative Model for Rich, AI-based Data Processing | Jul 16, 2024 | Extreme Multi-Label ClassificationFact Checking | CodeCode Available | 5 |
| OMG-Seg: Is One Model Good Enough For All Segmentation? | Jan 18, 2024 | AllDecoder | CodeCode Available | 5 |
| Ferret: Refer and Ground Anything Anywhere at Any Granularity | Oct 11, 2023 | HallucinationLanguage Modeling | CodeCode Available | 5 |
| TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting | May 23, 2024 | Future predictionTime Series | CodeCode Available | 5 |
| MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI | Nov 27, 2023 | Complex Query AnsweringLogical Reasoning | CodeCode Available | 5 |
| SoftHGNN: Soft Hypergraph Neural Networks for General Visual Recognition | May 21, 2025 | | CodeCode Available | 5 |
| Masked Completion via Structured Diffusion with White-Box Transformers | Apr 3, 2024 | Representation Learning | CodeCode Available | 5 |
| Inpaint Anything: Segment Anything Meets Image Inpainting | Apr 13, 2023 | Image Inpainting | CodeCode Available | 5 |
| Extreme Compression of Large Language Models via Additive Quantization | Jan 11, 2024 | CPUGPU | CodeCode Available | 5 |
| Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement Learning | Jul 8, 2024 | | CodeCode Available | 5 |
| FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning | Feb 29, 2024 | GPULanguage Modeling | CodeCode Available | 5 |
| CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling | May 26, 2024 | | CodeCode Available | 5 |
| Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities | May 5, 2025 | Image GenerationSurvey | CodeCode Available | 5 |
| Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation | Apr 16, 2023 | Instruction Following | CodeCode Available | 5 |
| MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search | Dec 24, 2024 | | CodeCode Available | 5 |
| CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models | Jul 21, 2024 | AllFashion Synthesis | CodeCode Available | 5 |
| Arbitrary-steps Image Super-resolution via Diffusion Inversion | Dec 12, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 5 |
| SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural Networks | Apr 15, 2024 | Quantization | CodeCode Available | 5 |
| SymbolicAI: A framework for logic-based approaches combining generative models and solvers | Feb 1, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 5 |
| That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design | Nov 15, 2024 | Deep Reinforcement Learning | CodeCode Available | 5 |
| GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation | Nov 27, 2024 | Depth EstimationDiversity | CodeCode Available | 5 |
| Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) with Pitch Prediction | May 31, 2024 | Speech Synthesis | CodeCode Available | 5 |
| A quantum semantic framework for natural language processing | Jun 11, 2025 | | CodeCode Available | 5 |
| Single-seed generation of Brownian paths and integrals for adaptive and high order SDE solvers | May 10, 2024 | | CodeCode Available | 5 |
| The Path To Autonomous Cyber Defense | Apr 12, 2024 | | CodeCode Available | 5 |
| CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians | Apr 1, 2024 | 3DGS3D Scene Reconstruction | CodeCode Available | 5 |
| pyvene: A Library for Understanding and Improving PyTorch Models via Interventions | Mar 12, 2024 | Model Editing | CodeCode Available | 5 |
| Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond | Apr 11, 2023 | Text to 3D | CodeCode Available | 5 |
| Interpretability, Then What? Editing Machine Learning Models to Reflect Human Knowledge and Values | Jun 30, 2022 | Additive modelsBIG-bench Machine Learning | CodeCode Available | 5 |
| Magic Clothing: Controllable Garment-Driven Image Synthesis | Apr 15, 2024 | Image Generation | CodeCode Available | 5 |
| MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation | Jun 25, 2024 | DiversityNatural Language Understanding | CodeCode Available | 5 |