| Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions | Jan 7, 2024 | BenchmarkingImage Segmentation | CodeCode Available | 5 | 5 |
| aeon: a Python toolkit for learning from time series | Jun 20, 2024 | Anomaly DetectionModel Selection | CodeCode Available | 5 | 5 |
| Controllable Generation with Text-to-Image Diffusion Models: A Survey | Mar 7, 2024 | Denoising | CodeCode Available | 5 | 5 |
| Datasets for Large Language Models: A Comprehensive Survey | Feb 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 5 | 5 |
| Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding | Jan 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 | 5 |
| Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis | Jan 16, 2024 | 3D ReconstructionFace Generation | CodeCode Available | 5 | 5 |
| Make Your LLM Fully Utilize the Context | Apr 25, 2024 | 4kInformation Retrieval | CodeCode Available | 5 | 5 |
| Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training | May 23, 2023 | Contrastive LearningSelf-Supervised Learning | CodeCode Available | 5 | 5 |
| DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos | Sep 3, 2024 | Depth EstimationDiversity | CodeCode Available | 5 | 5 |
| ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills | Feb 3, 2025 | | CodeCode Available | 5 | 5 |
| MambaIRv2: Attentive State Space Restoration | Nov 22, 2024 | Computational EfficiencyImage Restoration | CodeCode Available | 5 | 5 |
| WebLINX: Real-World Website Navigation with Multi-Turn Dialogue | Feb 8, 2024 | Conversational Web NavigationText Generation | CodeCode Available | 5 | 5 |
| VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models | Nov 20, 2024 | BenchmarkingImage Generation | CodeCode Available | 5 | 5 |
| Trust Regions for Explanations via Black-Box Probabilistic Certification | Feb 17, 2024 | | CodeCode Available | 5 | 5 |
| MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments | Feb 1, 2024 | Embodied Question AnsweringLanguage Modeling | CodeCode Available | 5 | 5 |
| EasyPhoto: Your Smart AI Photo Generator | Oct 7, 2023 | | CodeCode Available | 5 | 5 |
| Language Agents as Optimizable Graphs | Feb 26, 2024 | Prompt Engineering | CodeCode Available | 5 | 5 |
| Data-Juicer: A One-Stop Data Processing System for Large Language Models | Sep 5, 2023 | Distributed Computing | CodeCode Available | 5 | 5 |
| Training Large Language Models to Reason in a Continuous Latent Space | Dec 9, 2024 | Logical Reasoning | CodeCode Available | 5 | 5 |
| YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception | Jun 21, 2025 | Computational Efficiencyobject-detection | CodeCode Available | 5 | 5 |
| YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications | Sep 7, 2022 | GPUObject Detection | CodeCode Available | 5 | 5 |
| FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification | Oct 14, 2024 | Image Generation | CodeCode Available | 5 | 5 |
| OminiControl2: Efficient Conditioning for Diffusion Transformers | Mar 11, 2025 | Conditional Image GenerationDenoising | CodeCode Available | 5 | 5 |
| Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B | Jun 11, 2024 | Decision MakingGSM8K | CodeCode Available | 5 | 5 |
| Semantic Operators: A Declarative Model for Rich, AI-based Data Processing | Jul 16, 2024 | Extreme Multi-Label ClassificationFact Checking | CodeCode Available | 5 | 5 |
| OMG-Seg: Is One Model Good Enough For All Segmentation? | Jan 18, 2024 | AllDecoder | CodeCode Available | 5 | 5 |
| Ferret: Refer and Ground Anything Anywhere at Any Granularity | Oct 11, 2023 | HallucinationLanguage Modeling | CodeCode Available | 5 | 5 |
| TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting | May 23, 2024 | Future predictionTime Series | CodeCode Available | 5 | 5 |
| MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI | Nov 27, 2023 | Complex Query AnsweringLogical Reasoning | CodeCode Available | 5 | 5 |
| SoftHGNN: Soft Hypergraph Neural Networks for General Visual Recognition | May 21, 2025 | | CodeCode Available | 5 | 5 |
| Masked Completion via Structured Diffusion with White-Box Transformers | Apr 3, 2024 | Representation Learning | CodeCode Available | 5 | 5 |
| Inpaint Anything: Segment Anything Meets Image Inpainting | Apr 13, 2023 | Image Inpainting | CodeCode Available | 5 | 5 |
| Extreme Compression of Large Language Models via Additive Quantization | Jan 11, 2024 | CPUGPU | CodeCode Available | 5 | 5 |
| Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement Learning | Jul 8, 2024 | | CodeCode Available | 5 | 5 |
| FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning | Feb 29, 2024 | GPULanguage Modeling | CodeCode Available | 5 | 5 |
| CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling | May 26, 2024 | | CodeCode Available | 5 | 5 |
| Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities | May 5, 2025 | Image GenerationSurvey | CodeCode Available | 5 | 5 |
| Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation | Apr 16, 2023 | Instruction Following | CodeCode Available | 5 | 5 |
| MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 | 5 |
| Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search | Dec 24, 2024 | | CodeCode Available | 5 | 5 |
| CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models | Jul 21, 2024 | AllFashion Synthesis | CodeCode Available | 5 | 5 |
| Arbitrary-steps Image Super-resolution via Diffusion Inversion | Dec 12, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 5 | 5 |
| SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural Networks | Apr 15, 2024 | Quantization | CodeCode Available | 5 | 5 |
| SymbolicAI: A framework for logic-based approaches combining generative models and solvers | Feb 1, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 5 | 5 |
| That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design | Nov 15, 2024 | Deep Reinforcement Learning | CodeCode Available | 5 | 5 |
| GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation | Nov 27, 2024 | Depth EstimationDiversity | CodeCode Available | 5 | 5 |
| Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) with Pitch Prediction | May 31, 2024 | Speech Synthesis | CodeCode Available | 5 | 5 |
| A quantum semantic framework for natural language processing | Jun 11, 2025 | | CodeCode Available | 5 | 5 |
| Single-seed generation of Brownian paths and integrals for adaptive and high order SDE solvers | May 10, 2024 | | CodeCode Available | 5 | 5 |
| The Path To Autonomous Cyber Defense | Apr 12, 2024 | | CodeCode Available | 5 | 5 |