| Customized Segment Anything Model for Medical Image Segmentation | Apr 26, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| Enhancing LLM Reasoning with Reward-guided Tree Search | Nov 18, 2024 | Mathematical Reasoning | CodeCode Available | 2 |
| SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation | May 8, 2025 | 3DGSData Augmentation | CodeCode Available | 2 |
| CityDreamer: Compositional Generative Model of Unbounded 3D Cities | Sep 1, 2023 | modelScene Generation | CodeCode Available | 2 |
| Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic | Apr 10, 2024 | GPU | CodeCode Available | 2 |
| Out of Many, One: Designing and Scaffolding Proteins at the Scale of the Structural Universe with Genie 2 | May 24, 2024 | Data AugmentationDiversity | CodeCode Available | 2 |
| BEiT: BERT Pre-Training of Image Transformers | Jun 15, 2021 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 2 |
| Twelve years of SAMtools and BCFtools | Dec 18, 2020 | | CodeCode Available | 2 |
| EnzymeFlow: Generating Reaction-specific Enzyme Catalytic Pockets through Flow Matching and Co-Evolutionary Dynamics | Oct 1, 2024 | | CodeCode Available | 2 |
| Deep Long-Tailed Learning: A Survey | Oct 9, 2021 | Survey | CodeCode Available | 2 |
| Using Speech Synthesis to Train End-to-End Spoken Language Understanding Models | Oct 21, 2019 | Data AugmentationNatural Language Understanding | CodeCode Available | 2 |
| ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing | Dec 19, 2024 | Mixture-of-Experts | CodeCode Available | 2 |
| HumanMM: Global Human Motion Recovery from Multi-shot Videos | Mar 10, 2025 | Camera Pose EstimationMotion Generation | CodeCode Available | 2 |
| TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation | Oct 12, 2020 | Sign Language RecognitionSign Language Translation | CodeCode Available | 2 |
| An Overview of Deep Semi-Supervised Learning | Jun 9, 2020 | Deep Learningimage-classification | CodeCode Available | 2 |
| Monster Mash: A Single-View Approach to Casual 3D Modeling and Animation | Dec 1, 2020 | Image Generation | CodeCode Available | 2 |
| Deep Portfolio Theory | May 23, 2016 | | CodeCode Available | 2 |
| A System for Real-Time Interactive Analysis of Deep Learning Training | Jan 5, 2020 | 3D Action RecognitionDiagnostic | CodeCode Available | 2 |
| Context Encoding for Semantic Segmentation | Mar 23, 2018 | image-classificationImage Classification | CodeCode Available | 2 |
| ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation | Jun 1, 2023 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence | Apr 8, 2020 | Sentence EmbeddingsTopic Models | CodeCode Available | 2 |
| SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search | Nov 5, 2021 | | CodeCode Available | 2 |
| Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly | Jan 1, 2024 | Anomaly Detection | CodeCode Available | 2 |
| PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models | Jul 8, 2024 | Autonomous DrivingImage Generation | CodeCode Available | 2 |
| Towards Unifying Feature Attribution and Counterfactual Explanations: Different Means to the Same End | Nov 10, 2020 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| Neural Texture Extraction and Distribution for Controllable Person Image Synthesis | Apr 13, 2022 | Image Generation | CodeCode Available | 2 |
| AMC: AutoML for Model Compression and Acceleration on Mobile Devices | Feb 10, 2018 | AutoMLGPU | CodeCode Available | 2 |
| What do we learn from inverting CLIP models? | Mar 5, 2024 | | CodeCode Available | 2 |
| EasyTPP: Towards Open Benchmarking Temporal Point Processes | Jul 16, 2023 | BenchmarkingPoint Processes | CodeCode Available | 2 |
| Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents | Feb 3, 2023 | MinecraftTask Planning | CodeCode Available | 2 |
| Technique Inference Engine: A Recommender Model to Support Cyber Threat Hunting | Mar 4, 2025 | | CodeCode Available | 2 |
| Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications | Nov 29, 2023 | Autonomous DrivingPrediction | CodeCode Available | 2 |
| Barbershop: GAN-based Image Compositing using Segmentation Masks | Jun 2, 2021 | | CodeCode Available | 2 |
| On the limits of cross-domain generalization in automated X-ray prediction | Feb 6, 2020 | DiagnosticDomain Generalization | CodeCode Available | 2 |
| Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles | Dec 5, 2016 | Image Classificationregression | VerifiedCommunity Verified — 1 reproduction | 2 |
| PixARMesh: Autoregressive Mesh-Native Single-View Scene Reconstruction | Mar 6, 2026 | | —Unverified | 1 |
| KLASS: KL-Guided Fast Inference in Masked Diffusion Models | Mar 5, 2026 | | —Unverified | 1 |
| TinyNav: End-to-End TinyML for Real-Time Autonomous Navigation on Microcontrollers | Mar 10, 2026 | | —Unverified | 1 |
| CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization | Mar 6, 2026 | | —Unverified | 1 |
| CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance | Mar 11, 2026 | | —Unverified | 1 |
| SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise | Feb 13, 2026 | | —Unverified | 1 |
| MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding | Feb 23, 2026 | | —Unverified | 1 |
| ChatInject: Abusing Chat Templates for Prompt Injection in LLM Agents | Jan 30, 2026 | | —Unverified | 1 |
| EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning | Feb 10, 2026 | | —Unverified | 1 |
| GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing | Mar 12, 2026 | | —Unverified | 1 |
| CUA-Skill: Develop Skills for Computer Using Agent | Feb 2, 2026 | | —Unverified | 1 |
| SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation | Mar 14, 2026 | | —Unverified | 1 |
| Revisiting Text Ranking in Deep Research | Feb 25, 2026 | | —Unverified | 1 |
| RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning | Feb 21, 2026 | | —Unverified | 1 |
| Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration | Mar 12, 2026 | | —Unverified | 1 |