| DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation | Jun 25, 2025 | Code GenerationDenoising | CodeCode Available | 4 | 5 |
| DeepFilterNet2: Towards Real-Time Speech Enhancement on Embedded Devices for Full-Band Audio | May 11, 2022 | CPUData Augmentation | CodeCode Available | 4 | 5 |
| Efficient Few-Shot Learning Without Prompts | Sep 22, 2022 | Few-Shot LearningFew-Shot Text Classification | CodeCode Available | 4 | 5 |
| AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents | May 23, 2024 | Benchmarking | CodeCode Available | 4 | 5 |
| Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning | Nov 18, 2023 | Transfer Learning | CodeCode Available | 4 | 5 |
| Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering | Jan 12, 2024 | 3D Panoptic Segmentation3D Semantic Segmentation | CodeCode Available | 4 | 5 |
| Generalizable and Animatable Gaussian Head Avatar | Oct 10, 2024 | | CodeCode Available | 4 | 5 |
| Deep Industrial Image Anomaly Detection: A Survey | Jan 27, 2023 | Anomaly DetectionDeep Learning | CodeCode Available | 4 | 5 |
| PharMolixFM: All-Atom Foundation Models for Molecular Modeling and Generation | Mar 12, 2025 | AllDenoising | CodeCode Available | 4 | 5 |
| Transformer for Object Re-Identification: A Survey | Jan 13, 2024 | ObjectSurvey | CodeCode Available | 4 | 5 |
| FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation | Mar 19, 2024 | Translationvalid | CodeCode Available | 4 | 5 |
| FLEX: FLEXible Federated Learning Framework | Apr 9, 2024 | Federated Learning | CodeCode Available | 4 | 5 |
| Deep Multi-Frame Filtering for Hearing Aids | May 14, 2023 | Speech Enhancement | CodeCode Available | 4 | 5 |
| Neuralangelo: High-Fidelity Neural Surface Reconstruction | Jun 5, 2023 | Neural RenderingSurface Reconstruction | CodeCode Available | 4 | 5 |
| Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond | May 6, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 4 | 5 |
| PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor | Jan 1, 2024 | Object | CodeCode Available | 4 | 5 |
| Training Software Engineering Agents and Verifiers with SWE-Gym | Dec 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control | Mar 7, 2025 | Image InpaintingOptical Flow Estimation | CodeCode Available | 4 | 5 |
| pgmpy: A Python Toolkit for Bayesian Networks | Apr 17, 2023 | Causal DiscoveryCausal Identification | CodeCode Available | 4 | 5 |
| OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning | Dec 31, 2024 | BenchmarkingLogical Reasoning | CodeCode Available | 4 | 5 |
| Rethinking Inductive Biases for Surface Normal Estimation | Mar 1, 2024 | Surface Normal Estimation | CodeCode Available | 4 | 5 |
| UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation | Jun 3, 2024 | Image AnimationVideo Generation | CodeCode Available | 4 | 5 |
| InkSight: Offline-to-Online Handwriting Conversion by Learning to Read and Write | Feb 8, 2024 | Derendering | CodeCode Available | 4 | 5 |
| Long-form factuality in large language models | Mar 27, 2024 | 16kForm | CodeCode Available | 4 | 5 |
| Molecular-driven Foundation Model for Oncologic Pathology | Jan 28, 2025 | BenchmarkingDiagnostic | CodeCode Available | 4 | 5 |
| Natural Language Generation | Feb 20, 2025 | Text Generation | CodeCode Available | 4 | 5 |
| Medical SAM 2: Segment medical images as video via Segment Anything Model 2 | Aug 1, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 4 | 5 |
| From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents | Jun 23, 2025 | Information RetrievalRetrieval | CodeCode Available | 4 | 5 |
| Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Jun 11, 2024 | 4kLanguage Modeling | CodeCode Available | 4 | 5 |
| 3D-aware Conditional Image Synthesis | Feb 16, 2023 | Image Generation | CodeCode Available | 4 | 5 |
| NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning | Mar 11, 2024 | Collision AvoidanceMotion Generation | CodeCode Available | 4 | 5 |
| LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day | Jun 1, 2023 | Image ClassificationInstruction Following | CodeCode Available | 4 | 5 |
| MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark | Sep 4, 2024 | Optical Character Recognition (OCR) | CodeCode Available | 4 | 5 |
| Pen and Paper Exercises in Machine Learning | Jun 27, 2022 | BIG-bench Machine Learning | CodeCode Available | 4 | 5 |
| RewardBench: Evaluating Reward Models for Language Modeling | Mar 20, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 4 | 5 |
| Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model | Dec 1, 2022 | Colorizationcompressed sensing | CodeCode Available | 4 | 5 |
| Taming Rectified Flow for Inversion and Editing | Nov 7, 2024 | Image GenerationText-to-Image Generation | CodeCode Available | 4 | 5 |
| A Foundation Model for Zero-shot Logical Query Reasoning | Apr 10, 2024 | Complex Query AnsweringKnowledge Graph Completion | CodeCode Available | 4 | 5 |
| DoRA: Weight-Decomposed Low-Rank Adaptation | Feb 14, 2024 | parameter-efficient fine-tuning | CodeCode Available | 4 | 5 |
| Blind Image Deblurring with Unknown Kernel Size and Substantial Noise | Aug 18, 2022 | Blind Image DeblurringDeblurring | CodeCode Available | 4 | 5 |
| Human Motion Diffusion Model | Sep 29, 2022 | 3D Generationmodel | CodeCode Available | 4 | 5 |
| Fast Inference of Mixture-of-Experts Language Models with Offloading | Dec 28, 2023 | Mixture-of-ExpertsQuantization | CodeCode Available | 4 | 5 |
| Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model | Oct 23, 2023 | | CodeCode Available | 4 | 5 |
| BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation | Feb 16, 2024 | Knowledge DistillationQuantization | CodeCode Available | 4 | 5 |
| TerraTorch: The Geospatial Foundation Models Toolkit | Mar 26, 2025 | BenchmarkingDecoder | CodeCode Available | 4 | 5 |
| Video-R1: Reinforcing Video Reasoning in MLLMs | Mar 27, 2025 | MVBenchReinforcement Learning (RL) | CodeCode Available | 4 | 5 |
| SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement | Jun 9, 2025 | Music Generation | CodeCode Available | 4 | 5 |
| SpatialTrackerV2: 3D Point Tracking Made Easy | Jul 16, 2025 | 3D ReconstructionCamera Pose Estimation | CodeCode Available | 4 | 5 |
| Proactive Detection of Voice Cloning with Localized Watermarking | Jan 30, 2024 | Voice Cloning | CodeCode Available | 4 | 5 |
| Eliciting Latent Predictions from Transformers with the Tuned Lens | Mar 14, 2023 | Language Modelling | CodeCode Available | 4 | 5 |