| Distributed Prioritized Experience Replay | Mar 2, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 |
| PromptHMR: Promptable Human Mesh Recovery | Apr 8, 2025 | 3D Human Pose EstimationHuman Mesh Recovery | CodeCode Available | 3 |
| Pushing the Limits of Large Language Model Quantization via the Linearity Theorem | Nov 26, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| U-Net: Convolutional Networks for Biomedical Image Segmentation | May 18, 2015 | Cell SegmentationCell Tracking | CodeCode Available | 3 |
| History-Guided Video Diffusion | Feb 10, 2025 | Video Generation | CodeCode Available | 3 |
| Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services | Apr 25, 2024 | GPU | CodeCode Available | 3 |
| Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval | Feb 17, 2025 | Information RetrievalRetrieval | CodeCode Available | 3 |
| Probabilistic Volumetric Fusion for Dense Monocular SLAM | Oct 3, 2022 | | CodeCode Available | 3 |
| Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation | May 30, 2023 | Machine TranslationSegmentation | CodeCode Available | 3 |
| Discovered Policy Optimisation | Oct 11, 2022 | IngenuityMeta-Learning | CodeCode Available | 3 |
| MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning | May 13, 2024 | Data AugmentationGSM8K | CodeCode Available | 3 |
| On Distillation of Guided Diffusion Models | Oct 6, 2022 | DenoisingImage Generation | CodeCode Available | 3 |
| SWE-bench-java: A GitHub Issue Resolving Benchmark for Java | Aug 26, 2024 | | CodeCode Available | 3 |
| SoundStream: An End-to-End Neural Audio Codec | Jul 7, 2021 | CPUDecoder | CodeCode Available | 3 |
| Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization Perspective | Feb 2, 2025 | Multi-Task Learning | CodeCode Available | 3 |
| On the Content Bias in Fréchet Video Distance | Apr 18, 2024 | Video Generation | CodeCode Available | 3 |
| Flow Matching for Generative Modeling | Oct 6, 2022 | Density EstimationImage Generation | CodeCode Available | 3 |
| W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training | Aug 7, 2021 | Contrastive LearningLanguage Modeling | CodeCode Available | 3 |
| 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Feb 16, 2024 | DenoisingRobot Manipulation | CodeCode Available | 3 |
| Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion | Jun 6, 2024 | 3D Generation | CodeCode Available | 3 |
| SkyMath: Technical Report | Oct 25, 2023 | GSM8KLanguage Modeling | CodeCode Available | 3 |
| XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions Parameters | May 19, 2023 | | CodeCode Available | 3 |
| Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning | Mar 26, 2025 | Few-Shot LearningVisual Reasoning | CodeCode Available | 3 |
| Designing and building the mlpack open-source machine learning library | Aug 17, 2017 | BIG-bench Machine Learning | CodeCode Available | 3 |
| One-step Diffusion with Distribution Matching Distillation | Nov 30, 2023 | | CodeCode Available | 3 |
| EAFormer: Scene Text Segmentation with Edge-Aware Transformers | Jul 24, 2024 | DecoderSegmentation | CodeCode Available | 3 |
| Accurate clinical and biomedical Named entity recognition at scale | Jul 19, 2022 | Clinical Concept ExtractionDe-identification | CodeCode Available | 3 |
| Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1 | Oct 3, 2024 | Scheduling | CodeCode Available | 3 |
| EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models | Feb 18, 2024 | Event ExtractionHallucination | CodeCode Available | 3 |
| LRM: Large Reconstruction Model for Single Image to 3D | Nov 8, 2023 | Image to 3DNeRF | CodeCode Available | 3 |
| GluonTS: Probabilistic Time Series Models in Python | Jun 12, 2019 | Anomaly DetectionTime Series | CodeCode Available | 3 |
| Practical Deep Reinforcement Learning Approach for Stock Trading | Nov 19, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 3 |
| CodeBLEU: a Method for Automatic Evaluation of Code Synthesis | Sep 22, 2020 | Code TranslationTranslation | CodeCode Available | 3 |
| Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction | Dec 5, 2024 | Multimodal ReasoningNatural Language Visual Grounding | CodeCode Available | 3 |
| Merlin: A Vision Language Foundation Model for 3D Computed Tomography | Jun 10, 2024 | 3D Semantic SegmentationComputed Tomography (CT) | CodeCode Available | 3 |
| Text Embeddings Reveal (Almost) As Much As Text | Oct 10, 2023 | | CodeCode Available | 3 |
| dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching | May 17, 2025 | Denoising | CodeCode Available | 3 |
| SkillMimic: Learning Basketball Interaction Skills from Demonstrations | Aug 12, 2024 | DiversityHuman-Object Interaction Detection | CodeCode Available | 3 |
| DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation | Jan 28, 2025 | 3D Generation | CodeCode Available | 3 |
| MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding | Mar 18, 2025 | document understandingQuestion Answering | CodeCode Available | 3 |
| MiniViT: Compressing Vision Transformers with Weight Multiplexing | Apr 14, 2022 | DiversityImage Classification | CodeCode Available | 3 |
| SPMamba: State-space model is all you need in speech separation | Apr 2, 2024 | AllMamba | CodeCode Available | 3 |
| Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight Detection | Aug 6, 2024 | audio moment retrievalHighlight Detection | CodeCode Available | 3 |
| Vision as LoRA | Mar 26, 2025 | | CodeCode Available | 3 |
| Deep Limit Order Book Forecasting | Mar 14, 2024 | Deep Learning | CodeCode Available | 3 |
| Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding | Mar 14, 2024 | MambaMoment Retrieval | CodeCode Available | 3 |
| ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting | Jul 23, 2023 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 3 |
| EfficientFormer: Vision Transformers at MobileNet Speed | Jun 2, 2022 | | CodeCode Available | 3 |
| Demystify Mamba in Vision: A Linear Attention Perspective | May 26, 2024 | image-classificationImage Classification | CodeCode Available | 3 |
| Visual Large Language Models for Generalized and Specialized Applications | Jan 6, 2025 | Ethics | CodeCode Available | 3 |