| A Common Interface for Automatic Differentiation | May 8, 2025 | | CodeCode Available | 3 | 5 |
| GameGen-X: Interactive Open-world Game Video Generation | Nov 1, 2024 | Text-to-Video GenerationVideo Generation | CodeCode Available | 3 | 5 |
| Valley2: Exploring Multimodal Models with Scalable Vision-Language Design | Jan 10, 2025 | Image CaptioningLanguage Modeling | CodeCode Available | 3 | 5 |
| Measuring AI Ability to Complete Long Tasks | Mar 18, 2025 | Logical Reasoning | CodeCode Available | 3 | 5 |
| Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis | Jun 5, 2024 | MambaMedical Image Analysis | CodeCode Available | 3 | 5 |
| InterpretML: A Unified Framework for Machine Learning Interpretability | Sep 19, 2019 | Additive modelsBIG-bench Machine Learning | CodeCode Available | 3 | 5 |
| Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models | Jun 10, 2025 | 3D Lane Detection3D Object Detection | CodeCode Available | 3 | 5 |
| AlphaMath Almost Zero: Process Supervision without Process | May 6, 2024 | Mathematical ReasoningMath Word Problem Solving | CodeCode Available | 3 | 5 |
| Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action Models | May 29, 2025 | Autonomous DrivingDiagnostic | CodeCode Available | 3 | 5 |
| Normalizing Flows are Capable Generative Models | Dec 9, 2024 | Conditional Image GenerationDensity Estimation | CodeCode Available | 3 | 5 |
| 3D Photography using Context-aware Layered Depth Inpainting | Apr 9, 2020 | Novel View Synthesis | CodeCode Available | 3 | 5 |
| SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models | Aug 19, 2024 | image-classificationImage Classification | CodeCode Available | 3 | 5 |
| Focused Transformer: Contrastive Training for Context Scaling | Jul 6, 2023 | Contrastive Learning | CodeCode Available | 3 | 5 |
| Deep Neural Networks for Encrypted Inference with TFHE | Feb 13, 2023 | Privacy Preserving | CodeCode Available | 3 | 5 |
| Foundations of Large Language Models | Jan 16, 2025 | | CodeCode Available | 3 | 5 |
| MobileMamba: Lightweight Multi-Receptive Visual Mamba Network | Nov 24, 2024 | GPUMamba | CodeCode Available | 3 | 5 |
| Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography | Mar 26, 2024 | Anomaly DetectionLarge Language Model | CodeCode Available | 3 | 5 |
| EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation | May 11, 2024 | Computational EfficiencyDecoder | CodeCode Available | 3 | 5 |
| Expanding Language-Image Pretrained Models for General Video Recognition | Aug 4, 2022 | Action ClassificationAction Recognition | CodeCode Available | 3 | 5 |
| WavChat: A Survey of Spoken Dialogue Models | Nov 15, 2024 | speech-recognitionSpeech Recognition | CodeCode Available | 3 | 5 |
| PirateNets: Physics-informed Deep Learning with Residual Adaptive Networks | Feb 1, 2024 | Deep Learning | CodeCode Available | 3 | 5 |
| FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization | Mar 24, 2023 | 3D Hand Pose EstimationGPU | CodeCode Available | 3 | 5 |
| Style Aligned Image Generation via Shared Attention | Dec 4, 2023 | Image Generation | CodeCode Available | 3 | 5 |
| Camera Calibration via Circular Patterns: A Comprehensive Framework with Measurement Uncertainty and Unbiased Projection Model | Jun 20, 2025 | Camera Calibration | CodeCode Available | 3 | 5 |
| emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose Estimation | Dec 2, 2024 | AnatomyHand Pose Estimation | CodeCode Available | 3 | 5 |
| Separable Self-attention for Mobile Vision Transformers | Jun 6, 2022 | Image ClassificationObject Detection | CodeCode Available | 3 | 5 |
| Safety at Scale: A Comprehensive Survey of Large Model Safety | Feb 2, 2025 | Autonomous DrivingData Poisoning | CodeCode Available | 3 | 5 |
| ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks | Sep 1, 2018 | Face HallucinationGenerative Adversarial Network | CodeCode Available | 3 | 5 |
| CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph | Jun 16, 2024 | Drug DesignFairness | CodeCode Available | 3 | 5 |
| A Declarative System for Optimizing AI Workloads | May 23, 2024 | | CodeCode Available | 3 | 5 |
| MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction | Feb 17, 2025 | 2kAutonomous Driving | CodeCode Available | 3 | 5 |
| How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection | Aug 25, 2023 | Object Detection | CodeCode Available | 3 | 5 |
| PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models | Mar 8, 2020 | Face HallucinationHallucination | CodeCode Available | 3 | 5 |
| DETRs with Collaborative Hybrid Assignments Training | Nov 22, 2022 | DecoderInstance Segmentation | CodeCode Available | 3 | 5 |
| Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension | Nov 20, 2024 | GPUMME | CodeCode Available | 3 | 5 |
| SparseTSF: Modeling Long-term Time Series Forecasting with 1k Parameters | May 2, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 3 | 5 |
| SelfCodeAlign: Self-Alignment for Code Generation | Oct 31, 2024 | Code GenerationHumanEval | CodeCode Available | 3 | 5 |
| BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | May 29, 2025 | Large Language Modelscientific discovery | CodeCode Available | 3 | 5 |
| Scientific Large Language Models: A Survey on Biological & Chemical Domains | Jan 26, 2024 | scientific discoverySurvey | CodeCode Available | 3 | 5 |
| TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation | Feb 27, 2024 | Protein Design | CodeCode Available | 3 | 5 |
| Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought | Oct 3, 2022 | Mathematical ReasoningQuestion Answering | CodeCode Available | 3 | 5 |
| FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors | Jan 14, 2025 | Image to Video GenerationVideo Generation | CodeCode Available | 3 | 5 |
| Dopamine: A Research Framework for Deep Reinforcement Learning | Dec 14, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 3 | 5 |
| ModelScope Text-to-Video Technical Report | Aug 12, 2023 | DenoisingImage Generation | CodeCode Available | 3 | 5 |
| DocAgent: A Multi-Agent System for Automated Code Documentation Generation | Apr 11, 2025 | Code Documentation Generation | CodeCode Available | 3 | 5 |
| Geometric-aware Pretraining for Vision-centric 3D Object Detection | Apr 6, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 3 | 5 |
| Physics-Informed Diffusion Models | Mar 21, 2024 | Denoising | CodeCode Available | 3 | 5 |
| An end-to-end strategy for recovering a free-form potential from a snapshot of stellar coordinates | May 26, 2023 | FormSymbolic Regression | CodeCode Available | 3 | 5 |
| MELODI: Exploring Memory Compression for Long Contexts | Oct 4, 2024 | | CodeCode Available | 3 | 5 |
| Accelerating Production LLMs with Combined Token/Embedding Speculators | Apr 29, 2024 | | CodeCode Available | 3 | 5 |