| Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism | Nov 25, 2022 | GPU | CodeCode Available | 2 | 5 |
| Habitat: A Platform for Embodied AI Research | Apr 2, 2019 | BenchmarkingGPU | CodeCode Available | 2 | 5 |
| Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step | Oct 19, 2024 | Conditional Image GenerationGPU | CodeCode Available | 2 | 5 |
| Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation | Apr 26, 2023 | Domain AdaptationDomain Generalization | CodeCode Available | 2 | 5 |
| mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs | Dec 5, 2023 | GPULarge Language Model | CodeCode Available | 2 | 5 |
| Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow | Jun 3, 2024 | GPULanguage Modeling | CodeCode Available | 2 | 5 |
| Differentiable Voxelization and Mesh Morphing | Jul 15, 2024 | GPU | CodeCode Available | 2 | 5 |
| Full Parameter Fine-tuning for Large Language Models with Limited Resources | Jun 16, 2023 | GPUparameter-efficient fine-tuning | CodeCode Available | 2 | 5 |
| BiFormer: Vision Transformer with Bi-Level Routing Attention | Mar 15, 2023 | Computational EfficiencyGPU | CodeCode Available | 2 | 5 |
| Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis | Mar 3, 2024 | 3D Parameter-Efficient Fine-Tuning for ClassificationGPU | CodeCode Available | 2 | 5 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 | 5 |
| Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction | Mar 27, 2024 | 3D Generation3DGS | CodeCode Available | 2 | 5 |
| GPU Performance Portability needs Autotuning | Apr 30, 2025 | GPU | CodeCode Available | 2 | 5 |
| AutoFocus: Efficient Multi-Scale Inference | Dec 4, 2018 | GPU | CodeCode Available | 2 | 5 |
| FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation | Mar 4, 2023 | BenchmarkingGPU | CodeCode Available | 2 | 5 |
| Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation | Sep 2, 2024 | GPU | CodeCode Available | 2 | 5 |
| FP8-LM: Training FP8 Large Language Models | Oct 27, 2023 | GPU | CodeCode Available | 2 | 5 |
| Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity | Sep 19, 2023 | GPU | CodeCode Available | 2 | 5 |
| A User's Guide to KSig: GPU-Accelerated Computation of the Signature Kernel | Jan 13, 2025 | GPU | CodeCode Available | 2 | 5 |
| FlashRNN: Optimizing Traditional RNNs on Modern Hardware | Dec 10, 2024 | GPULogical Reasoning | CodeCode Available | 2 | 5 |
| Invertible Diffusion Models for Compressed Sensing | Mar 25, 2024 | compressed sensingGPU | CodeCode Available | 2 | 5 |
| Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning | Aug 24, 2021 | CPUGPU | CodeCode Available | 2 | 5 |
| DEYO: DETR with YOLO for End-to-End Object Detection | Feb 26, 2024 | DecoderGPU | CodeCode Available | 2 | 5 |
| Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models | Jun 11, 2024 | DiversityGPU | CodeCode Available | 2 | 5 |
| Differentiable Forward Projector for X-ray Computed Tomography | Jul 11, 2023 | CT ReconstructionDeep Learning | CodeCode Available | 2 | 5 |
| FRA-RIR: Fast Random Approximation of the Image-source Method | Aug 8, 2022 | DenoisingGPU | CodeCode Available | 2 | 5 |
| AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec | May 26, 2023 | CPUGPU | CodeCode Available | 2 | 5 |
| $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources | Oct 30, 2024 | GPU | CodeCode Available | 2 | 5 |
| From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients | Jul 15, 2024 | GPU | CodeCode Available | 2 | 5 |
| KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation | Feb 21, 2025 | Audio GenerationFAD | CodeCode Available | 2 | 5 |
| CaRL: Learning Scalable Planning Policies with Simple Rewards | Apr 24, 2025 | Autonomous DrivingCARLA longest6 | CodeCode Available | 2 | 5 |
| Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification | Dec 1, 2024 | GPUVisual Question Answering | CodeCode Available | 2 | 5 |
| Me LLaMA: Foundation Large Language Models for Medical Applications | Feb 20, 2024 | Few-Shot LearningGPU | CodeCode Available | 2 | 5 |
| LeanDojo: Theorem Proving with Retrieval-Augmented Language Models | Jun 27, 2023 | Automated Theorem ProvingGPU | CodeCode Available | 2 | 5 |
| Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness | May 18, 2023 | CPUGPU | CodeCode Available | 2 | 5 |
| λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space | Feb 7, 2024 | Concept AlignmentGPU | CodeCode Available | 2 | 5 |
| BMInf: An Efficient Toolkit for Big Model Inference and Tuning | May 1, 2022 | CPUGPU | CodeCode Available | 2 | 5 |
| A Tensor Compiler for Unified Machine Learning Prediction Serving | Oct 9, 2020 | BIG-bench Machine LearningCPU | CodeCode Available | 2 | 5 |
| DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation | Mar 30, 2022 | GPU | CodeCode Available | 1 | 5 |
| Fine-tuning Quantized Neural Networks with Zeroth-order Optimization | May 19, 2025 | GPUQuantization | CodeCode Available | 1 | 5 |
| Defocus Blur Detection via Depth Distillation | Jul 16, 2020 | DecoderDefocus Blur Detection | CodeCode Available | 1 | 5 |
| ArchesWeather: An efficient AI weather forecasting model at 1.5° resolution | May 23, 2024 | GPUWeather Forecasting | CodeCode Available | 1 | 5 |
| Fine-tuning of sign language recognition models: a technical report | Feb 15, 2023 | Gesture RecognitionGPU | CodeCode Available | 1 | 5 |
| Fine-tuning giant neural networks on commodity hardware with automatic pipeline model parallelism | Jul 14, 2021 | GPUTransfer Learning | CodeCode Available | 1 | 5 |
| Fine-Tuning Pre-trained Transformers into Decaying Fast Weights | Oct 9, 2022 | GPU | CodeCode Available | 1 | 5 |
| FG-Net: Fast Large-Scale LiDAR Point Clouds Understanding Network Leveraging Correlated Feature Mining and Geometric-Aware Modelling | Dec 17, 2020 | 3D Part Segmentation3D Point Cloud Classification | CodeCode Available | 1 | 5 |
| Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving | Apr 10, 2025 | GPULarge Language Model | CodeCode Available | 1 | 5 |
| A Probabilistic Neuro-symbolic Layer for Algebraic Constraint Satisfaction | Mar 25, 2025 | GPU | CodeCode Available | 1 | 5 |
| Accelerating Evolutionary Neural Architecture Search via Multi-Fidelity Evaluation | Aug 10, 2021 | GPUNeural Architecture Search | CodeCode Available | 1 | 5 |
| Fill the K-Space and Refine the Image: Prompting for Dynamic and Multi-Contrast MRI Reconstruction | Sep 25, 2023 | GPUMRI Reconstruction | CodeCode Available | 1 | 5 |