| Efficient Forward Architecture Search | May 31, 2019 | feature selectionGPU | CodeCode Available | 1 | 5 |
| Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving | Apr 10, 2025 | GPULarge Language Model | CodeCode Available | 1 | 5 |
| EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs | Nov 30, 2021 | GPUImage Generation | CodeCode Available | 1 | 5 |
| A C Code Generator for Fast Inference and Simple Deployment of Convolutional Neural Networks on Resource Constrained Systems | Jan 14, 2020 | C++ codeCode Generation | CodeCode Available | 1 | 5 |
| Fully Convolutional Line Parsing | Apr 22, 2021 | GPULine Segment Detection | CodeCode Available | 1 | 5 |
| Easy and Efficient Transformer : Scalable Inference Solution For large NLP model | Apr 26, 2021 | DecoderGPU | CodeCode Available | 1 | 5 |
| L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training | Aug 18, 2022 | CPUGPU | CodeCode Available | 1 | 5 |
| Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding | Nov 30, 2023 | GPUInductive Bias | CodeCode Available | 1 | 5 |
| Learning Neural Volumetric Representations of Dynamic Humans in Minutes | Feb 23, 2023 | GPUNeRF | CodeCode Available | 1 | 5 |
| LightViT: Towards Light-Weight Convolution-Free Vision Transformers | Jul 12, 2022 | GPUimage-classification | CodeCode Available | 1 | 5 |
| Dynamic Sparse Training with Structured Sparsity | May 3, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| Dynamic Perceiver for Efficient Visual Recognition | Jun 20, 2023 | Action RecognitionClassification | CodeCode Available | 1 | 5 |
| ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training | Jun 3, 2024 | Distributed OptimizationFederated Learning | CodeCode Available | 1 | 5 |
| CommVQ: Commutative Vector Quantization for KV Cache Compression | Jun 23, 2025 | GPUGSM8K | CodeCode Available | 1 | 5 |
| Dynamic Pooling Improves Nanopore Base Calling Accuracy | May 16, 2021 | GPU | CodeCode Available | 1 | 5 |
| KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation | Oct 28, 2024 | GPUKnowledge Distillation | CodeCode Available | 1 | 5 |
| Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms | May 8, 2021 | CPUGPU | CodeCode Available | 1 | 5 |
| Dynamic Mesh-Aware Radiance Fields | Sep 8, 2023 | GPUNeRF | CodeCode Available | 1 | 5 |
| Dynamic Structure Pruning for Compressing CNNs | Mar 17, 2023 | GPU | CodeCode Available | 1 | 5 |
| A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image | Aug 27, 2019 | 3D Pose EstimationDecoder | CodeCode Available | 1 | 5 |
| Dynamic Low-Rank Sparse Adaptation for Large Language Models | Feb 20, 2025 | CPUGPU | CodeCode Available | 1 | 5 |
| KD-MRI: A knowledge distillation framework for image reconstruction and image restoration in MRI workflow | Apr 11, 2020 | CPUGPU | CodeCode Available | 1 | 5 |
| Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded Devices | Jan 17, 2024 | Dynamic neural networksGPU | CodeCode Available | 1 | 5 |
| BASNet: Boundary-Aware Salient Object Detection | Jun 1, 2019 | Camouflaged Object SegmentationDecoder | CodeCode Available | 1 | 5 |
| Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks | Aug 22, 2023 | Computational EfficiencyGPU | CodeCode Available | 1 | 5 |
| Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth Maps | Jun 8, 2022 | Autonomous DrivingDepth Estimation | CodeCode Available | 1 | 5 |
| DVIS: Decoupled Video Instance Segmentation Framework | Jun 6, 2023 | Autonomous DrivingGPU | CodeCode Available | 1 | 5 |
| AKG: Automatic Kernel Generation for Neural Processing Units using Polyhedral Transformations | Jun 19, 2021 | Code GenerationCPU | CodeCode Available | 1 | 5 |
| DXSLAM: A Robust and Efficient Visual SLAM System with Deep Features | Aug 12, 2020 | GPULoop Closure Detection | CodeCode Available | 1 | 5 |
| Accelerating Vision-Language Pretraining with Free Language Modeling | Mar 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Dynamic GPU Energy Optimization for Machine Learning Training Workloads | Jan 5, 2022 | BIG-bench Machine LearningGPU | CodeCode Available | 1 | 5 |
| JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning | Mar 17, 2024 | GPUManagement | CodeCode Available | 1 | 5 |
| DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions | Jan 4, 2021 | GPU | CodeCode Available | 1 | 5 |
| DTL: Disentangled Transfer Learning for Visual Recognition | Dec 13, 2023 | GPUTransfer Learning | CodeCode Available | 1 | 5 |
| JetSeg: Efficient Real-Time Semantic Segmentation Model for Low-Power GPU-Embedded Systems | May 19, 2023 | DecoderGPU | CodeCode Available | 1 | 5 |
| JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes | May 10, 2025 | BenchmarkingGPU | CodeCode Available | 1 | 5 |
| JGR-P2O: Joint Graph Reasoning based Pixel-to-Offset Prediction Network for 3D Hand Pose Estimation from a Single Depth Image | Jul 9, 2020 | 3D Hand Pose EstimationGPU | CodeCode Available | 1 | 5 |
| Accelerating Translational Image Registration for HDR Images on GPU | Jul 13, 2020 | CPUGPU | CodeCode Available | 1 | 5 |
| Bag of Tricks for Inference-time Computation of LLM Reasoning | Feb 11, 2025 | GPU | CodeCode Available | 1 | 5 |
| 3D Small Object Detection with Dynamic Spatial Pruning | May 5, 2023 | 3D Object DetectionDecoder | CodeCode Available | 1 | 5 |
| DSNAS: Direct Neural Architecture Search without Parameter Retraining | Feb 21, 2020 | GPUNeural Architecture Search | CodeCode Available | 1 | 5 |
| JIT-Masker: Efficient Online Distillation for Background Matting | Jun 11, 2020 | GPUImage Matting | CodeCode Available | 1 | 5 |
| DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation | Feb 27, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 | 5 |
| DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training | Feb 28, 2022 | GPUInstance Segmentation | CodeCode Available | 1 | 5 |
| DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting | Mar 4, 2025 | Computational EfficiencyCPU | CodeCode Available | 1 | 5 |
| DreamShard: Generalizable Embedding Table Placement for Recommender Systems | Oct 5, 2022 | GPURecommendation Systems | CodeCode Available | 1 | 5 |
| DR-SPAAM: A Spatial-Attention and Auto-regressive Model for Person Detection in 2D Range Data | Apr 29, 2020 | GPUHuman Detection | CodeCode Available | 1 | 5 |
| DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting | Mar 2, 2025 | CPUGPU | CodeCode Available | 1 | 5 |
| Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads | May 24, 2021 | CPUGPU | CodeCode Available | 1 | 5 |
| DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks | Mar 4, 2021 | GPUNeRF | CodeCode Available | 1 | 5 |