| DXSLAM: A Robust and Efficient Visual SLAM System with Deep Features | Aug 12, 2020 | GPULoop Closure Detection | CodeCode Available | 1 | 5 |
| Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving | Apr 10, 2025 | GPULarge Language Model | CodeCode Available | 1 | 5 |
| A Lightweight CNN-Transformer Model for Learning Traveling Salesman Problems | May 3, 2023 | GPU | CodeCode Available | 1 | 5 |
| Bayesian Optimization for auto-tuning GPU kernels | Nov 26, 2021 | Bayesian OptimizationGPU | CodeCode Available | 1 | 5 |
| FL_PyTorch: optimization research simulator for federated learning | Feb 7, 2022 | Federated LearningGPU | CodeCode Available | 1 | 5 |
| Minuet: Accelerating 3D Sparse Convolutions on GPUs | Dec 1, 2023 | GPU | CodeCode Available | 1 | 5 |
| ITER: Iterative Transformer-based Entity Recognition and Relation Extraction | Nov 11, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning | Oct 4, 2020 | GPUMuJoCo | CodeCode Available | 1 | 5 |
| Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling | Apr 14, 2021 | GPURe-Ranking | CodeCode Available | 1 | 5 |
| DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting | Mar 2, 2025 | CPUGPU | CodeCode Available | 1 | 5 |
| A C Code Generator for Fast Inference and Simple Deployment of Convolutional Neural Networks on Resource Constrained Systems | Jan 14, 2020 | C++ codeCode Generation | CodeCode Available | 1 | 5 |
| Dynamic Perceiver for Efficient Visual Recognition | Jun 20, 2023 | Action RecognitionClassification | CodeCode Available | 1 | 5 |
| Iterative Patch Selection for High-Resolution Image Recognition | Oct 24, 2022 | Autonomous DrivingGPU | CodeCode Available | 1 | 5 |
| CommVQ: Commutative Vector Quantization for KV Cache Compression | Jun 23, 2025 | GPUGSM8K | CodeCode Available | 1 | 5 |
| iUNets: Fully invertible U-Nets with Learnable Up- and Downsampling | May 11, 2020 | GPU | CodeCode Available | 1 | 5 |
| Keypoints Localization for Joint Vertebra Detection and Fracture Severity Quantification | May 25, 2020 | Computed Tomography (CT)GPU | CodeCode Available | 1 | 5 |
| Learning Rich Features at High-Speed for Single-Shot Object Detection | Oct 1, 2019 | GPUobject-detection | CodeCode Available | 1 | 5 |
| DR-SPAAM: A Spatial-Attention and Auto-regressive Model for Person Detection in 2D Range Data | Apr 29, 2020 | GPUHuman Detection | CodeCode Available | 1 | 5 |
| DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training | Feb 28, 2022 | GPUInstance Segmentation | CodeCode Available | 1 | 5 |
| Dr. Top-k: Delegate-Centric Top-k on GPUs | Sep 16, 2021 | GPU | CodeCode Available | 1 | 5 |
| DreamShard: Generalizable Embedding Table Placement for Recommender Systems | Oct 5, 2022 | GPURecommendation Systems | CodeCode Available | 1 | 5 |
| From English To Foreign Languages: Transferring Pre-trained Language Models | Feb 18, 2020 | Dependency ParsingGPU | CodeCode Available | 1 | 5 |
| ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training | Jun 3, 2024 | Distributed OptimizationFederated Learning | CodeCode Available | 1 | 5 |
| DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation | Feb 27, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 | 5 |
| InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction | Jan 23, 2024 | 3D Semantic Occupancy PredictionAutonomous Driving | CodeCode Available | 1 | 5 |