| DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions | Jan 4, 2021 | GPU | CodeCode Available | 1 | 5 |
| Label Supervised LLaMA Finetuning | Oct 2, 2023 | GPUnamed-entity-recognition | CodeCode Available | 1 | 5 |
| LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation | Jun 18, 2024 | GPUNatural Language Understanding | CodeCode Available | 1 | 5 |
| DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting | Mar 2, 2025 | CPUGPU | CodeCode Available | 1 | 5 |
| AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning | Feb 2, 2021 | GPU | CodeCode Available | 1 | 5 |
| KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing | Oct 24, 2024 | GPU | CodeCode Available | 1 | 5 |
| Accelerating Sampling and Aggregation Operations in GNN Frameworks with GPU Initiated Direct Storage Accesses | Jun 28, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| 3D Small Object Detection with Dynamic Spatial Pruning | May 5, 2023 | 3D Object DetectionDecoder | CodeCode Available | 1 | 5 |
| L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training | Aug 18, 2022 | CPUGPU | CodeCode Available | 1 | 5 |
| LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark | Jun 11, 2023 | GPU | CodeCode Available | 1 | 5 |