| Flash Invariant Point Attention | May 16, 2025 | GPU | CodeCode Available | 1 | 5 |
| Application-Oriented Benchmarking of Quantum Generative Learning Using QUARK | Aug 8, 2023 | BenchmarkingGPU | CodeCode Available | 1 | 5 |
| DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation | Mar 30, 2022 | GPU | CodeCode Available | 1 | 5 |
| Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality | Dec 21, 2024 | GPU | CodeCode Available | 1 | 5 |
| APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores | Jun 23, 2021 | GPUQuantization | CodeCode Available | 1 | 5 |
| APLA: A Simple Adaptation Method for Vision Transformers | Mar 14, 2025 | ClassificationGPU | CodeCode Available | 1 | 5 |
| Defocus Blur Detection via Depth Distillation | Jul 16, 2020 | DecoderDefocus Blur Detection | CodeCode Available | 1 | 5 |
| InferCept: Efficient Intercept Support for Augmented Large Language Model Inference | Feb 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Feb 7, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Adaptively Placed Multi-Grid Scene Representation Networks for Large-Scale Data Visualization | Jul 16, 2023 | Data VisualizationGPU | CodeCode Available | 1 | 5 |