| FlashMLA-ETAP: Efficient Transpose Attention Pipeline for Accelerating MLA Inference on NVIDIA H20 GPUs | May 13, 2025 | GPU | CodeCode Available | 1 |
| A Practical Stereo Depth System for Smart Glasses | Nov 19, 2022 | CPUDepth Estimation | CodeCode Available | 1 |
| APQ: Joint Search for Network Architecture, Pruning and Quantization Policy | Jun 15, 2020 | GPUQuantization | CodeCode Available | 1 |
| AdaSplash: Adaptive Sparse Flash Attention | Feb 17, 2025 | GPULanguage Modeling | CodeCode Available | 1 |
| ApproxTrain: Fast Simulation of Approximate Multipliers for DNN Training and Inference | Sep 9, 2022 | CPUGPU | CodeCode Available | 1 |
| Accelerating DNN Training with Structured Data Gradient Pruning | Feb 1, 2022 | GPU | CodeCode Available | 1 |
| Beyond [cls]: Exploring the true potential of Masked Image Modeling representations | Dec 4, 2024 | GPUSelf-Supervised Learning | CodeCode Available | 1 |
| FOVEA: Foveated Image Magnification for Autonomous Navigation | Aug 27, 2021 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 1 |
| APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Classification | May 2, 2022 | 3D Classification3D Point Cloud Classification | CodeCode Available | 1 |
| Applying supervised and reinforcement learning methods to create neural-network-based agents for playing StarCraft II | Sep 26, 2021 | GPUStarcraft | CodeCode Available | 1 |
| Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models | Dec 19, 2023 | GPU | CodeCode Available | 1 |
| Application-Oriented Benchmarking of Quantum Generative Learning Using QUARK | Aug 8, 2023 | BenchmarkingGPU | CodeCode Available | 1 |
| Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality | Dec 21, 2024 | GPU | CodeCode Available | 1 |
| Fine-tuning of sign language recognition models: a technical report | Feb 15, 2023 | Gesture RecognitionGPU | CodeCode Available | 1 |
| APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores | Jun 23, 2021 | GPUQuantization | CodeCode Available | 1 |
| APLA: A Simple Adaptation Method for Vision Transformers | Mar 14, 2025 | ClassificationGPU | CodeCode Available | 1 |
| Fine-tuning giant neural networks on commodity hardware with automatic pipeline model parallelism | Jul 14, 2021 | GPUTransfer Learning | CodeCode Available | 1 |
| Fine-Tuning Pre-trained Transformers into Decaying Fast Weights | Oct 9, 2022 | GPU | CodeCode Available | 1 |
| InferCept: Efficient Intercept Support for Augmented Large Language Model Inference | Feb 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Feb 7, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Adaptively Placed Multi-Grid Scene Representation Networks for Large-Scale Data Visualization | Jul 16, 2023 | Data VisualizationGPU | CodeCode Available | 1 |
| ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis | Jul 20, 2022 | CPUGPU | CodeCode Available | 1 |
| Adaptive Graph Diffusion Networks | Dec 30, 2020 | GPULink Prediction | CodeCode Available | 1 |
| Better Than Reference In Low Light Image Enhancement: Conditional Re-Enhancement Networks | Aug 26, 2020 | GPUImage Enhancement | CodeCode Available | 1 |
| Fine-tuning Quantized Neural Networks with Zeroth-order Optimization | May 19, 2025 | GPUQuantization | CodeCode Available | 1 |