| FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods | Jun 15, 2023 | BenchmarkingFairness | CodeCode Available | 1 |
| Computationally Budgeted Continual Learning: What Does Matter? | Mar 20, 2023 | Continual LearningGPU | CodeCode Available | 1 |
| A C Code Generator for Fast Inference and Simple Deployment of Convolutional Neural Networks on Resource Constrained Systems | Jan 14, 2020 | C++ codeCode Generation | CodeCode Available | 1 |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Feb 7, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| InferCept: Efficient Intercept Support for Augmented Large Language Model Inference | Feb 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning | Nov 20, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| APLA: A Simple Adaptation Method for Vision Transformers | Mar 14, 2025 | ClassificationGPU | CodeCode Available | 1 |
| A Runtime-Based Computational Performance Predictor for Deep Neural Network Training | Jan 31, 2021 | GPU | CodeCode Available | 1 |
| APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores | Jun 23, 2021 | GPUQuantization | CodeCode Available | 1 |
| FFHNet : Generating Multi-Fingered Robotic Grasps for Unknown Objects in Real-time | May 23, 2022 | GPUGrasp Generation | CodeCode Available | 1 |