| Half Search Space is All You Need | May 19, 2025 | AllGPU | —Unverified | 0 |
| HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks | Jan 20, 2023 | GPULow-rank compression | —Unverified | 0 |
| Dynamic Contrastive Distillation for Image-Text Retrieval | Jul 4, 2022 | Contrastive LearningGPU | —Unverified | 0 |
| Binarized Convolutional Neural Networks for Efficient Inference on GPUs | Aug 1, 2018 | BinarizationGPU | —Unverified | 0 |
| A brief survey on deep belief networks and introducing a new object oriented toolbox (DeeBNet) | Aug 14, 2014 | General ClassificationGPU | —Unverified | 0 |
| DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference | Jun 2, 2023 | Collaborative InferenceCPU | —Unverified | 0 |
| DuoGPT: Training-free Dual Sparsity through Activation-aware Pruning in LLMs | Jun 25, 2025 | GPU | —Unverified | 0 |
| HAFLO: GPU-Based Acceleration for Federated Logistic Regression | Jul 29, 2021 | Federated LearningGPU | —Unverified | 0 |
| LANA: Latency Aware Network Acceleration | Jul 12, 2021 | CPUGPU | —Unverified | 0 |
| AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks | Feb 28, 2025 | CPUGPU | —Unverified | 0 |