| An Efficient Large Recommendation Model: Towards a Resource-Optimal Scaling Law | Feb 14, 2025 | Feature CompressionGPU | —Unverified | 0 |
| Bringing together invertible UNets with invertible attention modules for memory-efficient diffusion models | Apr 15, 2025 | DenoisingGPU | —Unverified | 0 |
| Accelerated Fingerprint Enhancement: A GPU-Optimized Mixed Architecture Approach | Jun 1, 2023 | GPU | —Unverified | 0 |
| Bringing regularized optimal transport to lightspeed: a splitting method adapted for GPUs | May 29, 2023 | Domain AdaptationGPU | —Unverified | 0 |
| GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable | Apr 10, 2025 | GPUMath | —Unverified | 0 |
| Efficient Memory Management for GPU-based Deep Learning Systems | Feb 19, 2019 | CPUDeep Learning | —Unverified | 0 |
| Efficient Machine Translation with Model Pruning and Quantization | Nov 1, 2021 | CPUDecoder | —Unverified | 0 |
| MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training | Jul 16, 2024 | CPUGPU | —Unverified | 0 |
| Brief Announcement: On the Limits of Parallelizing Convolutional Neural Networks on GPUs | May 28, 2020 | GPU | —Unverified | 0 |
| A Convolutional Neural Network Cascade for Face Detection | Jun 1, 2015 | CPUFace Detection | —Unverified | 0 |