| Dynamic Sampling Rate: Harnessing Frame Coherence in Graphics Applications for Energy-Efficient GPUs | Feb 21, 2022 | GPU | —Unverified | 0 |
| Survey on Large Scale Neural Network Training | Feb 21, 2022 | GPUSurvey | —Unverified | 0 |
| Enabling On-Device Smartphone GPU based Training: Lessons Learned | Feb 21, 2022 | CPUGPU | —Unverified | 0 |
| Distributed Out-of-Memory NMF on CPU/GPU Architectures | Feb 19, 2022 | CPUDimensionality Reduction | CodeCode Available | 1 |
| Single UHD Image Dehazing via Interpretable Pyramid Network | Feb 17, 2022 | 4kGPU | CodeCode Available | 1 |
| BB-ML: Basic Block Performance Prediction using Machine Learning Techniques | Feb 16, 2022 | BIG-bench Machine LearningGPU | —Unverified | 0 |
| Aryl: An Elastic Cluster Scheduler for Deep Learning | Feb 16, 2022 | Deep LearningGPU | —Unverified | 0 |
| HiMA: A Fast and Scalable History-based Memory Access Engine for Differentiable Neural Computer | Feb 15, 2022 | GPU | —Unverified | 0 |
| Benchmarking of DL Libraries and Models on Mobile Devices | Feb 14, 2022 | BenchmarkingGPU | CodeCode Available | 1 |
| Learning from distinctive candidates to optimize reduced-precision convolution program on tensor cores | Feb 11, 2022 | GPUScheduling | —Unverified | 0 |
| FL_PyTorch: optimization research simulator for federated learning | Feb 7, 2022 | Federated LearningGPU | CodeCode Available | 1 |
| MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks | Feb 4, 2022 | GPU | CodeCode Available | 1 |
| The Ecological Footprint of Neural Machine Translation Systems | Feb 4, 2022 | GPUMachine Translation | CodeCode Available | 0 |
| Towards Training Reproducible Deep Learning Models | Feb 4, 2022 | Deep LearningGPU | CodeCode Available | 0 |
| Harmony: Overcoming the Hurdles of GPU Memory Capacity to Train Massive DNN Models on Commodity Servers | Feb 2, 2022 | CPUGPU | CodeCode Available | 1 |
| Accelerated Quality-Diversity through Massive Parallelism | Feb 2, 2022 | DiversityGPU | CodeCode Available | 2 |
| Giga-scale Kernel Matrix Vector Multiplication on GPU | Feb 2, 2022 | CPUGPU | CodeCode Available | 0 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 |
| Accelerating DNN Training with Structured Data Gradient Pruning | Feb 1, 2022 | GPU | CodeCode Available | 1 |
| Computational Scatter Correction for High-Resolution Flat-Panel CT Based on a Fast Monte Carlo Photon Transport Model | Jan 31, 2022 | Computed Tomography (CT)CT Reconstruction | —Unverified | 0 |
| SPDY: Accurate Pruning with Speedup Guarantees | Jan 31, 2022 | GPUModel Compression | CodeCode Available | 1 |
| Combining Local and Global Pose Estimation for Precise Tracking of Similar Objects | Jan 31, 2022 | GPUObject | —Unverified | 0 |
| Benchmarking Resource Usage for Efficient Distributed Deep Learning | Jan 28, 2022 | BenchmarkingDeep Learning | —Unverified | 0 |
| Prediction of GPU Failures Under Deep Learning Workloads | Jan 27, 2022 | Deep LearningGPU | —Unverified | 0 |
| ASFD: Automatic and Scalable Face Detector | Jan 26, 2022 | Face DetectionGPU | —Unverified | 0 |