| Bottleneck Analysis of Dynamic Graph Neural Network Inference on CPU and GPU | Oct 8, 2022 | CPUDiversity | CodeCode Available | 0 |
| M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining | Jan 29, 2024 | GPUzero-shot-classification | CodeCode Available | 0 |
| Learning Compression from Limited Unlabeled Data | Sep 1, 2018 | CPUGPU | CodeCode Available | 0 |
| Distributed Hierarchical GPU Parameter Server for Massive Scale Deep Learning Ads Systems | Mar 12, 2020 | CPUGPU | CodeCode Available | 0 |
| Solving ill-posed inverse problems using iterative deep neural networks | Apr 13, 2017 | GPU | CodeCode Available | 0 |
| BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNet | May 27, 2017 | CPUGPU | CodeCode Available | 0 |
| Learning-based Application-Agnostic 3D NoC Design for Heterogeneous Manycore Systems | Oct 20, 2018 | CPUGPU | CodeCode Available | 0 |
| APE: A Data-Centric Benchmark for Efficient LLM Adaptation in Text Summarization | May 26, 2025 | GPUNews Summarization | CodeCode Available | 0 |
| Distributed Extra-gradient with Optimal Complexity and Communication Guarantees | Aug 17, 2023 | GPU | CodeCode Available | 0 |
| Characteristic Performance Study on Solving Oscillator ODEs via Soft-constrained Physics-informed Neural Network with Small Data | Aug 19, 2024 | CPUGPU | CodeCode Available | 0 |
| Solving the Resource Constrained Project Scheduling Problem Using the Parallel Tabu Search Designed for the CUDA Platform | Nov 13, 2017 | CPUGPU | CodeCode Available | 0 |
| EENA: Efficient Evolution of Neural Architecture | May 10, 2019 | General ClassificationGPU | CodeCode Available | 0 |
| YOLOBench: Benchmarking Efficient Object Detectors on Embedded Systems | Jul 26, 2023 | BenchmarkingCPU | CodeCode Available | 0 |
| A parallelized cellular Potts model that enables simulations at tissue scale | Dec 14, 2023 | GPU | CodeCode Available | 0 |
| Distilling the Knowledge of Romanian BERTs Using Multiple Teachers | Dec 23, 2021 | Dialect IdentificationGPU | CodeCode Available | 0 |
| Distilled GPT for Source Code Summarization | Aug 28, 2023 | Code SummarizationGPU | CodeCode Available | 0 |
| 4D-ROLLS: 4D Radar Occupancy Learning via LiDAR Supervision | May 20, 2025 | Autonomous VehiclesBEV Segmentation | CodeCode Available | 0 |
| Permutohedral Attention Module for Efficient Non-Local Neural Networks | Jul 1, 2019 | GPUSegmentation | CodeCode Available | 0 |
| PERP: Rethinking the Prune-Retrain Paradigm in the Era of LLMs | Dec 23, 2023 | GPU | CodeCode Available | 0 |
| Learned D-AMP: Principled Neural Network based Compressive Image Recovery | Apr 21, 2017 | DenoisingGPU | CodeCode Available | 0 |
| AdaNet: A Scalable and Flexible Framework for Automatically Learning Ensembles | Apr 30, 2019 | CPUGPU | CodeCode Available | 0 |
| Distance-Based Tree-Sliced Wasserstein Distance | Mar 14, 2025 | Computational EfficiencyGPU | CodeCode Available | 0 |
| Disjunctive Normal Networks | Dec 30, 2014 | General ClassificationGPU | CodeCode Available | 0 |
| BlockSwap: Fisher-guided Block Substitution for Network Compression on a Budget | Jun 10, 2019 | GPU | CodeCode Available | 0 |
| StructADMM: A Systematic, High-Efficiency Framework of Structured Weight Pruning for DNNs | Jul 29, 2018 | CPUGPU | CodeCode Available | 0 |