| Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning | Jan 3, 2022 | Deep Reinforcement LearningGPU | —Unverified | 0 |
| Challenges in Deploying Long-Context Transformers: A Theoretical Peak Performance Analysis | May 14, 2024 | 4kGPU | —Unverified | 0 |
| Challenges and Obstacles Towards Deploying Deep Learning Models on Mobile Devices | May 6, 2021 | Autonomous VehiclesDeep Learning | —Unverified | 0 |
| An OpenCL(TM) Deep Learning Accelerator on Arria 10 | Jan 13, 2017 | Deep LearningGPU | —Unverified | 0 |
| Finch: Prompt-guided Key-Value Cache Compression | Jul 31, 2024 | GPULanguage Modeling | —Unverified | 0 |
| CF-DETR: Coarse-to-Fine Transformer for Real-Time Object Detection | May 29, 2025 | GPUobject-detection | —Unverified | 0 |
| AdaCM: Adaptive ColorMLP for Real-Time Universal Photo-realistic Style Transfer | Dec 3, 2022 | 4kGPU | —Unverified | 0 |
| CenterAtt: Fast 2-stage Center Attention Network | Jun 19, 2021 | GPU | —Unverified | 0 |
| Energy efficiency in Edge TPU vs. embedded GPU for computer-aided medical imaging segmentation and classification | Nov 20, 2023 | ClassificationDiagnostic | —Unverified | 0 |
| Field Trial of a Flexible Real-time Software-defined GPU-based Optical Receiver | Nov 27, 2020 | GPU | —Unverified | 0 |
| FIKIT: Priority-Based Real-time GPU Multi-tasking Scheduling with Kernel Identification | Nov 17, 2023 | Cloud ComputingGPU | —Unverified | 0 |
| Finding Competitive Network Architectures Within a Day Using UCT | Dec 20, 2017 | GPUNeural Architecture Search | —Unverified | 0 |
| Findings of the WMT 2021 Shared Task on Efficient Translation | Nov 1, 2021 | CPUGPU | —Unverified | 0 |
| CE-NAS: An End-to-End Carbon-Efficient Neural Architecture Search Framework | Jun 3, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption | Feb 17, 2025 | BenchmarkingCode Summarization | —Unverified | 0 |
| FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference | Jan 8, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Energy-based Tuning of Convolutional Neural Networks on Multi-GPUs | Aug 1, 2018 | GPUObject Recognition | —Unverified | 0 |
| CEEMS: A Resource Manager Agnostic Energy and Emissions Monitoring Stack | Dec 10, 2024 | CPUGPU | —Unverified | 0 |
| An NMF Perspective on Binary Hashing | Dec 1, 2015 | GPUQuantization | —Unverified | 0 |
| FFTLasso: Large-Scale LASSO in the Fourier Domain | Jul 1, 2017 | DenoisingDimensionality Reduction | —Unverified | 0 |
| CDPS: Constrained DTW-Preserving Shapelets | Sep 29, 2021 | ClusteringConstrained Clustering | —Unverified | 0 |
| Energon: Towards Efficient Acceleration of Transformers Using Dynamic Sparse Attention | Oct 18, 2021 | CPUEdge-computing | —Unverified | 0 |
| EnergonAI: An Inference System for 10-100 Billion Parameter Transformer Models | Sep 6, 2022 | BlockingGPU | —Unverified | 0 |
| Energy-Efficient Inference Accelerator for Memory-Augmented Neural Networks on an FPGA | May 21, 2018 | GPUQuestion Answering | —Unverified | 0 |
| AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction | Jan 1, 2025 | GPUQuestion Answering | —Unverified | 0 |