| A Novel Low-cost FPGA-based Real-time Object Tracking System | Apr 16, 2018 | CPUGPU | —Unverified | 0 |
| Characterizing Deep Learning Training Workloads on Alibaba-PAI | Oct 14, 2019 | Deep LearningGPU | —Unverified | 0 |
| Characterizing Concurrency Mechanisms for NVIDIA GPUs under Deep Learning Workloads | Oct 1, 2021 | Deep LearningGPU | —Unverified | 0 |
| A Novel Implicit Neural Representation for Volume Data | Mar 13, 2024 | GPU | —Unverified | 0 |
| Characterizing and Understanding HGNN Training on GPUs | Jul 16, 2024 | GPURecommendation Systems | —Unverified | 0 |
| Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures | Apr 16, 2025 | CPUGPU | —Unverified | 0 |
| A Novel Framework for Neural Architecture Search in the Hill Climbing Domain | Feb 22, 2021 | GPUNeural Architecture Search | —Unverified | 0 |
| Adaptable Butterfly Accelerator for Attention-based NNs via Hardware and Algorithm Co-design | Sep 20, 2022 | CPUGPU | —Unverified | 0 |
| A Novel DNN Training Framework via Data Sampling and Multi-Task Optimization | Jul 2, 2020 | GPUTransfer Learning | —Unverified | 0 |
| Characterizing and Efficiently Accelerating Multimodal Generation Model Inference | Sep 30, 2024 | GPUmultimodal generation | —Unverified | 0 |
| A Novel Co-design Peta-scale Heterogeneous Cluster for Deep Learning Training | Feb 7, 2018 | GPUScheduling | —Unverified | 0 |
| Accelerating Deep Learning with Millions of Classes | Aug 1, 2020 | ClassificationDeep Learning | —Unverified | 0 |
| Federated Fine-Tuning of LLMs on the Very Edge: The Good, the Bad, the Ugly | Oct 4, 2023 | Computational EfficiencyEdge-computing | —Unverified | 0 |
| FetalDiffusion: Pose-Controllable 3D Fetal MRI Synthesis with Conditional Diffusion Model | Mar 29, 2024 | GPUPose Estimation | —Unverified | 0 |
| A Novel Breast Ultrasound Image Augmentation Method Using Advanced Neural Style Transfer: An Efficient and Explainable Approach | Oct 31, 2024 | GPUImage Augmentation | —Unverified | 0 |
| Accelerating Clinical NLP at Scale with a Hybrid Framework with Reduced GPU Demands: A Case Study in Dementia Identification | Apr 16, 2025 | GPU | —Unverified | 0 |
| Channel Merging: Preserving Specialization for Merged Experts | Dec 18, 2024 | Code GenerationGPU | —Unverified | 0 |
| A Note on Deepfake Detection with Low-Resources | Jun 9, 2020 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes | Jul 9, 2024 | GPU | —Unverified | 0 |
| Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs | Mar 15, 2025 | GPU | —Unverified | 0 |
| An Optimized Union-Find Algorithm for Connected Components Labeling Using GPUs | Aug 28, 2017 | GPU | —Unverified | 0 |
| Challenging GPU Dominance: When CPUs Outperform for On-Device LLM Inference | May 9, 2025 | CPUGPU | —Unverified | 0 |
| An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks | Nov 26, 2019 | Decision MakingGPU | —Unverified | 0 |
| Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training | May 31, 2023 | GPU | —Unverified | 0 |
| Feature Pyramid Encoding Network for Real-time Semantic Segmentation | Sep 18, 2019 | DecoderGPU | —Unverified | 0 |