| Human-Machine Collaborative Design for Accelerated Design of Compact Deep Neural Networks for Autonomous Driving | Sep 12, 2019 | Autonomous DrivingGPU | —Unverified | 0 |
| Hardware-conscious Hash-Joins on GPUs | Aug 11, 2019 | CPUGPU | —Unverified | 0 |
| Deep Modulation Embedding | Feb 17, 2019 | GPU | —Unverified | 0 |
| L2PF -- Learning to Prune Faster | Jan 7, 2021 | Autonomous DrivingGPU | —Unverified | 0 |
| Hybrid Data-Model Parallel Training for Sequence-to-Sequence Recurrent Neural Network Machine Translation | Sep 2, 2019 | DecoderGPU | —Unverified | 0 |
| L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference | Apr 24, 2025 | GPU | —Unverified | 0 |
| Hardware-Aware Graph Neural Network Automated Design for Edge Computing Platforms | Mar 20, 2023 | Edge-computingGPU | —Unverified | 0 |
| Hardware and Software Platform Inference | Nov 7, 2024 | GPULarge Language Model | —Unverified | 0 |
| Hybrid-Regressive Neural Machine Translation | Oct 19, 2022 | CPUDecoder | —Unverified | 0 |
| DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network | Mar 5, 2023 | GPUImage Classification | —Unverified | 0 |
| Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead | Dec 21, 2020 | Autonomous DrivingBenchmarking | —Unverified | 0 |
| Hardware Accelerator for Multi-Head Attention and Position-Wise Feed-Forward in the Transformer | Sep 18, 2020 | GPUPosition | —Unverified | 0 |
| A Hardware Evaluation Framework for Large Language Model Inference | Dec 5, 2023 | GPULanguage Modeling | —Unverified | 0 |
| Accelerating Sparse Graph Neural Networks with Tensor Core Optimization | Dec 16, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| DeRF: Decomposed Radiance Fields | Nov 25, 2020 | GPUNeRF | —Unverified | 0 |
| Hardware Acceleration of LLMs: A comprehensive survey and comparison | Sep 5, 2024 | GPUSurvey | —Unverified | 0 |
| Hyper: Distributed Cloud Processing for Large-Scale Deep Learning Tasks | Oct 16, 2019 | CPUDeep Learning | —Unverified | 0 |
| Hardware Acceleration of Lane Detection Algorithm: A GPU Versus FPGA Comparison | Dec 19, 2022 | Autonomous DrivingEdge Detection | —Unverified | 0 |
| Deep Local Video Feature for Action Recognition | Jan 25, 2017 | Action RecognitionGPU | —Unverified | 0 |
| HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection | Apr 3, 2016 | GPUObject | —Unverified | 0 |
| DeServe: Towards Affordable Offline LLM Inference via Decentralization | Jan 4, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Hardware Acceleration of Fully Quantized BERT for Efficient Natural Language Processing | Mar 4, 2021 | CPUEdge-computing | —Unverified | 0 |
| Automatic Classification of Defective Photovoltaic Module Cells in Electroluminescence Images | Jul 8, 2018 | Anomaly ClassificationGPU | —Unverified | 0 |
| Design equivariant neural networks for 3D point cloud | May 2, 2022 | GPUSemantic Segmentation | —Unverified | 0 |
| KVDirect: Distributed Disaggregated LLM Inference | Dec 13, 2024 | GPUScheduling | —Unverified | 0 |