| High-Throughput SAT Sampling | Feb 12, 2025 | GPUvalid | CodeCode Available | 0 | 5 |
| High-Resolution Deep Convolutional Generative Adversarial Networks | Nov 17, 2017 | GPUImage Generation | CodeCode Available | 0 | 5 |
| Fast ES-RNN: A GPU Implementation of the ES-RNN Algorithm | Jul 7, 2019 | GPUTime Series | CodeCode Available | 0 | 5 |
| Parallel and in-process compilation of individuals for genetic programming on GPU | May 21, 2017 | GPU | CodeCode Available | 0 | 5 |
| Automatic Differentiation in PyTorch | Oct 28, 2017 | ClusteringCPU | CodeCode Available | 0 | 5 |
| Parallel Hyperparameter Optimization Of Spiking Neural Network | Mar 1, 2024 | Bayesian OptimizationGPU | CodeCode Available | 0 | 5 |
| ILP-M Conv: Optimize Convolution Algorithm for Single-Image Convolution Neural Network Inference on Mobile GPUs | Sep 6, 2019 | GPU | CodeCode Available | 0 | 5 |
| Comparing Energy Efficiency of CPU, GPU and FPGA Implementations for Vision Kernels | May 31, 2019 | CPUGPU | CodeCode Available | 0 | 5 |
| High Performance Computing Applied to Logistic Regression: A CPU and GPU Implementation Comparison | Aug 19, 2023 | Binary ClassificationCPU | CodeCode Available | 0 | 5 |
| Higher-Order Ratio Cycles for Fast and Globally Optimal Shape Matching | Jan 1, 2025 | GPUImage Segmentation | CodeCode Available | 0 | 5 |
| HighEr-Resolution Network for Image Demosaicing and Enhancing | Nov 19, 2019 | DemosaickingGPU | CodeCode Available | 0 | 5 |
| High-quality Task Division for Large-scale Entity Alignment | Aug 22, 2022 | Entity AlignmentGPU | CodeCode Available | 0 | 5 |
| Improving the Neural GPU Architecture for Algorithm Learning | Feb 28, 2017 | GPU | CodeCode Available | 0 | 5 |
| Posterior-Guided Neural Architecture Search | Jun 23, 2019 | GPUimage-classification | CodeCode Available | 0 | 5 |
| Faster object tracking pipeline for real time tracking | Nov 8, 2020 | GPUMulti-Object Tracking | —Unverified | 0 | 0 |
| Comparative Analysis of Open Source Frameworks for Machine Learning with Use Case in Single-Threaded and Multi-Threaded Modes | Jun 7, 2017 | BIG-bench Machine LearningCPU | —Unverified | 0 | 0 |
| Faster Multi-GPU Training with PPLL: A Pipeline Parallelism Framework Leveraging Local Learning | Nov 19, 2024 | GPU | —Unverified | 0 | 0 |
| Architecture Search of Dynamic Cells for Semantic Video Segmentation | Apr 4, 2019 | GPUNeural Architecture Search | —Unverified | 0 | 0 |
| Faster Inference of Integer SWIN Transformer by Removing the GELU Activation | Feb 2, 2024 | GPUimage-classification | —Unverified | 0 | 0 |
| BNAS-v2: Memory-efficient and Performance-collapse-prevented Broad Neural Architecture Search | Sep 18, 2020 | GPUNeural Architecture Search | —Unverified | 0 | 0 |
| Comparative Analysis of CPU and GPU Profiling for Deep Learning Models | Sep 5, 2023 | CPUGPU | —Unverified | 0 | 0 |
| Faster and Smarter AutoAugment: Augmentation Policy Search Based on Dynamic Data-Clustering | Jan 1, 2021 | Autonomous DrivingClustering | —Unverified | 0 | 0 |
| Compact Neural Network Solutions to Laplace's Equation in a Nanofluidic Device | Oct 20, 2018 | GPU | —Unverified | 0 | 0 |
| Architectural Implications of Embedding Dimension during GCN on CPU and GPU | Dec 1, 2022 | CPUGPU | —Unverified | 0 | 0 |
| Fast Distributed Inference Serving for Large Language Models | May 10, 2023 | BlockingGPU | —Unverified | 0 | 0 |
| Semi-Dynamic Load Balancing: Efficient Distributed Learning in Non-Dedicated Environments | Jun 7, 2018 | CPUGPU | —Unverified | 0 | 0 |
| Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective | Feb 2, 2023 | GPUMixture-of-Experts | —Unverified | 0 | 0 |
| CompAct: Compressed Activations for Memory-Efficient LLM Training | Oct 20, 2024 | GPU | —Unverified | 0 | 0 |
| Fast DCTTS: Efficient Deep Convolutional Text-to-Speech | Apr 1, 2021 | Computational EfficiencyCPU | —Unverified | 0 | 0 |
| Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models | Oct 9, 2024 | GPU | —Unverified | 0 | 0 |
| Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization | Oct 19, 2021 | CPUGPU | —Unverified | 0 | 0 |
| Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving | Feb 11, 2025 | Autonomous DrivingComputational Efficiency | —Unverified | 0 | 0 |
| FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs | Dec 30, 2024 | GPUGraph Neural Network | —Unverified | 0 | 0 |
| Communication Optimization for Distributed Training: Architecture, Advances, and Opportunities | Mar 12, 2024 | GPU | —Unverified | 0 | 0 |
| Communication-Free Distributed GNN Training with Vertex Cut | Aug 6, 2023 | GPUNode Classification | —Unverified | 0 | 0 |
| Fast Back-Projection for Non-Line of Sight Reconstruction | Mar 6, 2017 | GPU | —Unverified | 0 | 0 |
| Communication-Efficient TeraByte-Scale Model Training Framework for Online Advertising | Jan 5, 2022 | Click-Through Rate PredictionCPU | —Unverified | 0 | 0 |
| ARAP-GS: Drag-driven As-Rigid-As-Possible 3D Gaussian Splatting Editing with Diffusion Prior | Apr 17, 2025 | 3DGSGPU | —Unverified | 0 | 0 |
| FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs | Oct 22, 2024 | CPUGPU | —Unverified | 0 | 0 |
| Fast and Scalable Optimal Transport for Brain Tractograms | Jul 5, 2021 | GPU | —Unverified | 0 | 0 |
| A Random Gossip BMUF Process for Neural Language Modeling | Sep 19, 2019 | GPULanguage Modeling | —Unverified | 0 | 0 |
| Fast and Scalable Distributed Deep Convolutional Autoencoder for fMRI Big Data Analytics | Oct 24, 2017 | blind source separationDictionary Learning | —Unverified | 0 | 0 |
| Fast and Robust Hand Tracking Using Detection-Guided Optimization | Feb 12, 2016 | GPUPose Estimation | —Unverified | 0 | 0 |
| Fast and parallel decoding for transducer | Oct 31, 2022 | GPUspeech-recognition | —Unverified | 0 | 0 |
| Communication Contention Aware Scheduling of Multiple Deep Learning Training Jobs | Feb 24, 2020 | Deep LearningGPU | —Unverified | 0 | 0 |
| A Data-Driven Approach to Dataflow-Aware Online Scheduling for Graph Neural Network Inference | Nov 25, 2024 | CPUGPU | —Unverified | 0 | 0 |
| Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling | May 26, 2025 | GPUtext-to-speech | —Unverified | 0 | 0 |
| 3D helical CT Reconstruction with a Memory Efficient Learned Primal-Dual Architecture | May 24, 2022 | Computed Tomography (CT)CT Reconstruction | —Unverified | 0 | 0 |
| Fast and Efficient Once-For-All Networks for Diverse Hardware Deployment | Sep 29, 2021 | AllGPU | —Unverified | 0 | 0 |
| ComboNet: Combined 2D & 3D Architecture for Aorta Segmentation | Jun 9, 2020 | 3D ArchitectureGPU | —Unverified | 0 | 0 |