| Provably Bounding Neural Network Preimages | Feb 2, 2023 | Adversarial RobustnessGPU | CodeCode Available | 0 |
| 3D Anisotropic Hybrid Network: Transferring Convolutional Features from 2D Images to 3D Anisotropic Volumes | Nov 23, 2017 | GPULesion Detection | CodeCode Available | 0 |
| A Comprehensive Summarization and Evaluation of Feature Refinement Modules for CTR Prediction | Nov 8, 2023 | BenchmarkingClick-Through Rate Prediction | CodeCode Available | 0 |
| Improve Machine Learning carbon footprint using Nvidia GPU and Mixed Precision training for classification models -- Part I | Sep 12, 2024 | BenchmarkingCPU | CodeCode Available | 0 |
| A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering Tasks | Dec 25, 2023 | GPUparameter-efficient fine-tuning | CodeCode Available | 0 |
| DeepShift: Towards Multiplication-Less Neural Networks | May 30, 2019 | Edge-computingGPU | CodeCode Available | 0 |
| Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction | Jul 4, 2021 | Atari GamesGPU | CodeCode Available | 0 |
| PruneTrain: Fast Neural Network Training by Dynamic Sparse Model Reconfiguration | Jan 26, 2019 | GPU | CodeCode Available | 0 |
| Deep Semantic Role Labeling with Self-Attention | Dec 5, 2017 | GPUNatural Language Understanding | CodeCode Available | 0 |
| Implementing a GPU-based parallel MAX-MIN Ant System | Jan 18, 2020 | Combinatorial OptimizationCPU | CodeCode Available | 0 |
| Implementation and Analysis of GPU Algorithms for Vecchia Approximation | Jul 3, 2024 | Gaussian ProcessesGPU | CodeCode Available | 0 |
| Pulse shape discrimination based on the Tempotron: a powerful classifier on GPU | May 26, 2023 | CPUGPU | CodeCode Available | 0 |
| Impacts of floating-point non-associativity on reproducibility for HPC and deep learning applications | Aug 9, 2024 | Deep LearningGPU | CodeCode Available | 0 |
| YOLOX-PAI: An Improved YOLOX, Stronger and Faster than YOLOv6 | Aug 27, 2022 | GPUobject-detection | CodeCode Available | 0 |
| Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved Offloading | Oct 26, 2024 | CPUGPU | CodeCode Available | 0 |
| You Only Cache Once: Decoder-Decoder Architectures for Language Models | May 8, 2024 | DecoderGPU | CodeCode Available | 0 |
| Image Smoothing via Unsupervised Learning | Nov 7, 2018 | GPUImage Manipulation | CodeCode Available | 0 |
| Pushing the Performance Envelope of DNN-based Recommendation Systems Inference on GPUs | Oct 29, 2024 | GPURecommendation Systems | CodeCode Available | 0 |
| Push: Concurrent Probabilistic Programming for Bayesian Deep Learning | Jun 10, 2023 | Bayesian InferenceDeep Learning | CodeCode Available | 0 |
| SpykeTorch: Efficient Simulation of Convolutional Spiking Neural Networks with at most one Spike per Neuron | Mar 6, 2019 | GPU | CodeCode Available | 0 |
| PVANET: Deep but Lightweight Neural Networks for Real-time Object Detection | Aug 29, 2016 | CPUGeneral Classification | CodeCode Available | 0 |
| PVR: Patch-to-Volume Reconstruction for Large Area Motion Correction of Fetal MRI | Nov 22, 2016 | GPUMotion Compensation | CodeCode Available | 0 |
| ImageNet Training in Minutes | Sep 14, 2017 | 16kGPU | CodeCode Available | 0 |
| ImageNet Classification with Deep Convolutional Neural Networks | Dec 1, 2012 | General ClassificationGPU | CodeCode Available | 0 |
| DeepOHeat-v1: Efficient Operator Learning for Fast and Trustworthy Thermal Simulation and Optimization in 3D-IC Design | Apr 4, 2025 | GPUKolmogorov-Arnold Networks | CodeCode Available | 0 |
| Image Classification with CondenseNeXt for ARM-Based Computing Platforms | Jun 26, 2021 | Autonomous DrivingClassification | CodeCode Available | 0 |
| PyHySCO: GPU-Enabled Susceptibility Artifact Distortion Correction in Seconds | Mar 15, 2024 | distortion correctionGPU | CodeCode Available | 0 |
| ICNet for Real-Time Semantic Segmentation on High-Resolution Images | Apr 27, 2017 | Dichotomous Image SegmentationGPU | CodeCode Available | 0 |
| Hyper-parameter Tuning for Adversarially Robust Models | Apr 5, 2023 | Adversarial RobustnessGPU | CodeCode Available | 0 |
| You Only Propagate Once: Accelerating Adversarial Training via Maximal Principle | May 2, 2019 | Adversarial DefenseGPU | CodeCode Available | 0 |
| HyP-DESPOT: A Hybrid Parallel Algorithm for Online Planning under Uncertainty | Feb 17, 2018 | Computational EfficiencyCPU | CodeCode Available | 0 |
| Pyro: Deep Universal Probabilistic Programming | Oct 18, 2018 | GPUProbabilistic Programming | CodeCode Available | 0 |
| A Neural Approach to Blind Motion Deblurring | Mar 15, 2016 | DeblurringGPU | CodeCode Available | 0 |
| Human-Level Control without Server-Grade Hardware | Nov 1, 2021 | Cloud ComputingCPU | CodeCode Available | 0 |
| HourNAS: Extremely Fast Neural Architecture Search Through an Hourglass Lens | May 29, 2020 | GPUNeural Architecture Search | CodeCode Available | 0 |
| Horovod: fast and easy distributed deep learning in TensorFlow | Feb 15, 2018 | Deep LearningGPU | CodeCode Available | 0 |
| Deep Neural Networks for Physics Analysis on low-level whole-detector data at the LHC | Nov 9, 2017 | CPUGPU | CodeCode Available | 0 |
| HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices | Nov 1, 2024 | Autonomous DrivingGPU | CodeCode Available | 0 |
| Deeply Learned Spectral Total Variation Decomposition | Jun 17, 2020 | GPU | CodeCode Available | 0 |
| Deep Learning Workload Scheduling in GPU Datacenters: Taxonomy, Challenges and Vision | May 24, 2022 | GPUScheduling | CodeCode Available | 0 |
| ILP-M Conv: Optimize Convolution Algorithm for Single-Image Convolution Neural Network Inference on Mobile GPUs | Sep 6, 2019 | GPU | CodeCode Available | 0 |
| Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge Deployment | May 2, 2024 | GPUNVIDIA Jetson Orin Nano | CodeCode Available | 0 |
| High-Throughput SAT Sampling | Feb 12, 2025 | GPUvalid | CodeCode Available | 0 |
| High-Resolution Deep Convolutional Generative Adversarial Networks | Nov 17, 2017 | GPUImage Generation | CodeCode Available | 0 |
| High-quality Task Division for Large-scale Entity Alignment | Aug 22, 2022 | Entity AlignmentGPU | CodeCode Available | 0 |
| DeepLearningKit - an GPU Optimized Deep Learning Framework for Apple's iOS, OS X and tvOS developed in Metal and Swift | May 15, 2016 | Deep LearningGPU | CodeCode Available | 0 |
| Towards Training Reproducible Deep Learning Models | Feb 4, 2022 | Deep LearningGPU | CodeCode Available | 0 |
| High Performance Computing Applied to Logistic Regression: A CPU and GPU Implementation Comparison | Aug 19, 2023 | Binary ClassificationCPU | CodeCode Available | 0 |
| HighEr-Resolution Network for Image Demosaicing and Enhancing | Nov 19, 2019 | DemosaickingGPU | CodeCode Available | 0 |
| word2ket: Space-efficient Word Embeddings inspired by Quantum Entanglement | Nov 12, 2019 | GPUWord Embeddings | CodeCode Available | 0 |