| MegDet: A Large Mini-Batch Object Detector | Nov 20, 2017 | GPUObject | CodeCode Available | 1 |
| A Batch Noise Contrastive Estimation Approach for Training Large Vocabulary Language Models | Aug 20, 2017 | GPUText Compression | CodeCode Available | 1 |
| SSH: Single Stage Headless Face Detector | Aug 14, 2017 | General ClassificationGPU | CodeCode Available | 1 |
| Deep Architectures for Neural Machine Translation | Jul 24, 2017 | DecoderGPU | CodeCode Available | 1 |
| Rgtsvm: Support Vector Machines on a GPU in R | Jun 17, 2017 | General ClassificationGPU | CodeCode Available | 1 |
| LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation | Jun 14, 2017 | GPUScene Understanding | CodeCode Available | 1 |
| Working hard to know your neighbor's margins: Local descriptor learning loss | May 30, 2017 | GPUImage Retrieval | CodeCode Available | 1 |
| Convolutional Sequence to Sequence Learning | May 8, 2017 | Bangla Spelling Error CorrectionCPU | CodeCode Available | 1 |
| OptNet: Differentiable Optimization as a Layer in Neural Networks | Mar 1, 2017 | Bilevel OptimizationGPU | CodeCode Available | 1 |
| Fast and Accurate Entity Recognition with Iterated Dilated Convolutions | Feb 7, 2017 | Computational EfficiencyGPU | CodeCode Available | 1 |
| BioEM: GPU-accelerated computing of Bayesian inference of electron microscopy images | Sep 21, 2016 | Bayesian InferenceCPU | CodeCode Available | 1 |
| Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network | Sep 16, 2016 | GPUImage Super-Resolution | CodeCode Available | 1 |
| Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising | Aug 13, 2016 | Color Image DenoisingDenoising | CodeCode Available | 1 |
| A Generic Inverted Index Framework for Similarity Search on the GPU - Technical Report | Mar 28, 2016 | GPU | CodeCode Available | 1 |
| Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1 | Feb 9, 2016 | GPU | CodeCode Available | 1 |
| Asynchronous Methods for Deep Reinforcement Learning | Feb 4, 2016 | Atari GamesCPU | CodeCode Available | 1 |
| Bottom-Up and Top-Down Reasoning with Hierarchical Rectified Gaussians | Jul 21, 2015 | GPUPose Estimation | CodeCode Available | 1 |
| Towards Good Practices for Very Deep Two-Stream ConvNets | Jul 8, 2015 | Action RecognitionAction Recognition In Videos | CodeCode Available | 1 |
| Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks | Jun 4, 2015 | 2D Object DetectionGPU | CodeCode Available | 1 |
| Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs | Dec 22, 2014 | GPUimage-classification | CodeCode Available | 1 |
| Real-Time Grasp Detection Using Convolutional Neural Networks | Dec 9, 2014 | General ClassificationGPU | CodeCode Available | 1 |
| CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning | Jul 18, 2025 | Code GenerationGPU | —Unverified | 0 |
| DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model | Jul 17, 2025 | GPUMonocular Visual Odometry | —Unverified | 0 |
| Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models | Jul 17, 2025 | DenoisingGPU | —Unverified | 0 |
| Kevin: Multi-Turn RL for Generating CUDA Kernels | Jul 16, 2025 | GPUReinforcement Learning (RL) | —Unverified | 0 |