| MegDet: A Large Mini-Batch Object Detector | Nov 20, 2017 | GPUObject | CodeCode Available | 1 |
| A Batch Noise Contrastive Estimation Approach for Training Large Vocabulary Language Models | Aug 20, 2017 | GPUText Compression | CodeCode Available | 1 |
| SSH: Single Stage Headless Face Detector | Aug 14, 2017 | General ClassificationGPU | CodeCode Available | 1 |
| Deep Architectures for Neural Machine Translation | Jul 24, 2017 | DecoderGPU | CodeCode Available | 1 |
| Rgtsvm: Support Vector Machines on a GPU in R | Jun 17, 2017 | General ClassificationGPU | CodeCode Available | 1 |
| LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation | Jun 14, 2017 | GPUScene Understanding | CodeCode Available | 1 |
| Working hard to know your neighbor's margins: Local descriptor learning loss | May 30, 2017 | GPUImage Retrieval | CodeCode Available | 1 |
| Convolutional Sequence to Sequence Learning | May 8, 2017 | Bangla Spelling Error CorrectionCPU | CodeCode Available | 1 |
| OptNet: Differentiable Optimization as a Layer in Neural Networks | Mar 1, 2017 | Bilevel OptimizationGPU | CodeCode Available | 1 |
| Fast and Accurate Entity Recognition with Iterated Dilated Convolutions | Feb 7, 2017 | Computational EfficiencyGPU | CodeCode Available | 1 |
| BioEM: GPU-accelerated computing of Bayesian inference of electron microscopy images | Sep 21, 2016 | Bayesian InferenceCPU | CodeCode Available | 1 |
| Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network | Sep 16, 2016 | GPUImage Super-Resolution | CodeCode Available | 1 |
| Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising | Aug 13, 2016 | Color Image DenoisingDenoising | CodeCode Available | 1 |
| A Generic Inverted Index Framework for Similarity Search on the GPU - Technical Report | Mar 28, 2016 | GPU | CodeCode Available | 1 |
| Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1 | Feb 9, 2016 | GPU | CodeCode Available | 1 |
| Asynchronous Methods for Deep Reinforcement Learning | Feb 4, 2016 | Atari GamesCPU | CodeCode Available | 1 |
| Bottom-Up and Top-Down Reasoning with Hierarchical Rectified Gaussians | Jul 21, 2015 | GPUPose Estimation | CodeCode Available | 1 |
| Towards Good Practices for Very Deep Two-Stream ConvNets | Jul 8, 2015 | Action RecognitionAction Recognition In Videos | CodeCode Available | 1 |
| Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks | Jun 4, 2015 | 2D Object DetectionGPU | CodeCode Available | 1 |
| Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs | Dec 22, 2014 | GPUimage-classification | CodeCode Available | 1 |
| Real-Time Grasp Detection Using Convolutional Neural Networks | Dec 9, 2014 | General ClassificationGPU | CodeCode Available | 1 |
| CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning | Jul 18, 2025 | Code GenerationGPU | —Unverified | 0 |
| DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model | Jul 17, 2025 | GPUMonocular Visual Odometry | —Unverified | 0 |
| Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models | Jul 17, 2025 | DenoisingGPU | —Unverified | 0 |
| Kevin: Multi-Turn RL for Generating CUDA Kernels | Jul 16, 2025 | GPUReinforcement Learning (RL) | —Unverified | 0 |
| Lightweight Model for Poultry Disease Detection from Fecal Images Using Multi-Color Space Feature Optimization and Machine Learning | Jul 14, 2025 | Computational EfficiencyDimensionality Reduction | —Unverified | 0 |
| DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation | Jul 14, 2025 | DecoderGPU | CodeCode Available | 0 |
| Scaling Attention to Very Long Sequences in Linear Time with Wavelet-Enhanced Random Spectral Attention (WERSA) | Jul 11, 2025 | GPU | CodeCode Available | 0 |
| HNOSeg-XS: Extremely Small Hartley Neural Operator for Efficient and Resolution-Robust 3D Image Segmentation | Jul 10, 2025 | GPUImage Segmentation | —Unverified | 0 |
| From large-eddy simulations to deep learning: A U-net model for fast urban canopy flow predictions | Jul 9, 2025 | GPUL2 Regularization | CodeCode Available | 0 |
| Artificial Generals Intelligence: Mastering Generals.io with Reinforcement Learning | Jul 9, 2025 | GPUMulti-agent Reinforcement Learning | —Unverified | 0 |
| Diffusion Dataset Condensation: Training Your Diffusion Model Faster with Less Data | Jul 8, 2025 | Dataset CondensationGPU | —Unverified | 0 |
| Real-Time Graph-based Point Cloud Networks on FPGAs via Stall-Free Deep Pipelining | Jul 7, 2025 | GPU | CodeCode Available | 0 |
| SketchColour: Channel Concat Guided DiT-based Sketch-to-Colour Pipeline for 2D Animation | Jul 2, 2025 | GPU | —Unverified | 0 |
| LoRA Fine-Tuning Without GPUs: A CPU-Efficient Meta-Generation Framework for LLMs | Jul 2, 2025 | CPUGPU | —Unverified | 0 |
| Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation | Jun 26, 2025 | GPUImage Generation | —Unverified | 0 |
| DuoGPT: Training-free Dual Sparsity through Activation-aware Pruning in LLMs | Jun 25, 2025 | GPU | —Unverified | 0 |
| GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel Optimization | Jun 25, 2025 | GPU | —Unverified | 0 |
| Omniwise: Predicting GPU Kernels Performance with LLMs | Jun 25, 2025 | GPU | —Unverified | 0 |
| Virtual Memory for 3D Gaussian Splatting | Jun 24, 2025 | GPUNovel View Synthesis | —Unverified | 0 |
| Scaling Speculative Decoding with Lookahead Reasoning | Jun 24, 2025 | GPUGSM8K | CodeCode Available | 0 |
| TDACloud: Point Cloud Recognition Using Topological Data Analysis | Jun 23, 2025 | Autonomous DrivingGPU | —Unverified | 0 |
| Let Your Video Listen to Your Music! | Jun 23, 2025 | GPUMusic Generation | —Unverified | 0 |
| 4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time | Jun 23, 2025 | 4D reconstructionGPU | —Unverified | 0 |
| Survey of HPC in US Research Institutions | Jun 23, 2025 | BenchmarkingGPU | —Unverified | 0 |
| Lightweight RGB-T Tracking with Mobile Vision Transformers | Jun 23, 2025 | GPUObject Tracking | —Unverified | 0 |
| Collaborative Texture Filtering | Jun 21, 2025 | GPU | —Unverified | 0 |
| Beyond Blur: A Fluid Perspective on Generative Diffusion Models | Jun 20, 2025 | DiversityGPU | —Unverified | 0 |
| VeriLocc: End-to-End Cross-Architecture Register Allocation via LLM | Jun 20, 2025 | GPU | —Unverified | 0 |
| Speeding up Local Optimization in Vehicle Routing with Tensor-based GPU Acceleration | Jun 20, 2025 | AttributeComputational Efficiency | —Unverified | 0 |