| dMath: Distributed Linear Algebra for DL | Nov 19, 2016 | GPUManagement | —Unverified | 0 |
| Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU | Nov 18, 2016 | CPUGPU | CodeCode Available | 0 |
| End-to-end Learning of Cost-Volume Aggregation for Real-time Dense Stereo | Nov 17, 2016 | Deep LearningGPU | —Unverified | 0 |
| Guidefill: GPU Accelerated, Artist Guided Geometric Inpainting for 3D Conversion | Nov 16, 2016 | GPU | —Unverified | 0 |
| Deep Convolutional Neural Network for Inverse Problems in Imaging | Nov 11, 2016 | GPU | —Unverified | 0 |
| DiffSharp: An AD Library for .NET Languages | Nov 10, 2016 | GPU | —Unverified | 0 |
| A Differentiable Physics Engine for Deep Learning in Robotics | Nov 5, 2016 | CPUDeep Learning | —Unverified | 0 |
| GPU-based Pedestrian Detection for Autonomous Driving | Nov 5, 2016 | Autonomous DrivingCPU | —Unverified | 0 |
| Extensions and Limitations of the Neural GPU | Nov 2, 2016 | GPU | CodeCode Available | 0 |
| LightRNN: Memory and Computation-Efficient Recurrent Neural Networks | Oct 31, 2016 | GPULanguage Modeling | —Unverified | 0 |
| TensorLy: Tensor Learning in Python | Oct 29, 2016 | CPUGPU | CodeCode Available | 0 |
| CuMF_SGD: Fast and Scalable Matrix Factorization | Oct 19, 2016 | CPUGPU | CodeCode Available | 0 |
| Streaming Normalization: Towards Simpler and More Biologically-plausible Normalizations for Online and Recurrent Learning | Oct 19, 2016 | GPU | —Unverified | 0 |
| GPU-accelerated real-time stixel computation | Oct 13, 2016 | GPU | CodeCode Available | 0 |
| Embedded real-time stereo estimation via Semi-Global Matching on the GPU | Oct 13, 2016 | Autonomous VehiclesDisparity Estimation | CodeCode Available | 0 |
| Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs | Oct 12, 2016 | Computational EfficiencyGPU | —Unverified | 0 |
| SaberLDA: Sparsity-Aware Learning of Topic Models on GPUs | Oct 8, 2016 | CPUGPU | —Unverified | 0 |
| Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks | Sep 30, 2016 | General ClassificationGPU | CodeCode Available | 0 |
| Latent fingerprint minutia extraction using fully convolutional network | Sep 30, 2016 | GPU | —Unverified | 0 |
| Training a Feedback Loop for Hand Pose Estimation | Sep 30, 2016 | GPUHand Pose Estimation | —Unverified | 0 |
| Comprehensive Evaluation of OpenCL-based Convolutional Neural Network Accelerators in Xilinx and Altera FPGAs | Sep 29, 2016 | CPUGPU | —Unverified | 0 |
| Fast Single Shot Detection and Pose Estimation | Sep 19, 2016 | GPUObject Tracking | —Unverified | 0 |
| Poisson Noise Reduction with Higher-order Natural Image Prior Model | Sep 19, 2016 | DenoisingGPU | —Unverified | 0 |
| Tracking Tensor Subspaces with Informative Random Sampling for Real-Time MR Imaging | Sep 14, 2016 | DiagnosticGPU | —Unverified | 0 |
| The CUDA LATCH Binary Descriptor: Because Sometimes Faster Means Better | Sep 13, 2016 | GPU | CodeCode Available | 0 |