| At-Scale Sparse Deep Neural Network Inference with Efficient GPU Implementation | Jul 28, 2020 | GPU | CodeCode Available | 0 | 5 |
| An Efficient and Layout-Independent Automatic License Plate Recognition System Based on the YOLO detector | Sep 4, 2019 | Data AugmentationGPU | CodeCode Available | 0 | 5 |
| MobileDets: Searching for Object Detection Architectures for Mobile Accelerators | Apr 30, 2020 | CPUGPU | CodeCode Available | 0 | 5 |
| MLitB: Machine Learning in the Browser | Dec 8, 2014 | BIG-bench Machine LearningDistributed Computing | CodeCode Available | 0 | 5 |
| MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary Network | Jun 24, 2024 | GPU | CodeCode Available | 0 | 5 |
| Efficient Gender Classification Using a Deep LDA-Pruned Net | Apr 20, 2017 | ClassificationGender Classification | CodeCode Available | 0 | 5 |
| Efficient Featurized Image Pyramid Network for Single Shot Detector | Jun 1, 2019 | GPU | CodeCode Available | 0 | 5 |
| Efficient Distillation of Classifier-Free Guidance using Adapters | Mar 10, 2025 | GPU | CodeCode Available | 0 | 5 |
| Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Oct 14, 2024 | Autonomous DrivingGPU | CodeCode Available | 0 | 5 |
| MIOpen: An Open Source Library For Deep Learning Primitives | Sep 30, 2019 | Deep LearningGPU | CodeCode Available | 0 | 5 |
| Efficient Differentiable Approximation of Generalized Low-rank Regularization | May 21, 2025 | GPU | CodeCode Available | 0 | 5 |
| Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs | Feb 10, 2025 | GPU | CodeCode Available | 0 | 5 |
| Accelerating Distributed Deep Learning using Lossless Homomorphic Compression | Feb 12, 2024 | Computational EfficiencyCPU | CodeCode Available | 0 | 5 |
| Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization | Nov 11, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Anchor Space Optimal Transport as a Fast Solution to Multiple Optimal Transport Problems | Oct 24, 2023 | GPU | CodeCode Available | 0 | 5 |
| Efficient Deep Learning for Stereo Matching | Jun 1, 2016 | Deep LearningGeneral Classification | CodeCode Available | 0 | 5 |
| Anchors no more: Using peculiar velocities to constrain H_0 and the primordial Universe without calibrators | Apr 14, 2025 | GPU | CodeCode Available | 0 | 5 |
| Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference | Mar 11, 2025 | GPU | CodeCode Available | 0 | 5 |
| ALTIS: Modernizing GPGPU Benchmarking | Jun 25, 2019 | BenchmarkingGPU | CodeCode Available | 0 | 5 |
| Efficient ConvNet for Real-time Semantic Segmentation | Jun 1, 2017 | GPUReal-Time Semantic Segmentation | CodeCode Available | 0 | 5 |
| Efficient Constituency Tree based Encoding for Natural Language to Bash Translation | Jul 1, 2022 | CPUGPU | CodeCode Available | 0 | 5 |
| Bottleneck Analysis of Dynamic Graph Neural Network Inference on CPU and GPU | Oct 8, 2022 | CPUDiversity | CodeCode Available | 0 | 5 |
| FastFace: Fast-converging Scheduler for Large-scale Face Recognition Training with One GPU | Apr 17, 2024 | Face RecognitionGPU | CodeCode Available | 0 | 5 |
| Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity | Feb 20, 2025 | GPULanguage Modeling | CodeCode Available | 0 | 5 |
| Efficient brain age prediction from 3D MRI volumes using 2D projections | Nov 10, 2022 | GPU | CodeCode Available | 0 | 5 |