| An Efficient MCMC Approach to Energy Function Optimization in Protein Structure Prediction | Nov 6, 2022 | CPUDrug Design | CodeCode Available | 0 | 5 |
| Places205-VGGNet Models for Scene Recognition | Aug 7, 2015 | Computational EfficiencyGPU | CodeCode Available | 0 | 5 |
| Monte Carlo Convolution for Learning on Non-Uniformly Sampled Point Clouds | Jun 5, 2018 | GPUPoint Cloud Segmentation | CodeCode Available | 0 | 5 |
| 3D Anisotropic Hybrid Network: Transferring Convolutional Features from 2D Images to 3D Anisotropic Volumes | Nov 23, 2017 | GPULesion Detection | CodeCode Available | 0 | 5 |
| Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU | Aug 10, 2023 | Decision MakingGPU | CodeCode Available | 0 | 5 |
| Multi-scale fully convolutional neural networks for histopathology image segmentation: from nuclear aberrations to the global tissue architecture | Sep 24, 2019 | GPUImage Segmentation | CodeCode Available | 0 | 5 |
| Efficient Multi-Organ Segmentation Using SpatialConfiguration-Net with Low GPU Memory Requirements | Nov 26, 2021 | GPUOrgan Segmentation | CodeCode Available | 0 | 5 |
| Efficient MPI-based Communication for GPU-Accelerated Dask Applications | Jan 21, 2021 | BlockingCPU | CodeCode Available | 0 | 5 |
| MoGA: Searching Beyond MobileNetV3 | Aug 4, 2019 | AutoMLCPU | CodeCode Available | 0 | 5 |
| MoE-Gen: High-Throughput MoE Inference on a Single GPU with Module-Based Batching | Mar 12, 2025 | GPU | CodeCode Available | 0 | 5 |
| An Efficient Inference Frame for SMLM (Single-Molecule Localization Microscopy) | Oct 3, 2024 | Deep LearningGPU | CodeCode Available | 0 | 5 |
| Bridging the Gap of AutoGraph between Academia and Industry: Analysing AutoGraph Challenge at KDD Cup 2020 | Apr 6, 2022 | AutoMLGPU | CodeCode Available | 0 | 5 |
| MODNet-V: Improving Portrait Video Matting via Background Restoration | Sep 24, 2021 | GPUImage Matting | CodeCode Available | 0 | 5 |
| KVPR: Efficient LLM Inference with I/O-Aware KV Cache Partial Recomputation | Nov 26, 2024 | CPUGPU | CodeCode Available | 0 | 5 |
| Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task | Jul 9, 2024 | GPUText-to-Video Generation | CodeCode Available | 0 | 5 |
| Bridge the Gap Between Architecture Spaces via A Cross-Domain Predictor | Nov 1, 2022 | GPUNeural Architecture Search | CodeCode Available | 0 | 5 |
| MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU | Jun 3, 2017 | Activity RecognitionGPU | CodeCode Available | 0 | 5 |
| Accelerating Deterministic and Stochastic Binarized Neural Networks on FPGAs Using OpenCL | May 15, 2019 | GPU | CodeCode Available | 0 | 5 |
| Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM | Apr 9, 2021 | GPULanguage Modeling | CodeCode Available | 0 | 5 |
| Bridging Data Center AI Systems with Edge Computing for Actionable Information Retrieval | May 28, 2021 | BIG-bench Machine LearningEdge-computing | CodeCode Available | 0 | 5 |
| Efficient Large-scale Approximate Nearest Neighbor Search on the GPU | Feb 20, 2017 | CPUGPU | CodeCode Available | 0 | 5 |
| A Computing Kernel for Network Binarization on PyTorch | Nov 11, 2019 | BinarizationCPU | CodeCode Available | 0 | 5 |
| Efficient Joint Learning for Clinical Named Entity Recognition and Relation Extraction Using Fourier Networks: A Use Case in Adverse Drug Events | Feb 8, 2023 | GPUnamed-entity-recognition | CodeCode Available | 0 | 5 |
| Breast-NET: a lightweight DCNN model for breast cancer detection and grading using histological samples | Aug 10, 2024 | Breast Cancer DetectionBreast Cancer Histology Image Classification | CodeCode Available | 0 | 5 |
| CLTune: A Generic Auto-Tuner for OpenCL Kernels | Mar 19, 2017 | GPURolling Shutter Correction | CodeCode Available | 0 | 5 |
| At-Scale Sparse Deep Neural Network Inference with Efficient GPU Implementation | Jul 28, 2020 | GPU | CodeCode Available | 0 | 5 |
| An Efficient and Layout-Independent Automatic License Plate Recognition System Based on the YOLO detector | Sep 4, 2019 | Data AugmentationGPU | CodeCode Available | 0 | 5 |
| MobileDets: Searching for Object Detection Architectures for Mobile Accelerators | Apr 30, 2020 | CPUGPU | CodeCode Available | 0 | 5 |
| MLitB: Machine Learning in the Browser | Dec 8, 2014 | BIG-bench Machine LearningDistributed Computing | CodeCode Available | 0 | 5 |
| MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary Network | Jun 24, 2024 | GPU | CodeCode Available | 0 | 5 |
| Efficient Gender Classification Using a Deep LDA-Pruned Net | Apr 20, 2017 | ClassificationGender Classification | CodeCode Available | 0 | 5 |
| Efficient Featurized Image Pyramid Network for Single Shot Detector | Jun 1, 2019 | GPU | CodeCode Available | 0 | 5 |
| Efficient Distillation of Classifier-Free Guidance using Adapters | Mar 10, 2025 | GPU | CodeCode Available | 0 | 5 |
| Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Oct 14, 2024 | Autonomous DrivingGPU | CodeCode Available | 0 | 5 |
| MIOpen: An Open Source Library For Deep Learning Primitives | Sep 30, 2019 | Deep LearningGPU | CodeCode Available | 0 | 5 |
| Efficient Differentiable Approximation of Generalized Low-rank Regularization | May 21, 2025 | GPU | CodeCode Available | 0 | 5 |
| Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs | Feb 10, 2025 | GPU | CodeCode Available | 0 | 5 |
| Accelerating Distributed Deep Learning using Lossless Homomorphic Compression | Feb 12, 2024 | Computational EfficiencyCPU | CodeCode Available | 0 | 5 |
| Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization | Nov 11, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Anchor Space Optimal Transport as a Fast Solution to Multiple Optimal Transport Problems | Oct 24, 2023 | GPU | CodeCode Available | 0 | 5 |
| Efficient Deep Learning for Stereo Matching | Jun 1, 2016 | Deep LearningGeneral Classification | CodeCode Available | 0 | 5 |
| Anchors no more: Using peculiar velocities to constrain H_0 and the primordial Universe without calibrators | Apr 14, 2025 | GPU | CodeCode Available | 0 | 5 |
| Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference | Mar 11, 2025 | GPU | CodeCode Available | 0 | 5 |
| ALTIS: Modernizing GPGPU Benchmarking | Jun 25, 2019 | BenchmarkingGPU | CodeCode Available | 0 | 5 |
| Efficient ConvNet for Real-time Semantic Segmentation | Jun 1, 2017 | GPUReal-Time Semantic Segmentation | CodeCode Available | 0 | 5 |
| Efficient Constituency Tree based Encoding for Natural Language to Bash Translation | Jul 1, 2022 | CPUGPU | CodeCode Available | 0 | 5 |
| Bottleneck Analysis of Dynamic Graph Neural Network Inference on CPU and GPU | Oct 8, 2022 | CPUDiversity | CodeCode Available | 0 | 5 |
| FastFace: Fast-converging Scheduler for Large-scale Face Recognition Training with One GPU | Apr 17, 2024 | Face RecognitionGPU | CodeCode Available | 0 | 5 |
| Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity | Feb 20, 2025 | GPULanguage Modeling | CodeCode Available | 0 | 5 |
| Efficient brain age prediction from 3D MRI volumes using 2D projections | Nov 10, 2022 | GPU | CodeCode Available | 0 | 5 |