SOTAVerified

GPU

Papers

Showing 26012650 of 5629 papers

TitleStatusHype
DeepSperm: A robust and real-time bull sperm-cell detection in densely populated semen videos0
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks0
Recipes for Pre-training LLMs with MXFP80
iServe: An Intent-based Serving System for LLMs0
DeepSolarEye: Power Loss Prediction and Weakly Supervised Soiling Localization via Fully Convolutional Networks for Solar Panels0
Hierarchical Temporal Convolutional Networks for Dynamic Recommender Systems0
A High-Throughput Solver for Marginalized Graph Kernels on GPU0
Accelerating Sparse Matrix Operations in Neural Networks on Graphics Processing Units0
Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach0
A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU0
InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models0
HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline0
HGMR: Hierarchical Gaussian Mixtures for Adaptive 3D Registration0
Deep SCNN-based Real-time Object Detection for Self-driving Vehicles Using LiDAR Temporal Data0
AutoML for Multilayer Perceptron and FPGA Co-design0
Invertible Learned Primal-Dual0
Deep Scattering: Rendering Atmospheric Clouds with Radiance-Predicting Neural Networks0
DeepScale: Online Frame Size Adaptation for Multi-object Tracking on Smart Cameras and Edge Servers0
Automating Neural Architecture Design without Search0
Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach0
HeterMoE: Efficient Training of Mixture-of-Experts Models on Heterogeneous GPUs0
Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups0
A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition0
HessFormer: Hessians at Foundation Scale0
HEPPO: Hardware-Efficient Proximal Policy Optimization -- A Universal Pipelined Architecture for Generalized Advantage Estimation0
Deep Rigid Instance Scene Flow0
Automatic Skull Reconstruction by Deep Learnable Symmetry Enforcement0
A higher-order MRF based variational model for multiplicative noise reduction0
Deep operator network models for predicting post-burn contraction0
Automatic Segmentation of Pulmonary Lobes Using a Progressive Dense V-Network0
HEADS-UP: Head-Mounted Egocentric Dataset for Trajectory Prediction in Blind Assistance Systems0
HeteroEdge: Addressing Asymmetry in Heterogeneous Collaborative Autonomous Systems0
Heterogeneous Acceleration Pipeline for Recommendation System Training0
DeepRT: A Soft Real Time Scheduler for Computer Vision Applications on the Edge0
HETHUB: A Distributed Training System with Heterogeneous Cluster for Large-Scale Models0
Interactive Evidence Detection: train state-of-the-art model out-of-domain or simple model interactively?0
Hexcute: A Tile-based Programming Language with Automatic Layout and Task-Mapping Synthesis0
HG-Caffe: Mobile and Embedded Neural Network GPU (OpenCL) Inference Engine with FP16 Supporting0
InterTrain: Accelerating DNN Training using Input Interpolation0
HGNAS: Hardware-Aware Graph Neural Architecture Search for Edge Devices0
I/O Lower Bounds for Auto-tuning of Convolutions in CNNs0
Hierarchical Autoscaling for Large Language Model Serving with Chiron0
HDReason: Algorithm-Hardware Codesign for Hyperdimensional Knowledge Graph Reasoning0
Hierarchical Memory for Long Video QA0
HD-PiSSA: High-Rank Distributed Orthogonal Adaptation0
Accelerating SpMM Kernel with Cache-First Edge Sampling for Graph Neural Networks0
DeepNVM++: Cross-Layer Modeling and Optimization Framework of Non-Volatile Memories for Deep Learning0
Deep Slice Interpolation via Marginal Super-Resolution, Fusion and Refinement0
DeepNorm-A Deep Learning Approach to Text Normalization0
Automatic registration with continuous pose updates for marker-less surgical navigation in spine surgery0
Show:102550
← PrevPage 53 of 113Next →

No leaderboard results yet.