SOTAVerified

GPU

Papers

Showing 32513300 of 5629 papers

TitleStatusHype
Dynamic Sampling Rate: Harnessing Frame Coherence in Graphics Applications for Energy-Efficient GPUs0
Survey on Large Scale Neural Network Training0
Enabling On-Device Smartphone GPU based Training: Lessons Learned0
Distributed Out-of-Memory NMF on CPU/GPU ArchitecturesCode1
Single UHD Image Dehazing via Interpretable Pyramid NetworkCode1
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques0
Aryl: An Elastic Cluster Scheduler for Deep Learning0
HiMA: A Fast and Scalable History-based Memory Access Engine for Differentiable Neural Computer0
Benchmarking of DL Libraries and Models on Mobile DevicesCode1
Learning from distinctive candidates to optimize reduced-precision convolution program on tensor cores0
FL_PyTorch: optimization research simulator for federated learningCode1
MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural NetworksCode1
The Ecological Footprint of Neural Machine Translation SystemsCode0
Towards Training Reproducible Deep Learning ModelsCode0
Harmony: Overcoming the Hurdles of GPU Memory Capacity to Train Massive DNN Models on Commodity ServersCode1
Accelerated Quality-Diversity through Massive ParallelismCode2
Giga-scale Kernel Matrix Vector Multiplication on GPUCode0
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and DetectionCode2
Accelerating DNN Training with Structured Data Gradient PruningCode1
Computational Scatter Correction for High-Resolution Flat-Panel CT Based on a Fast Monte Carlo Photon Transport Model0
SPDY: Accurate Pruning with Speedup GuaranteesCode1
Combining Local and Global Pose Estimation for Precise Tracking of Similar Objects0
Benchmarking Resource Usage for Efficient Distributed Deep Learning0
Prediction of GPU Failures Under Deep Learning Workloads0
ASFD: Automatic and Scalable Face Detector0
Convolutional Xformers for VisionCode1
An Application of Pseudo-Log-Likelihoods to Natural Language Scoring0
Learning-Driven Lossy Image Compression; A Comprehensive Survey0
Accelerating Laue Depth Reconstruction Algorithm with CUDA0
GenGNN: A Generic FPGA Framework for Graph Neural Network AccelerationCode1
What can we learn from misclassified ImageNet images?0
GroupGazer: A Tool to Compute the Gaze per Participant in Groups with integrated Calibration to Map the Gaze Online to a Screen or Beamer Projection0
Learned Cone-Beam CT Reconstruction Using Neural Ordinary Differential Equations0
Building a Performance Model for Deep Learning Recommendation Model Training on GPUsCode1
GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge0
OSSID: Online Self-Supervised Instance Detection by (and for) Pose Estimation0
Attention-based Proposals Refinement for 3D Object DetectionCode1
SunCast: Solar Irradiance Nowcasting from Geosynchronous Satellite Data0
Cross-stitched Multi-modal Encoders0
Re2G: Retrieve, Rerank, Generate0
StAnD: A Dataset of Linear Static Analysis ProblemsCode0
SympOCnet: Solving optimal control problems with applications to high-dimensional multi-agent path planning problemsCode0
GPU-accelerated partially linear multiuser detection for 5G and beyond URLLC systemsCode0
TransVOD: End-to-End Video Object Detection with Spatial-Temporal TransformersCode2
OCSampler: Compressing Videos to One Clip with Single-step SamplingCode1
SLISEMAP: Supervised dimensionality reduction through local explanationsCode1
A Simulation Platform for Multi-tenant Machine Learning Services on Thousands of GPUs0
GhostNets on Heterogeneous Devices via Cheap OperationsCode0
GPU-Net: Lightweight U-Net with more diverse featuresCode1
BottleFit: Learning Compressed Representations in Deep Neural Networks for Effective and Efficient Split ComputingCode1
Show:102550
← PrevPage 66 of 113Next →

No leaderboard results yet.