SOTAVerified

GPU

Papers

Showing 19762000 of 5629 papers

TitleStatusHype
A Comprehensive Summarization and Evaluation of Feature Refinement Modules for CTR PredictionCode0
CoDiCast: Conditional Diffusion Model for Global Weather Prediction with Uncertainty QuantificationCode0
Factored Latent-Dynamic Conditional Random Fields for Single and Multi-label Sequence ModelingCode0
Efficient approximation of Earth Mover's Distance Based on Nearest Neighbor SearchCode0
MIOpen: An Open Source Library For Deep Learning PrimitivesCode0
Efficient and Robust Parallel DNN Training through Model Parallelism on Multi-GPU PlatformCode0
Efficient and generalizable nested Fourier-DeepONet for three-dimensional geological carbon sequestrationCode0
Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM InferenceCode0
Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate GradientsCode0
An Analysis of Neural Language Modeling at Multiple ScalesCode0
M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient PretrainingCode0
BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNetCode0
A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering TasksCode0
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep LearningCode0
MG-GCN: Scalable Multi-GPU GCN Training FrameworkCode0
FastFace: Fast-converging Scheduler for Large-scale Face Recognition Training with One GPUCode0
Edge-Guided Occlusion Fading Reduction for a Light-Weighted Self-Supervised Monocular Depth EstimationCode0
BlockSwap: Fisher-guided Block Substitution for Network Compression on a BudgetCode0
METER: a mobile vision transformer architecture for monocular depth estimationCode0
BlockQNN: Efficient Block-wise Neural Network Architecture GenerationCode0
PIM-Opt: Demystifying Distributed Optimization Algorithms on a Real-World Processing-In-Memory SystemCode0
Message Scheduling for Performant, Many-Core Belief PropagationCode0
BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate BlocksCode0
Meta Networks for Neural Style TransferCode0
Memory-efficient Segmentation of High-resolution Volumetric MicroCT ImagesCode0
Show:102550
← PrevPage 80 of 226Next →

No leaderboard results yet.