SOTAVerified

GPU

Papers

Showing 19512000 of 5629 papers

TitleStatusHype
At-Scale Sparse Deep Neural Network Inference with Efficient GPU ImplementationCode0
An Efficient and Layout-Independent Automatic License Plate Recognition System Based on the YOLO detectorCode0
MoGA: Searching Beyond MobileNetV3Code0
Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMUCode0
MODNet-V: Improving Portrait Video Matting via Background RestorationCode0
MoE-Gen: High-Throughput MoE Inference on a Single GPU with Module-Based BatchingCode0
Efficient Gender Classification Using a Deep LDA-Pruned NetCode0
NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM InferenceCode0
Efficient Featurized Image Pyramid Network for Single Shot DetectorCode0
Efficient Distillation of Classifier-Free Guidance using AdaptersCode0
MobileDets: Searching for Object Detection Architectures for Mobile AcceleratorsCode0
Efficient Differentiable Approximation of Generalized Low-rank RegularizationCode0
MLitB: Machine Learning in the BrowserCode0
Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy OptimizationCode0
Anchor Space Optimal Transport as a Fast Solution to Multiple Optimal Transport ProblemsCode0
Efficient Deep Learning for Stereo MatchingCode0
Anchors no more: Using peculiar velocities to constrain H_0 and the primordial Universe without calibratorsCode0
Efficient ConvNet for Real-time Semantic SegmentationCode0
Efficient Constituency Tree based Encoding for Natural Language to Bash TranslationCode0
Bottleneck Analysis of Dynamic Graph Neural Network Inference on CPU and GPUCode0
Efficient brain age prediction from 3D MRI volumes using 2D projectionsCode0
Efficient block contrastive learning via parameter-free meta-node approximationCode0
MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary NetworkCode0
ALTIS: Modernizing GPGPU BenchmarkingCode0
Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation RegularizationCode0
A Comprehensive Summarization and Evaluation of Feature Refinement Modules for CTR PredictionCode0
CoDiCast: Conditional Diffusion Model for Global Weather Prediction with Uncertainty QuantificationCode0
Factored Latent-Dynamic Conditional Random Fields for Single and Multi-label Sequence ModelingCode0
Efficient approximation of Earth Mover's Distance Based on Nearest Neighbor SearchCode0
MIOpen: An Open Source Library For Deep Learning PrimitivesCode0
Efficient and Robust Parallel DNN Training through Model Parallelism on Multi-GPU PlatformCode0
Efficient and generalizable nested Fourier-DeepONet for three-dimensional geological carbon sequestrationCode0
Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM InferenceCode0
Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate GradientsCode0
An Analysis of Neural Language Modeling at Multiple ScalesCode0
M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient PretrainingCode0
BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNetCode0
A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering TasksCode0
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep LearningCode0
MG-GCN: Scalable Multi-GPU GCN Training FrameworkCode0
FastFace: Fast-converging Scheduler for Large-scale Face Recognition Training with One GPUCode0
Edge-Guided Occlusion Fading Reduction for a Light-Weighted Self-Supervised Monocular Depth EstimationCode0
BlockSwap: Fisher-guided Block Substitution for Network Compression on a BudgetCode0
METER: a mobile vision transformer architecture for monocular depth estimationCode0
BlockQNN: Efficient Block-wise Neural Network Architecture GenerationCode0
PIM-Opt: Demystifying Distributed Optimization Algorithms on a Real-World Processing-In-Memory SystemCode0
Message Scheduling for Performant, Many-Core Belief PropagationCode0
BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate BlocksCode0
Meta Networks for Neural Style TransferCode0
Memory-efficient Segmentation of High-resolution Volumetric MicroCT ImagesCode0
Show:102550
← PrevPage 40 of 113Next →

No leaderboard results yet.