SOTAVerified

GPU

Papers

Showing 12511275 of 5629 papers

TitleStatusHype
A Pairwise Comparison Relation-assisted Multi-objective Evolutionary Neural Architecture Search Method with Multi-population Mechanism0
vTensor: Flexible Virtual Tensor Management for Efficient LLM ServingCode3
Automated Road Safety: Enhancing Sign and Surface Damage Detection with AI0
MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM0
LSM-GNN: Large-scale Storage-based Multi-GPU GNN Training by Optimizing Data Transfer Scheme0
GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image GenerationCode0
Mixture of Experts with Mixture of Precisions for Tuning Quality of Service0
Neural topology optimization: the good, the bad, and the ugly0
Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference0
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model InternalsCode4
Forecasting GPU Performance for Deep Learning Training and InferenceCode2
Attention in SRAM on Tenstorrent GrayskullCode1
LiNR: Model Based Neural Retrieval on GPUs at LinkedIn0
WiNet: Wavelet-based Incremental Learning for Efficient Medical Image RegistrationCode1
Visual Haystacks: A Vision-Centric Needle-In-A-Haystack BenchmarkCode1
SmartQuant: CXL-based AI Model Store in Support of Runtime Configurable Weight Quantization0
FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty QuantificationCode1
Spectra: Surprising Effectiveness of Pretraining Ternary Language Models at ScaleCode2
ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks0
RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models0
PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer0
Learning Multi-view Anomaly Detection0
MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training0
Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors0
MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models0
Show:102550
← PrevPage 51 of 226Next →

No leaderboard results yet.