SOTAVerified

GPU

Papers

Showing 421430 of 5629 papers

TitleStatusHype
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs0
X-EcoMLA: Upcycling Pre-Trained Attention into MLA for Efficient and Extreme KV Compression0
APLA: A Simple Adaptation Method for Vision TransformersCode1
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers0
Characterizing GPU Resilience and Impact on AI/HPC Systems0
Distance-Based Tree-Sliced Wasserstein DistanceCode0
Cost-effective Deep Learning Infrastructure with NVIDIA GPUCode0
LLMPerf: GPU Performance Modeling meets Large Language ModelsCode0
OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models0
KV-Distill: Nearly Lossless Learnable Context Compression for LLMs0
Show:102550
← PrevPage 43 of 563Next →

No leaderboard results yet.