SOTAVerified

GPU

Papers

Showing 171180 of 5629 papers

TitleStatusHype
Efficient and Generalizable Speaker Diarization via Structured Pruning of Self-Supervised ModelsCode3
EfficientQAT: Efficient Quantization-Aware Training for Large Language ModelsCode3
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context AccurayCode3
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video UnderstandingCode3
Merlin: A Vision Language Foundation Model for 3D Computed TomographyCode3
94% on CIFAR-10 in 3.29 Seconds on a Single GPUCode3
LiteGS: A High-Performance Modular Framework for Gaussian Splatting TrainingCode3
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language ModelsCode3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
LinFusion: 1 GPU, 1 Minute, 16K ImageCode3
Show:102550
← PrevPage 18 of 563Next →

No leaderboard results yet.