SOTAVerified

GPU

Papers

Showing 531540 of 5629 papers

TitleStatusHype
An Experimental Study of SOTA LiDAR Segmentation Models0
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear DistillationCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference0
GPU Memory Usage Optimization for Backward Propagation in Deep Network Training0
Myna: Masking-Based Contrastive Learning of Musical RepresentationsCode1
SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic EmbeddingsCode0
Rotate, Clip, and Partition: Towards W2A4KV4 Quantization by Integrating Rotation and Learnable Non-uniform Quantizer0
Fate: Fast Edge Inference of Mixture-of-Experts Models via Cross-Layer GateCode0
AdaSplash: Adaptive Sparse Flash AttentionCode1
Show:102550
← PrevPage 54 of 563Next →

No leaderboard results yet.