SOTAVerified

GPU

Papers

Showing 18311840 of 5629 papers

TitleStatusHype
Fused3S: Fast Sparse Attention on Tensor CoresCode0
Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains0
L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers0
SLAG: Scalable Language-Augmented Gaussian Splatting0
Private LoRA Fine-tuning of Open-Source LLMs with Homomorphic Encryption0
Streaming Krylov-Accelerated Stochastic Gradient Descent0
Matrix Is All You Need0
QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration0
Challenging GPU Dominance: When CPUs Outperform for On-Device LLM Inference0
FloE: On-the-Fly MoE Inference on Memory-constrained GPU0
Show:102550
← PrevPage 184 of 563Next →

No leaderboard results yet.