SOTAVerified

GPU

Papers

Showing 231240 of 5629 papers

TitleStatusHype
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language ModelsCode3
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language ModelsCode3
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust AdaptationCode3
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning LibraryCode3
Splatter Image: Ultra-Fast Single-View 3D ReconstructionCode3
S-LoRA: Serving Thousands of Concurrent LoRA AdaptersCode3
Punica: Multi-Tenant LoRA ServingCode3
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUsCode3
Take the aTrain. Introducing an Interface for the Accessible Transcription of InterviewsCode3
Show:102550
← PrevPage 24 of 563Next →

No leaderboard results yet.