SOTAVerified

GPU

Papers

Showing 311320 of 5629 papers

TitleStatusHype
Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian ProcessCode2
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal ModelsCode2
Streaming Video Question-Answering with In-context Video KV-Cache RetrievalCode2
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio GenerationCode2
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton OperatorsCode2
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language ModelsCode2
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear DistillationCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
Saving 77% of the Parameters in Large Language Models Technical ReportCode2
QuEST: Stable Training of LLMs with 1-Bit Weights and ActivationsCode2
Show:102550
← PrevPage 32 of 563Next →

No leaderboard results yet.