SOTAVerified

GPU

Papers

Showing 46514675 of 5629 papers

TitleStatusHype
Knowledge Extracted from Recurrent Deep Belief Network for Real Time Deterministic Control0
Knowledge Graph Tuning: Real-time Large Language Model Personalization based on Human Feedback0
KPNet: Towards Minimal Face Detector0
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference0
KunServe: Efficient Parameter-centric Memory Management for LLM Serving0
KurTail : Kurtosis-based LLM Quantization0
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization0
KVDirect: Distributed Disaggregated LLM Inference0
KV-Distill: Nearly Lossless Learnable Context Compression for LLMs0
L2PF -- Learning to Prune Faster0
L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference0
Label Delay in Online Continual Learning0
Label-Looping: Highly Efficient Decoding for Transducers0
Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation0
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings0
LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM0
LaFiCMIL: Rethinking Large File Classification from the Perspective of Correlated Multiple Instance Learning0
LAMP: Learn A Motion Pattern for Few-Shot Video Generation0
LaneSegNet Design Study0
Language Modeling at Scale0
Language verY Rare for All0
Large Batch and Patch Size Training for Medical Image Segmentation0
Lillama: Large Language Models Compression via Low-Rank Feature Distillation0
Large, Pruned or Continuous Space Language Models on a GPU for Statistical Machine Translation0
Large Scale Artificial Neural Network Training Using Multi-GPUs0
Show:102550
← PrevPage 187 of 226Next →

No leaderboard results yet.