SOTAVerified

GPU

Papers

Showing 46514700 of 5629 papers

TitleStatusHype
Knowledge Extracted from Recurrent Deep Belief Network for Real Time Deterministic Control0
Knowledge Graph Tuning: Real-time Large Language Model Personalization based on Human Feedback0
KPNet: Towards Minimal Face Detector0
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference0
KunServe: Efficient Parameter-centric Memory Management for LLM Serving0
KurTail : Kurtosis-based LLM Quantization0
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization0
KVDirect: Distributed Disaggregated LLM Inference0
KV-Distill: Nearly Lossless Learnable Context Compression for LLMs0
L2PF -- Learning to Prune Faster0
L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference0
Label Delay in Online Continual Learning0
Label-Looping: Highly Efficient Decoding for Transducers0
Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation0
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings0
LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM0
LaFiCMIL: Rethinking Large File Classification from the Perspective of Correlated Multiple Instance Learning0
LAMP: Learn A Motion Pattern for Few-Shot Video Generation0
LaneSegNet Design Study0
Language Modeling at Scale0
Language verY Rare for All0
Large Batch and Patch Size Training for Medical Image Segmentation0
Lillama: Large Language Models Compression via Low-Rank Feature Distillation0
Large, Pruned or Continuous Space Language Models on a GPU for Statistical Machine Translation0
Large Scale Artificial Neural Network Training Using Multi-GPUs0
Large-Scale Cox Process Inference using Variational Fourier Features0
Large-Scale Deep Learning on the YFCC100M Dataset0
Large-scale GPU-based network analysis of the human T-cell receptor repertoire0
Large-Scale Paralleled Sparse Principal Component Analysis0
Insights into Ordinal Embedding Algorithms: A Systematic Evaluation0
Large-Scale Stochastic Learning using GPUs0
Large-Scale Training System for 100-Million Classification at Alibaba0
LASSI: An LLM-based Automated Self-Correcting Pipeline for Translating Parallel Scientific Codes0
Latent fingerprint minutia extraction using fully convolutional network0
Latents of latents to delineate pixels: hybrid Matryoshka autoencoder-to-U-Net pairing for segmenting large medical images in GPU-poor and low-data regimes0
Later-stage Minimum Bayes-Risk Decoding for Neural Machine Translation0
LATUP-Net: A Lightweight 3D Attention U-Net with Parallel Convolutions for Brain Tumor Segmentation0
Layered gradient accumulation and modular pipeline parallelism: fast and efficient training of large language models0
Layered Interpretation of Street View Images0
Layer-Parallel Training with GPU Concurrency of Deep Residual Neural Networks via Nonlinear Multigrid0
Layer Pruning on Demand with Intermediate CTC0
Layer-wise Adaptive Gradient Sparsification for Distributed Deep Learning with Convergence Guarantees0
Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement0
LazyEviction: Lagged KV Eviction with Attention Pattern Observation for Efficient Long Reasoning0
Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance0
LeanQuant: Accurate Large Language Model Quantization with Loss-Error-Aware Grid0
Learnability and Robustness of Shallow Neural Networks Learned With a Performance-Driven BP and a Variant PSO For Edge Decision-Making0
Learned Block-based Hybrid Image Compression0
Learned Cone-Beam CT Reconstruction Using Neural Ordinary Differential Equations0
Learned Image Compression with Generalized Octave Convolution and Cross-Resolution Parameter Estimation0
Show:102550
← PrevPage 94 of 113Next →

No leaderboard results yet.