SOTAVerified

GPU

Papers

Showing 15261550 of 5629 papers

TitleStatusHype
Mirage: A Multi-Level Superoptimizer for Tensor ProgramsCode7
Preble: Efficient Distributed Prompt Scheduling for LLM ServingCode2
You Only Cache Once: Decoder-Decoder Architectures for Language ModelsCode0
Vidur: A Large-Scale Simulation Framework For LLM InferenceCode4
Open Implementation and Study of BEST-RQ for Speech Processing0
A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields0
Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression0
DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid0
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM ServingCode4
vAttention: Dynamic Memory Management for Serving LLMs without PagedAttentionCode3
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization0
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory SystemsCode0
Neural Graphics Texture Compression Supporting Random Access0
QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation0
Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs0
Labeling supervised fine-tuning data with the scaling lawCode7
UniDEC : Unified Dual Encoder and Classifier Training for Extreme Multi-Label Classification0
Fast Algorithms for Spiking Neural Network Simulation with FPGAsCode0
SoftMCL: Soft Momentum Contrastive Learning for Fine-grained Sentiment-aware Pre-trainingCode0
Structural Pruning of Pre-trained Language Models via Neural Architecture SearchCode0
MTDT: A Multi-Task Deep Learning Digital Twin0
Self-Supervised Learning for Interventional Image Analytics: Towards Robust Device Trackers0
FeNNol: an Efficient and Flexible Library for Building Force-field-enhanced Neural Network PotentialsCode2
Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge DeploymentCode0
Addressing Diverging Training Costs using BEVRestore for High-resolution Bird's Eye View Map Construction0
Show:102550
← PrevPage 62 of 226Next →

No leaderboard results yet.