SOTAVerified

GPU

Papers

Showing 19762000 of 5629 papers

TitleStatusHype
ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning0
Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory0
MagicDistillation: Weak-to-Strong Video Distillation for Large-Scale Few-Step Synthesis0
AccelGen: Heterogeneous SLO-Guaranteed High-Throughput LLM Inference Serving for Diverse Applications0
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs0
PIPO: Pipelined Offloading for Efficient Inference on Consumer Devices0
Characterizing GPU Resilience and Impact on AI/HPC Systems0
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers0
Distance-Based Tree-Sliced Wasserstein DistanceCode0
X-EcoMLA: Upcycling Pre-Trained Attention into MLA for Efficient and Extreme KV Compression0
LLMPerf: GPU Performance Modeling meets Large Language ModelsCode0
Cost-effective Deep Learning Infrastructure with NVIDIA GPUCode0
OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models0
KV-Distill: Nearly Lossless Learnable Context Compression for LLMs0
Speedy MASt3R0
MoE-Gen: High-Throughput MoE Inference on a Single GPU with Module-Based BatchingCode0
Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference0
Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge0
VideoScan: Enabling Efficient Streaming Video Understanding via Frame-level Semantic Carriers0
MarineGym: A High-Performance Reinforcement Learning Platform for Underwater Robotics0
Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM InferenceCode0
Accelerating MoE Model Inference with Expert Sharding0
TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting0
AdaptSR: Low-Rank Adaptation for Efficient and Scalable Real-World Super-Resolution0
Global Context Is All You Need for Parallel Efficient Tractography Parcellation0
Show:102550
← PrevPage 80 of 226Next →

No leaderboard results yet.