SOTAVerified

GPU

Papers

Showing 401450 of 5629 papers

TitleStatusHype
Splat-LOAM: Gaussian Splatting LiDAR Odometry and MappingCode2
GauRast: Enhancing GPU Triangle Rasterizers to Accelerate 3D Gaussian Splatting0
SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs0
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image UnderstandingCode2
ML-Triton, A Multi-Level Compilation and Language Extension to Triton GPU Programming0
Reducing Communication Overhead in Federated Learning for Network Anomaly Detection with Adaptive Client Selection0
Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM KernelsCode2
Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam TasksCode1
Bolt3D: Generating 3D Scenes in Seconds0
TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection0
DIFFVSGG: Diffusion-Driven Online Video Scene Graph GenerationCode1
Optimized 3D Gaussian Splatting using Coarse-to-Fine Image Frequency Modulation0
MagicDistillation: Weak-to-Strong Video Distillation for Large-Scale Few-Step Synthesis0
ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning0
AccelGen: Heterogeneous SLO-Guaranteed High-Throughput LLM Inference Serving for Diverse Applications0
Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory0
MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language ModelingCode2
RENO: Real-Time Neural Compression for 3D LiDAR Point CloudsCode2
ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU MemoryCode3
PIPO: Pipelined Offloading for Efficient Inference on Consumer Devices0
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs0
X-EcoMLA: Upcycling Pre-Trained Attention into MLA for Efficient and Extreme KV Compression0
APLA: A Simple Adaptation Method for Vision TransformersCode1
Characterizing GPU Resilience and Impact on AI/HPC Systems0
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers0
Distance-Based Tree-Sliced Wasserstein DistanceCode0
LLMPerf: GPU Performance Modeling meets Large Language ModelsCode0
Cost-effective Deep Learning Infrastructure with NVIDIA GPUCode0
OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models0
KV-Distill: Nearly Lossless Learnable Context Compression for LLMs0
Speedy MASt3R0
Low Complexity Point Tracking of the Myocardium in 2D EchocardiographyCode1
VideoScan: Enabling Efficient Streaming Video Understanding via Frame-level Semantic Carriers0
MoE-Gen: High-Throughput MoE Inference on a Single GPU with Module-Based BatchingCode0
MarineGym: A High-Performance Reinforcement Learning Platform for Underwater Robotics0
Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference0
Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge0
TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting0
OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space ModelsCode2
Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM InferenceCode0
Accelerating MoE Model Inference with Expert Sharding0
LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference OptimizationCode2
Are We There Yet? A Measurement Study of Efficiency for LLM Applications on Mobile Devices0
Fine-Tuning LLMs for Report Summarization: Analysis on Supervised and Unsupervised Data0
AdaptSR: Low-Rank Adaptation for Efficient and Scalable Real-World Super-Resolution0
Short-Term Load Forecasting for AI-Data Center0
AttFC: Attention Fully-Connected Layer for Large-Scale Face Recognition with One GPU0
Efficient Distillation of Classifier-Free Guidance using AdaptersCode0
Global Context Is All You Need for Parallel Efficient Tractography Parcellation0
A Mesh Is Worth 512 Numbers: Spectral-domain Diffusion Modeling for High-dimension Shape Generation0
Show:102550
← PrevPage 9 of 113Next →

No leaderboard results yet.