SOTAVerified

GPU

Papers

Showing 376400 of 5629 papers

TitleStatusHype
ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model0
Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time0
Self-ReS: Self-Reflection in Large Vision-Language Models for Long Video Understanding0
High Quality Diffusion Distillation on a Single GPU with Relative and Absolute Position Matching0
Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via TensorizationCode7
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary AdaptationCode0
A Probabilistic Neuro-symbolic Layer for Algebraic Constraint SatisfactionCode1
Scaling Down Text Encoders of Text-to-Image Diffusion ModelsCode2
Improved Alignment of Modalities in Large Vision Language Models0
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch0
Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification0
Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding0
Efficient Self-Supervised Adaptation for Medical Image AnalysisCode1
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV CacheCode2
Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization0
GRiNS: A Python Library for Simulating Gene Regulatory Network DynamicsCode0
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language PretrainingCode3
Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial ImagesCode0
WindowKV: Task-Adaptive Group-Wise KV Cache Window Selection for Efficient LLM InferenceCode0
Temporal Action Detection Model Compression by Progressive Block Drop0
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models0
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data ConstructionCode9
Robustness of deep learning classification to adversarial input on GPUs: asynchronous parallel accumulation is a source of vulnerability0
Splat-LOAM: Gaussian Splatting LiDAR Odometry and MappingCode2
Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation0
Show:102550
← PrevPage 16 of 226Next →

No leaderboard results yet.