SOTAVerified

GPU

Papers

Showing 19511975 of 5629 papers

TitleStatusHype
Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time0
Stochastic Engrams for Efficient Continual Learning with Binarized Neural Networks0
High Quality Diffusion Distillation on a Single GPU with Relative and Absolute Position Matching0
Self-ReS: Self-Reflection in Large Vision-Language Models for Long Video Understanding0
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary AdaptationCode0
Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification0
Improved Alignment of Modalities in Large Vision Language Models0
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch0
Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization0
Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding0
GRiNS: A Python Library for Simulating Gene Regulatory Network DynamicsCode0
WindowKV: Task-Adaptive Group-Wise KV Cache Window Selection for Efficient LLM InferenceCode0
Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial ImagesCode0
Robustness of deep learning classification to adversarial input on GPUs: asynchronous parallel accumulation is a source of vulnerability0
Temporal Action Detection Model Compression by Progressive Block Drop0
Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation0
V-Seek: Accelerating LLM Reasoning on Open-hardware Server-class RISC-V Platforms0
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models0
GauRast: Enhancing GPU Triangle Rasterizers to Accelerate 3D Gaussian Splatting0
SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs0
ML-Triton, A Multi-Level Compilation and Language Extension to Triton GPU Programming0
Reducing Communication Overhead in Federated Learning for Network Anomaly Detection with Adaptive Client Selection0
Optimized 3D Gaussian Splatting using Coarse-to-Fine Image Frequency Modulation0
Bolt3D: Generating 3D Scenes in Seconds0
TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection0
Show:102550
← PrevPage 79 of 226Next →

No leaderboard results yet.