SOTAVerified

GPU

Papers

Showing 27012750 of 5629 papers

TitleStatusHype
S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs0
Knowledge Graph Tuning: Real-time Large Language Model Personalization based on Human Feedback0
STAT: Shrinking Transformers After Training0
MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models0
Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification0
Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World ClustersCode0
Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection0
Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model0
Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training0
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings0
SWAT: Scalable and Efficient Window Attention-based Transformers Acceleration on FPGAs0
TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation ModelsCode0
CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy0
GPU Based Differential Evolution: New Insights and Comparative Study0
The devil is in discretization discrepancy. Robustifying Differentiable NAS with Single-Stage Searching Protocol0
Apply Distributed CNN on Genomics to accelerate Transcription-Factor TAL1 Motif Prediction0
HETHUB: A Distributed Training System with Heterogeneous Cluster for Large-Scale Models0
A GPU-Accelerated Bi-linear ADMM Algorithm for Distributed Sparse Machine Learning0
LUCIE: A Lightweight Uncoupled ClImate Emulator with long-term stability and physical consistency for O(1000)-member ensemblesCode0
Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity0
ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning0
CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor OptimizationCode0
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models0
Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras0
MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models0
Fast Bayesian Inference for Neutrino Non-Standard Interactions at Dark Matter Direct Detection ExperimentsCode0
HoverFast: an accurate, high-throughput, clinically deployable nuclear segmentation tool for brightfield digital pathology images0
Adversarial Training of Two-Layer Polynomial and ReLU Activation Networks via Convex OptimizationCode0
ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation0
Personalized Residuals for Concept-Driven Text-to-Image Generation0
Parallelization of the K-Means Algorithm with Applications to Big Data Clustering0
ENOVA: Autoscaling towards Cost-effective and Stable Serverless LLM Serving0
Specialising and Analysing Instruction-Tuned and Byte-Level Language Models for Organic Reaction Prediction0
IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretraining0
Challenges in Deploying Long-Context Transformers: A Theoretical Peak Performance Analysis0
Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach0
Do Bayesian imaging methods report trustworthy probabilities?0
Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis0
Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation0
Sparse Sampling is All You Need for Fast Wrong-way Cycling Detection in CCTV Videos0
Input Snapshots Fusion for Scalable Discrete Dynamic Graph Nerual Networks0
SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models0
Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection0
Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering0
You Only Cache Once: Decoder-Decoder Architectures for Language ModelsCode0
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory SystemsCode0
A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields0
DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid0
Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression0
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization0
Show:102550
← PrevPage 55 of 113Next →

No leaderboard results yet.