SOTAVerified

GPU

Papers

Showing 14511500 of 5629 papers

TitleStatusHype
STAT: Shrinking Transformers After Training0
MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models0
Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification0
Spatio-Spectral Graph Neural NetworksCode1
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM InferenceCode2
Cardiovascular Disease Detection from Multi-View Chest X-rays with BI-MambaCode1
DiG: Scalable and Efficient Diffusion Models with Gated Linear AttentionCode2
Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World ClustersCode0
Scaling Laws and Compute-Optimal Training Beyond Fixed Training DurationsCode2
ViG: Linear-complexity Visual Sequence Learning with Gated Linear AttentionCode2
Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection0
Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model0
Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training0
CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy0
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning AttentionCode3
SWAT: Scalable and Efficient Window Attention-based Transformers Acceleration on FPGAs0
TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation ModelsCode0
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings0
Transformers Can Do Arithmetic with the Right EmbeddingsCode3
GPU Based Differential Evolution: New Insights and Comparative Study0
LoQT: Low-Rank Adapters for Quantized PretrainingCode2
The devil is in discretization discrepancy. Robustifying Differentiable NAS with Single-Stage Searching Protocol0
vHeat: Building Vision Models upon Heat ConductionCode3
Apply Distributed CNN on Genomics to accelerate Transcription-Factor TAL1 Motif Prediction0
LUCIE: A Lightweight Uncoupled ClImate Emulator with long-term stability and physical consistency for O(1000)-member ensemblesCode0
HETHUB: A Distributed Training System with Heterogeneous Cluster for Large-Scale Models0
MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface DefectsCode1
A GPU-Accelerated Bi-linear ADMM Algorithm for Distributed Sparse Machine Learning0
Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity0
Looking Backward: Streaming Video-to-Video Translation with Feature BanksCode4
DAGER: Exact Gradient Inversion for Large Language ModelsCode1
Sparse Matrix in Large Language Model Fine-tuningCode1
ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning0
Fast inference with Kronecker-sparse matricesCode1
Fast Bayesian Inference for Neutrino Non-Standard Interactions at Dark Matter Direct Detection ExperimentsCode0
ArchesWeather: An efficient AI weather forecasting model at 1.5° resolutionCode1
CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor OptimizationCode0
Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras0
MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models0
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models0
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token IdentificationCode1
Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and InferenceCode1
ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation0
Attention as an RNNCode1
HoverFast: an accurate, high-throughput, clinically deployable nuclear segmentation tool for brightfield digital pathology images0
Adversarial Training of Two-Layer Polynomial and ReLU Activation Networks via Convex OptimizationCode0
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence FunctionsCode2
PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM InferenceCode1
Personalized Residuals for Concept-Driven Text-to-Image Generation0
Parallelization of the K-Means Algorithm with Applications to Big Data Clustering0
Show:102550
← PrevPage 30 of 113Next →

No leaderboard results yet.