SOTAVerified

GPU

Papers

Showing 14511475 of 5629 papers

TitleStatusHype
STAT: Shrinking Transformers After Training0
MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models0
Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification0
Spatio-Spectral Graph Neural NetworksCode1
Cardiovascular Disease Detection from Multi-View Chest X-rays with BI-MambaCode1
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM InferenceCode2
Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World ClustersCode0
DiG: Scalable and Efficient Diffusion Models with Gated Linear AttentionCode2
Scaling Laws and Compute-Optimal Training Beyond Fixed Training DurationsCode2
Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection0
Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model0
ViG: Linear-complexity Visual Sequence Learning with Gated Linear AttentionCode2
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning AttentionCode3
CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy0
Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training0
TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation ModelsCode0
SWAT: Scalable and Efficient Window Attention-based Transformers Acceleration on FPGAs0
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings0
Transformers Can Do Arithmetic with the Right EmbeddingsCode3
LoQT: Low-Rank Adapters for Quantized PretrainingCode2
GPU Based Differential Evolution: New Insights and Comparative Study0
vHeat: Building Vision Models upon Heat ConductionCode3
The devil is in discretization discrepancy. Robustifying Differentiable NAS with Single-Stage Searching Protocol0
Apply Distributed CNN on Genomics to accelerate Transcription-Factor TAL1 Motif Prediction0
MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface DefectsCode1
Show:102550
← PrevPage 59 of 226Next →

No leaderboard results yet.