SOTAVerified

GPU

Papers

Showing 25762600 of 5629 papers

TitleStatusHype
Textile Anomaly Detection: Evaluation of the State-of-the-Art for Automated Quality Inspection of Carpet0
HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline0
Keep the Cost Down: A Review on Methods to Optimize LLM' s KV-Cache ConsumptionCode0
SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention0
A Pairwise Comparison Relation-assisted Multi-objective Evolutionary Neural Architecture Search Method with Multi-population Mechanism0
Automated Road Safety: Enhancing Sign and Surface Damage Detection with AI0
LSM-GNN: Large-scale Storage-based Multi-GPU GNN Training by Optimizing Data Transfer Scheme0
MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM0
GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image GenerationCode0
Neural topology optimization: the good, the bad, and the ugly0
Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference0
Mixture of Experts with Mixture of Precisions for Tuning Quality of Service0
LiNR: Model Based Neural Retrieval on GPUs at LinkedIn0
RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models0
SmartQuant: CXL-based AI Model Store in Support of Runtime Configurable Weight Quantization0
ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks0
Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors0
Characterizing and Understanding HGNN Training on GPUs0
MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training0
Learning Multi-view Anomaly Detection0
PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer0
MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models0
SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation0
Differentiable Neural-Integrated Meshfree Method for Forward and Inverse Modeling of Finite Strain HyperelasticityCode0
NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis0
Show:102550
← PrevPage 104 of 226Next →

No leaderboard results yet.