SOTAVerified

GPU

Papers

Showing 10511075 of 5629 papers

TitleStatusHype
FusionANNS: An Efficient CPU/GPU Cooperative Processing Architecture for Billion-scale Approximate Nearest Neighbor Search0
CNN Mixture-of-Depths0
INT-FlashAttention: Enabling Flash Attention for INT8 QuantizationCode2
Textless NLP -- Zero Resource Challenge with Low Resource Compute0
CAD: Memory Efficient Convolutional Adapter for Segment AnythingCode1
A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation0
Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference SpeedCode1
dnaGrinder: a lightweight and high-capacity genomic foundation model0
PipeFill: Using GPUs During Bubbles in Pipeline-parallel LLM Training0
TextToon: Real-Time Text Toonify Head Avatar from Single Video0
Efficient Tabular Data Preprocessing of ML Pipelines0
Benchmarking Edge AI Platforms for High-Performance ML Inference0
FastGL: A GPU-Efficient Framework for Accelerating Sampling-Based GNN Training at Large ScaleCode1
A Realistic Simulation Framework for Analog/Digital Neuromorphic Architectures0
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video UnderstandingCode4
ProTEA: Programmable Transformer Encoder Acceleration on FPGA0
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs0
Drift to Remember0
On Importance of Pruning and Distillation for Efficient Low Resource NLP0
Optimizing RLHF Training for Large Language Models with Stage Fusion0
Occupancy-Based Dual ContouringCode2
Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention0
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-MarquardtCode3
CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMsCode1
Graph Convolutional Neural Networks as Surrogate Models for Climate Simulation0
Show:102550
← PrevPage 43 of 226Next →

No leaderboard results yet.