SOTAVerified

GPU

Papers

Showing 6170 of 5629 papers

TitleStatusHype
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse AttentionCode5
Group-in-Group Policy Optimization for LLM Agent TrainingCode5
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio GenerationCode5
Deep Lake: a Lakehouse for Deep LearningCode5
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language ModelsCode5
TabPFN: A Transformer That Solves Small Tabular Classification Problems in a SecondCode5
LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language ModelsCode5
Comet: Fine-grained Computation-communication Overlapping for Mixture-of-ExpertsCode5
LLM.int8(): 8-bit Matrix Multiplication for Transformers at ScaleCode5
Point-E: A System for Generating 3D Point Clouds from Complex PromptsCode5
Show:102550
← PrevPage 7 of 563Next →

No leaderboard results yet.