SOTAVerified

GPU

Papers

Showing 7180 of 5629 papers

TitleStatusHype
LLM.int8(): 8-bit Matrix Multiplication for Transformers at ScaleCode5
TabPFN: A Transformer That Solves Small Tabular Classification Problems in a SecondCode5
Multi-head Temporal Latent AttentionCode4
Accelerating Visual-Policy Learning through Parallel Differentiable SimulationCode4
OnPrem.LLM: A Privacy-Conscious Document Intelligence ToolkitCode4
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory ConstraintsCode4
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length FloatCode4
LettuceDetect: A Hallucination Detection Framework for RAG ApplicationsCode4
Building reliable sim driving agents by scaling self-playCode4
KernelBench: Can LLMs Write Efficient GPU Kernels?Code4
Show:102550
← PrevPage 8 of 563Next →

No leaderboard results yet.