SOTAVerified

16k

Papers

Showing 110 of 146 papers

TitleStatusHype
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code IntelligenceCode9
Global Structure-from-Motion RevisitedCode7
Code Llama: Open Foundation Models for CodeCode6
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-AwarenessCode6
Learning to (Learn at Test Time): RNNs with Expressive Hidden StatesCode5
Long-form factuality in large language modelsCode4
FlashDMoE: Fast Distributed MoE in a Single KernelCode3
M+: Extending MemoryLLM with Scalable Long-Term MemoryCode3
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model TransformationCode3
LinFusion: 1 GPU, 1 Minute, 16K ImageCode3
Show:102550
← PrevPage 1 of 15Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Suprime21'"1Unverified