SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 11261150 of 177339 papers

TitleStatusHype
Focus Anywhere for Fine-grained Multi-page Document UnderstandingCode5
Improving Text-To-Audio Models with Synthetic CaptionsCode5
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical ReasoningCode5
DreamFusion: Text-to-3D using 2D DiffusionCode5
OmniV2V: Versatile Video Generation and Editing via Dynamic Content ManipulationCode5
4M-21: An Any-to-Any Vision Model for Tens of Tasks and ModalitiesCode5
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive RetrievalCode5
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal PromptsCode5
StarCoder: may the source be with you!Code5
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model ParametersCode5
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation ModelsCode5
Jamba-1.5: Hybrid Transformer-Mamba Models at ScaleCode5
XGrammar: Flexible and Efficient Structured Generation Engine for Large Language ModelsCode5
SpinQuant: LLM quantization with learned rotationsCode5
Image Vectorization: a ReviewCode5
Zephyr: Direct Distillation of LM AlignmentCode5
ReLU Strikes Back: Exploiting Activation Sparsity in Large Language ModelsCode5
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven GenerationCode5
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and EditingCode5
3D Reconstruction with Spatial MemoryCode5
RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query ParallelismCode5
Transformers without NormalizationCode5
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification BenchmarkCode5
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional TokensCode5
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction FollowingCode5
Show:102550
← PrevPage 46 of 7094Next →