SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 451475 of 177339 papers

TitleStatusHype
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code ProcessingCode7
AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task SolvingCode7
MAGI-1: Autoregressive Video Generation at ScaleCode7
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese UnderstandingCode7
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow DevelopmentCode7
Kimi-Audio Technical ReportCode7
Bilateral Reference for High-Resolution Dichotomous Image SegmentationCode7
EvoGP: A GPU-accelerated Framework for Tree-based Genetic ProgrammingCode7
AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied SystemsCode7
StarCoder 2 and The Stack v2: The Next GenerationCode7
Mini-Omni: Language Models Can Hear, Talk While Thinking in StreamingCode7
Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning SystemsCode7
Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersCode7
DocETL: Agentic Query Rewriting and Evaluation for Complex Document ProcessingCode7
Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary casesCode7
Improving Sample Quality of Diffusion Models Using Self-Attention GuidanceCode7
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer ArchitectureCode7
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple CharactersCode7
MagicQuill: An Intelligent Interactive Image Editing SystemCode7
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit TrainingCode7
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text EmbeddingCode7
Faster Video Diffusion with Trainable Sparse AttentionCode7
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?Code7
EasySpider: A No-Code Visual System for Crawling the WebCode7
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model ServingCode7
Show:102550
← PrevPage 19 of 7094Next →