SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 176200 of 180343 papers

TitleStatusHype
Toward Guidance-Free AR Visual Generation via Condition Contrastive AlignmentCode9
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code IntelligenceCode9
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware MemoryCode9
InternLM2 Technical ReportCode9
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive PerceptionCode9
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data ConstructionCode9
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language ModelCode9
UFO: A UI-Focused Agent for Windows OS InteractionCode9
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait AnimationCode9
RULER: What's the Real Context Size of Your Long-Context Language Models?Code9
MindSearch: Mimicking Human Minds Elicits Deep AI SearcherCode9
Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech GenerationCode9
Overview of the Amphion Toolkit (v0.2)Code9
Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image SynthesisCode9
Agent Laboratory: Using LLM Agents as Research AssistantsCode9
Divide and Conquer: High-Resolution Industrial Anomaly Detection via Memory Efficient Tiled EnsembleCode9
OpenVLA: An Open-Source Vision-Language-Action ModelCode9
Transformer Explainer: Interactive Learning of Text-Generative ModelsCode9
SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compileCode9
Emerging Properties in Unified Multimodal PretrainingCode9
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated ParametersCode9
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion TransformersCode9
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language ModelsCode9
AgentRxiv: Towards Collaborative Autonomous ResearchCode9
Natural language guidance of high-fidelity text-to-speech with synthetic annotationsCode9
Show:102550
← PrevPage 8 of 7214Next →