SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 951975 of 177339 papers

TitleStatusHype
MobileVLM V2: Faster and Stronger Baseline for Vision Language ModelCode5
MV-Adapter: Multi-view Consistent Image Generation Made EasyCode5
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM WorkflowsCode5
DeepPhase: Periodic Autoencoders for Learning Motion Phase ManifoldsCode5
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language ModelingCode5
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech ModelCode5
Understanding R1-Zero-Like Training: A Critical PerspectiveCode5
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive AnnotationsCode5
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to VerificationCode5
CogAgent: A Visual Language Model for GUI AgentsCode5
Transformer-Squared: Self-adaptive LLMsCode5
CogVLM: Visual Expert for Pretrained Language ModelsCode5
Aria: An Open Multimodal Native Mixture-of-Experts ModelCode5
Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model LearningCode5
τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World DomainsCode5
A Brief Overview of AI Governance for Responsible Machine Learning SystemsCode5
Autoregressive Image Generation without Vector QuantizationCode5
Representing Long Volumetric Video with Temporal Gaussian HierarchyCode5
Scalable Diffusion Models with TransformersCode5
Awesome Multi-modal Object TrackingCode5
Trajectory Prediction Meets Large Language Models: A SurveyCode5
PaperBench: Evaluating AI's Ability to Replicate AI ResearchCode5
4th PVUW MeViS 3rd Place Report: Sa2VACode5
GAM(e) changer or not? An evaluation of interpretable machine learning models based on additive model constraintsCode5
Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and ProspectsCode5
Show:102550
← PrevPage 39 of 7094Next →