SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 19762000 of 177340 papers

TitleStatusHype
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion ModelCode4
Generalizable Humanoid Manipulation with 3D Diffusion PoliciesCode4
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QACode4
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed ImagesCode4
Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language ModelsCode4
Multimodal Chain-of-Thought Reasoning in Language ModelsCode4
Efficient Automated Deep Learning for Time Series ForecastingCode4
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAMCode4
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt InjectionCode4
Lean Workbook: A large-scale Lean problem set formalized from natural language math problemsCode4
GeoCalib: Learning Single-image Calibration with Geometric OptimizationCode4
ManimML: Communicating Machine Learning Architectures with AnimationCode4
Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous DrivingCode4
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference OptimizationCode4
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language ModelsCode4
Reasoning with Language Model is Planning with World ModelCode4
Fine-Tuning Image-Conditional Diffusion Models is Easier than You ThinkCode4
DocRes: A Generalist Model Toward Unifying Document Image Restoration TasksCode4
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisCode4
Flamingo: a Visual Language Model for Few-Shot LearningCode4
Matching 2D Images in 3D: Metric Relative Pose from Metric CorrespondencesCode4
Prompt2Model: Generating Deployable Models from Natural Language InstructionsCode4
Sequential Models in the Synthetic Data VaultCode4
UniTS: A Unified Multi-Task Time Series ModelCode4
YuLan: An Open-source Large Language ModelCode4
Show:102550
← PrevPage 80 of 7094Next →