SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 28012825 of 661570 papers

TitleStatusHype
The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling CapabilitiesCode3
HAC++: Towards 100X Compression of 3D Gaussian SplattingCode3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language ModelCode3
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?Code3
The OpenLAM ChallengesCode3
CoverM: Read alignment statistics for metagenomicsCode3
CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal ConcatenationCode3
Infrared and Visible Image Fusion: From Data Compatibility to Task AdaptionCode3
Universal Actions for Enhanced Embodied Foundation ModelsCode3
A Survey on LLM Test-Time Compute via Search: Tasks, LLM Profiling, Search Algorithms, and Relevant FrameworksCode3
X-Dyna: Expressive Dynamic Human Image AnimationCode3
OmniThink: Expanding Knowledge Boundaries in Machine Writing through ThinkingCode3
Foundations of Large Language ModelsCode3
DEFOM-Stereo: Depth Foundation Model Based Stereo MatchingCode3
Karatsuba Matrix Multiplication and its Efficient Custom Hardware ImplementationsCode3
FramePainter: Endowing Interactive Image Editing with Video Diffusion PriorsCode3
Do generative video models understand physical principles?Code3
In-situ graph reasoning and knowledge expansion using Graph-PReFLexORCode3
Lifelong Learning of Large Language Model based Agents: A RoadmapCode3
A General Framework for Inference-time Scaling and Steering of Diffusion ModelsCode3
ELIZA Reanimated: The world's first chatbot restored on the world's first time sharing systemCode3
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMsCode3
BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster responseCode3
Valley2: Exploring Multimodal Models with Scalable Vision-Language DesignCode3
Relative Pose Estimation through Affine Corrections of Monocular Depth PriorsCode3
Show:102550
← PrevPage 113 of 26463Next →