SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 751775 of 659983 papers

TitleStatusHype
Measuring Taiwanese Mandarin Language UnderstandingCode5
Exploring GLU Expansion Ratios: A Study of Structured Pruning in LLaMA-3.2 ModelsCode5
OpenCodeInterpreter: Integrating Code Generation with Execution and RefinementCode5
LAB: Large-Scale Alignment for ChatBotsCode5
ReSearch: Learning to Reason with Search for LLMs via Reinforcement LearningCode5
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video GenerationCode5
OpenR: An Open Source Framework for Advanced Reasoning with Large Language ModelsCode5
MonST3R: A Simple Approach for Estimating Geometry in the Presence of MotionCode5
MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic VideosCode5
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme PredictionsCode5
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion ModelsCode5
Uni-Mol2: Exploring Molecular Pretraining Model at ScaleCode5
TabPFN: A Transformer That Solves Small Tabular Classification Problems in a SecondCode5
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language ModelingCode5
Zero-shot Image Editing with Reference ImitationCode5
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language ModelsCode5
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language ModelsCode5
Focus Anywhere for Fine-grained Multi-page Document UnderstandingCode5
Improving Text-To-Audio Models with Synthetic CaptionsCode5
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world VideosCode5
DreamFusion: Text-to-3D using 2D DiffusionCode5
OmniV2V: Versatile Video Generation and Editing via Dynamic Content ManipulationCode5
4M-21: An Any-to-Any Vision Model for Tens of Tasks and ModalitiesCode5
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive RetrievalCode5
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal PromptsCode5
Show:102550
← PrevPage 31 of 26400Next →