SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 676700 of 177339 papers

TitleStatusHype
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video UnderstandingCode5
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from TextCode5
GauStudio: A Modular Framework for 3D Gaussian Splatting and BeyondCode5
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object DetectionCode5
Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped NoiseCode5
TrustRAG: An Information Assistant with Retrieval Augmented GenerationCode5
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity PreservingCode5
Parrot: Multilingual Visual Instruction TuningCode5
Improved Differentially Private Regression via Gradient BoostingCode5
AIDE: AI-Driven Exploration in the Space of CodeCode5
WizardLM: Empowering Large Language Models to Follow Complex InstructionsCode5
Ovis: Structural Embedding Alignment for Multimodal Large Language ModelCode5
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset CurationCode5
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and BeyondCode5
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft ReasoningCode5
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language ModelsCode5
Assessing Language Model Deployment with Risk CardsCode5
UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsCode5
SantaCoder: don't reach for the stars!Code5
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of ExpertsCode5
Evolutionary Optimization of Model Merging RecipesCode5
Automatic Interactive Evaluation for Large Language Models with State Aware Patient SimulatorCode5
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal ModelsCode5
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank GradientsCode5
GraphCast: Learning skillful medium-range global weather forecastingCode5
Show:102550
← PrevPage 28 of 7094Next →