SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 876900 of 659983 papers

TitleStatusHype
OminiControl: Minimal and Universal Control for Diffusion TransformerCode5
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AICode5
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video UnderstandingCode5
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from TextCode5
GauStudio: A Modular Framework for 3D Gaussian Splatting and BeyondCode5
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object DetectionCode5
Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped NoiseCode5
TrustRAG: An Information Assistant with Retrieval Augmented GenerationCode5
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity PreservingCode5
Parrot: Multilingual Visual Instruction TuningCode5
Improved Differentially Private Regression via Gradient BoostingCode5
AIDE: AI-Driven Exploration in the Space of CodeCode5
WizardLM: Empowering Large Language Models to Follow Complex InstructionsCode5
Ovis: Structural Embedding Alignment for Multimodal Large Language ModelCode5
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset CurationCode5
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and BeyondCode5
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft ReasoningCode5
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language ModelsCode5
Assessing Language Model Deployment with Risk CardsCode5
UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsCode5
SantaCoder: don't reach for the stars!Code5
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of ExpertsCode5
Evolutionary Optimization of Model Merging RecipesCode5
MoVQ: Modulating Quantized Vectors for High-Fidelity Image GenerationCode5
Automatic Interactive Evaluation for Large Language Models with State Aware Patient SimulatorCode5
Show:102550
← PrevPage 36 of 26400Next →