SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 5175 of 474278 papers

TitleStatusHype
CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-trainingCode11
CogVideoX: Text-to-Video Diffusion Models with An Expert TransformerCode11
Eliza: A Web3 friendly AI Agent Operating SystemCode11
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and GenerationCode11
SWIFT:A Scalable lightWeight Infrastructure for Fine-TuningCode11
LangGPT: Rethinking Structured Reusable Prompt Design Framework for LLMs from the Programming LanguageCode11
Pixtral 12BCode11
Structured 3D Latents for Scalable and Versatile 3D GenerationCode11
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V TrustworthinessCode11
Qwen2.5-VL Technical ReportCode11
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion ModelCode11
Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language ModelsCode11
ROMAS: A Role-Based Multi-Agent System for Database monitoring and PlanningCode11
Agent S: An Open Agentic Framework that Uses Computers Like a HumanCode11
The AI Scientist: Towards Fully Automated Open-Ended Scientific DiscoveryCode11
WebLLM: A High-Performance In-Browser LLM Inference EngineCode11
Deep Time Series Models: A Comprehensive Survey and BenchmarkCode11
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model ScalingCode11
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting ControlCode11
OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task AutomationCode11
Wan: Open and Advanced Large-Scale Video Generative ModelsCode11
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets GenerationCode11
SCORE: Systematic COnsistency and Robustness Evaluation for Large Language ModelsCode11
Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language ModelsCode11
CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language ModelsCode11
Show:102550
← PrevPage 3 of 18972Next →