SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 701750 of 659983 papers

TitleStatusHype
DanceGRPO: Unleashing GRPO on Visual GenerationCode5
UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsCode5
Generating Physically Stable and Buildable LEGO Designs from TextCode5
Continuous Thought MachinesCode5
ZeroSearch: Incentivize the Search Capability of LLMs without SearchingCode5
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video GenerationCode5
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and OpportunitiesCode5
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal DecompositionCode5
WebThinker: Empowering Large Reasoning Models with Deep Research CapabilityCode5
Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble ScorersCode5
Reservoir-enhanced Segment Anything Model for Subsurface DiagnosisCode5
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse AttentionCode5
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer FrameworkCode5
Reinforcement Learning from Human FeedbackCode5
Pixel-SAIL: Single Transformer For Pixel-Grounded UnderstandingCode5
Kimi-VL Technical ReportCode5
M-Prometheus: A Suite of Open Multilingual LLM JudgesCode5
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video SegmentationCode5
PaperBench: Evaluating AI's Ability to Replicate AI ResearchCode5
Less-to-More Generalization: Unlocking More Controllability by In-Context GenerationCode5
HDVIO2.0: Wind and Disturbance Estimation with Hybrid Dynamics VIOCode5
4th PVUW MeViS 3rd Place Report: Sa2VACode5
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic FaithfulnessCode5
Understanding R1-Zero-Like Training: A Critical PerspectiveCode5
ReSearch: Learning to Reason with Search for LLMs via Reinforcement LearningCode5
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of ToolsCode5
TikZero: Zero-Shot Text-Guided Graphics Program SynthesisCode5
Transformers without NormalizationCode5
FlowTok: Flowing Seamlessly Across Text and Image TokensCode5
OminiControl2: Efficient Conditioning for Diffusion TransformersCode5
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language ModelsCode5
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement LearningCode5
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera ControlCode5
InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music GenerationCode5
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and SuccessCode5
Comet: Fine-grained Computation-communication Overlapping for Mixture-of-ExpertsCode5
UniDepthV2: Universal Monocular Metric Depth Estimation Made SimplerCode5
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training ParadigmsCode5
Fractal Generative ModelsCode5
From System 1 to System 2: A Survey of Reasoning Large Language ModelsCode5
Getting SMARTER for Motion Planning in Autonomous Driving SystemsCode5
TrustRAG: An Information Assistant with Retrieval Augmented GenerationCode5
Magma: A Foundation Model for Multimodal AI AgentsCode5
AIDE: AI-Driven Exploration in the Space of CodeCode5
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?Code5
On the Computation of the Fisher Information in Continual LearningCode5
Time-series attribution maps with regularized contrastive learningCode5
Phantom: Subject-consistent video generation via cross-modal alignmentCode5
The Role of World Models in Shaping Autonomous Driving: A Comprehensive SurveyCode5
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge AdaptationCode5
Show:102550
← PrevPage 15 of 13200Next →