SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 26262650 of 661570 papers

TitleStatusHype
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion ModelsCode3
Frequency Dynamic Convolution for Dense Image PredictionCode3
Will LLMs be Professional at Fund Investment? DeepFund: A Live Arena PerspectiveCode3
Retrieval Augmented Generation and Understanding in Vision: A Survey and New OutlookCode3
PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from VideosCode3
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language PretrainingCode3
Multi-Modality Representation Learning for Antibody-Antigen Interactions PredictionCode3
NdLinear Is All You Need for Representation LearningCode3
Halton Scheduler For Masked Generative Image TransformerCode3
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn'tCode3
Unleashing Vecset Diffusion Model for Fast Shape GenerationCode3
XAttention: Block Sparse Attention with Antidiagonal ScoringCode3
NeuralFoil: An Airfoil Aerodynamics Analysis Tool Using Physics-Informed Machine LearningCode3
A Comprehensive Survey on Long Context Language ModelingCode3
Unreal-MAP: Unreal-Engine-Based General Platform for Multi-Agent Reinforcement LearningCode3
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning TasksCode3
Vision-Speech Models: Teaching Speech Models to Converse about ImagesCode3
TripNet: Learning Large-scale High-fidelity 3D Car Aerodynamics with Triplane NetworksCode3
Measuring AI Ability to Complete Long TasksCode3
MDocAgent: A Multi-Modal Multi-Agent Framework for Document UnderstandingCode3
MoonCast: High-Quality Zero-Shot Podcast GenerationCode3
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking PortraitCode3
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy OptimizationCode3
VideoMind: A Chain-of-LoRA Agent for Long Video ReasoningCode3
Why Do Multi-Agent LLM Systems Fail?Code3
Show:102550
← PrevPage 106 of 26463Next →