SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 27262750 of 661570 papers

TitleStatusHype
Curie: Toward Rigorous and Automated Scientific Experimentation with AI AgentsCode3
Prompt-to-LeaderboardCode3
Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture GenerationCode3
Accelerating Neural Network Training: An Analysis of the AlgoPerf CompetitionCode3
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMsCode3
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise DataCode3
CrossOver: 3D Scene Cross-Modal AlignmentCode3
A Comprehensive Survey on Composed Image RetrievalCode3
Slamming: Training a Speech Language Model on One GPU in a DayCode3
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object ManipulationCode3
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song GenerationCode3
Soundwave: Less is More for Speech-Text Alignment in LLMsCode3
Personalized Image Generation with Deep Generative Models: A Decade SurveyCode3
PathRAG: Pruning Graph-based Retrieval Augmented Generation with Relational PathsCode3
Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge NetworksCode3
LIMR: Less is More for RL ScalingCode3
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information RetrievalCode3
TokenSkip: Controllable Chain-of-Thought Compression in LLMsCode3
Learning Getting-Up Policies for Real-World Humanoid RobotsCode3
Intuitive physics understanding emerges from self-supervised pretraining on natural videosCode3
MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionCode3
Stonefish: Supporting Machine Learning Research in Marine RoboticsCode3
Automated Hypothesis Validation with Agentic Sequential FalsificationsCode3
Text-guided Sparse Voxel Pruning for Efficient 3D Visual GroundingCode3
Strassen Multisystolic Array Hardware ArchitecturesCode3
Show:102550
← PrevPage 110 of 26463Next →