SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 50265050 of 661570 papers

TitleStatusHype
AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-benchCode2
Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics EmulationCode2
MathOptAI.jl: Embed trained machine learning predictors into JuMP modelsCode2
MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech EnhancementCode2
Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-TuningCode2
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real WorldCode2
NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous EnvironmentsCode2
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement LearningCode2
VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and CollisionsCode2
MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow EstimationCode2
EAMamba: Efficient All-Around Vision State Space Model for Image RestorationCode2
LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMsCode2
Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement LearningCode2
R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement LearningCode2
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT ImprovementsCode2
Spatial Mental Modeling from Limited ViewsCode2
BMFM-DNA: A SNP-aware DNA foundation model to capture variant effectsCode2
DBConformer: Dual-Branch Convolutional Transformer for EEG DecodingCode2
EraRAG: Efficient and Incremental Retrieval Augmented Generation for Growing CorporaCode2
Curve-Aware Gaussian Splatting for 3D Parametric Curve ReconstructionCode2
ESMStereo: Enhanced ShuffleMixer Disparity Upsampling for Real-Time and Accurate Stereo MatchingCode2
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding ModelCode2
HumanOmniV2: From Understanding to Omni-Modal Reasoning with ContextCode2
Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and TrendsCode2
FairyGen: Storied Cartoon Video from a Single Child-Drawn CharacterCode2
Show:102550
← PrevPage 202 of 26463Next →