SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 24012425 of 661570 papers

TitleStatusHype
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders3
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking3
ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion3
Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AICode3
PhysX: Physical-Grounded 3D Asset GenerationCode3
FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scaleCode3
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem SolvingCode3
A Survey on Latent ReasoningCode3
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic AgentsCode3
No time to train! Training-Free Reference-Based Instance SegmentationCode3
Epona: Autoregressive Diffusion World Model for Autonomous DrivingCode3
L0: Reinforcement Learning to Become General AgentsCode3
Flash-VStream: Efficient Real-Time Understanding for Long Video StreamsCode3
Ovis-U1 Technical ReportCode3
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every LanguageCode3
MMSearch-R1: Incentivizing LMMs to SearchCode3
The Ideation-Execution Gap: Execution Outcomes of LLM-Generated versus Human Research IdeasCode3
Efficient and Generalizable Speaker Diarization via Structured Pruning of Self-Supervised ModelsCode3
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image GenerationCode3
TabArena: A Living Benchmark for Machine Learning on Tabular DataCode3
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual TokensCode3
Camera Calibration via Circular Patterns: A Comprehensive Framework with Measurement Uncertainty and Unbiased Projection ModelCode3
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate DetailsCode3
Discrete Diffusion in Large Language and Multimodal Models: A SurveyCode3
Show:102550
← PrevPage 97 of 26463Next →