SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 50515075 of 661570 papers

TitleStatusHype
HumanOmniV2: From Understanding to Omni-Modal Reasoning with ContextCode2
FairyGen: Storied Cartoon Video from a Single Child-Drawn CharacterCode2
Language Modeling by Language ModelsCode2
OctoThinker: Mid-training Incentivizes Reinforcement Learning ScalingCode2
Stochastic Parameter DecompositionCode2
PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket ConditioningCode2
Video Compression for Spatiotemporal Earth System DataCode2
ConStellaration: A dataset of QI-like stellarator plasma boundaries and optimization benchmarksCode2
MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction ModelsCode2
An ab initio foundation model of wavefunctions that accurately describes chemical bond breakingCode2
AnalogNAS-Bench: A NAS Benchmark for Analog In-Memory ComputingCode2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics LearningCode2
Thought Anchors: Which LLM Reasoning Steps Matter?Code2
Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation BoosterCode2
TAB: Unified Benchmarking of Time Series Anomaly Detection MethodsCode2
Graphs Meet AI Agents: Taxonomy, Progress, and Future OpportunitiesCode2
From Tiny Machine Learning to Tiny Deep Learning: A SurveyCode2
MemBench: Towards More Comprehensive Evaluation on the Memory of LLM-based AgentsCode2
RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and TrackingCode2
Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion ModelsCode2
Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario GenerationCode2
RapFlow-TTS: Rapid and High-Fidelity Text-to-Speech with Improved Consistency Flow MatchingCode2
Watermarking Autoregressive Image GenerationCode2
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph RefinementCode2
Descriptor-based Foundation Models for Molecular Property PredictionCode2
Show:102550
← PrevPage 203 of 26463Next →