SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1072610750 of 177340 papers

TitleStatusHype
FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud MapsCode2
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information SteeringCode2
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQLCode2
VaViM and VaVAM: Autonomous Driving through Video Generative ModelingCode2
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object DetectionCode2
A Survey on Industrial Anomalies SynthesisCode2
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You ThinkCode2
InsTaG: Learning Personalized 3D Talking Head from Few-Second VideoCode2
AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual LearningCode2
WritingBench: A Comprehensive Benchmark for Generative WritingCode2
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and EditingCode2
Advancing Language Model Reasoning through Reinforcement Learning and Inference ScalingCode2
MegaMath: Pushing the Limits of Open Math CorporaCode2
POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D ReconstructionCode2
Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language NavigationCode2
GuardReasoner-VL: Safeguarding VLMs via Reinforced ReasoningCode2
μPC: Scaling Predictive Coding to 100+ Layer NetworksCode2
VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to RankCode2
CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal FeaturesCode2
DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail PredictionCode2
Play to Generalize: Learning to Reason Through Game PlayCode2
ChineseHarm-Bench: A Chinese Harmful Content Detection BenchmarkCode2
Curve-Aware Gaussian Splatting for 3D Parametric Curve ReconstructionCode2
Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation BoosterCode2
LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential RecommendationCode2
Show:102550
← PrevPage 430 of 7094Next →