SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 12011250 of 659983 papers

TitleStatusHype
EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language ModelsCode5
Online Iterative Reinforcement Learning from Human Feedback with General Preference ModelCode5
Segment Anything Model for Medical Image Segmentation: Current Applications and Future DirectionsCode5
aeon: a Python toolkit for learning from time seriesCode5
Controllable Generation with Text-to-Image Diffusion Models: A SurveyCode5
Datasets for Large Language Models: A Comprehensive SurveyCode5
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative DecodingCode5
Real3D-Portrait: One-shot Realistic 3D Talking Portrait SynthesisCode5
Make Your LLM Fully Utilize the ContextCode5
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative TrainingCode5
Unified Training of Universal Time Series Forecasting TransformersCode5
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer FrameworkCode5
TimeMixer++: A General Time Series Pattern Machine for Universal Predictive AnalysisCode5
Learning Flow Fields in Attention for Controllable Person Image GenerationCode5
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter ExpertsCode5
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens4
Unified Personalized Reward Model for Vision Generation4
Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills4
Reinforcement Learning via Self-Distillation4
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks4
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models4
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery4
QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining4
VideoWorld 2: Learning Transferable Knowledge from Real-world Videos4
R-Zero: Self-Evolving Reasoning LLM from Zero Data4
ATOM: AdapTive and OptiMized dynamic temporal knowledge graph construction using LLMs4
Precise Object and Effect Removal with Adaptive Target-Aware Attention4
MOSS-TTS Technical Report4
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations4
MotionStream: Real-Time Video Generation with Interactive Motion Controls4
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models4
Closing the Loop: Universal Repository Representation with RPG-Encoder4
MOVA: Towards Scalable and Synchronized Video-Audio Generation4
Cautious Weight Decay4
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs4
Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching4
SkillNet: Create, Evaluate, and Connect AI Skills4
TTT3R: 3D Reconstruction as Test-Time Training4
Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation4
SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds4
UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers4
Utonia: Toward One Encoder for All Point Clouds4
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models4
On the Theoretical Limitations of Embedding-Based Retrieval4
MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator4
Hyperagents4
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations4
Masked Depth Modeling for Spatial Perception4
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research4
Learning to Discover at Test Time4
Show:102550
← PrevPage 25 of 13200Next →