SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 10011025 of 177339 papers

TitleStatusHype
Online Iterative Reinforcement Learning from Human Feedback with General Preference ModelCode5
Segment Anything Model for Medical Image Segmentation: Current Applications and Future DirectionsCode5
aeon: a Python toolkit for learning from time seriesCode5
Controllable Generation with Text-to-Image Diffusion Models: A SurveyCode5
Datasets for Large Language Models: A Comprehensive SurveyCode5
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative DecodingCode5
Real3D-Portrait: One-shot Realistic 3D Talking Portrait SynthesisCode5
Make Your LLM Fully Utilize the ContextCode5
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative TrainingCode5
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world VideosCode5
ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body SkillsCode5
MambaIRv2: Attentive State Space RestorationCode5
WebLINX: Real-World Website Navigation with Multi-Turn DialogueCode5
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative ModelsCode5
Trust Regions for Explanations via Black-Box Probabilistic CertificationCode5
MEIA: Multimodal Embodied Perception and Interaction in Unknown EnvironmentsCode5
EasyPhoto: Your Smart AI Photo GeneratorCode5
Language Agents as Optimizable GraphsCode5
Data-Juicer: A One-Stop Data Processing System for Large Language ModelsCode5
Training Large Language Models to Reason in a Continuous Latent SpaceCode5
YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual PerceptionCode5
YOLOv6: A Single-Stage Object Detection Framework for Industrial ApplicationsCode5
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture ModificationCode5
OminiControl2: Efficient Conditioning for Diffusion TransformersCode5
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8BCode5
Show:102550
← PrevPage 41 of 7094Next →