SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 14261450 of 659983 papers

TitleStatusHype
Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed TomographyCode4
Multimodal Chain-of-Thought Reasoning: A Comprehensive SurveyCode4
RGBD GS-ICP SLAMCode4
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language ModelsCode4
Exploring the Capabilities of Large Multimodal Models on Dense TextCode4
CameraCtrl: Enabling Camera Control for Text-to-Video GenerationCode4
SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion ModelsCode4
Mutual Reasoning Makes Smaller LLMs Stronger Problem-SolversCode4
Data quality dimensions for fair AICode4
AnyText: Multilingual Visual Text Generation And EditingCode4
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned EncodersCode4
AutoCoder: Enhancing Code Large Language Model with AIEV-InstructCode4
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image EditingCode4
TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot ControlCode4
CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching ModelsCode4
Kubric: A scalable dataset generatorCode4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
R^2-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic ReconstructionCode4
AgentGym: Evolving Large Language Model-based Agents across Diverse EnvironmentsCode4
RecBole 2.0: Towards a More Up-to-Date Recommendation LibraryCode4
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought TemplatesCode4
IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo MatchingCode4
Long Context Transfer from Language to VisionCode4
RealisDance: Equip controllable character animation with realistic handsCode4
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model InternalsCode4
Show:102550
← PrevPage 58 of 26400Next →