SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 826850 of 659983 papers

TitleStatusHype
CanViT: Toward Active-Vision Foundation Models0
FullCircle: Effortless 3D Reconstruction from Casual 360^ Captures0
CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context0
STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving0
Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?0
Single-Subject Multi-View MRI Super-Resolution via Implicit Neural Representations0
LGSE: Lexically Grounded Subword Embedding Initialization for Low-Resource Language Adaptation0
CAM3R: Camera-Agnostic Model for 3D Reconstruction0
Graph-Aware Late Chunking for Retrieval-Augmented Generation in Biomedical Literature0
Q-Tacit: Image Quality Assessment via Latent Visual Reasoning0
Multi-Method Validation of Large Language Model Medical Translation Across High- and Low-Resource Languages0
Overfitting and Generalizing with (PAC) Bayesian Prediction in Noisy Binary Classification0
AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research0
Pretext Matters: An Empirical Study of SSL Methods in Medical Imaging0
MAGICIAN: Efficient Long-Term Planning with Imagined Gaussians for Active Mapping0
Mixture of Demonstrations for Textual Graph Understanding and Question Answering0
Upper Entropy for 2-Monotone Lower Probabilities0
DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation0
Beyond Correlation: Refutation-Validated Aspect-Based Sentiment Analysis for Explainable Energy Market Returns0
Off-Policy Evaluation for Ranking Policies under Deterministic Logging Policies0
GaussianSSC: Triplane-Guided Directional Gaussian Fields for 3D Semantic Completion0
Learning Trajectory-Aware Multimodal Large Language Models for Video Reasoning Segmentation0
Effective Strategies for Asynchronous Software Engineering Agents0
Learning Can Converge Stably to the Wrong Belief under Latent Reliability0
Multinoulli Extension: A Lossless Continuous Relaxation for Partition-Constrained Subset Selection0
Show:102550
← PrevPage 34 of 26400Next →