SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 56265650 of 177340 papers

TitleStatusHype
Voice Conversion With Just Nearest NeighborsCode2
Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with TransformersCode2
DreamLLM: Synergistic Multimodal Comprehension and CreationCode2
On-Device Domain GeneralizationCode2
Dynamic Early Exit in Reasoning ModelsCode2
Medical Vision Generalist: Unifying Medical Imaging Tasks in ContextCode2
AIR-Bench: Automated Heterogeneous Information Retrieval BenchmarkCode2
Revisiting Adversarial Training under Long-Tailed DistributionsCode2
Many-Shot In-Context Learning in Multimodal Foundation ModelsCode2
Towards Unified Keyframe Propagation ModelsCode2
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive TasksCode2
OS-Harm: A Benchmark for Measuring Safety of Computer Use AgentsCode2
A Versatile Framework for Multi-scene Person Re-identificationCode2
Measuring Massive Multitask Language UnderstandingCode2
CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing GamesCode2
Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-TuningCode2
Tuning Large Neural Networks via Zero-Shot Hyperparameter TransferCode2
YOLO-UniOW: Efficient Universal Open-World Object DetectionCode2
Voxurf: Voxel-based Efficient and Accurate Neural Surface ReconstructionCode2
DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic SystemsCode2
CLRerNet: Improving Confidence of Lane Detection with LaneIoUCode2
Do we actually understand the impact of renewables on electricity prices? A causal inference approachCode2
Transformer Circuit Faithfulness Metrics are not RobustCode2
Retinexmamba: Retinex-based Mamba for Low-light Image EnhancementCode2
COVID-19 Image Data Collection: Prospective Predictions Are the FutureCode2
Show:102550
← PrevPage 226 of 7094Next →