SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 29012925 of 661570 papers

TitleStatusHype
Keypoint Promptable Re-IdentificationCode3
Proteus: A Self-Designing Range FilterCode3
SARATR-X: Toward Building A Foundation Model for SAR Target RecognitionCode3
AutoTimes: Autoregressive Time Series Forecasters via Large Language ModelsCode3
PromptKD: Unsupervised Prompt Distillation for Vision-Language ModelsCode3
Matbench Discovery -- A framework to evaluate machine learning crystal stability predictionsCode3
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language ModelsCode3
SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image SegmentationCode3
Multimodal Foundation Models: From Specialists to General-Purpose AssistantsCode3
Aria-UI: Visual Grounding for GUI InstructionsCode3
Karatsuba Matrix Multiplication and its Efficient Custom Hardware ImplementationsCode3
VRT: A Video Restoration TransformerCode3
A Demonstration of Adaptive Collaboration of Large Language Models for Medical Decision-MakingCode3
TinyAgent: Function Calling at the EdgeCode3
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language ModelsCode3
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time SeriesCode3
Towards An End-to-End Framework for Flow-Guided Video InpaintingCode3
Sintel: A Machine Learning Framework to Extract Insights from SignalsCode3
VideoCutLER: Surprisingly Simple Unsupervised Video Instance SegmentationCode3
TAPIR: Tracking Any Point with per-frame Initialization and temporal RefinementCode3
Playing Non-Embedded Card-Based Games with Reinforcement LearningCode3
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech SeparationCode3
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative ModelsCode3
Evaluation Report on MCP ServersCode3
ChartGalaxy: A Dataset for Infographic Chart Understanding and GenerationCode3
Show:102550
← PrevPage 117 of 26463Next →