SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 28262850 of 177340 papers

TitleStatusHype
TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous DrivingCode3
EfficientVMamba: Atrous Selective Scan for Light Weight Visual MambaCode3
ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use CapabilitiesCode3
Accelerating Diffusion Transformers with Dual Feature CachingCode3
Keypoint Promptable Re-IdentificationCode3
Proteus: A Self-Designing Range FilterCode3
SARATR-X: Toward Building A Foundation Model for SAR Target RecognitionCode3
AutoTimes: Autoregressive Time Series Forecasters via Large Language ModelsCode3
PromptKD: Unsupervised Prompt Distillation for Vision-Language ModelsCode3
Matbench Discovery -- A framework to evaluate machine learning crystal stability predictionsCode3
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language ModelsCode3
SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image SegmentationCode3
Multimodal Foundation Models: From Specialists to General-Purpose AssistantsCode3
Aria-UI: Visual Grounding for GUI InstructionsCode3
Karatsuba Matrix Multiplication and its Efficient Custom Hardware ImplementationsCode3
VRT: A Video Restoration TransformerCode3
A Demonstration of Adaptive Collaboration of Large Language Models for Medical Decision-MakingCode3
TinyAgent: Function Calling at the EdgeCode3
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language ModelsCode3
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time SeriesCode3
Towards An End-to-End Framework for Flow-Guided Video InpaintingCode3
Sintel: A Machine Learning Framework to Extract Insights from SignalsCode3
VideoCutLER: Surprisingly Simple Unsupervised Video Instance SegmentationCode3
TAPIR: Tracking Any Point with per-frame Initialization and temporal RefinementCode3
Playing Non-Embedded Card-Based Games with Reinforcement LearningCode3
Show:102550
← PrevPage 114 of 7094Next →