SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 26762700 of 177340 papers

TitleStatusHype
Open-Source Skull Reconstruction with MONAICode3
MMedAgent: Learning to Use Medical Tools with Multi-modal AgentCode3
DiarizationLM: Speaker Diarization Post-Processing with Large Language ModelsCode3
RelBench: A Benchmark for Deep Learning on Relational DatabasesCode3
A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future DirectionsCode3
Learning Bipedal Walking On Planned Footsteps For Humanoid RobotsCode3
Large Language Monkeys: Scaling Inference Compute with Repeated SamplingCode3
ECG-FM: An Open Electrocardiogram Foundation ModelCode3
Hyper-YOLO: When Visual Object Detection Meets Hypergraph ComputationCode3
SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised LearningCode3
SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear ComplexityCode3
CAD-Recode: Reverse Engineering CAD Code from Point CloudsCode3
EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-JudgeCode3
DeepfakeBench: A Comprehensive Benchmark of Deepfake DetectionCode3
FlowDock: Geometric Flow Matching for Generative Protein-Ligand Docking and Affinity PredictionCode3
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation ModelsCode3
ImageFolder: Autoregressive Image Generation with Folded TokensCode3
ConsistI2V: Enhancing Visual Consistency for Image-to-Video GenerationCode3
Simple linear attention language models balance the recall-throughput tradeoffCode3
MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation SystemCode3
The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple FeaturesCode3
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified FlowCode3
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window TransformerCode3
IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & LocalizationCode3
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens IntactCode3
Show:102550
← PrevPage 108 of 7094Next →