SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 65766600 of 474278 papers

TitleStatusHype
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image RetrievalCode2
Reliable, Reproducible, and Really Fast Leaderboards with EvalicaCode2
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
AirMorph: Topology-Preserving Deep Learning for Pulmonary Airway AnalysisCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic PromptCode2
Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-ReflectionCode2
NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow ModelsCode2
Physics-based battery model parametrisation from impedance dataCode2
Memory Efficient Matting with Adaptive Token RoutingCode2
You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary ProjectsCode2
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy PredictionCode2
GAOKAO-Eval: Does high scores truly reflect strong capabilities in LLMs?Code2
UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging ModalitiesCode2
RemDet: Rethinking Efficient Model Design for UAV Object DetectionCode2
GaussianAD: Gaussian-Centric End-to-End Autonomous DrivingCode2
Financial Fine-tuning a Large Time Series ModelCode2
EvalGIM: A Library for Evaluating Generative Image ModelsCode2
Mr. DETR: Instructive Multi-Route Training for Detection TransformersCode2
Efficient Large-Scale Traffic Forecasting with Transformers: A Spatial Data Management PerspectiveCode2
AutoPatent: A Multi-Agent Framework for Automatic Patent GenerationCode2
Simple Guidance Mechanisms for Discrete Diffusion ModelsCode2
Doe-1: Closed-Loop Autonomous Driving with Large World ModelCode2
Auto-Regressive Moving Diffusion Models for Time Series ForecastingCode2
Towards a Multimodal Large Language Model with Pixel-Level Insight for BiomedicineCode2
Show:102550
← PrevPage 264 of 18972Next →