SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 13511375 of 177339 papers

TitleStatusHype
Deep Patch Visual SLAMCode4
Towards Automated Circuit Discovery for Mechanistic InterpretabilityCode4
VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement LearningCode4
TigerBot: An Open Multilingual Multitask LLMCode4
PLAID: An Efficient Engine for Late Interaction RetrievalCode4
Knowledge Fusion of Large Language ModelsCode4
TALENT: A Tabular Analytics and Learning ToolboxCode4
Osprey: Pixel Understanding with Visual Instruction TuningCode4
Let's Verify Step by StepCode4
Agent-as-a-Judge: Evaluate Agents with AgentsCode4
TUMTraf V2X Cooperative Perception DatasetCode4
Attention on the SphereCode4
Generalized Recorrupted-to-Recorrupted: Self-Supervised Learning Beyond Gaussian NoiseCode4
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy PredictionCode4
Vision-Language Models for Vision Tasks: A SurveyCode4
A Survey on Visual MambaCode4
End-to-end Autonomous Driving: Challenges and FrontiersCode4
TensoRF: Tensorial Radiance FieldsCode4
A Convergent Single-Loop Algorithm for Relaxation of Gromov-Wasserstein in Graph DataCode4
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language TasksCode4
Generating Structured Outputs from Language Models: Benchmark and StudiesCode4
Semi-Mamba-UNet: Pixel-Level Contrastive and Pixel-Level Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image SegmentationCode4
Radiative Gaussian Splatting for Efficient X-ray Novel View SynthesisCode4
Timer-XL: Long-Context Transformers for Unified Time Series ForecastingCode4
TRUE: Re-evaluating Factual Consistency EvaluationCode4
Show:102550
← PrevPage 55 of 7094Next →