SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 80518075 of 177340 papers

TitleStatusHype
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
MedM-VL: What Makes a Good Medical LVLM?Code2
Self-Explore: Enhancing Mathematical Reasoning in Language Models with Fine-grained RewardsCode2
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics BenchmarkCode2
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion ModelsCode2
All for One and One for All: Improving Music Separation by Bridging NetworksCode2
Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and RestorationCode2
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent SystemsCode2
Mixture of LoRA ExpertsCode2
Neighboring Autoregressive Modeling for Efficient Visual GenerationCode2
The Calysto Scheme ProjectCode2
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language ModelsCode2
Exploring Plain Vision Transformer Backbones for Object DetectionCode2
Twin-Merging: Dynamic Integration of Modular Expertise in Model MergingCode2
Hidden Biases of End-to-End Driving ModelsCode2
LaserMix for Semi-Supervised LiDAR Semantic SegmentationCode2
IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source LocalizationCode2
GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal ModelingCode2
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation EngineeringCode2
SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep ReconstructionCode2
Can Language Models Solve Olympiad Programming?Code2
Improving Autoformalization using Type CheckingCode2
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event DetectionCode2
Masked Autoencoders for Point Cloud Self-supervised LearningCode2
MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation LearningCode2
Show:102550
← PrevPage 323 of 7094Next →