SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 98769900 of 177340 papers

TitleStatusHype
Sparse maximal update parameterization: A holistic approach to sparse training dynamicsCode2
Frustratingly Easy Test-Time Adaptation of Vision-Language ModelsCode2
Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identificationCode2
Scaling Laws and Compute-Optimal Training Beyond Fixed Training DurationsCode2
Benchmarking and Improving Detail Image CaptionCode2
WorldGUI: An Interactive Benchmark for Desktop GUI Automation from Any Starting PointCode2
Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series ClassificationCode2
TabPedia: Towards Comprehensive Visual Table Understanding with Concept SynergyCode2
DroneVis: Versatile Computer Vision Library for DronesCode2
Neural Optimal Transport with Lagrangian CostsCode2
Generative Pre-trained Speech Language Model with Efficient Hierarchical TransformerCode2
Parameter-Inverted Image Pyramid NetworksCode2
MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision TasksCode2
FRAG: Frequency Adapting Group for Diffusion Video EditingCode2
Towards Lifelong Learning of Large Language Models: A SurveyCode2
Needle In A Multimodal HaystackCode2
DafnyBench: A Benchmark for Formal Software VerificationCode2
Probing Synergistic High-Order Interaction in Infrared and Visible Image FusionCode2
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMsCode2
ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real WorldCode2
Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language ModelsCode2
Can LLMs Learn by Teaching for Better Reasoning? A Preliminary StudyCode2
OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?Code2
GC4NC: A Benchmark Framework for Graph Condensation on Node Classification with New InsightsCode2
Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular VideosCode2
Show:102550
← PrevPage 396 of 7094Next →