SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 35313540 of 474278 papers

TitleStatusHype
Min-Max Similarity: A Contrastive Semi-Supervised Deep Learning Network for Surgical Tools SegmentationCode3
WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on WikipediaCode3
Deep Learning for Trajectory Data Management and Mining: A Survey and BeyondCode3
DeepCAVE: An Interactive Analysis Tool for Automated Machine LearningCode3
Plotly-Resampler: Effective Visual Analytics for Large Time SeriesCode3
MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-MakingCode3
The Common Core OntologiesCode3
PARTNR: A Benchmark for Planning and Reasoning in Embodied Multi-agent TasksCode3
PANGAEA: A Global and Inclusive Benchmark for Geospatial Foundation ModelsCode3
RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic AgentsCode3
Show:102550
← PrevPage 354 of 47428Next →