SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 33813390 of 474278 papers

TitleStatusHype
Are Language Models Actually Useful for Time Series Forecasting?Code3
Taming 3DGS: High-Quality Radiance Fields with Limited ResourcesCode3
A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion ModelsCode3
Consistency Models Made EasyCode3
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and BaselinesCode3
LLM4CP: Adapting Large Language Models for Channel PredictionCode3
^2DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network PotentialsCode3
Detecting hallucinations in large language models using semantic entropyCode3
GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual GenerationCode3
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM AgentsCode3
Show:102550
← PrevPage 339 of 47428Next →