SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 521530 of 659983 papers

TitleStatusHype
Judging LLM-as-a-Judge with MT-Bench and Chatbot ArenaCode7
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image ManifoldCode7
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsCode7
Full Scaling Automation for Sustainable Development of Green Data CentersCode7
EasySpider: A No-Code Visual System for Crawling the WebCode7
Measuring Massive Multitask Chinese UnderstandingCode7
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language ModelsCode7
Low-code LLM: Graphical User Interface over Large Language ModelsCode7
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation ModelsCode7
LLaMA: Open and Efficient Foundation Language ModelsCode7
Show:102550
← PrevPage 53 of 65999Next →