SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 21112120 of 661570 papers

TitleStatusHype
Structured Pruning for Deep Convolutional Neural Networks: A surveyCode4
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judgeCode4
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing TasksCode4
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active AssistanceCode4
Orb: A Fast, Scalable Neural Network PotentialCode4
Spirit LM: Interleaved Spoken and Written Language ModelCode4
When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world EnvironmentsCode4
SuperCorrect: Supervising and Correcting Language Models with Error-Driven InsightsCode4
I Think, Therefore I am: Benchmarking Awareness of Large Language Models Using AwareBenchCode4
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion TokensCode4
Show:102550
← PrevPage 212 of 66157Next →