SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 12511260 of 177340 papers

TitleStatusHype
Dólares or Dollars? Unraveling the Bilingual Prowess of Financial LLMs Between Spanish and EnglishCode4
SpargeAttention: Accurate and Training-free Sparse Attention Accelerating Any Model InferenceCode4
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine TranslationCode4
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree SearchCode4
Conditional Prompt Learning for Vision-Language ModelsCode4
DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image EditingCode4
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2Code4
FG-CLIP: Fine-Grained Visual and Textual AlignmentCode4
xLAM: A Family of Large Action Models to Empower AI Agent SystemsCode4
Self-Play Preference Optimization for Language Model AlignmentCode4
Show:102550
← PrevPage 126 of 17734Next →