SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 711720 of 659983 papers

TitleStatusHype
Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble ScorersCode5
Reservoir-enhanced Segment Anything Model for Subsurface DiagnosisCode5
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse AttentionCode5
Reinforcement Learning from Human FeedbackCode5
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer FrameworkCode5
Pixel-SAIL: Single Transformer For Pixel-Grounded UnderstandingCode5
Kimi-VL Technical ReportCode5
M-Prometheus: A Suite of Open Multilingual LLM JudgesCode5
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video SegmentationCode5
PaperBench: Evaluating AI's Ability to Replicate AI ResearchCode5
Show:102550
← PrevPage 72 of 65999Next →