SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 75267550 of 474278 papers

TitleStatusHype
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model EvaluationCode2
EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysisCode2
DetailCLIP: Detail-Oriented CLIP for Fine-Grained TasksCode2
Learning Generative Interactive Environments By Trained Agent ExplorationCode2
What is the Role of Small Models in the LLM Era: A SurveyCode2
Towards Generalizable Scene Change DetectionCode2
SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank AdaptationCode2
TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification TasksCode2
GASP: Gaussian Splatting for Physic-Based SimulationsCode2
IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTSCode2
Assessing SPARQL capabilities of Large Language ModelsCode2
FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank AdaptationsCode2
DiffusionPen: Towards Controlling the Style of Handwritten Text GenerationCode2
Revisiting the Solution of Meta KDD Cup 2024: CRAGCode2
PiEEG-16 to Measure 16 EEG Channels with Raspberry Pi for Brain-Computer Interfaces and EEG devicesCode2
The first Cadenza challenges: using machine learning competitions to improve music for listeners with a hearing lossCode2
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven RefinementCode2
OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMsCode2
A Survey on Diffusion Models for Recommender SystemsCode2
A Survey on Mixup Augmentations and BeyondCode2
Evaluating Neural Networks Architectures for Spring Reverb ModellingCode2
forester: A Tree-Based AutoML Tool in RCode2
FedModule: A Modular Federated Learning FrameworkCode2
A Comprehensive Survey on Evidential Deep Learning and Its ApplicationsCode2
GST: Precise 3D Human Body from a Single Image with Gaussian Splatting TransformersCode2
Show:102550
← PrevPage 302 of 18972Next →