SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 16761700 of 659983 papers

TitleStatusHype
Holistic Evaluation of Language ModelsCode4
Seed-Coder: Let the Code Model Curate Data for ItselfCode4
FullStack Bench: Evaluating LLMs as Full Stack CodersCode4
Motion Capture Dataset for Practical Use of AI-based Motion Editing and StylizationCode4
The Platonic Representation HypothesisCode4
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation ExpertsCode4
ArchiSound: Audio Generation with DiffusionCode4
Aligning benchmark datasets for table structure recognitionCode4
Instruction Tuning with GPT-4Code4
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning ResearchersCode4
TOPIQ: A Top-down Approach from Semantics to Distortions for Image Quality AssessmentCode4
ChatHaruhi: Reviving Anime Character in Reality via Large Language ModelCode4
Dataverse: Open-Source ETL (Extract, Transform, Load) Pipeline for Large Language ModelsCode4
FLASC: A Flare-Sensitive Clustering AlgorithmCode4
Video-LLaVA: Learning United Visual Representation by Alignment Before ProjectionCode4
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic KernelsCode4
Video Understanding with Large Language Models: A SurveyCode4
Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern LanguagesCode4
InstructIR: High-Quality Image Restoration Following Human InstructionsCode4
Weighted-Reward Preference Optimization for Implicit Model FusionCode4
AlphaFold Meets Flow Matching for Generating Protein EnsemblesCode4
ScreenAgent: A Vision Language Model-driven Computer Control AgentCode4
2D Matryoshka Sentence EmbeddingsCode4
The largest EEG-based BCI reproducibility study for open science: the MOABB benchmarkCode4
3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion PriorsCode4
Show:102550
← PrevPage 68 of 26400Next →