SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 10011025 of 659983 papers

TitleStatusHype
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?Code5
pyvene: A Library for Understanding and Improving PyTorch Models via InterventionsCode5
VideoMamba: State Space Model for Efficient Video UnderstandingCode5
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion HeadCode5
BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch DiffusionCode5
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic AlignmentCode5
CogView3: Finer and Faster Text-to-Image Generation via Relay DiffusionCode5
TextMonkey: An OCR-Free Large Multimodal Model for Understanding DocumentCode5
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image GenerationCode5
Controllable Generation with Text-to-Image Diffusion Models: A SurveyCode5
Common 7B Language Models Already Possess Strong Math CapabilitiesCode5
3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D RepresentationsCode5
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank ProjectionCode5
Rethinking LLM Language Adaptation: A Case Study on Chinese MixtralCode5
APISR: Anime Production Inspired Real-World Anime Super-ResolutionCode5
LAB: Large-Scale Alignment for ChatBotsCode5
Retrieval-Augmented Generation for AI-Generated Content: A SurveyCode5
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient FinetuningCode5
Deep Confident Steps to New Pockets: Strategies for Docking GeneralizationCode5
Datasets for Large Language Models: A Comprehensive SurveyCode5
Information Flow Routes: Automatically Interpreting Language Models at ScaleCode5
Language Agents as Optimizable GraphsCode5
Repetition Improves Language Model EmbeddingsCode5
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUsCode5
MambaIR: A Simple Baseline for Image Restoration with State-Space ModelCode5
Show:102550
← PrevPage 41 of 26400Next →