SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 951975 of 659983 papers

TitleStatusHype
UQLM: A Python Package for Uncertainty Quantification in Large Language ModelsCode5
Chinese CLIP: Contrastive Vision-Language Pretraining in ChineseCode5
ControlNeXt: Powerful and Efficient Control for Image and Video GenerationCode5
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUsCode5
MiniRAG: Towards Extremely Simple Retrieval-Augmented GenerationCode5
SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and MoreCode5
WizardCoder: Empowering Code Large Language Models with Evol-InstructCode5
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue AbilitiesCode5
Long-term Forecasting with TiDE: Time-series Dense EncoderCode5
From System 1 to System 2: A Survey of Reasoning Large Language ModelsCode5
Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked DiffusionsCode5
Wonder3D: Single Image to 3D using Cross-Domain DiffusionCode5
MobileVLM V2: Faster and Stronger Baseline for Vision Language ModelCode5
MV-Adapter: Multi-view Consistent Image Generation Made EasyCode5
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM WorkflowsCode5
DeepPhase: Periodic Autoencoders for Learning Motion Phase ManifoldsCode5
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language ModelingCode5
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech ModelCode5
Understanding R1-Zero-Like Training: A Critical PerspectiveCode5
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive AnnotationsCode5
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to VerificationCode5
CogAgent: A Visual Language Model for GUI AgentsCode5
Transformer-Squared: Self-adaptive LLMsCode5
CogVLM: Visual Expert for Pretrained Language ModelsCode5
Aria: An Open Multimodal Native Mixture-of-Experts ModelCode5
Show:102550
← PrevPage 39 of 26400Next →