SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 151175 of 180343 papers

TitleStatusHype
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal UnderstandingCode9
LatentSync: Audio Conditioned Latent Diffusion Models for Lip SyncCode9
FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language ModelsCode9
MiniCPM4: Ultra-Efficient LLMs on End DevicesCode9
Kodezi Chronos: A Debugging-First Language Model for Repository-Scale, Memory-Driven Code UnderstandingCode9
Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for and with Foundation ModelsCode9
OLMo: Accelerating the Science of Language ModelsCode9
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training StrategiesCode9
UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented GenerationCode9
Model Stock: All we need is just a few fine-tuned modelsCode9
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge FusionCode9
Large Action Models: From Inception to ImplementationCode9
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and ApplicationsCode9
2 OLMo 2 FuriousCode9
LTX-Video: Realtime Video Latent DiffusionCode9
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion ModelsCode9
s1: Simple test-time scalingCode9
FastVLM: Efficient Vision Encoding for Vision Language ModelsCode9
Depth Anything: Unleashing the Power of Large-Scale Unlabeled DataCode9
Arcee's MergeKit: A Toolkit for Merging Large Language ModelsCode9
SkyServe: Serving AI Models across Regions and Clouds with Spot InstancesCode9
PP-FormulaNet: Bridging Accuracy and Efficiency in Advanced Formula RecognitionCode9
When Do We Not Need Larger Vision Models?Code9
garak: A Framework for Security Probing Large Language ModelsCode9
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt CompressionCode9
Show:102550
← PrevPage 7 of 7214Next →