SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 23012325 of 661570 papers

TitleStatusHype
Safurai 001: New Qualitative Approach for Code LLM EvaluationCode4
The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language ModelsCode4
RePaint: Inpainting using Denoising Diffusion Probabilistic ModelsCode4
A Preview of XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQLCode4
MTEB: Massive Text Embedding BenchmarkCode4
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement LearningCode4
Identify Critical KV Cache in LLM Inference from an Output Perturbation PerspectiveCode4
Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic DataCode4
FinBen: A Holistic Financial Benchmark for Large Language ModelsCode4
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion ModelsCode4
Region-Aware Text-to-Image Generation via Hard Binding and Soft RefinementCode4
Recognize Anything: A Strong Image Tagging ModelCode4
Replace Anyone in VideosCode4
Phased Consistency ModelsCode4
A Survey on Vision-Language-Action Models for Autonomous DrivingCode4
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment AnythingCode4
InternLM-Math: Open Math Large Language Models Toward Verifiable ReasoningCode4
Training-free Regional Prompting for Diffusion TransformersCode4
Your ViT is Secretly an Image Segmentation ModelCode4
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image SegmentationCode4
MedMamba: Vision Mamba for Medical Image ClassificationCode4
CLAIMED -- the open source framework for building coarse-grained operators for accelerated discovery in scienceCode4
SepLLM: Accelerate Large Language Models by Compressing One Segment into One SeparatorCode4
SVFR: A Unified Framework for Generalized Video Face RestorationCode4
Hidden Biases of End-to-End Driving DatasetsCode4
Show:102550
← PrevPage 93 of 26463Next →