SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 30613070 of 474278 papers

TitleStatusHype
Learning Smooth Humanoid Locomotion through Lipschitz-Constrained PoliciesCode3
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive MemoryCode3
GIFT-Eval: A Benchmark For General Time Series Forecasting Model EvaluationCode3
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic SegmentationCode3
Predicting from Strings: Language Model Embeddings for Bayesian OptimizationCode3
LoLCATs: On Low-Rank Linearizing of Large Language ModelsCode3
Large-Scale 3D Medical Image Pre-training with Geometric Context PriorsCode3
FlatQuant: Flatness Matters for LLM QuantizationCode3
MMAD: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly DetectionCode3
C-Adapter: Adapting Deep Classifiers for Efficient Conformal Prediction SetsCode3
Show:102550
← PrevPage 307 of 47428Next →