SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 45264550 of 177340 papers

TitleStatusHype
N-LTP: An Open-source Neural Language Technology Platform for ChineseCode3
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM AgentsCode3
EfficientDet: Scalable and Efficient Object DetectionCode3
Auto-Sklearn 2.0: Hands-free AutoML via Meta-LearningCode3
LongRoPE2: Near-Lossless LLM Context Window ScalingCode3
A Novel Non-population-based Meta-heuristic Optimizer Inspired by the Philosophy of Yi JingCode3
CarDreamer: Open-Source Learning Platform for World Model based Autonomous DrivingCode3
Practical Video Object Detection via Feature Selection and AggregationCode3
LIMR: Less is More for RL ScalingCode3
WebCanvas: Benchmarking Web Agents in Online EnvironmentsCode3
ptwt - The PyTorch Wavelet ToolboxCode3
UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond ScalingCode3
DNA Family: Boosting Weight-Sharing NAS with Block-Wise SupervisionsCode3
Syzygy of Thoughts: Improving LLM CoT with the Minimal Free ResolutionCode3
MTVCrafter: 4D Motion Tokenization for Open-World Human Image AnimationCode3
mlpack 3: a fast, flexible machine learning libraryCode3
Large Language Model-Brained GUI Agents: A SurveyCode3
Rethinking Histology Slide Digitization Workflows for Low-Resource SettingsCode3
Allo: A Programming Model for Composable Accelerator DesignCode3
CogCoM: Train Large Vision-Language Models Diving into Details through Chain of ManipulationsCode3
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of contextCode3
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM InferenceCode3
PyTorch Metric LearningCode3
ReasonIR: Training Retrievers for Reasoning TasksCode3
OCR-free Document Understanding TransformerCode3
Show:102550
← PrevPage 182 of 7094Next →