SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 81768200 of 177340 papers

TitleStatusHype
Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing DetectionCode2
ShiftwiseConv: Small Convolutional Kernel with Large Kernel EffectCode2
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement LearningCode2
Institutional Books 1.0: A 242B token dataset from Harvard Library's collections, refined for accuracy and usabilityCode2
Do MIL Models Transfer?Code2
SDialog: A Python Toolkit for Synthetic Dialogue Generation and AnalysisCode2
Vision Transformers Don't Need Trained RegistersCode2
AutoMind: Adaptive Knowledgeable Agent for Automated Data ScienceCode2
UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI AgentsCode2
IntPhys 2: Benchmarking Intuitive Physics Understanding In Complex Synthetic EnvironmentsCode2
VerIF: Verification Engineering for Reinforcement Learning in Instruction FollowingCode2
Solving the Job Shop Scheduling Problem with Graph Neural Networks: A Customizable Reinforcement Learning EnvironmentCode2
AnalogNAS-Bench: A NAS Benchmark for Analog In-Memory ComputingCode2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics LearningCode2
Towards In-the-wild 3D Plane Reconstruction from a Single ImageCode2
Test3R: Learning to Reconstruct 3D at Test TimeCode2
Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and TrendsCode2
Flow-Anchored Consistency ModelsCode2
Feed-Forward SceneDINO for Unsupervised Semantic Scene CompletionCode2
EAMamba: Efficient All-Around Vision State Space Model for Image RestorationCode2
Open Source Planning & Control System with Language Agents for Autonomous Scientific DiscoveryCode2
CaRL: Learning Scalable Planning Policies with Simple RewardsCode2
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal ModelCode2
Detecting Spacecraft Anomalies Using LSTMs and Nonparametric Dynamic ThresholdingCode2
The Best of Both Worlds: Combining Recent Advances in Neural Machine TranslationCode2
Show:102550
← PrevPage 328 of 7094Next →