SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 40514075 of 661570 papers

TitleStatusHype
Developing a Discrete-Event Simulator of School Shooter Behavior from VR Data0
Optimal rates for density and mode estimation with expand-and-sparsify representations0
Equivariant symmetry-aware head pose estimation for fetal MRICode0
Efficient and Scalable Monocular Human-Object Interaction Motion ReconstructionCode0
Multimodal Machine Learning for Soft High-k Elastomers under Data ScarcityCode0
EPOFusion: Exposure aware Progressive Optimization Method for Infrared and Visible Image FusionCode0
SSP-SAM: SAM with Semantic-Spatial Prompt for Referring Expression SegmentationCode0
Sharpness-Aware Minimization in Logit Space Efficiently Enhances Direct Preference OptimizationCode0
Approximate Subgraph Matching with Neural Graph Representations and Reinforcement LearningCode0
ARISE: Agent Reasoning with Intrinsic Skill Evolution in Hierarchical Reinforcement LearningCode0
Don't Pass@k: A Bayesian Framework for Large Language Model EvaluationCode0
Theory of Code Space: Do Code Agents Understand Software Architecture?Code0
GRAFITE: Generative Regression Analysis Framework for Issue Tracking and EvaluationCode0
AgentFactory: A Self-Evolving Framework Through Executable Subagent Accumulation and ReuseCode0
DREAM: A Benchmark Study for Deepfake photoREalism AssessMentCode0
MLLM-based Textual Explanations for Face ComparisonCode0
Training-Only Heterogeneous Image-Patch-Text Graph Supervision for Advancing Few-Shot Learning AdaptersCode0
R2-Dreamer: Redundancy-Reduced World Models without Decoders or AugmentationCode0
Open-o3-Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence2
MOSS-TTS Technical Report4
LoST: Level of Semantics Tokenization for 3D Shapes2
OPUS-VFL: Incentivizing Optimal Privacy-Utility Tradeoffs in Vertical Federated Learning0
Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models0
Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech0
M2P: Improving Visual Foundation Models with Mask-to-Point Weakly-Supervised Learning for Dense Point Tracking0
Show:102550
← PrevPage 163 of 26463Next →