SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 50015025 of 661570 papers

TitleStatusHype
Masked BRep Autoencoder via Hierarchical Graph Transformer0
Analyzing Error Sources in Global Feature Effect Estimation0
Physics-Informed Neural Systems for the Simulation of EUV Electromagnetic Wave Diffraction from a Lithography Mask0
Tracking the Discriminative Axis: Dual Prototypes for Test-Time OOD Detection Under Covariate Shift0
SAGE: Multi-Agent Self-Evolution for LLM Reasoning0
Noisy Data is Destructive to Reinforcement Learning with Verifiable Rewards0
Structure-Aware Multimodal LLM Framework for Trustworthy Near-Field Beam Prediction0
Deep Adaptive Model-Based Design of Experiments0
Dual Consensus: Escaping from Spurious Majority in Unsupervised RLVR via Two-Stage Vote Mechanism0
Speak, Segment, Track, Navigate: An Interactive System for Video-Guided Skull-Base Surgery0
3D tomography of exchange phase in a Si/SiGe quantum dot device0
POaaS: Minimal-Edit Prompt Optimization as a Service to Lift Accuracy and Cut Hallucinations on On-Device sLLMs0
The Era of End-to-End Autonomy: Transitioning from Rule-Based Driving to Large Driving Models0
Volumetrically Consistent Implicit Atlas Learning via Neural Diffeomorphic Flow for Placenta MRI0
A Context Alignment Pre-processor for Enhancing the Coherence of Human-LLM Dialog0
Safe Distributionally Robust Feature Selection under Covariate Shift0
Diffusion Models for Joint Audio-Video Generation0
Reevaluating the Intra-Modal Misalignment Hypothesis in CLIP0
ViT-AdaLA: Adapting Vision Transformers with Linear Attention0
Adaptive regularization parameter selection for high-dimensional inverse problems: A Bayesian approach with Tucker low-rank constraints0
Structured prototype regularization for synthetic-to-real driving scene parsing0
Attribution Upsampling should Redistribute, Not Interpolate0
SEAHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Southeast Asia0
ClaimFlow: Tracing the Evolution of Scientific Claims in NLP0
Interact3D: Compositional 3D Generation of Interactive Objects0
Show:102550
← PrevPage 201 of 26463Next →