SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 276300 of 659983 papers

TitleStatusHype
Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling0
POLY-SIM: Polyglot Speaker Identification with Missing Modality Grand Challenge 2026 Evaluation Plan0
Anti-I2V: Safeguarding your photos from malicious image-to-video generation0
Object Search in Partially-Known Environments via LLM-informed Model-based Planning and Prompt Selection0
Deep Neural Regression Collapse0
Willful Disobedience: Automatically Detecting Failures in Agentic Traces0
Perturbation: A simple and efficient adversarial tracer for representation learning in language models0
How Vulnerable Are Edge LLMs?0
Circuit Complexity of Hierarchical Knowledge Tracing and Implications for Log-Precision Transformers0
Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective0
Beyond Consistency: Inference for the Relative risk functional in Deep Nonparametric Cox Models0
Learning-guided Prioritized Planning for Lifelong Multi-Agent Path Finding in Warehouse Automation0
Infrequent Child-Directed Speech Is Bursty and May Draw Infant Vocalizations0
Resolving gradient pathology in physics-informed epidemiological models0
VehicleMemBench: An Executable Benchmark for Multi-User Long-Term Memory in In-Vehicle Agents0
PoliticsBench: Benchmarking Political Values in Large Language Models with Multi-Turn Roleplay0
Language Model Planners do not Scale, but do Formalizers?0
3D-LLDM: Label-Guided 3D Latent Diffusion Model for Improving High-Resolution Synthetic MR Imaging in Hepatic Structure Segmentation0
KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao0
Few-Shot Generative Model Adaption via Identity Injection and Preservation0
GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning0
Think 360°: Evaluating the Width-centric Reasoning Capability of MLLMs Beyond Depth0
WiFi2Cap: Semantic Action Captioning from Wi-Fi CSI via Limb-Level Semantic Alignment0
Coordinate Encoding on Linear Grids for Physics-Informed Neural Networks0
TimeWeaver: Age-Consistent Reference-Based Face Restoration with Identity Preservation0
Show:102550
← PrevPage 12 of 26400Next →