SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 59766000 of 661570 papers

TitleStatusHype
FocusTrack: One-Stage Focus-and-Suppress Framework for 3D Point Cloud Object Tracking0
Multi-Condition Digital Twin Calibration for Axial Piston Pumps : Compound Fault Simulation0
A Comprehensive Evaluation of LLM Unlearning Robustness under Multi-Turn Interaction0
RMBench: Memory-Dependent Robotic Manipulation Benchmark with Insights into Policy Design0
On Google's SynthID-Text LLM Watermarking System: Theoretical Analysis and Empirical Validation0
A Hypertoroidal Covering for Perfect Color Equivariance0
ProFocus: Proactive Perception and Focused Reasoning in Vision-and-Language Navigation0
Frequency-Separable Hamiltonian Neural Network for Multi-Timescale Dynamics0
TrajPred: Trajectory-Conditioned Joint Embedding Prediction for Surgical Instrument-Tissue Interaction Recognition in Vision-Language Models0
Single Image Super-Resolution via Bivariate `A Trous Wavelet Diffusion0
Agora: Teaching the Skill of Consensus-Finding with AI Personas Grounded in Human Voice0
Proxy-Guided Measurement Calibration0
OrthoFormer: Instrumental Variable Estimation in Transformer Hidden States via Neural Control Functions0
Concept-Guided Fine-Tuning: Steering ViTs away from Spurious Correlations to Improve Robustness0
STRIDE: Structured Lagrangian and Stochastic Residual Dynamics via Flow Matching0
Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers0
LuxBorrow: From Pompier to Pompjee, Tracing Borrowing in Luxembourgish0
Kernel Tests of Equivalence0
Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI0
Verified Multi-Agent Orchestration: A Plan-Execute-Verify-Replan Framework for Complex Query Resolution0
KEPo: Knowledge Evolution Poison on Graph-based Retrieval-Augmented Generation0
MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?0
Data-Driven Physics Embedded Dynamics with Predictive Control and Reinforcement Learning for Quadrupeds0
Beyond Means: Topological Causal Effects under Persistent-Homology Ignorability0
Balancing Multimodal Domain Generalization via Gradient Modulation and Projection0
Show:102550
← PrevPage 240 of 26463Next →