SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 31013150 of 659983 papers

TitleStatusHype
Deep Tabular Representation Corrector0
Face2Scene: Using Facial Degradation as an Oracle for Diffusion-Based Scene Restoration0
Malicious Or Not: Adding Repository Context to Agent Skill Classification0
When and Why Does Unsupervised RL Succeed in Mathematical Reasoning? A Manifold Envelopment Perspective0
Runtime Governance for AI Agents: Policies on Paths0
BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization0
Domain Mixture Design via Log-Likelihood Differences for Aligning Language Models with a Target Model0
On the Transfer of Collinearity to Computer Vision0
Simplex-to-Euclidean Bijection for Conjugate and Calibrated Multiclass Gaussian Process0
FlowComposer: Composable Flows for Compositional Zero-Shot Learning0
Domain-Independent Dynamic Programming with Constraint Propagation0
Efficient Brood Cell Detection in Layer Trap Nests for Bees and Wasps: Balancing Labeling Effort and Species Coverage0
Machines acquire scientific taste from institutional traces0
Can Linguistically Related Languages Guide LLM Translation in Low-Resource Settings?0
Spectral Property-Driven Data Augmentation for Hyperspectral Single-Source Domain Generalization0
Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation0
Fast-WAM: Do World Action Models Need Test-time Future Imagination?0
Kinema4D: Kinematic 4D World Modeling for Spatiotemporal Embodied Simulation1
When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making0
HMAR: Hierarchical Modality-Aware Expert and Dynamic Routing Medical Image Retrieval Architecture0
vAccSOL: Efficient and Transparent AI Vision Offloading for Mobile Robots0
CABTO: Context-Aware Behavior Tree Grounding for Robot Manipulation0
Grid-World Representations in Transformers Reflect Predictive Geometry0
Cost Trade-offs in Matrix Inversion Updates for Streaming Outlier Detection0
Learning Lineage-guided Geodesics with Finsler Geometry0
High-dimensional estimation with missing data: Statistical and computational limits0
GeMA: Learning Latent Manifold Frontiers for Benchmarking Complex Systems0
Emotion-Aware Classroom Quality Assessment Leveraging IoT-Based Real-Time Student Monitoring0
Understanding Quantization of Optimizer States in LLM Pre-training: Dynamics of State Staleness and Effectiveness of State Resets0
IQuest-Coder-V1 Technical Report0
Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure0
Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern Generation0
Efficient Reasoning on the Edge0
MedCL-Bench: Benchmarking stability-efficiency trade-offs and scaling in biomedical continual learning0
Data-driven forced response analysis with min-max representations of nonlinear restoring forces0
Finding Common Ground in a Sea of Alternatives0
pADAM: A Plug-and-Play All-in-One Diffusion Architecture for Multi-Physics Learning0
SuCor: Susceptibility Distortion Correction via Parameter-Free and Self-Regularized Optimal Transport0
SOMP: Scalable Gradient Inversion for Large Language Models via Subspace-Guided Orthogonal Matching Pursuit0
Anticipatory Planning for Multimodal AI Agents0
IOSVLM: A 3D Vision-Language Model for Unified Dental Diagnosis from Intraoral Scans0
High-Dimensional Gaussian Mean Estimation under Realizable Contamination0
ODIN-Based CPU-GPU Architecture with Replay-Driven Simulation and Emulation0
WildDepth: A Multimodal Dataset for 3D Wildlife Perception and Depth Estimation0
SurgΣ: A Spectrum of Large-Scale Multimodal Data and Foundation Models for Surgical Intelligence0
Real-Time Decoding of Movement Onset and Offset for Brain-Controlled Rehabilitation Exoskeleton0
Prompt Programming for Cultural Bias and Alignment of Large Language Models0
Stochastic Resetting Accelerates Policy Convergence in Reinforcement Learning0
Internalizing Agency from Reflective Experience0
Dynamic Meta-Layer Aggregation for Byzantine-Robust Federated Learning0
Show:102550
← PrevPage 63 of 13200Next →