SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 51265150 of 661570 papers

TitleStatusHype
MOSAIC: Composable Safety Alignment with Modular Control Tokens0
How to Utilize Complementary Vision-Text Information for 2D Structure Understanding0
Physics-integrated neural differentiable modeling for immersed boundary systems0
FG-SGL: Fine-Grained Semantic Guidance Learning via Motion Process Decomposition for Micro-Gesture Recognition0
Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits0
Overview of the CXR-LT 2026 Challenge: Multi-Center Long-Tailed and Zero Shot Chest X-ray Classification0
On-Policy Self-Distillation for Reasoning CompressionCode0
Clinical Priors Guided Lung Disease Detection in 3D CT Scans0
Controllable Graph Generation with Diffusion Models via Inference-Time Tree Search Guidance0
Proactive Rejection and Grounded Execution: A Dual-Stage Intent Analysis Paradigm for Safe and Efficient AIoT Smart Homes0
Muon Converges under Heavy-Tailed Noise: Nonconvex Hölder-Smooth Empirical Risk Minimization0
Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models0
Foundation-Model Surrogates Enable Data-Efficient Active Learning for Materials Discovery0
Alternating Gradient Flow Utility: A Unified Metric for Structural Pruning and Dynamic Routing in Deep Networks0
Content-Aware Mamba for Learned Image CompressionCode0
SARMAE: Masked Autoencoder for SAR Representation LearningCode0
Urban Socio-Semantic Segmentation with Vision-Language ReasoningCode0
Power Analysis for Prediction-Powered InferenceCode0
SciZoom: A Large-scale Benchmark for Hierarchical Scientific Summarization across the LLM EraCode0
PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding SpaceCode0
Point-to-Mask: From Arbitrary Point Annotations to Mask-Level Infrared Small Target DetectionCode0
AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object DetectionCode0
MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied NavigationCode0
3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal DiffusionCode0
KEEP: A KV-Cache-Centric Memory Management System for Efficient Embodied PlanningCode0
Show:102550
← PrevPage 206 of 26463Next →