SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 65016550 of 661570 papers

TitleStatusHype
SAS-Net: Cross-Domain Image Registration as Inverse Rendering via Structure-Appearance FactorizationCode0
KAN-FIF: Spline-Parameterized Lightweight Physics-based Tropical Cyclone Estimation on Meteorological SatelliteCode0
Towards On-Policy SFT: Distribution Discriminant Theory and its Applications in LLM TrainingCode0
EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning DistillationCode0
QTrack: Query-Driven Reasoning for Multi-modal MOTCode0
SPACE-CLIP: Spatial Perception via Adaptive CLIP Embeddings for Monocular Depth EstimationCode0
Sat-JEPA-Diff: Bridging Self-Supervised Learning and Generative Diffusion for Remote SensingCode0
Iterative Semantic Reasoning from Individual to Group Interests for Generative Recommendation with LLMsCode0
Towards Efficient Medical Reasoning with Minimal Fine-Tuning DataCode0
Colon-X: Advancing Intelligent Colonoscopy toward Clinical ReasoningCode0
DSB: Dynamic Sliding Block Scheduling for Diffusion LLMsCode0
Node Role-Guided LLMs for Dynamic Graph ClusteringCode0
Step-CoT: Stepwise Visual Chain-of-Thought for Medical Visual Question AnsweringCode0
Multi-Modal Character Localization and Extraction for Chinese Text RecognitionCode0
ToolFlood: Beyond Selection -- Hiding Valid Tools from LLM Agents via Semantic CoveringCode0
VID-AD: A Dataset for Image-Level Logical Anomaly Detection under Vision-Induced DistractionCode0
RSEdit: Text-Guided Image Editing for Remote SensingCode0
Routing Channel-Patch Dependencies in Time Series Forecasting with Graph Spectral DecompositionCode0
Boosting Active Defense Persistence: A Two-Stage Defense Framework Combining Interruption and Poisoning Against DeepfakeCode0
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision EncodersCode0
TSDCRF: Balancing Privacy and Multi-Object Tracking via Time-Series CRF and Normalized Control PenaltyCode0
REAEDP: Entropy-Calibrated Differentially Private Data Release with Formal Guarantees and Attack-Based EvaluationCode0
VFM-Loc: Zero-Shot Cross-View Geo-Localization via Aligning Discriminative Visual HierarchiesCode0
sebis at ArchEHR-QA 2026: How Much Can You Do Locally? Evaluating Grounded EHR QA on a Single NotebookCode0
Multi-Grained Vision-Language Alignment for Domain Generalized Person Re-IdentificationCode0
Garments2Look: A Multi-Reference Dataset for High-Fidelity Outfit-Level Virtual Try-On with Clothing and Accessories1
HEARTS: Benchmarking LLM Reasoning on Health Time Series1
SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation1
LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction2
LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume AssessmentsCode0
DualSwinFusionSeg: Multimodal Martian Landslide Segmentation via Dual Swin Transformer with Multi-Scale Fusion and UNet++0
ST-ResGAT: Explainable Spatio-Temporal Graph Neural Network for Road Condition Prediction and Priority-Driven Maintenance0
UniMMAD: Unified Multi-Modal and Multi-Class Anomaly Detection via MoE-Driven Feature DecompressionCode0
Early Rug Pull Warning for BSC Meme Tokens via Multi-Granularity Wash-Trading Pattern Profiling0
Diffusion Reinforcement Learning via Centered Reward Distillation0
Repetition Without Exclusivity: Scale Sensitivity of Referential Mechanisms in Child-Scale Language Models0
VAD4Space: Visual Anomaly Detection for Planetary Surface Imagery0
GroupGuard: A Framework for Modeling and Defending Collusive Attacks in Multi-Agent Systems0
Learning Energy-Efficient Air--Ground Actuation for Hybrid Robots on Stair-Like Terrain0
Response-Aware Risk-Constrained Control Barrier Function With Application to Vehicles0
A Learnable SIM Paradigm: Fundamentals, Training Techniques, and Applications0
FED-HARGPT: A Hybrid Centralized-Federated Approach of a Transformer-based Architecture for Human Context Recognition0
MuViS: Multimodal Virtual Sensing Benchmark0
Large Language Models and Scientific Discourse: Where's the Intelligence?0
Learning Actionable Manipulation Recovery via Counterfactual Failure Synthesis0
Deciphering Scientific Reasoning Steps from Outcome Data for Molecule Optimization0
MiSiSUn: Minimum Simplex Semisupervised Unmixing0
JCAS-MARL: Joint Communication and Sensing UAV Networks via Resource-Constrained Multi-Agent Reinforcement Learning0
PhyGile: Physics-Prefix Guided Motion Generation for Agile General Humanoid Motion Tracking0
VERDICT: Verifiable Evolving Reasoning with Directive-Informed Collegial Teams for Legal Judgment Prediction0
Show:102550
← PrevPage 131 of 13232Next →