SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1180111850 of 661570 papers

TitleStatusHype
WTHaar-Net: a Hybrid Quantum-Classical Approach0
Biomechanically Accurate Gait Analysis: A 3d Human Reconstruction Framework for Markerless Estimation of Gait Parameters0
SGMA: Semantic-Guided Modality-Aware Segmentation for Remote Sensing with Incomplete Multimodal Data0
ITLC at SemEval-2026 Task 11: Normalization and Deterministic Parsing for Formal Reasoning in LLMs0
Learning Object-Centric Spatial Reasoning for Sequential Manipulation in Cluttered Environments0
ExpGuard: LLM Content Moderation in Specialized Domains0
Beyond Anatomy: Explainable ASD Classification from rs-fMRI via Functional Parcellation and Graph Attention Networks0
NeighborMAE: Exploiting Spatial Dependencies between Neighboring Earth Observation Images in Masked Autoencoders Pretraining0
Delegation and Verification Under AI0
The elbow statistic: Multiscale clustering statistical significance0
LLM-MLFFN: Multi-Level Autonomous Driving Behavior Feature Fusion via Large Language Model0
Bridging Diffusion Guidance and Anderson Acceleration via Hopfield Dynamics0
Same Error, Different Function: The Optimizer as an Implicit Prior in Financial Time Series0
SUN: Shared Use of Next-token Prediction for Efficient Multi-LLM Disaggregated Serving0
ReCo-Diff: Residual-Conditioned Deterministic Sampling for Cold Diffusion in Sparse-View CT0
Functional Properties of the Focal-Entropy0
A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities0
AnchorDrive: LLM Scenario Rollout with Anchor-Guided Diffusion Regeneration for Safety-Critical Scenario Generation0
Uni-Skill: Building Self-Evolving Skill Repository for Generalizable Robotic Manipulation0
See and Remember: A Multimodal Agent for Web TraversalCode0
Improving Anomaly Detection with Foundation-Model Synthesis and Wavelet-Domain Attention0
Post Hoc Extraction of Pareto Fronts for Continuous Control0
Towards an Incremental Unified Multimodal Anomaly Detection: Augmenting Multimodal Denoising From an Information Bottleneck Perspective0
MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks0
StitchCUDA: An Automated Multi-Agents End-to-End GPU Programing Framework with Rubric-based Agentic Reinforcement Learning0
The Vienna 4G/5G Drive-Test Dataset0
Convex and Non-convex Federated Learning with Stale Stochastic Gradients: Diminishing Step Size is All You Need0
OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets0
SEP-YOLO: Fourier-Domain Feature Representation for Transparent Object Instance Segmentation0
Evaluating Cross-Modal Reasoning Ability and Problem Characteristics with Multimodal Item Response Theory0
HomeAdam: Adam and AdamW Algorithms Sometimes Go Home to Obtain Better Provable Generalization0
Improving Diffusion Planners by Self-Supervised Action Gating with Energies0
AlphaFree: Recommendation Free from Users, IDs, and GNNs0
Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches0
SorryDB: Can AI Provers Complete Real-World Lean Theorems?0
LLMs for High-Frequency Decision-Making: Normalized Action Reward-Guided Consistency Policy Optimization0
VisionCreator: A Native Visual-Generation Agentic Model with Understanding, Thinking, Planning and Creation0
HateMirage: An Explainable Multi-Dimensional Dataset for Decoding Faux Hate and Subtle Online Abuse0
Retrieval-Augmented Robots via Retrieve-Reason-Act0
Addressing Missing and Noisy Modalities in One Solution: Unified Modality-Quality Framework for Low-quality Multimodal Data0
ShareVerse: Multi-Agent Consistent Video Generation for Shared World Modeling0
Neural quantum support vector data description for one-class classification0
Single Microphone Own Voice Detection based on Simulated Transfer Functions for Hearing Aids0
Graph-GRPO: Stabilizing Multi-Agent Topology Learning via Group Relative Policy Optimization0
Sensory-Aware Sequential Recommendation via Review-Distilled Representations0
Enhancing User Throughput in Multi-panel mmWave Radio Access Networks for Beam-based MU-MIMO Using a DRL Method0
A Natural Language Agentic Approach to Study Affective Polarization0
From "What" to "How": Constrained Reasoning for Autoregressive Image Generation0
An Empirical Analysis of Calibration and Selective Prediction in Multimodal Clinical Condition Classification0
TenExp: Mixture-of-Experts-Based Tensor Decomposition Structure Search Framework0
Show:102550
← PrevPage 237 of 13232Next →