SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 36013650 of 659983 papers

TitleStatusHype
RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge GuidanceCode0
Open-Source Reproduction and Explainability Analysis of Corrective Retrieval Augmented GenerationCode0
Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-trainingCode0
Parametric Social Identity Injection and Diversification in Public Opinion SimulationCode0
Grounding the Score: Explicit Visual Premise Verification for Reliable Vision-Language Process Reward ModelsCode0
Decoding the Critique Mechanism in Large Reasoning ModelsCode0
EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and RetrievalCode0
PolyGraph Discrepancy: a classifier-based metric for graph generationCode0
From Intuition to Calibrated Judgment: A Rubric-Based Expert-Panel Study of Human Detection of LLM-Generated Korean TextCode0
Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning1
SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding0
Wavelet-based Frame Selection by Detecting Semantic Boundary for Long Video UnderstandingCode0
AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis0
RecBundle: A Next-Generation Geometric Paradigm for Explainable Recommender Systems0
Towards Solving Polynomial-Objective Integer Programming with Hypergraph Neural Networks0
Analytically tractable model of synaptic crowding explains emergent small-world structure and network dynamics0
Exploring Novelty Differences between Industry and Academia: A Knowledge Entity-centric PerspectiveCode0
Co-Design of Memory-Storage Systems for Workload Awareness with Interpretable Models0
RARE disease detection from Capsule Endoscopic Videos based on Vision Transformers0
AC-Foley: Reference-Audio-Guided Video-to-Audio Synthesis with Acoustic Transfer0
This Is Taking Too Long -- Investigating Time as a Proxy for Energy Consumption of LLMs0
Entropy-Aware Task Offloading in Mobile Edge Computing0
Vision-Language Model Based Multi-Expert Fusion for CT Image Classification0
Hypothesis Class Determines Explanation: Why Accurate Models Disagree on Feature Attribution0
UNICORN: Ultrasound Nakagami Imaging via Score Matching and Adaptation for Assessing Hepatic Steatosis0
Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models0
Joint Optimization of Storage and Loading for High-Performance 3D Point Cloud Data Processing0
EmergeNav: Structured Embodied Inference for Zero-Shot Vision-and-Language Navigation in Continuous Environments0
Minimum-Action Learning: Energy-Constrained Symbolic Model Selection for Physical Law Identification from Noisy Data0
A Framework for Modeling Liquefaction-Induced Road Disruptions After Earthquakes: Implications for Emergency Response and Access in the Cascadia Region of North America0
KGS-GCN: Enhancing Sparse Skeleton Sensing via Kinematics-Driven Gaussian Splatting and Probabilistic Topology for Action Recognition0
Automatic Termination Strategy of Inelastic Neutron-scattering Measurement Using Bayesian Optimization for Bin-width Selection0
Kriging via variably scaled kernels0
GDPO-SR: Group Direct Preference Optimization for One-Step Generative Image Super-ResolutionCode0
Lore: Repurposing Git Commit Messages as a Structured Knowledge Protocol for AI Coding Agents0
HAMLOCK: HArdware-Model LOgically Combined attacK0
Relevance Feedback in Text-to-Image Diffusion: A Training-Free And Model-Agnostic Interactive Framework0
Targum - A Multilingual New Testament Translation Corpus0
Agentic workflow enables the recovery of critical materials from complex feedstocks via selective precipitation0
TI-DeepONet: Learnable Time Integration for Stable Long-Term Extrapolation0
AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed Multi-Agent Systems0
Evaluating Causal Discovery Algorithms for Path-Specific Fairness and Utility in Healthcare0
The Importance of Being Smoothly Calibrated0
Automated Counting of Stacked Objects in Industrial Inspection0
Unbiased and Biased Variance-Reduced Forward-Reflected-Backward Splitting Methods for Stochastic Composite Inclusions0
Lite Any Stereo: Efficient Zero-Shot Stereo Matching0
daVinci-Env: Open SWE Environment Synthesis at Scale0
Intelligent Co-Design: An Interactive LLM Framework for Interior Spatial Design via Multi-Modal Agents0
Beyond Benchmark Islands: Toward Representative Trustworthiness Evaluation for Agentic AICode0
Geometric framework for biological evolution0
Show:102550
← PrevPage 73 of 13200Next →