SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 94519500 of 661570 papers

TitleStatusHype
TransUNet-GradCAM: A Hybrid Transformer-U-Net with Self-Attention and Explainable Visualizations for Foot Ulcer Segmentation0
BoxMind: Closed-loop AI strategy optimization for elite boxing validated in the 2024 OlympicsCode0
Compose by Focus: Scene Graph-based Atomic Skills0
Nwāchā Munā: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR0
Sparsity and Out-of-Distribution Generalization0
Learning-free L2-Accented Speech Generation using Phonological Rules0
Unsupervised Deep Generative Models for Anomaly Detection in Neuroimaging: A Systematic Scoping Review0
Group Cross-Correlations with Faintly Constrained Filters0
Image Generation Models: A Technical History0
AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots0
StyleBench: Evaluating Speech Language Models on Conversational Speaking Style Control0
Integration of deep generative Anomaly Detection algorithm in high-speed industrial line0
LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation0
A Simple and Effective Reinforcement Learning Method for Text-to-Image Diffusion Fine-tuning0
Synthetic data for ratemaking: imputation-based methods vs adversarial networks and autoencoders0
ORN-CBF: Learning Observation-conditioned Residual Neural Control Barrier Functions via Hypernetworks0
AEGIS: Authentic Edge Growth In Sparsity for Link Prediction in Edge-Sparse Bipartite Knowledge Graphs0
FS-KAN: Permutation Equivariant Kolmogorov-Arnold Networks via Function Sharing0
Membership Inference Attacks on Tokenizers of Large Language Models0
Real-Time Motion-Controllable Autoregressive Video Diffusion0
Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs0
Explainable Heterogeneous Anomaly Detection in Financial Networks via Adaptive Expert Routing0
AnyPcc: Compressing Any Point Cloud with a Single Universal Model0
FATE: A Formal Benchmark Series for Frontier Algebra of Multiple Difficulty Levels0
SETUP: Sentence-level English-To-Uniform Meaning Representation Parser0
Why Code, Why Now: Learnability, Computability, and the Real Limits of Machine Learning0
Learning to Think Fast and Slow for Visual Language Models0
ForamDeepSlice: A High-Accuracy Deep Learning Framework for Foraminifera Species Classification from 2D Micro-CT Slices0
MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation0
Beyond Additivity: Sparse Isotonic Shapley Regression toward Nonlinear Explainability0
Evolving Diffusion and Flow Matching Policies for Online Reinforcement LearningCode0
NC-Bench: An LLM Benchmark for Evaluating Conversational Competence0
Multifaceted Scenario-Aware Hypergraph Learning for Next POI Recommendation0
Improving X-Codec-2.0 for Multi-Lingual Speech: 25 Hz Latent Rate and 24 kHz Sampling0
Retrieval Pivot Attacks in Hybrid RAG: Measuring and Mitigating Amplified Leakage from Vector Seeds to Graph Expansion0
Mean Flow Policy with Instantaneous Velocity Constraint for One-step Action Generation0
Listen to the Layers: Mitigating Hallucinations with Inter-Layer Disagreement0
Discovering Semantic Latent Structures in Psychological Scales: A Response-Free Pathway to Efficient Simplification0
RobustVisRAG: Causality-Aware Vision-Based Retrieval-Augmented Generation under Visual Degradations0
Universal 3D Shape Matching via Coarse-to-Fine Language Guidance0
Cycle-Consistent Tuning for Layered Image Decomposition0
A Mathematical Theory of Agency and Intelligence0
Decomposing Physician Disagreement in HealthBench0
Annotation-Free Visual Reasoning for High-Resolution Large Multimodal Models via Reinforcement Learning0
PEPA: a Persistently Autonomous Embodied Agent with Personalities0
HarmonyCell: Automating Single-Cell Perturbation Modeling under Semantic and Distribution Shifts0
Embedding interpretable _1-regression into neural networks for uncovering temporal structure in cell imaging0
MEM: Multi-Scale Embodied Memory for Vision Language Action Models0
Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency Hallucination Detector0
ECG Classification on PTB-XL: A Data-Centric Approach with Simplified CNN-VAE0
Show:102550
← PrevPage 190 of 13232Next →