SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 9761000 of 659983 papers

TitleStatusHype
PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation0
SynSym: A Synthetic Data Generation Framework for Psychiatric Symptom Identification0
LLM-Based Test Case Generation in DBMS through Monte Carlo Tree Search0
Evolutionary Biparty Multiobjective UAV Path Planning: Problems and Empirical Comparisons0
What Do World Models Learn in RL? Probing Latent Representations in Learned Environment Simulators0
PROBE: Diagnosing Residual Concept Capacity in Erased Text-to-Video Diffusion Models0
From Part to Whole: 3D Generative World Model with an Adaptive Structural Hierarchy0
Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment0
Revisiting Weakly-Supervised Video Scene Graph Generation via Pair Affinity Learning0
Exploring Multimodal Prompts For Unsupervised Continuous Anomaly Detection0
Counterfactual Credit Policy Optimization for Multi-Agent Collaboration0
HACMatch Semi-Supervised Rotation Regression with Hardness-Aware Curriculum Pseudo Labeling0
SSAM: Singular Subspace Alignment for Merging Multimodal Large Language Models0
Feature Incremental Clustering with Generalization Bounds0
Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limited Communications0
DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers0
Rule-State Inference (RSI): A Bayesian Framework for Compliance Monitoring in Rule-Governed Domains0
SARe: Structure-Aware Large-Scale 3D Fragment Reassembly0
Towards Multimodal Time Series Anomaly Detection with Semantic Alignment and Condensed Interaction0
PGR-Net: Prior-Guided ROI Reasoning Network for Brain Tumor MRI Segmentation0
Dual-level Adaptation for Multi-Object Tracking: Building Test-Time Calibration from Experience and Intuition0
EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises0
Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks0
No Dense Tensors Needed: Fully Sparse Object Detection on Event-Camera Voxel Grids0
A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Commonalities and Model-Specific Signatures0
Show:102550
← PrevPage 40 of 26400Next →