SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 35013550 of 659983 papers

TitleStatusHype
Learning Human-Object Interaction for 3D Human Pose Estimation from LiDAR Point Clouds0
Adaptive Theory of Mind for LLM-based Multi-Agent Coordination0
VIGOR: VIdeo Geometry-Oriented Reward for Temporal Generative Alignment0
Laya: A LeJEPA Approach to EEG via Latent Prediction over Reconstruction0
Locate-then-Sparsify: Attribution Guided Sparse Strategy for Visual Hallucination Mitigation0
Persistent Story World Simulation with Continuous Character Customization0
Surrogate-Assisted Genetic Programming with Rank-Based Phenotypic Characterisation for Dynamic Multi-Mode Project Scheduling0
PyPhonPlan: Simulating phonetic planning with dynamic neural fields and task dynamics0
Micro-AU CLIP: Fine-Grained Contrastive Learning from Local Independence to Global Dependency for Micro-Expression Action Unit Detection0
DriveFix: Spatio-Temporally Coherent Driving Scene Restoration0
NeSy-Route: A Neuro-Symbolic Benchmark for Constrained Route Planning in Remote Sensing0
SpikeCLR: Contrastive Self-Supervised Learning for Few-Shot Event-Based Vision using Spiking Neural Networks0
UIS-Digger: Towards Comprehensive Research Agent Systems for Real-world Unindexed Information Seeking0
DRL-Based Beam Positioning for LEO Satellite Constellations with Weighted Least Squares0
Enhancing Linguistic Generalization of VLA: Fine-Tuning OpenVLA via Synthetic Instruction Augmentation0
PathGLS: Evaluating Pathology Vision-Language Models without Ground Truth through Multi-Dimensional ConsistencyCode0
Collaborative Temporal Feature Generation via Critic-Free Reinforcement Learning for Cross-User Sensor-Based Activity Recognition0
When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systems0
The DeepLog Neurosymbolic Machine0
PKINet-v2: Towards Powerful and Efficient Poly-Kernel Remote Sensing Object Detection0
The PokeAgent Challenge: Competitive and Long-Context Learning at Scale0
χ_0: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies3
A Novel Evolutionary Method for Automated Skull-Face Overlay in Computer-Aided Craniofacial Superimposition0
Answer Bubbles: Information Exposure in AI-Mediated Search0
Artificial intelligence-enabled single-lead ECG for non-invasive hyperkalemia detection: development, multicenter validation, and proof-of-concept deployment0
GNNVerifier: Graph-based Verifier for LLM Task PlanningCode0
Molecular Identifier Visual Prompt and Verifiable Reinforcement Learning for Chemical Reaction Diagram Parsing0
Towards the Vision-Sound-Language-Action Paradigm: The HEAR Framework for Sound-Centric Manipulation0
HYDRA: Unifying Multi-modal Generation and Understanding via Representation-Harmonized Tokenization0
RepoReviewer: A Local-First Multi-Agent Architecture for Repository-Level Code Review0
Functional Stochastic Localization0
SineProject: Machine Unlearning for Stable Vision Language Alignment0
Traj2Action: A Co-Denoising Framework for Trajectory-Guided Human-to-Robot Skill Transfer0
When Silence Matters: The Impact of Irrelevant Audio on Text Reasoning in Large Audio-Language Models0
Exploring the Underwater World Segmentation without Extra Training0
Zero-Shot Time Series Foundation Models for Annual Institutional Forecasting Under Data Sparsity0
Structured Semantic Cloaking for Jailbreak Attacks on Large Language Models0
Detecting Sentiment Steering Attacks on RAG-enabled Large Language Models0
ECHO: Edge-Cloud Humanoid Orchestration for Language-to-Motion Control0
An Interpretable Machine Learning Framework for Non-Small Cell Lung Cancer Drug Response Analysis0
Robust Generative Audio Quality Assessment: Disentangling Quality from Spurious Correlations0
Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching4
VIGIL: Towards Edge-Extended Agentic AI for Enterprise IT Support0
Coded Robust Aggregation for Distributed Learning under Byzantine Attacks0
BridgeShape: Latent Diffusion Schrödinger Bridge for 3D Shape Completion0
Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition0
Steering LLMs toward Korean Local Speech: Iterative Refinement Framework for Faithful Dialect Translation0
Rethinking Reward Signals in Video GRPO: When Scores Become Targets0
Learning Topology-Driven Multi-Subspace Fusion for Grassmannian Deep Network0
FedSDWC: Federated Synergistic Dual-Representation Weak Causal Learning for OOD0
Show:102550
← PrevPage 71 of 13200Next →