SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 551575 of 659983 papers

TitleStatusHype
Reliable Classroom AI via Neuro-Symbolic Multimodal Reasoning0
It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal0
PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding0
Span Modeling for Idiomaticity and Figurative Language Detection with Span Contrastive Loss0
Universal and efficient graph neural networks with dynamic attention for machine learning interatomic potentials0
Efficient Hallucination Detection: Adaptive Bayesian Estimation of Semantic Entropy with Guided Semantic Exploration0
Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts0
Focus, Don't Prune: Identifying Instruction-Relevant Regions for Information-Rich Image Understanding0
MultiCam: On-the-fly Multi-Camera Pose Estimation Using Spatiotemporal Overlaps of Known Objects0
URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection0
UAV-DETR: DETR for Anti-Drone Target Detection0
L-UNet: An LSTM Network for Remote Sensing Image Change Detection0
TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture Co-design0
The Coordinate System Problem in Persistent Structural Memory for Neural Architectures0
A Feature Shuffling and Restoration Strategy for Universal Unsupervised Anomaly Detection0
The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration0
Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models0
Continuous Optimization for Satisfiability Modulo Theories on Linear Real Arithmetic0
Confidence Calibration under Ambiguous Ground Truth0
TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration0
ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling0
From the AI Act to a European AI Agency: Completing the Union's Regulatory Architecture0
Multilingual KokoroChat: A Multi-LLM Ensemble Translation Method for Creating a Multilingual Counseling Dialogue Dataset0
When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse0
EVA: Efficient Reinforcement Learning for End-to-End Video Agent0
Show:102550
← PrevPage 23 of 26400Next →