SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 551600 of 659983 papers

TitleStatusHype
Reliable Classroom AI via Neuro-Symbolic Multimodal Reasoning0
It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal0
PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding0
Span Modeling for Idiomaticity and Figurative Language Detection with Span Contrastive Loss0
Universal and efficient graph neural networks with dynamic attention for machine learning interatomic potentials0
Efficient Hallucination Detection: Adaptive Bayesian Estimation of Semantic Entropy with Guided Semantic Exploration0
Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts0
Focus, Don't Prune: Identifying Instruction-Relevant Regions for Information-Rich Image Understanding0
MultiCam: On-the-fly Multi-Camera Pose Estimation Using Spatiotemporal Overlaps of Known Objects0
URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection0
UAV-DETR: DETR for Anti-Drone Target Detection0
L-UNet: An LSTM Network for Remote Sensing Image Change Detection0
TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture Co-design0
The Coordinate System Problem in Persistent Structural Memory for Neural Architectures0
A Feature Shuffling and Restoration Strategy for Universal Unsupervised Anomaly Detection0
The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration0
Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models0
Continuous Optimization for Satisfiability Modulo Theories on Linear Real Arithmetic0
Confidence Calibration under Ambiguous Ground Truth0
TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration0
ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling0
From the AI Act to a European AI Agency: Completing the Union's Regulatory Architecture0
Multilingual KokoroChat: A Multi-LLM Ensemble Translation Method for Creating a Multilingual Counseling Dialogue Dataset0
When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse0
EVA: Efficient Reinforcement Learning for End-to-End Video Agent0
The EU AI Act and the Rights-based Approach to Technological Governance0
Quality Over Clicks: Intrinsic Quality-Driven Iterative Reinforcement Learning for Cold-Start E-Commerce Query Suggestion0
ProGRank: Probe-Gradient Reranking to Defend Dense-Retriever RAG from Corpus Poisoning0
Caption Generation for Dongba Paintings via Prompt Learning and Semantic Fusion0
Weak-PDE-Net: Discovering Open-Form PDEs via Differentiable Symbolic Networks and Weak Formulation0
Cluster-Wise Spatio-Temporal Masking for Efficient Video-Language Pretraining0
Privacy-Preserving EHR Data Transformation via Geometric Operators: A Human-AI Co-Design Technical Report0
Set-Valued Prediction for Large Language Models with Feasibility-Aware Coverage Guarantees0
Beyond Theoretical Bounds: Empirical Privacy Loss Calibration for Text Rewriting Under Local Differential Privacy0
FCL-COD: Weakly Supervised Camouflaged Object Detection with Frequency-aware and Contrastive Learning0
Where Experts Disagree, Models Fail: Detecting Implicit Legal Citations in French Court Decisions0
DariMis: Harm-Aware Modeling for Dari Misinformation Detection on YouTube0
JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees0
Can Graph Foundation Models Generalize Over Architecture?0
Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation0
VQ-Jarvis: Retrieval-Augmented Video Restoration Agent with Sharp Vision and Fast Thought0
PaperVoyager : Building Interactive Web with Visual Language Models0
On the use of Aggregation Operators to improve Human Identification using Dental Records0
Can Large Language Models Reason and Optimize Under Constraints?0
AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents0
Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation0
MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates0
DBAutoDoc: Automated Discovery and Documentation of Undocumented Database Schemas via Statistical Analysis and Iterative LLM Refinement0
Post-Selection Distributional Model Evaluation0
Prompt Amplification and Zero-Shot Late Fusion in Audio-Language Models for Speech Emotion Recognition0
Show:102550
← PrevPage 12 of 13200Next →