SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,292 code links4,818 tasks

Papers

Showing 18011825 of 661570 papers

TitleStatusHype
KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao0
Few-Shot Generative Model Adaption via Identity Injection and Preservation0
GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning0
Think 360°: Evaluating the Width-centric Reasoning Capability of MLLMs Beyond Depth0
WiFi2Cap: Semantic Action Captioning from Wi-Fi CSI via Limb-Level Semantic Alignment0
Coordinate Encoding on Linear Grids for Physics-Informed Neural Networks0
TimeWeaver: Age-Consistent Reference-Based Face Restoration with Identity Preservation0
Synthetic or Authentic? Building Mental Patient Simulators from Longitudinal Evidence0
Explanation Generation for Contradiction Reconciliation with LLMs0
Multitask-Informed Prior for In-Context Learning on Tabular Data: Application to Steel Property Prediction0
Analysing LLM Persona Generation and Fairness Interpretation in Polarised Geopolitical Contexts0
CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models0
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought0
UniQueR: Unified Query-based Feedforward 3D Reconstruction0
Gau-Occ: Geometry-Completed Gaussians for Multi-Modal 3D Occupancy Prediction0
Agent Audit: A Security Analysis System for LLM Agent Applications0
Avoiding Over-smoothing in Social Media Rumor Detection with Pre-trained Propagation Tree Transformer0
Agent-Sentry: Bounding LLM Agents via Execution Provenance0
Chain-of-Authorization: Internalizing Authorization into Large Language Models via Reasoning Trajectories0
Designing to Forget: Deep Semi-parametric Models for Unlearning0
Dynamical Systems Theory Behind a Hierarchical Reasoning Model0
ForeSea: AI Forensic Search with Multi-modal Queries for Video Surveillance0
Template-Based Feature Aggregation Network for Industrial Anomaly Detection0
VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents0
Off-Policy Evaluation and Learning for Survival Outcomes under Censoring0
Show:102550
← PrevPage 73 of 26463Next →