SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 12761300 of 659983 papers

TitleStatusHype
Adversarial Attacks on Locally Private Graph Neural Networks0
Modeling Epistemic Uncertainty in Social Perception via Rashomon Set Agents0
Smart Operation Theatre: An AI-based System for Surgical Gauze Counting0
Evaluating Uplift Modeling under Structural Biases: Insights into Metric Stability and Model Robustness0
OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation0
Code-MIE: A Code-style Model for Multimodal Information Extraction with Scene Graph and Entity Attribute Knowledge Enhancement0
MEMO: Human-like Crisp Edge Detection Using Masked Edge Prediction0
ME-IQA: Memory-Enhanced Image Quality Assessment via Re-Ranking0
Neural Autoregressive Flows for Markov Boundary Learning0
The Anatomy of an Edit: Mechanism-Guided Activation Steering for Knowledge Editing0
RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution0
Large Neighborhood Search meets Iterative Neural Constraint Heuristics0
Does Peer Observation Help? Vision-Sharing Collaboration for Vision-Language Navigation0
Compass: Optimizing Compound AI Workflows for Dynamic Adaptation0
Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP0
Simple Projection-Free Algorithm for Contextual Recommendation with Logarithmic Regret and Robustness0
EruDiff: Refactoring Knowledge in Diffusion Models for Advanced Text-to-Image Synthesis0
Beyond the Academic Monoculture: A Unified Framework and Industrial Perspective for Attributed Graph Clustering0
Governance-Aware Vector Subscriptions for Multi-Agent Knowledge Ecosystems0
Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping0
TAFG-MAN: Timestep-Adaptive Frequency-Gated Latent Diffusion for Efficient and High-Quality Low-Dose CT Image Denoising0
ReLaMix: Residual Latency-Aware Mixing for Delay-Robust Financial Time-Series Forecasting0
Incentive-Aware Federated Averaging with Performance Guarantees under Strategic Participation0
RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation0
NoveltyAgent: Autonomous Novelty Reporting Agent with Point-wise Novelty Analysis and Self-Validation0
Show:102550
← PrevPage 52 of 26400Next →