SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 12511300 of 659983 papers

TitleStatusHype
From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via Shadow-RAG0
Modernizing Amdahl's Law: How AI Scaling Laws Shape Computer Architecture0
Sinkhorn Based Associative Memory Retrieval Using Spherical Hellinger Kantorovich Dynamics0
Attention in Space: Functional Roles of VLM Heads for Spatial Reasoning0
REVERE: Reflective Evolving Research Engineer for Scientific Workflows0
ToFormer: Towards Large-scale Scenario Depth Completion for Lightweight ToF Camera0
Breaking the O(T) Cumulative Constraint Violation Barrier while Achieving O(T) Static Regret in Constrained Online Convex Optimization0
PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs0
AI-Driven Multi-Agent Simulation of Stratified Polyamory Systems: A Computational Framework for Optimizing Social Reproductive Efficiency0
Hierarchical Multiscale Structure-Function Coupling for Brain Connectome Integration0
IBCapsNet: Information Bottleneck Capsule Network for Noise-Robust Representation Learning0
Centrality-Based Pruning for Efficient Echo State Networks0
SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection0
Neuronal Self-Adaptation Enhances Capacity and Robustness of Representation in Spiking Neural Networks0
Artificial Intelligence in Experimental Approaches: Growth Hacking, Lean Startup, Design Thinking, and Agile0
MFSR: MeanFlow Distillation for One Step Real-World Image Super Resolution0
SWE-Next: Scalable Real-World Software Engineering Tasks for Agents0
Can I guess where you are from? Modeling dialectal morphosyntactic similarities in Brazilian Portuguese0
High-dimensional online learning via asynchronous decomposition: Non-divergent results, dynamic regularization, and beyond0
The Role and Relationship of Initialization and Densification in 3D Gaussian Splatting0
Cross-modal Fuzzy Alignment Network for Text-Aerial Person Retrieval and A Large-scale Benchmark0
Multi-RF Fusion with Multi-GNN Blending for Molecular Property Prediction0
Premier: Personalized Preference Modulation with Learnable User Embedding in Text-to-Image Generation0
Weakly supervised multimodal segmentation of acoustic borehole images with depth-aware cross-attention0
CTCal: Rethinking Text-to-Image Diffusion Models via Cross-Timestep Self-Calibration0
Adversarial Attacks on Locally Private Graph Neural Networks0
Modeling Epistemic Uncertainty in Social Perception via Rashomon Set Agents0
Smart Operation Theatre: An AI-based System for Surgical Gauze Counting0
Evaluating Uplift Modeling under Structural Biases: Insights into Metric Stability and Model Robustness0
OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation0
Code-MIE: A Code-style Model for Multimodal Information Extraction with Scene Graph and Entity Attribute Knowledge Enhancement0
MEMO: Human-like Crisp Edge Detection Using Masked Edge Prediction0
ME-IQA: Memory-Enhanced Image Quality Assessment via Re-Ranking0
Neural Autoregressive Flows for Markov Boundary Learning0
The Anatomy of an Edit: Mechanism-Guided Activation Steering for Knowledge Editing0
RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution0
Large Neighborhood Search meets Iterative Neural Constraint Heuristics0
Does Peer Observation Help? Vision-Sharing Collaboration for Vision-Language Navigation0
Compass: Optimizing Compound AI Workflows for Dynamic Adaptation0
Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP0
Simple Projection-Free Algorithm for Contextual Recommendation with Logarithmic Regret and Robustness0
EruDiff: Refactoring Knowledge in Diffusion Models for Advanced Text-to-Image Synthesis0
Beyond the Academic Monoculture: A Unified Framework and Industrial Perspective for Attributed Graph Clustering0
Governance-Aware Vector Subscriptions for Multi-Agent Knowledge Ecosystems0
Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping0
TAFG-MAN: Timestep-Adaptive Frequency-Gated Latent Diffusion for Efficient and High-Quality Low-Dose CT Image Denoising0
ReLaMix: Residual Latency-Aware Mixing for Delay-Robust Financial Time-Series Forecasting0
Incentive-Aware Federated Averaging with Performance Guarantees under Strategic Participation0
RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation0
NoveltyAgent: Autonomous Novelty Reporting Agent with Point-wise Novelty Analysis and Self-Validation0
Show:102550
← PrevPage 26 of 13200Next →