SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1135111400 of 661570 papers

TitleStatusHype
Tracing 3D Anatomy in 2D Strokes: A Multi-Stage Projection Driven Approach to Cervical Spine Fracture Identification0
A Unified Revisit of Temperature in Classification-Based Knowledge Distillation0
No More, No Less: Least-Privilege Language Models0
NeuroPareto: Calibrated Acquisition for Costly Many-Goal Search in Vast Parameter Spaces0
HealthMamba: An Uncertainty-aware Spatiotemporal Graph State Space Model for Effective and Reliable Healthcare Facility Visit Prediction0
Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding0
Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?0
Exploring Semantic Labeling Strategies for Third-Party Cybersecurity Risk Assessment Questionnaires0
Chimera: Neuro-Symbolic Attention Primitives for Trustworthy Dataplane Intelligence0
Meenz bleibt Meenz, but Large Language Models Do Not Speak Its Dialect0
JPmHC Dynamical Isometry via Orthogonal Hyper-Connections0
From Agent-Only Social Networks to Autonomous Scientific Research: Lessons from OpenClaw and Moltbook, and the Architecture of ClawdLab and Beach.Science0
Learning Physical Principles from Interaction: Self-Evolving Planning via Test-Time Memory0
Maximin Share Guarantees via Limited Cost-Sensitive Sharing0
When Safety Collides: Resolving Multi-Category Harmful Conflicts in Text-to-Image Diffusion via Adaptive Safety Guidance0
Automatic Map Density Selection for Locally-Performant Visual Place Recognition0
AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications0
Structured vs. Unstructured Pruning: An Exponential Gap0
The Sentience Readiness Index: A Preliminary Framework for Measuring National Preparedness for the Possibility of Artificial Sentience0
Agentic Code Reasoning0
From Variance to Invariance: Qualitative Content Analysis for Narrative Graph Annotation0
Rich Insights from Cheap Signals: Efficient Evaluations via Tensor Factorization0
Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction0
Federated Inference: Toward Privacy-Preserving Collaborative and Incentivized Model Serving0
Can machines be uncertain?0
Causal Learning Should Embrace the Wisdom of the Crowd0
TRACE: Task-Adaptive Reasoning and Representation Learning for Universal Multimodal Retrieval0
QFlowNet: Fast, Diverse, and Efficient Unitary Synthesis with Generative Flow Networks0
How to Model AI Agents as Personas?: Applying the Persona Ecosystem Playground to 41,300 Posts on Moltbook for Behavioral Insights0
Adaptive Sensing of Continuous Physical Systems for Machine Learning0
A Stein Identity for q-Gaussians with Bounded Support0
One-Step Face Restoration via Shortcut-Enhanced Coupling Flow0
Hybrid Belief Reinforcement Learning for Efficient Coordinated Spatial Exploration0
NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training0
DM-CFO: A Diffusion Model for Compositional 3D Tooth Generation with Collision-Free Optimization0
Detection and Identification of Penguins Using Appearance and Motion Features0
Tracking Feral Horses in Aerial Video Using Oriented Bounding Boxes0
Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional Data0
Riemannian Optimization in Modular Systems0
Parallax to Align Them All: An OmniParallax Attention Mechanism for Distributed Multi-View Image Compression0
LeafInst - Unified Instance Segmentation Network for Fine-Grained Forestry Leaf Phenotype Analysis: A New UAV based Benchmark0
CoRe-BT: A Multimodal Radiology-Pathology-Text Benchmark for Robust Brain Tumor Typing0
A Neural Topic Method Using a Large-Language-Model-in-the-Loop for Business Research0
Goal-Driven Risk Assessment for LLM-Powered Systems: A Healthcare Case Study0
Linguistically Informed Graph Model and Semantic Contrastive Learning for Korean Short Text Classification0
Field imaging framework for morphological characterization of aggregates with computer vision: Algorithms and applications0
InEdit-Bench: Benchmarking Intermediate Logical Pathways for Intelligent Image Editing Models0
Graph Negative Feedback Bias Correction Framework for Adaptive Heterophily Modeling0
Principled Learning-to-Communicate with Quasi-Classical Information Structures0
Local Shapley: Model-Induced Locality and Optimal Reuse in Data Valuation0
Show:102550
← PrevPage 228 of 13232Next →