SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 30013025 of 661570 papers

TitleStatusHype
Leveraging Natural Language Processing and Machine Learning for Evidence-Based Food Security Policy Decision-Making in Data-Scarce Making0
WebNavigator: Global Web Navigation via Interaction Graph Retrieval0
ALARA for Agents: Least-Privilege Context Engineering Through Portable Composable Multi-Agent Teams0
Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP0
NCSTR: Node-Centric Decoupled Spatio-Temporal Reasoning for Video-based Human Pose Estimation0
When Agents Disagree: The Selection Bottleneck in Multi-Agent LLM Pipelines0
DCG-Net: Dual Cross-Attention with Concept-Value Graph Reasoning for Interpretable Medical Diagnosis0
Prompt-Free Lightweight SAM Adaptation for Histopathology Nuclei Segmentation with Strong Cross-Dataset Generalization0
Probing the Latent World: Emergent Discrete Symbols and Physical Structure in Latent Representations0
GEM: A Native Graph-based Index for Multi-Vector Retrieval0
High-fidelity Multi-view Normal Integration with Scale-encoded Neural Surface Representation0
Low-pass Personalized Subgraph Federated Recommendation0
Graph-Aware Text-Only Backdoor Poisoning for Text-Attributed Graphs0
G2DR: A Genotype-First Framework for Genetics-Informed Target Prioritization and Drug Repurposing0
Toward a Multi-View Brain Network Foundation Model: Cross-View Consistency Learning Across Arbitrary Atlases0
MANA: Towards Efficient Mobile Ad Detection via Multimodal Agentic UI Navigation0
The Multiverse of Time Series Machine Learning: an Archive for Multivariate Time Series Classification0
Scene Representation using 360° Saliency Graph and its Application in Vision-based Indoor Navigation0
Leum-VL Technical Report0
Memory poisoning and secure multi-agent systems0
Operator Learning for Smoothing and Forecasting0
Comprehensive Description of Uncertainty in Measurement for Representation and Propagation with Scalable Precision0
Compression is all you need: Modeling Mathematics0
KV Cache Optimization Strategies for Scalable and Efficient LLM Inference0
FAAR: Efficient Frequency-Aware Multi-Task Fine-Tuning via Automatic Rank Selection0
Show:102550
← PrevPage 121 of 26463Next →