SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 27262750 of 661570 papers

TitleStatusHype
Pretrained Video Models as Differentiable Physics Simulators for Urban Wind Flows0
A Large-Scale Remote Sensing Dataset and VLM-based Algorithm for Fine-Grained Road Hierarchy Classification0
Does AI Homogenize Student Thinking? A Multi-Dimensional Analysis of Structural Convergence in AI-Augmented Essays0
Plant Taxonomy Meets Plant Counting: A Fine-Grained, Taxonomic Dataset for Counting Hundreds of Plant Species0
When Convenience Becomes Risk: A Semantic View of Under-Specification in Host-Acting Agents0
QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression0
DepthTCM: High Efficient Depth Compression via Physics-aware Transformer-CNN Mixed Architecture0
Enhancing Brain Tumor Classification Using Vision Transformers with Colormap-Based Feature Representation on BRISC2025 Dataset0
Domain Elastic Transform: Bayesian Function Registration for High-Dimensional Scientific Data0
Does Mechanistic Interpretability Transfer Across Data Modalities? A Cross-Domain Causal Circuit Analysis of Variational Autoencoders0
WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making0
Fusing Memory and Attention: A study on LSTM, Transformer and Hybrid Architectures for Symbolic Music Generation0
Sonny: Breaking the Compute Wall in Medium-Range Weather Forecasting0
Focus on Background: Exploring SAM's Potential in Few-shot Medical Image Segmentation with Background-centric Prompting0
More Than Sum of Its Parts: Deciphering Intent Shifts in Multimodal Hate Speech Detection0
Identity-Consistent Video Generation under Large Facial-Angle Variations0
The Average Relative Entropy and Transpilation Depth determines the noise robustness in Variational Quantum Classifiers0
Privacy-Preserving Federated Action Recognition via Differentially Private Selective Tuning and Efficient Communication0
Active Inference Agency Formalization, Metrics, and Convergence Assessments0
Improving Coherence and Persistence in Agentic AI for System Optimization0
Which Alert Removals are Beneficial?0
B-jet Tagging Using a Hybrid Edge Convolution and Transformer Architecture0
PAS3R: Pose-Adaptive Streaming 3D Reconstruction for Long Video Sequences0
Semantic Shift: the Fundamental Challenge in Text Embedding and Retrieval0
PROMPT2BOX: Uncovering Entailment Structure among LLM Prompts0
Show:102550
← PrevPage 110 of 26463Next →