SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 60516100 of 661570 papers

TitleStatusHype
Extending Minimal Pairs with Ordinal Surprisal Curves and Entropy Across Applied Domains0
G-ZAP: A Generalizable Zero-Shot Framework for Arbitrary-Scale Pansharpening0
ES-Merging: Biological MLLM Merging via Embedding Space Signals0
BiT-MCTS: A Theme-based Bidirectional MCTS Approach to Chinese Fiction Generation0
Questionnaire Responses Do not Capture the Safety of AI Agents0
Deep EM with Hierarchical Latent Label Modelling for Multi-Site Prostate Lesion Segmentation0
PARSA-Bench: A Comprehensive Persian Audio-Language Model Benchmark0
Distilling Latent Manifolds: Resolution Extrapolation by Variational Autoencoders0
Data Darwinism Part II: DataEvolve -- AI can Autonomously Evolve Pretraining Data Curation0
MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions0
GenState-AI: State-Aware Dataset for Text-to-Video Retrieval on AI-Generated Videos0
Creative Convergence or Imitation? Genre-Specific Homogeneity in LLM-Generated Chinese Literature0
Echoes Across Centuries: Phonetic Signatures of Persian Poets0
Uni-MDTrack: Learning Decoupled Memory and Dynamic States for Parameter-Efficient Visual Tracking in All Modality0
Distilling Reasoning Without Knowledge: A Framework for Reliable LLMs0
Inclusive AI for Group Interactions: Predicting Gaze-Direction Behaviors in People with Intellectual and Developmental Disabilities0
STAG-CN: Spatio-Temporal Apiary Graph Convolutional Network for Disease Onset Prediction in Beehive Sensor Networks0
LongVidSearch: An Agentic Benchmark for Multi-hop Evidence Retrieval Planning in Long Videos0
On the (Generative) Linear Sketching Problem0
Geometric and Topological Deep Learning for Predicting Thermo-mechanical Performance in Cold Spray Deposition Process Modeling0
Unlearning-based sliding window for continual learning under concept drift0
Infinite Problem Generator: Verifiably Scaling Physics Reasoning Data with Agentic Workflows0
Refining 3D Medical Segmentation with Verbal Instruction0
Mapping Dark-Matter Clusters via Physics-Guided Diffusion Models0
Excited Pfaffians: Generalized Neural Wave Functions Across Structure and State0
VLA-Thinker: Boosting Vision-Language-Action Models through Thinking-with-Image Reasoning0
MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection0
LatSearch: Latent Reward-Guided Search for Faster Inference-Time Scaling in Video Diffusion0
Expert Mind: A Retrieval-Augmented Architecture for Expert Knowledge Preservation in the Energy Sector0
Learning to Order: Task Sequencing as In-Context Optimization0
A comprehensive multimodal dataset and benchmark for ulcerative colitis scoring in endoscopy0
Covariance-Guided Resource Adaptive Learning for Efficient Edge Inference0
Power-Law Spectrum of the Random Feature Model0
Medical Image Spatial Grounding with Semantic Sampling0
Machine Learning-Driven Intelligent Memory System Design: From On-Chip Caches to Storage0
SuperLocalMemory V3: Information-Geometric Foundations for Zero-LLM Enterprise Agent Memory0
Adapting Critic Match Loss Landscape Visualization to Off-policy Reinforcement Learning0
FlashHead: Efficient Drop-In Replacement for the Classification Head in Language Model Inference0
A Multi-Scale Graph Learning Framework with Temporal Consistency Constraints for Financial Fraud Detection in Transaction Networks under Non-Stationary Conditions0
A Loss Landscape Visualization Framework for Interpreting Reinforcement Learning: An ADHDP Case Study0
Proactive Routing to Interpretable Surrogates with Distribution-Free Safety Guarantees0
EcoFair-CH-MARL: Scalable Constrained Hierarchical Multi-Agent RL with Real-Time Emission Budgets and Fairness Guarantees0
Anterior's Approach to Fairness Evaluation of Automated Prior Authorization System0
Continual Few-shot Adaptation for Synthetic Fingerprint Detection0
Compute Allocation for Reasoning-Intensive Retrieval Agents0
Circuit Representations of Random Forests with Applications to XAI0
Spiking neurons as predictive controllers of linear systems0
Infinity and Beyond: Compositional Alignment in VAR and Diffusion T2I Models0
Delving into Spectral Clustering with Vision-Language Representations0
Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models0
Show:102550
← PrevPage 122 of 13232Next →