SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 676700 of 659983 papers

TitleStatusHype
BXRL: Behavior-Explainable Reinforcement Learning0
Detection and Classification of (Pre)Cancerous Cells in Pap Smears: An Ensemble Strategy for the RIVA Cervical Cytology Challenge0
Kronecker-Structured Nonparametric Spatiotemporal Point Processes0
Manifold Generalization Provably Proceeds Memorization in Diffusion Models0
Sparse Autoencoders for Interpretable Medical Image Representation Learning0
Parameter-Efficient Fine-Tuning for Medical Text Summarization: A Comparative Study of Lora, Prompt Tuning, and Full Fine-Tuning0
Drop-In Perceptual Optimization for 3D Gaussian Splatting0
CAPTCHA Solving for Native GUI Agents: Automated Reasoning-Action Data Generation and Self-Corrective Training0
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation0
Mamba-VMR: Multimodal Query Augmentation via Generated Videos for Precise Temporal Grounding0
OpenEarth-Agent: From Tool Calling to Tool Creation for Open-Environment Earth Observation0
More Isn't Always Better: Balancing Decision Accuracy and Conformity Pressures in Multi-AI Advice0
dynActivation: A Trainable Activation Family for Adaptive Nonlinearity0
RAMPAGE: RAndomized Mid-Point for debiAsed Gradient Extrapolation0
Multimodal Survival Analysis with Locally Deployable Large Language Models0
Data Curation for Machine Learning Interatomic Potentials by Determinantal Point Processes0
DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation0
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning0
On the Failure of Topic-Matched Contrast Baselines in Multi-Directional Refusal Abliteration0
PreferRec: Learning and Transferring Pareto Preferences for Multi-objective Re-ranking0
MIHT: A Hoeffding Tree for Time Series Classification using Multiple Instance Learning0
Autoregressive vs. Masked Diffusion Language Models: A Controlled Comparison0
A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP0
Multiperspectivity as a Resource for Narrative Similarity Prediction0
Unveiling the Mechanism of Continuous Representation Full-Waveform Inversion: A Wave Based Neural Tangent Kernel Framework0
Show:102550
← PrevPage 28 of 26400Next →