SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 27512800 of 659983 papers

TitleStatusHype
A Creative Agent is Worth a 64-Token Template0
A Noise Sensitivity Exponent Controls Large Statistical-to-Computational Gaps in Single- and Multi-Index Models0
Pretrained Multilingual Transformers Reveal Quantitative Distance Between Human Languages0
Interpretable Traffic Responsibility from Dashcam Video via Legal Multi Agent Reasoning0
Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing0
Unified Policy Value Decomposition for Rapid Adaptation0
VideoAtlas: Navigating Long-Form Video in Logarithmic Compute0
LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition0
Robust-ComBat: Mitigating Outlier Effects in Diffusion MRI Data Harmonization0
Specification-Aware Distribution Shaping for Robotics Foundation Models0
TechImage-Bench: Rubric-Based Evaluation for Technical Image Generation0
Revisiting foundation models for cell instance segmentation0
Automated Grammar-based Algebraic Multigrid Design With Evolutionary Algorithms0
Differential Attention-Augmented BiomedCLIP with Asymmetric Focal Optimization for Imbalanced Multi-Label Video Capsule Endoscopy Classification0
Omnilingual MT: Machine Translation for 1,600 Languages0
Efficient Exploration at Scale0
Thin Keys, Full Values: Reducing KV Cache via Low-Dimensional Attention Selection0
Bodhi VLM: Privacy-Alignment Modeling for Hierarchical Visual Representations in Vision Backbones and VLM Encoders via Bottom-Up and Top-Down Feature SearchCode0
Omni-I2C: A Holistic Benchmark for High-Fidelity Image-to-Code GenerationCode0
TheraMind: A Strategic and Adaptive Agent for Longitudinal Psychological CounselingCode0
EvoGuard: An Extensible Agentic RL-based Framework for Practical and Evolving AI-Generated Image Detection0
A practical artificial intelligence framework for legal age estimation using clavicle computed tomography scans0
MALLES: A Multi-agent LLMs-based Economic Sandbox with Consumer Preference Alignment0
Proactive Knowledge Inquiry in Doctor-Patient Dialogue: Stateful Extraction, Belief Updating, and Path-Aware Action Planning0
Large Language Models as a Semantic Interface and Ethical Mediator in Neuro-Digital Ecosystems: Conceptual Foundations and a Regulatory Imperative0
The Phasor Transformer: Resolving Attention Bottlenecks on the Unit Circle0
Predicting Trajectories of Long COVID in Adult Women: The Critical Role of Causal Disentanglement0
VISER: Visually-Informed System for Enhanced Robustness in Open-Set Iris Presentation Attack Detection0
JAWS: Enhancing Long-term Rollout of Neural PDE Solvers via Spatially-Adaptive Jacobian RegularizationCode0
A Unified Language Model for Large Scale Search, Recommendation, and Reasoning0
Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval0
Inhibitory normalization of error signals improves learning in neural circuits0
Noise-Aware Misclassification Attack Detection in Collaborative DNN Inference0
Disentangled Representation Learning through Unsupervised Symmetry Group Discovery0
An Introduction to Flow Matching and Diffusion Models0
Pathology-Aware Multi-View Contrastive Learning for Patient-Independent ECG Reconstruction0
Classifier Pooling for Modern Ordinal Classification0
One-Step Sampler for Boltzmann Distributions via Drifting0
Modeling Changing Scientific Concepts with Complex Networks: A Case Study on the Chemical Revolution0
Can Blindfolded LLMs Still Trade? An Anonymization-First Framework for Portfolio Optimization0
Multi-Source Evidence Fusion for Audio Question Answering0
Constraint Learning in Multi-Agent Dynamic Games from Demonstrations of Local Nash Interactions0
SpiderCam: Low-Power Snapshot Depth from Differential Defocus0
Efficient Policy Learning with Hybrid Evaluation-Based Genetic Programming for Uncertain Agile Earth Observation Satellite Scheduling0
LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation0
AdaRadar: Rate Adaptive Spectral Compression for Radar-based Perception0
Learning Adaptive Distribution Alignment with Neural Characteristic Function for Graph Domain Adaptation0
Efficient LLM Safety Evaluation through Multi-Agent Debate0
Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review0
Role-Augmented Intent-Driven Generative Search Engine Optimization0
Show:102550
← PrevPage 56 of 13200Next →