SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 20012050 of 659983 papers

TitleStatusHype
A Theoretical Comparison of No-U-Turn Sampler Variants: Necessary and Su?cient Convergence Conditions and Mixing Time Analysis under Gaussian Targets0
Click-to-Ask: An AI Live Streaming Assistant with Offline Copywriting and Online Interactive QA0
Cognitive Amplification vs Cognitive Delegation in Human-AI Systems: A Metric Framework0
Towards High-Quality Image Segmentation: Improving Topology Accuracy by Penalizing Neighbor Pixels0
MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation0
Revisiting Label Inference Attacks in Vertical Federated Learning: Why They Are Vulnerable and How to Defend0
HISR: Hindsight Information Modulated Segmental Process Rewards For Multi-turn Agentic Reinforcement Learning0
OCP: Orthogonal Constrained Projection for Sparse Scaling in Industrial Commodity Recommendation0
Off-Policy Learning with Limited Supply0
Accurate and Efficient Multi-Channel Time Series Forecasting via Sparse Attention Mechanism0
Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures0
CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks0
Conflict-Based Search for Multi Agent Path Finding with Asynchronous Actions0
Quantitative Introspection in Language Models: Tracking Internal States Across Conversation0
EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation0
6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models0
Are complicated loss functions necessary for teaching LLMs to reason?0
DA-Mamba: Learning Domain-Aware State Space Model for Global-Local Alignment in Domain Adaptive Object Detection0
ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation0
A Concept is More Than a Word: Diversified Unlearning in Text-to-Image Diffusion Models0
Automatic Configuration of LLM Post-Training Pipelines0
Points-to-3D: Structure-Aware 3D Generation with Point Cloud Priors0
Proceedings of the 2nd Workshop on Advancing Artificial Intelligence through Theory of Mind0
Mi:dm K 2.5 Pro0
Rethinking Uncertainty Quantification and Entanglement in Image Segmentation0
Functional Subspace Watermarking for Large Language Models0
Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation0
VesselTok: Tokenizing Vessel-like 3D Biomedical Graph Representations for Reconstruction and Generation0
Signals of Success and Struggle: Early Prediction and Physiological Signatures of Human Performance across Task Complexity0
dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models0
Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments0
Data-driven construction of machine-learning-based interatomic potentials for gas-surface scattering dynamics: the case of NO on graphite0
RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction0
Through the Looking-Glass: AI-Mediated Video Communication Reduces Interpersonal Trust and Confidence in Judgments0
MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model0
Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case Study on Duolingo0
Geography According to ChatGPT -- How Generative AI Represents and Reasons about Geography0
Revisiting Autoregressive Models for Generative Image Classification0
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation0
A conceptual framework for ideology beyond the left and right0
Authority-Level Priors: An Under-Specified Constraint in Hierarchical Predictive Processing0
Context Bootstrapped Reinforcement Learning0
Unsupervised Contrastive Learning for Efficient and Robust Spectral Shape Matching0
Neural Galerkin Normalizing Flow for Transition Probability Density Functions of Diffusion Models0
Secure Linear Alignment of Large Language Models0
Security, privacy, and agentic AI in a regulatory view: From definitions and distinctions to provisions and reflections0
Agentic Business Process Management: A Research Manifesto0
Improving moment tensor solutions under Earth structure uncertainty with simulation-based inference0
An Optimised Greedy-Weighted Ensemble Framework for Financial Loan Default Prediction0
Entropy trajectory shape predicts LLM reasoning reliability: A diagnostic study of uncertainty dynamics in chain-of-thought0
Show:102550
← PrevPage 41 of 13200Next →