SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1065110700 of 661570 papers

TitleStatusHype
Memory as Ontology: A Constitutional Memory Architecture for Persistent Digital Citizens0
CONE: Embeddings for Complex Numerical Data Preserving Unit and Variable Semantics0
DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval1
Visioning Human-Agentic AI Teaming: Continuity, Tension, and Future Research0
HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel0
KindSleep: Knowledge-Informed Diagnosis of Obstructive Sleep Apnea from Oximetry0
Stacked from One: Multi-Scale Self-Injection for Context Window Extension0
Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary0
ConTSG-Bench: A Unified Benchmark for Conditional Time Series Generation0
TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings0
Distributional Reinforcement Learning with Information Bottleneck for Uncertainty-Aware DRAM Equalization0
SinhaLegal: A Benchmark Corpus for Information Extraction and Analysis in Sinhala Legislative Texts0
DSA-SRGS: Super-Resolution Gaussian Splatting for Dynamic Sparse-View DSA Reconstruction0
Differentially Private Multimodal In-Context Learning0
RMK RetinaNet: Rotated Multi-Kernel RetinaNet for Robust Oriented Object Detection in Remote Sensing Imagery0
LAW & ORDER: Adaptive Spatial Weighting for Medical Diffusion and Segmentation0
Privacy-Aware Camera 2.0 Technical Report0
Distributional Equivalence in Linear Non-Gaussian Latent-Variable Cyclic Causal Models: Characterization and Learning0
Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction0
Diffusion Policy through Conditional Proximal Policy Optimization0
Comparative Evaluation of Traditional Methods and Deep Learning for Brain Glioma Imaging. Review Paper0
Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm0
The Inductive Bias of Convolutional Neural Networks: Locality and Weight Sharing Reshape Implicit Regularization0
WhisperAlign: Word-Boundary-Aware ASR and WhisperX-Anchored Pyannote Diarization for Long-Form Bengali Speech0
Beyond the Context Window: A Cost-Performance Analysis of Fact-Based Memory vs. Long-Context LLMs for Persistent Agents0
EchoGuard: An Agentic Framework with Knowledge-Graph Memory for Detecting Manipulative Communication in Longitudinal Dialogue0
FC-VFI: Faithful and Consistent Video Frame Interpolation for High-FPS Slow Motion Video Generation0
On the Strengths and Weaknesses of Data for Open-set Embodied Assistance0
Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses0
VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment0
From Unfamiliar to Familiar: Detecting Pre-training Data via Gradient Deviations in Large Language Models0
SCoUT: Scalable Communication via Utility-Guided Temporal Grouping in Multi-Agent Reinforcement Learning0
An Approach to Simultaneous Acquisition of Real-Time MRI Video, EEG, and Surface EMG for Articulatory, Brain, and Muscle Activity During Speech Production0
GloSplat: Joint Pose-Appearance Optimization for Faster and More Accurate 3D Reconstruction0
When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger0
On Multi-Step Theorem Prediction via Non-Parametric Structural Priors0
Structure Observation Driven Image-Text Contrastive Learning for Computed Tomography Report Generation0
Scalable Injury-Risk Screening in Baseball Pitching From Broadcast Video0
Diffusion-Based sRGB Real Noise Generation via Prompt-Driven Noise Representation Learning0
DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization0
Bounded State in an Infinite Horizon: Proactive Hierarchical Memory for Ad-Hoc Recall over Streaming Dialogues0
Federated Modality-specific Encoders and Partially Personalized Fusion Decoder for Multimodal Brain Tumor Segmentation0
FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation0
Knowledge-informed Bidding with Dual-process Control for Online Advertising0
How Does the ReLU Activation Affect the Implicit Bias of Gradient Descent on High-dimensional Neural Network Regression?0
Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection for VLMs0
U-Parking: Distributed UWB-Assisted Autonomous Parking System with Robust Localization and Intelligent Planning0
EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection0
AgentSCOPE: Evaluating Contextual Privacy Across Agentic Workflows0
Deterministic Preprocessing and Interpretable Fuzzy Banding for Cost-per-Student Reporting from Extracted Records0
Show:102550
← PrevPage 214 of 13232Next →