SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 28512875 of 661570 papers

TitleStatusHype
User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction0
gUFO: A Gentle Foundational Ontology for Semantic Web Knowledge Graphs0
Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge0
GraPHFormer: A Multimodal Graph Persistent Homology Transformer for the Analysis of Neuroscience Morphologies0
DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles0
Detection of adversarial intent in Human-AI teams using LLMs0
MERIT: Multi-domain Efficient RAW Image Translation0
Dodgersort: Uncertainty-Aware VLM-Guided Human-in-the-Loop Pairwise Ranking0
A Knowledge-Informed Pretrained Model for Causal Discovery0
HiCI: Hierarchical Construction-Integration for Long-Context Attention0
GOLDMARK: Governed Outcome-Linked Diagnostic Model Assessment Reference Kit0
Glove2Hand: Synthesizing Natural Hand-Object Interaction from Multi-Modal Sensing Gloves0
Can ChatGPT Really Understand Modern Chinese Poetry?0
SozKZ: Training Efficient Small Language Models for Kazakh from Scratch0
Ensemble of Small Classifiers For Imbalanced White Blood Cell Classification0
RoboECC: Multi-Factor-Aware Edge-Cloud Collaborative Deployment for VLA Models0
PlanaReLoc: Camera Relocalization in 3D Planar Primitives via Region-Based Structure Matching0
Achieving O(1/ε) Sample Complexity for Bilinear Systems Identification under Bounded Noises0
Improving Diffusion Generalization with Weak-to-Strong Segmented Guidance0
RECLAIM: Cyclic Causal Discovery Amid Measurement Noise0
Neural collapse in the orthoplex regime0
RayMap3R: Inference-Time RayMap for Dynamic 3D Reconstruction0
Generating from Discrete Distributions Using Diffusions: Insights from Random Constraint Satisfaction Problems0
Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models0
Clinical Cognition Alignment for Gastrointestinal Diagnosis with Multimodal LLMs0
Show:102550
← PrevPage 115 of 26463Next →