SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 87268750 of 474278 papers

TitleStatusHype
Generative Universal Verifier as Multimodal Meta-Reasoner0
The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models0
BoltzNCE: Learning Likelihoods for Boltzmann Generation with Stochastic Interpolants and Noise Contrastive EstimationCode0
Echoes of BERT: Do Modern Language Models Rediscover the Classical NLP Pipeline?Code0
Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms0
PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning0
Hierarchical Frequency Tagging Probe (HFTP): A Unified Approach to Investigate Syntactic Structure Representations in Large Language Models and the Human BrainCode0
Removing Cost Volumes from Optical Flow Estimators0
BitNet DistillationCode0
KG2QA: Knowledge Graph-enhanced Retrieval-augmented Generation for Communication Standards Question AnsweringCode0
ESG-Net: Event-Aware Semantic Guided Network for Dense Audio-Visual Event LocalizationCode0
QuaDreamer: Controllable Panoramic Video Generation for Quadruped RobotsCode0
MedDINOv3: How to adapt vision foundation models for medical image segmentation?Code0
Geo-R1: Improving Few-Shot Geospatial Referring Expression Understanding with Reinforcement Fine-TuningCode0
DelRec: learning delays in recurrent spiking neural networksCode0
EReLiFM: Evidential Reliability-Aware Residual Flow Meta-Learning for Open-Set Domain Generalization under Noisy LabelsCode0
Offline and Online KL-Regularized RLHF under Differential PrivacyCode0
Scaling Vision Transformers for Functional MRI with Flat MapsCode0
Assessing the Geographic Generalization and Physical Consistency of Generative Models for Climate DownscalingCode0
Multi-Scale High-Resolution Logarithmic Grapher Module for Efficient Vision GNNsCode0
Behavioral Embeddings of Programs: A Quasi-Dynamic Approach for Optimization PredictionCode0
IterMask3D: Unsupervised Anomaly Detection and Segmentation with Test-Time Iterative Mask Refinement in 3D Brain MRCode0
ReasoningShield: Safety Detection over Reasoning Traces of Large Reasoning ModelsCode0
Data-Efficient Fine-Tuning of Vision-Language Models for Diagnosis of Alzheimer's DiseaseCode0
Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language ModelsCode0
Show:102550
← PrevPage 350 of 18972Next →