SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 60016025 of 661570 papers

TitleStatusHype
Selective Fine-Tuning of GPT Architectures for Parameter-Efficient Clinical Text Classification0
Deeper Thought, Weaker Aim: Understanding and Mitigating Perceptual Impairment during Reasoning in Multimodal Large Language Models0
Fair Benchmarking of Emerging One-Step Generative Models Against Multistep Diffusion and Flow Models0
Deep Learning From Routine Histology Improves Risk Stratification for Biochemical Recurrence in Prostate Cancer0
DualTSR: Unified Dual-Diffusion Transformer for Scene Text Image Super-Resolution0
ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control0
Memory as Asset: From Agent-centric to Human-centric Memory Management0
Interleaved Resampling and Refitting: Data and Compute-Efficient Evaluation of Black-Box Predictors0
FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection0
A Real-Time Neuro-Symbolic Ethical Governor for Safe Decision Control in Autonomous Robotic Manipulation0
BIT: Matching-based Bi-directional Interaction Transformation Network for Visible-Infrared Person Re-Identification0
Membership Inference for Contrastive Pre-training Models with Text-only PII Queries0
Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys0
Not All Directions Matter: Toward Structured and Task-Aware Low-Rank Adaptation0
QiMeng-CodeV-SVA: Training Specialized LLMs for Hardware Assertion Generation via RTL-Grounded Bidirectional Data Synthesis0
FOCUS: Bridging Fine-Grained Recognition and Open-World Discovery across Domains0
CamLit: Unified Video Diffusion with Explicit Camera and Lighting Control0
OAHuman: Occlusion-Aware 3D Human Reconstruction from Monocular Images0
Sampling Boltzmann distributions via normalizing flow approximation of transport maps0
Mitigating Overthinking in Large Reasoning Language Models via Reasoning Path Deviation Monitoring0
MedPriv-Bench: Benchmarking the Privacy-Utility Trade-off of Large Language Models in Medical Open-End Question Answering0
Learning in Function Spaces: An Unified Functional Analytic View of Supervised and Unsupervised Learning0
Controllable Accent Normalization via Discrete Diffusion0
All-day Multi-scenes Lifelong Vision-and-Language Navigation with Tucker Adaptation0
DC-ViT: Modulating Spatial and Channel Interactions for Multi-Channel Images0
Show:102550
← PrevPage 241 of 26463Next →