SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 70017025 of 474278 papers

TitleStatusHype
Conversational LLMs Simplify Secure Clinical Data Access, Understanding, and Analysis0
Health system learning achieves generalist neuroimaging models0
Assessing Historical Structural Oppression Worldwide via Rule-Guided Prompting of Large Language ModelsCode0
Kitty: Accurate and Efficient 2-bit KV Cache Quantization with Dynamic Channel-wise Precision BoostCode0
Xmodel-2.5: 1.3B Data-Efficient Reasoning SLMCode0
In Search of Goodness: Large Scale Benchmarking of Goodness Functions for the Forward-Forward AlgorithmCode0
Prompt Optimization as a State-Space Search ProblemCode0
An Analysis of Constraint-Based Multi-Agent Pathfinding AlgorithmsCode0
End-to-End Visual Autonomous Parking via Control-Aided AttentionCode0
Hyperspectral Variational Autoencoders for Joint Data Compression and Component ExtractionCode0
A Diffusion Model to Shrink Proteins While Maintaining Their Function0
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data0
General Agentic Memory Via Deep Research0
VPN: Visual Prompt NavigationCode0
DocPTBench: Benchmarking End-to-End Photographed Document Parsing and TranslationCode0
NAF: Zero-Shot Feature Upsampling via Neighborhood Attention FilteringCode0
ReCoGS: Real-time ReColoring for Gaussian Splatting scenesCode0
Towards Robust and Fair Next Visit Diagnosis Prediction under Noisy Clinical Notes with Large Language ModelsCode0
UPLME: Uncertainty-Aware Probabilistic Language Modelling for Robust Empathy RegressionCode0
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPOCode0
HEAL: Learning-Free Source Free Unsupervised Domain Adaptation for Cross-Modality Medical Image SegmentationCode0
AutoHFormer: Efficient Hierarchical Autoregressive Transformer for Time Series PredictionCode0
Matching-Based Few-Shot Semantic Segmentation Models Are Interpretable by DesignCode0
Fine-Grained GRPO for Precise Preference Alignment in Flow Models0
Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs0
Show:102550
← PrevPage 281 of 18972Next →