SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 69016925 of 474278 papers

TitleStatusHype
VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs0
The Curious Case of Analogies: Investigating Analogical Reasoning in Large Language Models0
OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation0
DUO-TOK: Dual-Track Semantic Music Tokenizer for Vocal-Accompaniment Generation0
The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment0
MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting0
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning0
Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled PairsCode0
AdaCap: An Adaptive Contrastive Approach for Small-Data Neural NetworksCode0
Interpretable Reward Model via Sparse AutoencoderCode0
ConfTuner: Training Large Language Models to Express Their Confidence VerballyCode0
OmniLens++: Blind Lens Aberration Correction via Large LensLib Pre-Training and Latent PSF RepresentationCode0
HunyuanVideo 1.5 Technical ReportCode0
Rectified SpaAttn: Revisiting Attention Sparsity for Efficient Video GenerationCode0
ChessMamba: Structure-Aware Interleaving of State Spaces for Change Detection in Remote Sensing ImagesCode0
ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene AnalysisCode0
Prompting Lipschitz-constrained network for multiple-in-one sparse-view CT reconstructionCode0
A Physics-Informed Loss Function for Boundary-Consistent and Robust Artery Segmentation in DSA SequencesCode0
DINO-Tok: Adapting DINO for Visual TokenizersCode0
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging ModalitiesCode0
Sparse-to-Field Reconstruction via Stochastic Neural Dynamic Mode DecompositionCode0
OceanGym: A Benchmark Environment for Underwater Embodied AgentsCode0
From Forecasting to Planning: Policy World Model for Collaborative State-Action PredictionCode0
Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal ReasoningCode0
AraFinNews: Arabic Financial Summarisation with Domain-Adapted LLMsCode0
Show:102550
← PrevPage 277 of 18972Next →