SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 77017725 of 474278 papers

TitleStatusHype
A Dual Perspective on Decision-Focused Learning: Scalable Training via Dual-Guided SurrogatesCode0
Role-SynthCLIP: A Role Play Driven Diverse Synthetic Data ApproachCode0
Diffusion-Based Electromagnetic Inverse Design of Scattering Structured MediaCode0
Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured EnvironmentsCode0
Sharing the Learned Knowledge-base to Estimate Convolutional Filter Parameters for Continual Image RestorationCode0
TimeSearch-R: Adaptive Temporal Search for Long-Form Video Understanding via Self-Verification Reinforcement LearningCode0
Caption-Driven Explainability: Probing CNNs for Bias via CLIPCode0
Cambrian-S: Towards Spatial Supersensing in Video0
MIDI-LLM: Adapting Large Language Models for Text-to-MIDI Music Generation0
RealDPO: Real or Not Real, that is the Preference0
Optimized Minimal 3D Gaussian Splatting0
Hierarchical Retrieval with Evidence Curation for Open-Domain Financial Question Answering on Standardized DocumentsCode0
BasicAVSR: Arbitrary-Scale Video Super-Resolution via Image Priors and Enhanced Motion CompensationCode0
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms0
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm0
Text2VectorSQL: Towards a Unified Interface for Vector Search and SQL QueriesCode0
DORAEMON: A Unified Library for Visual Object Modeling and Representation Learning at ScaleCode0
FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Language ModelsCode0
Residual Diffusion Bridge Model for Image RestorationCode0
PETRA: Pretrained Evolutionary Transformer for SARS-CoV-2 Mutation PredictionCode0
DartQuant: Efficient Rotational Distribution Calibration for LLM QuantizationCode0
BAPPA: Benchmarking Agents, Plans, and Pipelines for Automated Text-to-SQL GenerationCode0
AStF: Motion Style Transfer via Adaptive Statistics FusorCode0
BoRe-Depth: Self-supervised Monocular Depth Estimation with Boundary Refinement for Embedded SystemsCode0
Linear Mode Connectivity under Data Shifts for Deep Ensembles of Image ClassifiersCode0
Show:102550
← PrevPage 309 of 18972Next →