SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 90269050 of 474278 papers

TitleStatusHype
DeepPrune: Parallel Scaling without Inter-trace Redundancy0
InstructX: Towards Unified Visual Editing with MLLM Guidance0
ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation0
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning0
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints0
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation0
SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models0
Reinforcing Diffusion Models by Direct Group Preference OptimizationCode0
IMAGHarmony: Controllable Image Editing with Consistent Object Quantity and LayoutCode0
Paper2Video: Automatic Video Generation from Scientific PapersCode0
AutoMLGen: Navigating Fine-Grained Optimization for Coding AgentsCode0
Discrete Compositional Generation via General Soft Operators and Robust Reinforcement LearningCode0
PhyDAE: Physics-Guided Degradation-Adaptive Experts for All-in-One Remote Sensing Image RestorationCode0
Graph Diffusion Transformers are In-Context Molecular DesignersCode0
On the Alignment Between Supervised and Self-Supervised Contrastive LearningCode0
D-CoDe: Scaling Image-Pretrained VLMs to Video via Dynamic Compression and Question DecompositionCode0
Long-Tailed Recognition via Information-Preservable Two-Stage LearningCode0
Accelerated Aggregated D-Optimal Designs for Estimating Main Effects in Black-Box ModelsCode0
A Survey of Reinforcement Learning for Large Reasoning ModelsCode0
How to Teach Large Multimodal Models New SkillsCode0
Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language NavigationCode0
Training a Foundation Model for Materials on a BudgetCode0
A^2Search: Ambiguity-Aware Question Answering with Reinforcement LearningCode0
SenWave: A Fine-Grained Multi-Language Sentiment Analysis Dataset Sourced from COVID-19 TweetsCode0
Multilingual Generative Retrieval via Cross-lingual Semantic Compression0
Show:102550
← PrevPage 362 of 18972Next →