SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 73767400 of 474278 papers

TitleStatusHype
Axis-Aligned Document DewarpingCode0
Visual Document Understanding and Reasoning: A Multi-Agent Collaboration Framework with Agent-Wise Adaptive Test-Time Scaling0
FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation0
Instella: Fully Open Language Models with Stellar Performance0
CardioEmbed: Domain-Specialized Text Embeddings for Clinical Cardiology0
Hyperbolic Hierarchical Alignment Reasoning Network for Text-3D RetrievalCode0
LiteAttention: A Temporal Sparse Attention for Diffusion Transformers0
Coordinative Learning with Ordinal and Relational Priors for Volumetric Medical Image SegmentationCode0
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism0
MicroVQA++: High-Quality Microscopy Reasoning Dataset with Weakly Supervised Graphs for Multimodal Large Language Model0
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation0
Bridging Hidden States in Vision-Language ModelsCode0
Enhancing Meme Emotion Understanding with Multi-Level Modality Enhancement and Dual-Stage Modal FusionCode0
UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight ScenariosCode0
Building the Web for Agents: A Declarative Framework for Agent-Web Interaction0
Proactive Hearing Assistants that Isolate Egocentric Conversations0
Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn InteractionCode0
Generative AI in Map-Making: A Technical Exploration and Its Implications for CartographersCode0
Neuro-Spectral Architectures for Causal Physics-Informed NetworksCode0
UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code GenerationCode0
LeJEPA: Provable and Scalable Self-Supervised Learning Without the HeuristicsCode0
Human-Corrected Labels Learning: Enhancing Labels Quality via Human Correction of VLMs DiscrepanciesCode0
STAGE: A Symbolic Tensor grAph GEnerator for distributed AI system co-designCode0
Language-Guided Graph Representation Learning for Video SummarizationCode0
MAFM^3: Modular Adaptation of Foundation Models for Multi-Modal Medical AICode0
Show:102550
← PrevPage 296 of 18972Next →