SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 48264850 of 661570 papers

TitleStatusHype
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion2
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs2
MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing2
From Word to World: Can Large Language Models be Implicit Text-based World Models?2
NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation2
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs2
Hyperspherical Latents Improve Continuous-Token Autoregressive Generation2
Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels2
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding2
Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator2
RealWonder: Real-Time Physical Action-Conditioned Video Generation2
EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding2
VidEoMT: Your ViT is Secretly Also a Video Segmentation Model2
RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies2
CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video2
Latent Particle World Models: Self-supervised Object-centric Stochastic Dynamics Modeling2
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents2
Phi-4-reasoning-vision-15B Technical Report2
Stochastic Self-Guidance for Training-Free Enhancement of Diffusion Models2
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images2
SimRecon: SimReady Compositional Scene Reconstruction from Real Videos2
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization2
Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle2
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing2
MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning2
Show:102550
← PrevPage 194 of 26463Next →