SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 58015825 of 474278 papers

TitleStatusHype
Concat-ID: Towards Universal Identity-Preserving Video SynthesisCode2
LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion ModelsCode2
PENCIL: Long Thoughts with Short MemoryCode2
Where do Large Vision-Language Models Look at when Answering Questions?Code2
DAPO: An Open-Source LLM Reinforcement Learning System at ScaleCode2
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and ScenesCode2
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal ModelCode2
ViSpeak: Visual Instruction Feedback in Streaming VideosCode2
φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and ExploitationCode2
ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language ModelsCode2
Free-form language-based robotic reasoning and graspingCode2
DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and GeometryCode2
Crab: A Unified Audio-Visual Scene Understanding Model with Explicit CooperationCode2
MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language ModelingCode2
Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image SegmentationCode2
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual GroundingCode2
GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised MatchingCode2
Open3DBench: Open-Source Benchmark for 3D-IC Backend Implementation and PPA EvaluationCode2
Multi-modal Time Series Analysis: A Tutorial and SurveyCode2
Triad: Empowering LMM-based Anomaly Detection with Vision Expert-guided Visual Tokenizer and Manufacturing ProcessCode2
TimeZero: Temporal Video Grounding with Reasoning-Guided LVLMCode2
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and PerceptionCode2
RGBAvatar: Reduced Gaussian Blendshapes for Online Modeling of Head AvatarsCode2
All You Need to Know About Training Image Retrieval ModelsCode2
MambaIC: State Space Models for High-Performance Learned Image CompressionCode2
Show:102550
← PrevPage 233 of 18972Next →