SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 801850 of 659983 papers

TitleStatusHype
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion TransformerCode5
GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object ManipulationCode5
TS3-Codec: Transformer-Based Simple Streaming Single CodecCode5
ShowUI: One Vision-Language-Action Model for GUI Visual AgentCode5
StableAnimator: High-Quality Identity-Preserving Human Image AnimationCode5
Orthogonal Subspace Decomposition for Generalizable AI-Generated Image DetectionCode5
OminiControl: Minimal and Universal Control for Diffusion TransformerCode5
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous DrivingCode5
XGrammar: Flexible and Efficient Structured Generation Engine for Large Language ModelsCode5
MambaIRv2: Attentive State Space RestorationCode5
Multimodal Autoregressive Pre-training of Large Vision EncodersCode5
Marco-o1: Towards Open Reasoning Models for Open-Ended SolutionsCode5
DINO-X: A Unified Vision Model for Open-World Object Detection and UnderstandingCode5
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMsCode5
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative ModelsCode5
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer UseCode5
That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip DesignCode5
Watermark Anything with Localized MessagesCode5
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation ModelsCode5
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by TencentCode5
Randomized Autoregressive Visual GenerationCode5
CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale ScenesCode5
Neural Fields in Robotics: A SurveyCode5
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset CurationCode5
ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical AgentsCode5
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal ModelsCode5
TimeMixer++: A General Time Series Pattern Machine for Universal Predictive AnalysisCode5
Allegro: Open the Black Box of Commercial-Level Video Generation ModelCode5
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-DictionaryCode5
DepthSplat: Connecting Gaussian Splatting and DepthCode5
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio GenerationCode5
Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex CapabilitiesCode5
KBLaM: Knowledge Base augmented Language ModelCode5
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture ModificationCode5
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of ExpertsCode5
OpenR: An Open Source Framework for Advanced Reasoning with Large Language ModelsCode5
Conditional Generative Models for Contrast-Enhanced Synthesis of T1w and T1 Maps in Brain MRICode5
Low Bitrate High-Quality RVQGAN-based Discrete Speech TokenizerCode5
RDT-1B: a Diffusion Foundation Model for Bimanual ManipulationCode5
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal RepresentationsCode5
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image GenerationCode5
Enabling Novel Mission Operations and Interactions with ROSA: The Robot Operating System AgentCode5
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You ThinkCode5
MLE-bench: Evaluating Machine Learning Agents on Machine Learning EngineeringCode5
Aria: An Open Multimodal Native Mixture-of-Experts ModelCode5
MonST3R: A Simple Approach for Estimating Geometry in the Presence of MotionCode5
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical ReasoningCode5
Loki: An Open-Source Tool for Fact VerificationCode5
Maia-2: A Unified Model for Human-AI Alignment in ChessCode5
Showing Many Labels in Multi-label Classification Models: An Empirical Study of Adversarial ExamplesCode5
Show:102550
← PrevPage 17 of 13200Next →