SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 651700 of 177339 papers

TitleStatusHype
GLEAN: Generative Latent Bank for Image Super-Resolution and BeyondCode5
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMsCode5
Latte: Latent Diffusion Transformer for Video GenerationCode5
StableAnimator: High-Quality Identity-Preserving Human Image AnimationCode5
ImageBind-LLM: Multi-modality Instruction TuningCode5
DanceGRPO: Unleashing GRPO on Visual GenerationCode5
Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language ModelsCode5
BERTopic: Neural topic modeling with a class-based TF-IDF procedureCode5
Structure-Aware Sparse-View X-ray 3D ReconstructionCode5
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything ModelCode5
Do "English" Named Entity Recognizers Work Well on Global Englishes?Code5
Matching Anything by Segmenting AnythingCode5
DeepEyes: Incentivizing "Thinking with Images" via Reinforcement LearningCode5
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge TasksCode5
Learning to (Learn at Test Time): RNNs with Expressive Hidden StatesCode5
FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel FusionCode5
RAPTOR: Recursive Abstractive Processing for Tree-Organized RetrievalCode5
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image GenerationCode5
DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic ModelsCode5
LLM2Vec: Large Language Models Are Secretly Powerful Text EncodersCode5
EvTexture: Event-driven Texture Enhancement for Video Super-ResolutionCode5
AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion EncodingCode5
Wings: Learning Multimodal LLMs without Text-only ForgettingCode5
OminiControl: Minimal and Universal Control for Diffusion TransformerCode5
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AICode5
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video UnderstandingCode5
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from TextCode5
GauStudio: A Modular Framework for 3D Gaussian Splatting and BeyondCode5
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object DetectionCode5
Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped NoiseCode5
TrustRAG: An Information Assistant with Retrieval Augmented GenerationCode5
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity PreservingCode5
Parrot: Multilingual Visual Instruction TuningCode5
Improved Differentially Private Regression via Gradient BoostingCode5
AIDE: AI-Driven Exploration in the Space of CodeCode5
WizardLM: Empowering Large Language Models to Follow Complex InstructionsCode5
Ovis: Structural Embedding Alignment for Multimodal Large Language ModelCode5
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset CurationCode5
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and BeyondCode5
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft ReasoningCode5
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language ModelsCode5
Assessing Language Model Deployment with Risk CardsCode5
UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsCode5
SantaCoder: don't reach for the stars!Code5
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of ExpertsCode5
Evolutionary Optimization of Model Merging RecipesCode5
Automatic Interactive Evaluation for Large Language Models with State Aware Patient SimulatorCode5
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal ModelsCode5
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank GradientsCode5
GraphCast: Learning skillful medium-range global weather forecastingCode5
Show:102550
← PrevPage 14 of 3547Next →