SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 86768700 of 474278 papers

TitleStatusHype
Adapting Self-Supervised Representations as a Latent Space for Efficient Generation0
In-Context Learning with Unpaired Clips for Instruction-based Video Editing0
COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes0
Reasoning with Sampling: Your Base Model is Smarter Than You Think0
Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning0
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents0
Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models0
Qwen3Guard Technical Report0
Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents0
WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving0
Geometric Moment Alignment for Domain Adaptation via Siegel EmbeddingsCode0
LOTA: Bit-Planes Guided AI-Generated Image DetectionCode0
TED++: Submanifold-Aware Backdoor Detection via Layerwise Tubular-Neighbourhood ScreeningCode0
SUM-AgriVLN: Spatial Understanding Memory for Agricultural Vision-and-Language NavigationCode0
Spatial Preference Rewarding for MLLMs Spatial UnderstandingCode0
Deep Compositional Phase Diffusion for Long Motion Sequence GenerationCode0
Talking Points: Describing and Localizing PixelsCode0
When Planners Meet Reality: How Learned, Reactive Traffic Agents Shift nuPlan BenchmarksCode0
Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact ExplanationCode0
Stable Prediction of Adverse Events in Medical Time-Series DataCode0
Structured Universal Adversarial Attacks on Object Detection for Video SequencesCode0
ColorBench: Benchmarking Mobile Agents with Graph-Structured Framework for Complex Long-Horizon TasksCode0
Nonparametric Data Attribution for Diffusion ModelsCode0
Robust Policy Expansion for Offline-to-Online RL under Diverse Data CorruptionCode0
PIA: Deepfake Detection Using Phoneme-Temporal and Identity-Dynamic AnalysisCode0
Show:102550
← PrevPage 348 of 18972Next →