SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 64766500 of 474278 papers

TitleStatusHype
Soft Decision Tree classifier: explainable and extendable PyTorch implementationCode0
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle0
SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL0
CoDA: From Text-to-Image Diffusion Models to Training-Free Dataset DistillationCode0
InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue0
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling0
Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding0
PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation0
SkillFactory: Self-Distillation For Learning Cognitive Behaviors0
PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design0
Thinking with Programming Vision: Towards a Unified View for Thinking with ImagesCode0
Look Around and Pay Attention: Multi-camera Point Tracking Reimagined with TransformersCode0
Addressing Logical Fallacies In Scientific Reasoning From Large Language Models: Towards a Dual-Inference Training FrameworkCode0
Context-Aware Hierarchical Learning: A Two-Step Paradigm towards Safer LLMsCode0
Principled RL for Diffusion LLMs Emerges from a Sequence-Level PerspectiveCode0
Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face PersonalizationCode0
DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature AlignmentCode0
Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding PerturbationCode0
LoRA Patching: Exposing the Fragility of Proactive Defenses against DeepfakesCode0
Heatmap Pooling Network for Action Recognition from RGB VideosCode0
Score Distillation of Flow Matching Models0
Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation0
OneThinker: All-in-one Reasoning Model for Image and Video0
CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map UnderstandingCode0
Different types of syntactic agreement recruit the same units within large language models0
Show:102550
← PrevPage 260 of 18972Next →