SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 88268850 of 474278 papers

TitleStatusHype
High-resolution Photo Enhancement in Real-time: A Laplacian Pyramid NetworkCode0
J-ORA: A Framework and Multimodal Dataset for Japanese Object Identification, Reference, Action Prediction in Robot Perception0
Learning to Watermark: A Selective Watermarking Framework for Large Language Models via Multi-Objective OptimizationCode0
ProofFlow: A Dependency Graph Approach to Faithful Proof AutoformalizationCode0
CoLoR-GAN: Continual Few-Shot Learning with Low-Rank Adaptation in Generative Adversarial NetworksCode0
Self-Training with Dynamic Weighting for Robust Gradual Domain AdaptationCode0
Neon: Negative Extrapolation From Self-Training Improves Image GenerationCode0
Robust Ego-Exo Correspondence with Long-Term MemoryCode0
Graph Neural Network-Based Multicast Routing for On-Demand Streaming Services in 6G NetworksCode0
Aligning Deep Implicit Preferences by Learning to Reason DefensivelyCode0
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding0
Judge Before Answer: Can MLLM Discern the False Premise in Question?0
Where on Earth? A Vision-Language Benchmark for Probing Model Geolocation Skills Across ScalesCode0
torchsom: The Reference PyTorch Library for Self-Organizing MapsCode0
VideoAds for Fast-Paced Video Understanding0
SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models0
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer0
Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models0
From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood EstimationCode0
SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction0
DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation0
Failure Prediction at Runtime for Generative Robot Policies0
GIR-Bench: Versatile Benchmark for Generating Images with Reasoning0
EAGER: Entropy-Aware GEneRation for Adaptive Inference-Time Scaling0
A Survey on Agentic Multimodal Large Language ModelsCode0
Show:102550
← PrevPage 354 of 18972Next →