SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 59766000 of 177340 papers

TitleStatusHype
A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online AdaptationCode2
SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image InterpretationCode2
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short DramaCode2
GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual LocalizationCode2
TraDiffusion: Trajectory-Based Training-Free Image GenerationCode2
Mephisto: A Framework for Portable, Reproducible, and Iterative CrowdsourcingCode2
Attention-based Deep Multiple Instance LearningCode2
Interacting Attention Graph for Single Image Two-Hand ReconstructionCode2
Frequency-domain MLPs are More Effective Learners in Time Series ForecastingCode2
REALY: Rethinking the Evaluation of 3D Face ReconstructionCode2
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-TrainingCode2
Does Image Anonymization Impact Computer Vision Training?Code2
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving ScenarioCode2
In-Context Imitation Learning via Next-Token PredictionCode2
A Hybrid Transformer-Mamba Network for Single Image DerainingCode2
Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning IncentivizationCode2
LViT: Language meets Vision Transformer in Medical Image SegmentationCode2
gRNAde: Geometric Deep Learning for 3D RNA inverse designCode2
Uni3D: Exploring Unified 3D Representation at ScaleCode2
OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution FittingCode2
VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentationCode2
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for ReasoningCode2
Cross-Prediction-Powered InferenceCode2
LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task AutomationCode2
FedCLIP: Fast Generalization and Personalization for CLIP in Federated LearningCode2
Show:102550
← PrevPage 240 of 7094Next →