SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 54265450 of 661570 papers

TitleStatusHype
Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search AgentCode2
Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMsCode2
GuidedQuant: Large Language Model Quantization via Exploiting End Loss GuidanceCode2
ReplayCAD: Generative Diffusion Replay for Continual Anomaly DetectionCode2
Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model CapabilitiesCode2
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and SegmentationCode2
Understanding and Mitigating Toxicity in Image-Text Pretraining Datasets: A Case Study on LLaVACode2
Diffusion Model Quantization: A ReviewCode2
StabStitch++: Unsupervised Online Video Stitching with Spatiotemporal Bidirectional WarpsCode2
SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data AugmentationCode2
Foam-Agent: Towards Automated Intelligent CFD WorkflowsCode2
InstanceGen: Image Generation with Instance-level InstructionsCode2
Bring Reason to Vision: Understanding Perception and Reasoning through Model MergingCode2
Steerable Scene Generation with Post Training and Inference-Time SearchCode2
EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement LearningCode2
Convex Relaxation for Robust Vanishing Point Estimation in Manhattan WorldCode2
Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D GenerationCode2
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data RestorationCode2
TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh OptimizationCode2
Non-stationary Diffusion For Probabilistic Time Series ForecastingCode2
DeCLIP: Decoupled Learning for Open-Vocabulary Dense PerceptionCode2
Rethinking Boundary Detection in Deep Learning-Based Medical Image SegmentationCode2
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative SynchronizationCode2
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language ModelsCode2
RM-R1: Reward Modeling as ReasoningCode2
Show:102550
← PrevPage 218 of 26463Next →