SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 91019125 of 177340 papers

TitleStatusHype
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code GenerationCode2
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion ModelsCode2
ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference OptimizationCode2
SalM2: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver AttentionCode2
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton OperatorsCode2
A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and EvaluationsCode2
Sanity Checking Causal Representation Learning on a Simple Real-World SystemCode2
Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report GenerationCode2
A Training-free LLM-based Approach to General Chinese Character Error CorrectionCode2
SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation ModelsCode2
Neural Posterior Estimation for Cataloging Astronomical Images with Spatially Varying Backgrounds and Point Spread FunctionsCode2
AnalogGenie: A Generative Engine for Automatic Discovery of Analog Circuit TopologiesCode2
Patch-wise Structural Loss for Time Series ForecastingCode2
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object SegmentationCode2
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical EnvironmentsCode2
PromptPex: Automatic Test Generation for Language Model PromptsCode2
MPA: MultiPath++ Based Architecture for Motion PredictionCode2
Mellow: a small audio language model for reasoningCode2
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario UnderstandingCode2
Bayesian Prompt Flow Learning for Zero-Shot Anomaly DetectionCode2
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian SplattingCode2
Rapid patient-specific neural networks for intraoperative X-ray to volume registrationCode2
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language ModelCode2
Tokenize Image as a SetCode2
HahahaCode2
Show:102550
← PrevPage 365 of 7094Next →