SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 98019825 of 474278 papers

TitleStatusHype
Contextualized Diffusion Models for Text-Guided Image and Video GenerationCode2
UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei ImagesCode2
SPINEPS -- Automatic Whole Spine Segmentation of T2-weighted MR images using a Two-Phase Approach to Multi-class Semantic and Instance SegmentationCode2
An Integrated Data Processing Framework for Pretraining Foundation ModelsCode2
Defending LLMs against Jailbreaking Attacks via BacktranslationCode2
CARTE: Pretraining and Transfer for Tabular LearningCode2
Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language ModelsCode2
CLAP: Learning Transferable Binary Code Representations with Natural Language SupervisionCode2
Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language ModelsCode2
An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset GenerationCode2
Feedback Efficient Online Fine-Tuning of Diffusion ModelsCode2
Rethinking Negative Instances for Generative Named Entity RecognitionCode2
DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug DesignCode2
DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language ModelsCode2
DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM JailbreakersCode2
HiGPT: Heterogeneous Graph Language ModelCode2
Deep Homography Estimation for Visual Place RecognitionCode2
GraphWiz: An Instruction-Following Language Model for Graph ProblemsCode2
VOLoc: Visual Place Recognition by Querying Compressed Lidar MapCode2
GenNBV: Generalizable Next-Best-View Policy for Active 3D ReconstructionCode2
Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual LearningCode2
Reliable Conflictive Multi-View LearningCode2
HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion ModelsCode2
GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models EvaluationCode2
ToMBench: Benchmarking Theory of Mind in Large Language ModelsCode2
Show:102550
← PrevPage 393 of 18972Next →