SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1145111500 of 474278 papers

TitleStatusHype
Policy-Guided DiffusionCode2
HyperDiffusion: Generating Implicit Neural Fields with Weight-Space DiffusionCode2
Self-Calibrated CLIP for Training-Free Open-Vocabulary SegmentationCode2
Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph CompletionCode2
Proactive Agents for Multi-Turn Text-to-Image Generation Under UncertaintyCode2
BeLLM: Backward Dependency Enhanced Large Language Model for Sentence EmbeddingsCode2
SuperSVG: Superpixel-based Scalable Vector Graphics SynthesisCode2
Empirical Sample Complexity of Neural Network Mixed State ReconstructionCode2
DayDreamer: World Models for Physical Robot LearningCode2
MediCLIP: Adapting CLIP for Few-shot Medical Image Anomaly DetectionCode2
Medical MLLM is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language ModelsCode2
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding DistillationCode2
Foundation Models for Video Understanding: A SurveyCode2
ODIN: A Single Model for 2D and 3D SegmentationCode2
Tactics2D: A Highly Modular and Extensible Simulator for Driving Decision-makingCode2
RelTR: Relation Transformer for Scene Graph GenerationCode2
Intrinsic Image Diffusion for Indoor Single-view Material EstimationCode2
V_kD: Improving Knowledge Distillation using Orthogonal ProjectionsCode2
Social4Rec: Distilling User Preference from Social Graph for Video Recommendation in TencentCode2
SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language ModelCode2
Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion AlgorithmsCode2
Robust Reflection Removal with Flash-only Cues in the WildCode2
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-TuningCode2
MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report GenerationCode2
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You WantCode2
AGIEval: A Human-Centric Benchmark for Evaluating Foundation ModelsCode2
Verif.ai: Towards an Open-Source Scientific Generative Question-Answering System with Referenced and Verifiable AnswersCode2
Recurrent neural network wave functions for Rydberg atom arrays on kagome latticeCode2
RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan MapCode2
AceVFI: A Comprehensive Survey of Advances in Video Frame InterpolationCode2
Shikra: Unleashing Multimodal LLM's Referential Dialogue MagicCode2
Generative Multiplane Images: Making a 2D GAN 3D-AwareCode2
Wayformer: Motion Forecasting via Simple & Efficient Attention NetworksCode2
Real-Time Polygonal Semantic Mapping for Humanoid Robot Stair ClimbingCode2
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image RetrievalCode2
AgentSims: An Open-Source Sandbox for Large Language Model EvaluationCode2
Crystal-GFN: sampling crystals with desirable properties and constraintsCode2
GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-MeshCode2
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language ModelsCode2
MULDE: Multiscale Log-Density Estimation via Denoising Score Matching for Video Anomaly DetectionCode2
FaceID-6M: A Large-Scale, Open-Source FaceID Customization DatasetCode2
TAGLAS: An atlas of text-attributed graph datasets in the era of large graph and language modelsCode2
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual GroundingCode2
Universal Few-shot Learning of Dense Prediction Tasks with Visual Token MatchingCode2
HIT-UAV: A high-altitude infrared thermal dataset for Unmanned Aerial Vehicle-based object detectionCode2
Generative Medical SegmentationCode2
Vision-Centric BEV Perception: A SurveyCode2
KNighter: Transforming Static Analysis with LLM-Synthesized CheckersCode2
STaR: Bootstrapping Reasoning With ReasoningCode2
VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale ScenesCode2
Show:102550
← PrevPage 230 of 9486Next →