SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1240112450 of 474278 papers

TitleStatusHype
DayDreamer: World Models for Physical Robot LearningCode2
MediCLIP: Adapting CLIP for Few-shot Medical Image Anomaly DetectionCode2
Medical MLLM is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language ModelsCode2
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding DistillationCode2
Foundation Models for Video Understanding: A SurveyCode2
ODIN: A Single Model for 2D and 3D SegmentationCode2
Tactics2D: A Highly Modular and Extensible Simulator for Driving Decision-makingCode2
RelTR: Relation Transformer for Scene Graph GenerationCode2
Intrinsic Image Diffusion for Indoor Single-view Material EstimationCode2
V_kD: Improving Knowledge Distillation using Orthogonal ProjectionsCode2
Social4Rec: Distilling User Preference from Social Graph for Video Recommendation in TencentCode2
SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language ModelCode2
Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion AlgorithmsCode2
Robust Reflection Removal with Flash-only Cues in the WildCode2
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-TuningCode2
MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report GenerationCode2
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You WantCode2
AGIEval: A Human-Centric Benchmark for Evaluating Foundation ModelsCode2
Verif.ai: Towards an Open-Source Scientific Generative Question-Answering System with Referenced and Verifiable AnswersCode2
Recurrent neural network wave functions for Rydberg atom arrays on kagome latticeCode2
RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan MapCode2
AceVFI: A Comprehensive Survey of Advances in Video Frame InterpolationCode2
Shikra: Unleashing Multimodal LLM's Referential Dialogue MagicCode2
Generative Multiplane Images: Making a 2D GAN 3D-AwareCode2
Wayformer: Motion Forecasting via Simple & Efficient Attention NetworksCode2
Real-Time Polygonal Semantic Mapping for Humanoid Robot Stair ClimbingCode2
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image RetrievalCode2
AgentSims: An Open-Source Sandbox for Large Language Model EvaluationCode2
Crystal-GFN: sampling crystals with desirable properties and constraintsCode2
GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-MeshCode2
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language ModelsCode2
MULDE: Multiscale Log-Density Estimation via Denoising Score Matching for Video Anomaly DetectionCode2
FaceID-6M: A Large-Scale, Open-Source FaceID Customization DatasetCode2
TAGLAS: An atlas of text-attributed graph datasets in the era of large graph and language modelsCode2
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual GroundingCode2
Universal Few-shot Learning of Dense Prediction Tasks with Visual Token MatchingCode2
HIT-UAV: A high-altitude infrared thermal dataset for Unmanned Aerial Vehicle-based object detectionCode2
Generative Medical SegmentationCode2
Vision-Centric BEV Perception: A SurveyCode2
KNighter: Transforming Static Analysis with LLM-Synthesized CheckersCode2
STaR: Bootstrapping Reasoning With ReasoningCode2
VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale ScenesCode2
TableQuery: Querying tabular data with natural languageCode2
CVSS Corpus and Massively Multilingual Speech-to-Speech TranslationCode2
CrossFuse: A Novel Cross Attention Mechanism based Infrared and Visible Image Fusion ApproachCode2
Low-Rank Quantization-Aware Training for LLMsCode2
EEGUnity: Open-Source Tool in Facilitating Unified EEG Datasets Towards Large-Scale EEG ModelCode2
MolFM: A Multimodal Molecular Foundation ModelCode2
Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially FastCode2
Show:102550
← PrevPage 249 of 9486Next →