SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 93019325 of 474278 papers

TitleStatusHype
Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative PriorCode2
FairCLIP: Harnessing Fairness in Vision-Language LearningCode2
MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image SegmentationCode2
Video-Based Human Pose Regression via Decoupled Space-Time AggregationCode2
AgileFormer: Spatially Agile Transformer UNet for Medical Image SegmentationCode2
Efficient Modulation for Vision NetworksCode2
SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large ObjectsCode2
DiJiang: Efficient Large Language Models through Compact KernelizationCode2
Motion Inversion for Video CustomizationCode2
Fully Geometric Panoramic LocalizationCode2
Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language ModelsCode2
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt TuningCode2
Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image InpaintingCode2
StegoGAN: Leveraging Steganography for Non-Bijective Image-to-Image TranslationCode2
MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task LearningCode2
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image AnalysisCode2
FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose GenerationCode2
DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor QueriesCode2
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You WantCode2
TOD3Cap: Towards 3D Dense Captioning in Outdoor ScenesCode2
MineLand: Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical NeedsCode2
Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose EstimationCode2
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head GenerationCode2
Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and AnalysisCode2
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal InteractionCode2
Show:102550
← PrevPage 373 of 18972Next →