SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 93019350 of 661570 papers

TitleStatusHype
FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose GenerationCode2
MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image SegmentationCode2
Video-Based Human Pose Regression via Decoupled Space-Time AggregationCode2
FairCLIP: Harnessing Fairness in Vision-Language LearningCode2
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image AnalysisCode2
Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language ModelsCode2
AgileFormer: Spatially Agile Transformer UNet for Medical Image SegmentationCode2
SceneTracker: Long-term Scene Flow Estimation NetworkCode2
Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative PriorCode2
DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor QueriesCode2
Efficient Modulation for Vision NetworksCode2
Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image InpaintingCode2
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You WantCode2
StegoGAN: Leveraging Steganography for Non-Bijective Image-to-Image TranslationCode2
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt TuningCode2
Fully Geometric Panoramic LocalizationCode2
MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task LearningCode2
Motion Inversion for Video CustomizationCode2
DiJiang: Efficient Large Language Models through Compact KernelizationCode2
Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and AnalysisCode2
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head GenerationCode2
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTsCode2
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal InteractionCode2
A Review of Graph Neural Networks in Epidemic ModelingCode2
GlORIE-SLAM: Globally Optimized RGB-only Implicit Encoding Point Cloud SLAMCode2
RecDiffusion: Rectangling for Image Stitching with Diffusion ModelsCode2
Infrared Small Target Detection with Scale and Location SensitivityCode2
Disentangling Length from Quality in Direct Preference OptimizationCode2
Top Leaderboard Ranking = Top Coding Proficiency, Always? EvoEval: Evolving Coding Benchmarks via LLMCode2
TOD3Cap: Towards 3D Dense Captioning in Outdoor ScenesCode2
Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose EstimationCode2
Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous DrivingCode2
GraphAD: Interaction Scene Graph for End-to-end Autonomous DrivingCode2
BAMM: Bidirectional Autoregressive Motion ModelCode2
SA-GS: Scale-Adaptive Gaussian Splatting for Training-Free Anti-AliasingCode2
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality PropagationCode2
MineLand: Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical NeedsCode2
Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstructionCode2
A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness ConstraintCode2
Efficient Heatmap-Guided 6-Dof Grasp Detection in Cluttered ScenesCode2
An Image Grid Can Be Worth a Video: Zero-shot Video Question Answering Using a VLMCode2
IDGenRec: LLM-RecSys Alignment with Textual ID LearningCode2
Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive DecodingCode2
Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory PredictionCode2
Attention Calibration for Disentangled Text-to-Image PersonalizationCode2
Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech SeparationCode2
Unleashing the Potential of SAM for Medical Adaptation via Hierarchical DecodingCode2
LITA: Language Instructed Temporal-Localization AssistantCode2
Generative Medical SegmentationCode2
Garment3DGen: 3D Garment Stylization and Texture GenerationCode2
Show:102550
← PrevPage 187 of 13232Next →