SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 76100 of 474278 papers

TitleStatusHype
Understanding and Mitigating Hallucinations in Multimodal Chain-of-Thought Models0
VIRST: Video-Instructed Reasoning Assistant for SpatioTemporal Segmentation0
RailVQA: A Benchmark and Framework for Efficient Interpretable Visual Cognition in Automatic Train Operation0
DiffSoup: Direct Differentiable Rasterization of Triangle Soup for Extreme Radiance Field Simplification0
DRUM: Diffusion-based Raydrop-aware Unpaired Mapping for Sim2Real LiDAR Segmentation0
FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants0
Seeing Like Radiologists: Context- and Gaze-Guided Vision-Language Pretraining for Chest X-rays0
Provably Contractive and High-Quality Denoisers for Convergent Restoration0
Consistency Beyond Contrast: Enhancing Open-Vocabulary Object Detection Robustness via Contextual Consistency Learning0
DUGAE: Unified Geometry and Attribute Enhancement via Spatiotemporal Correlations for G-PCC Compressed Dynamic Point Clouds0
Topology-Aware Graph Reinforcement Learning for Energy Storage Systems Optimal Dispatch in Distribution Networks0
Reflect to Inform: Boosting Multimodal Reasoning via Information-Gain-Driven Verification0
Conditional Diffusion for 3D CT Volume Reconstruction from 2D X-rays0
Beyond MACs: Hardware Efficient Architecture Design for Vision Backbones0
From Synthetic Data to Real Restorations: Diffusion Model for Patient-specific Dental Crown Completion0
Zero-Shot Depth from Defocus0
Dual-branch Graph Domain Adaptation for Cross-scenario Multi-modal Emotion Recognition0
VAN-AD: Visual Masked Autoencoder with Normalizing Flow For Time Series Anomaly Detection0
TTE-CAM: Built-in Class Activation Maps for Test-Time Explainability in Pretrained Black-Box CNNs0
A Provable Energy-Guided Test-Time Defense Boosting Adversarial Robustness of Large Vision-Language Models0
GUIDED: Granular Understanding via Identification, Detection, and Discrimination for Fine-Grained Open-Vocabulary Object Detection0
TAPS: Task Aware Proposal Distributions for Speculative Sampling0
MOOZY: A Patient-First Foundation Model for Computational Pathology0
mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT0
A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning0
Show:102550
← PrevPage 4 of 18972Next →