SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 51100 of 474278 papers

TitleStatusHype
MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences0
Benchmarking Multi-View BEV Object Detection with Mixed Pinhole and Fisheye Cameras0
Spatial Orthogonal Refinement for Robust RGB-Event Visual Object Tracking0
Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch0
BCMDA: Bidirectional Correlation Maps Domain Adaptation for Mixed Domain Semi-Supervised Medical Image Segmentation0
SGS-Intrinsic: Semantic-Invariant Gaussian Splatting for Sparse-View Indoor Inverse Rendering0
SPROUT: A Scalable Diffusion Foundation Model for Agricultural Vision0
OmniColor: A Unified Framework for Multi-modal Lineart Colorization0
LongCat-Next: Lexicalizing Modalities as Discrete Tokens0
FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies0
Emergent Social Intelligence Risks in Generative Multi-Agent Systems0
Dual-Path Learning based on Frequency Structural Decoupling and Regional-Aware Fusion for Low-Light Image Super-Resolution0
EpochX: Building the Infrastructure for an Emergent Agent Civilization0
HMPDM: A Diffusion Model for Driving Video Prediction with Historical Motion Priors0
Decompose, Mix, Adapt: A Unified Framework for Parameter-Efficient Neural Network Recombination and Compression0
Diagnosing Non-Markovian Observations in Reinforcement Learning via Prediction-Based Violation Scoring0
Inference-Time Structural Reasoning for Compositional Vision-Language Understanding0
NimbusGS: Unified 3D Scene Reconstruction under Hybrid Weather0
TrackMAE: Video Representation Learning via Track Mask and Predict0
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models0
Text Data Integration0
Structural Graph Probing of Vision-Language Models0
LightCtrl: Training-free Controllable Video Relighting0
Reasoning-Driven Anomaly Detection and Localization with Image-Level Supervision0
Communicating about Space: Language-Mediated Spatial Integration Across Partial Views0
Understanding and Mitigating Hallucinations in Multimodal Chain-of-Thought Models0
VIRST: Video-Instructed Reasoning Assistant for SpatioTemporal Segmentation0
RailVQA: A Benchmark and Framework for Efficient Interpretable Visual Cognition in Automatic Train Operation0
DiffSoup: Direct Differentiable Rasterization of Triangle Soup for Extreme Radiance Field Simplification0
DRUM: Diffusion-based Raydrop-aware Unpaired Mapping for Sim2Real LiDAR Segmentation0
FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants0
Seeing Like Radiologists: Context- and Gaze-Guided Vision-Language Pretraining for Chest X-rays0
Provably Contractive and High-Quality Denoisers for Convergent Restoration0
Consistency Beyond Contrast: Enhancing Open-Vocabulary Object Detection Robustness via Contextual Consistency Learning0
DUGAE: Unified Geometry and Attribute Enhancement via Spatiotemporal Correlations for G-PCC Compressed Dynamic Point Clouds0
Topology-Aware Graph Reinforcement Learning for Energy Storage Systems Optimal Dispatch in Distribution Networks0
Reflect to Inform: Boosting Multimodal Reasoning via Information-Gain-Driven Verification0
Conditional Diffusion for 3D CT Volume Reconstruction from 2D X-rays0
Beyond MACs: Hardware Efficient Architecture Design for Vision Backbones0
From Synthetic Data to Real Restorations: Diffusion Model for Patient-specific Dental Crown Completion0
Zero-Shot Depth from Defocus0
Dual-branch Graph Domain Adaptation for Cross-scenario Multi-modal Emotion Recognition0
VAN-AD: Visual Masked Autoencoder with Normalizing Flow For Time Series Anomaly Detection0
TTE-CAM: Built-in Class Activation Maps for Test-Time Explainability in Pretrained Black-Box CNNs0
A Provable Energy-Guided Test-Time Defense Boosting Adversarial Robustness of Large Vision-Language Models0
GUIDED: Granular Understanding via Identification, Detection, and Discrimination for Fine-Grained Open-Vocabulary Object Detection0
TAPS: Task Aware Proposal Distributions for Speculative Sampling0
MOOZY: A Patient-First Foundation Model for Computational Pathology0
mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT0
A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning0
Show:102550
← PrevPage 2 of 9486Next →