SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1370113750 of 474278 papers

TitleStatusHype
Text2Cypher Across Languages: Evaluating Foundational Models Beyond English0
Maximal Matching Matters: Preventing Representation Collapse for Robust Cross-Modal Retrieval0
OmniEval: A Benchmark for Evaluating Omni-modal Models with Visual, Auditory, and Textual Inputs0
DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing0
From Cradle to Cane: A Two-Pass Framework for High-Fidelity Lifespan Face Aging0
EVA: Mixture-of-Experts Semantic Variant Alignment for Compositional Zero-Shot Learning0
TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation0
VisionGuard: Synergistic Framework for Helmet Violation Detection0
Bridging Video Quality Scoring and Justification via Large Multimodal Models0
Multimodal Prompt Alignment for Facial Expression Recognition0
Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation0
CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization0
Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image0
GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding0
BitMark for Infinity: Watermarking Bitwise Autoregressive Image Generative Models0
DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic0
HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation0
PanSt3R: Multi-view Consistent Panoptic Segmentation0
CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations0
CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection0
FastRef:Fast Prototype Refinement for Few-Shot Industrial Anomaly Detection0
Controllable 3D Placement of Objects with Scene-Aware Diffusion Models0
HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation0
MADrive: Memory-Augmented Driving Scene Modeling0
SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark0
Personalized Federated Learning via Dual-Prompt Optimization and Cross Fusion0
Holistic Surgical Phase Recognition with Hierarchical Input Dependent State Space Models0
Inverse Scene Text RemovalCode0
Homogenization of Multi-agent Learning Dynamics in Finite-state Markov GamesCode0
Robust Deep Learning for Myocardial Scar Segmentation in Cardiac MRI with Noisy LabelsCode0
Task-Aware KV Compression For Cost-Effective Long Video UnderstandingCode0
G^2D: Boosting Multimodal Learning with Gradient-Guided DistillationCode0
Unveiling Causal Reasoning in Large Language Models: Reality or Mirage?Code0
Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits CalibrationCode0
Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset EvaluationCode0
Adversarial Training: Enhancing Out-of-Distribution Generalization for Learning Wireless Resource Allocation0
Segment Anything in Pathology Images with Natural Language0
WAFT: Warping-Alone Field Transforms for Optical FlowCode2
Rethink Sparse Signals for Pose-guided Text-to-image GenerationCode0
LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object DetectionCode0
SharpZO: Hybrid Sharpness-Aware Vision Language Model Prompt Tuning via Forward-Only PassesCode0
Learning to See in the Extremely DarkCode2
Evidence-based diagnostic reasoning with multi-agent copilot for human pathology0
Pushing Trade-Off Boundaries: Compact yet Effective Remote Sensing Change DetectionCode0
Continual Self-Supervised Learning with Masked Autoencoders in Remote Sensing0
SAMURAI: Shape-Aware Multimodal Retrieval for 3D Object Identification0
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models0
Temporal Rate Reduction Clustering for Human Motion Segmentation0
Co-Design of Sensing, Communications, and Control for Low-Altitude Wireless Networks0
Beyond Reactive Safety: Risk-Aware LLM Alignment via Long-Horizon SimulationCode0
Show:102550
← PrevPage 275 of 9486Next →