SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1635116400 of 474278 papers

TitleStatusHype
Adding simple structure at inference improves Vision-Language CompositionalityCode0
PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI AssistantsCode0
How much is too much? Measuring divergence from Benford's Law with the Equivalent Contamination Proportion (ECP)Code0
CINeMA: Conditional Implicit Neural Multi-Modal Atlas for a Spatio-Temporal Representation of the Perinatal BrainCode0
Intent Factored Generation: Unleashing the Diversity in Your Language ModelCode0
On the Similarities of Embeddings in Contrastive LearningCode1
Guided Graph Compression for Quantum Graph Neural NetworksCode0
Patient-Specific Deep Reinforcement Learning for Automatic Replanning in Head-and-Neck Cancer Proton Therapy0
AI5GTest: AI-Driven Specification-Aware Automated Testing and Validation of 5G O-RAN Components0
Towards Responsible AI: Advances in Safety, Fairness, and Accountability of Autonomous Systems0
Regularizing Learnable Feature Extraction for Automatic Speech Recognition0
Retrieval of Surface Solar Radiation through Implicit Albedo Recovery from Temporal ContextCode0
Hearing Hands: Generating Sounds from Physical Interactions in 3D ScenesCode0
A Navigation Framework Utilizing Vision-Language ModelsCode0
Large Language Models for Toxic Language Detection in Low-Resource Balkan LanguagesCode0
Ming-Omni: A Unified Multimodal Model for Perception and GenerationCode4
MEDUSA: A Multimodal Deep Fusion Multi-Stage Training Framework for Speech Emotion Recognition in Naturalistic ConditionsCode0
California Crop Yield Benchmark: Combining Satellite Image, Climate, Evapotranspiration, and Soil Data Layers for County-Level Yield Forecasting of Over 70 CropsCode1
Outside Knowledge Conversational Video (OKCV) Dataset -- Dialoguing over VideosCode0
A Deep Generative Model for the Simulation of Discrete Karst Networks0
A Weighted Loss Approach to Robust Federated Learning under Data Heterogeneity0
RoCA: Robust Cross-Domain End-to-End Autonomous Driving0
HadaNorm: Diffusion Transformer Quantization through Mean-Centered Transformations0
UmbraTTS: Adapting Text-to-Speech to Environmental Contexts with Flow Matching0
Conditional diffusion models for guided anomaly detection in brain images using fluid-driven anomaly randomization0
ODG: Occupancy Prediction Using Dual Gaussians0
Effective Red-Teaming of Policy-Adherent Agents0
Adversarial Surrogate Risk Bounds for Binary Classification0
Accurate and efficient zero-shot 6D pose estimation with frozen foundation models0
Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints0
Class Similarity-Based Multimodal Classification under Heterogeneous Category Sets0
PlayerOne: Egocentric World Simulator0
Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments0
"What are my options?": Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended)0
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability0
Automatic Treatment Planning using Reinforcement Learning for High-dose-rate Prostate Brachytherapy0
CHIP: A multi-sensor dataset for 6D pose estimation of chairs in industrial settings0
When Is Diversity Rewarded in Cooperative Multi-Agent Learning?0
Towards Multi-modal Graph Large Language Model0
Measuring Communication Quality of Interest Rate AnnouncementsCode0
GLGENN: A Novel Parameter-Light Equivariant Neural Networks Architecture Based on Clifford Geometric AlgebrasCode1
A Manually Annotated Image-Caption Dataset for Detecting Children in the WildCode0
Structural-Spectral Graph Convolution with Evidential Edge Learning for Hyperspectral Image ClusteringCode0
A Hierarchical Probabilistic Framework for Incremental Knowledge Tracing in Classroom SettingsCode0
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and PlanningCode7
Inverting Black-Box Face Recognition Systems via Zero-Order Optimization in Eigenface SpaceCode0
EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule NetworksCode0
SAFE: Multitask Failure Detection for Vision-Language-Action Models0
MAGMaR Shared Task System Description: Video Retrieval with OmniEmbed0
Causal Climate Emulation with Bayesian Filtering0
Show:102550
← PrevPage 328 of 9486Next →