SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 72517300 of 661570 papers

TitleStatusHype
Efficient Cross-View Localization in 6G Space-Air-Ground Integrated Network0
Deployment-Time Reliability of Learned Robot Policies0
Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs Supplementary0
Algorithmic Consequences of Particle Filters for Sentence Processing: Amplified Garden-Paths and Digging-In Effects0
MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large language models0
Zero-Shot Cross-City Generalization in End-to-End Autonomous Driving: Self-Supervised versus Supervised Representations0
Beyond Single-Sample: Reliable Multi-Sample Distillation for Video Understanding0
A Stable Neural Statistical Dependence Estimator for Autoencoder Feature Analysis0
Adversarial Reinforcement Learning for Detecting False Data Injection Attacks in Vehicular Routing0
Stay in your Lane: Role Specific Queries with Overlap Suppression Loss for Dense Video Captioning0
Detect Anything in Real Time: From Single-Prompt Segmentation to Multi-Class DetectionCode0
LLM-Assisted Causal Structure Disambiguation and Factor Extraction for Legal Judgment Prediction0
UniHetCO: A Unified Heterogeneous Representation for Multi-Problem Learning in Unsupervised Neural Combinatorial Optimization0
HawkesRank: Event-Driven Centrality for Real-Time Importance Ranking0
Deep Learning Network-Temporal Models For Traffic Prediction0
Hypercomplex Widely Linear Processing: Fundamentals for Quaternion Machine Learning0
ActiveFreq: Integrating Active Learning and Frequency Domain Analysis for Interactive Segmentation0
Grammar of the Wave: Towards Explainable Multivariate Time Series Event Detection via Neuro-Symbolic VLM Agents0
INFACT: A Diagnostic Benchmark for Induced Faithfulness and Factuality Hallucinations in Video-LLMs0
OrthoEraser: Coupled-Neuron Orthogonal Projection for Concept Erasure0
LongFlow: Efficient KV Cache Compression for Reasoning M0
Manifold-Optimal Guidance: A Unified Riemannian Control View of Diffusion Guidance0
Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D0
R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection0
From Pen Strokes to Sleep States: Detecting Low-Recovery Days Using Sigma-Lognormal Handwriting Features0
Can Small Language Models Use What They Retrieve? An Empirical Study of Retrieval Utilization Across Model Scale0
Prediction of Grade, Gender, and Academic Performance of Children and Teenagers from Handwriting Using the Sigma-Lognormal Model0
CAETC: Causal Autoencoding and Treatment Conditioning for Counterfactual Estimation over Time0
UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution0
FBCIR: Balancing Cross-Modal Focuses in Composed Image Retrieval0
Simultaneous estimation of multiple discrete unimodal distributions under stochastic order constraints0
Risk-Controllable Multi-View Diffusion for Driving Scenario Generation0
Enhancing Image Aesthetics with Dual-Conditioned Diffusion Models Guided by Multimodal Perception0
Multi-Task Anti-Causal Learning for Reconstructing Urban Events from Residents' Reports0
Shadowless Projection Mapping for Tabletop Workspaces with Synthetic Aperture Projector0
Gender Bias in Generative AI-assisted Recruitment Processes0
AI Knows What's Wrong But Cannot Fix It: Helicoid Dynamics in Frontier LLMs Under High-Stakes Decisions0
How Intelligence Emerges: A Minimal Theory of Dynamic Adaptive Coordination0
Where Matters More Than What: Decoding-aligned KV Cache Compression via Position-aware Pseudo Queries0
WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing0
Leveraging Large Language Models and Survival Analysis for Early Prediction of Chemotherapy Outcomes0
Performance Evaluation of Open-Source Large Language Models for Assisting Pathology Report Writing in Japanese0
AutoScout: Structured Optimization for Automating ML System Configuration0
LaMoGen: Language to Motion Generation Through LLM-Guided Symbolic Inference0
Articulat3D: Reconstructing Articulated Digital Twins From Monocular Videos with Geometric and Motion Constraints0
DyWeight: Dynamic Gradient Weighting for Few-Step Diffusion SamplingCode0
Fractional Rotation, Full Potential? Investigating Performance and Convergence of Partial RoPE0
SemiTooth: a Generalizable Semi-supervised Framework for Multi-Source Tooth Segmentation0
Noise-aware few-shot learning through bi-directional multi-view prompt alignment0
MedPruner: Training-Free Hierarchical Token Pruning for Efficient 3D Medical Image Understanding in Vision-Language Models0
Show:102550
← PrevPage 146 of 13232Next →