SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 101125 of 658356 papers

TitleStatusHype
Dual Path Attribution: Efficient Attribution for SwiGLU-Transformers through Layer-Wise Target Propagation0
Rethinking Ground Truth: A Case Study on Human Label Variation in MLLM Benchmarking0
PhysNeXt: Next-Generation Dual-Branch Structured Attention Fusion Network for Remote Photoplethysmography Measurement0
Uncertainty-aware Prototype Learning with Variational Inference for Few-shot Point Cloud Segmentation0
Growing Networks with Autonomous Pruning0
PCSTracker: Long-Term Scene Flow Estimation for Point Cloud Sequences0
FREAK: A Fine-grained Hallucination Evaluation Benchmark for Advanced MLLMs0
FlashCap: Millisecond-Accurate Human Motion Capture via Flashing LEDs and Event-Based Vision0
Neither Here Nor There: Cross-Lingual Representation Dynamics of Code-Mixed Text in Multilingual Encoders0
Template-based Object Detection Using a Foundation Model0
Evaluating Image Editing with LLMs: A Comprehensive Benchmark and Intermediate-Layer Probing Approach0
Embodied Science: Closing the Discovery Loop with Agentic Embodied AI0
Learning Hierarchical Orthogonal Prototypes for Generalized Few-Shot 3D Point Cloud Segmentation0
Decoupled Sensitivity-Consistency Learning for Weakly Supervised Video Anomaly DetectionCode0
From Plausibility to Verifiability: Risk-Controlled Generative OCR for Vision-Language Models0
Quantifying Gate Contribution in Quantum Feature Maps for Scalable Circuit Optimization0
Scalable Learning of Multivariate Distributions via Coresets0
Controllable Text-to-Motion Generation via Modular Body-Part Phase Control0
Offshore oil and gas platform dynamics in the North Sea, Gulf of Mexico, and Persian Gulf: Exploiting the Sentinel-1 archive0
Two-Time-Scale Learning Dynamics: A Population View of Neural Network Training0
Eye Gaze-Informed and Context-Aware Pedestrian Trajectory Prediction in Shared Spaces with Automated Shuttles: A Virtual Reality Study0
GDEGAN: Gaussian Dynamic Equivariant Graph Attention Network for Ligand Binding Site Prediction0
HUGE-Bench: A Benchmark for High-Level UAV Vision-Language-Action Tasks0
FrameNet Semantic Role Classification by Analogy0
FormalEvolve: Neuro-Symbolic Evolutionary Search for Diverse and Prover-Effective Autoformalization0
Show:102550
← PrevPage 5 of 26335Next →