SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 101150 of 658356 papers

TitleStatusHype
Dual Path Attribution: Efficient Attribution for SwiGLU-Transformers through Layer-Wise Target Propagation0
Rethinking Ground Truth: A Case Study on Human Label Variation in MLLM Benchmarking0
PhysNeXt: Next-Generation Dual-Branch Structured Attention Fusion Network for Remote Photoplethysmography Measurement0
Uncertainty-aware Prototype Learning with Variational Inference for Few-shot Point Cloud Segmentation0
Growing Networks with Autonomous Pruning0
PCSTracker: Long-Term Scene Flow Estimation for Point Cloud Sequences0
FREAK: A Fine-grained Hallucination Evaluation Benchmark for Advanced MLLMs0
FlashCap: Millisecond-Accurate Human Motion Capture via Flashing LEDs and Event-Based Vision0
Neither Here Nor There: Cross-Lingual Representation Dynamics of Code-Mixed Text in Multilingual Encoders0
Template-based Object Detection Using a Foundation Model0
Evaluating Image Editing with LLMs: A Comprehensive Benchmark and Intermediate-Layer Probing Approach0
Embodied Science: Closing the Discovery Loop with Agentic Embodied AI0
Learning Hierarchical Orthogonal Prototypes for Generalized Few-Shot 3D Point Cloud Segmentation0
Decoupled Sensitivity-Consistency Learning for Weakly Supervised Video Anomaly DetectionCode0
From Plausibility to Verifiability: Risk-Controlled Generative OCR for Vision-Language Models0
Quantifying Gate Contribution in Quantum Feature Maps for Scalable Circuit Optimization0
Scalable Learning of Multivariate Distributions via Coresets0
Controllable Text-to-Motion Generation via Modular Body-Part Phase Control0
Offshore oil and gas platform dynamics in the North Sea, Gulf of Mexico, and Persian Gulf: Exploiting the Sentinel-1 archive0
Two-Time-Scale Learning Dynamics: A Population View of Neural Network Training0
Eye Gaze-Informed and Context-Aware Pedestrian Trajectory Prediction in Shared Spaces with Automated Shuttles: A Virtual Reality Study0
GDEGAN: Gaussian Dynamic Equivariant Graph Attention Network for Ligand Binding Site Prediction0
HUGE-Bench: A Benchmark for High-Level UAV Vision-Language-Action Tasks0
FrameNet Semantic Role Classification by Analogy0
FormalEvolve: Neuro-Symbolic Evolutionary Search for Diverse and Prover-Effective Autoformalization0
Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech?0
Fourier Splatting: Generalized Fourier encoded primitives for scalable radiance fields0
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization0
Explainable cluster analysis: a bagging approach0
Modeling subgrid scale production rates on complex meshes using graph neural networks0
Overreliance on AI in Information-seeking from Video Content0
Hyper-Connections for Adaptive Multi-Modal MRI Brain Tumor Segmentation0
Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue0
Failure Modes for Deep Learning-Based Online Mapping: How to Measure and Address Them0
FoleyDirector: Fine-Grained Temporal Steering for Video-to-Audio Generation via Structured Scripts0
MedQ-Engine: A Closed-Loop Data Engine for Evolving MLLMs in Medical Image Quality Assessment0
On the Dynamics & Transferability of Latent Generalization during Memorization0
SIMPLER: Efficient Foundation Model Adaptation via Similarity-Guided Layer Pruning for Earth Observation0
Minimax Generalized Cross-Entropy0
Discovery of Decision Synchronization Patterns from Event Logs0
Utility-Guided Agent Orchestration for Efficient LLM Tool Use0
Revealing Domain-Spatiality Patterns for Configuration Tuning: Domain Knowledge Meets Fitness Landscapes0
Infinite-dimensional spherical-radial decomposition for probabilistic functions, with application to constrained optimal control and Gaussian process regression0
PanORama: Multiview Consistent Panoptic Segmentation in Operating Rooms0
Span-Level Machine Translation Meta-Evaluation0
Translation from the Information Bottleneck Perspective: an Efficiency Analysis of Spatial Prepositions in Bitexts0
SegVGGT: Joint 3D Reconstruction and Instance Segmentation from Multi-View Images0
SAGE: Sustainable Agent-Guided Expert-tuning for Culturally Attuned Translation in Low-Resource Southeast Asia0
Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents0
LIORNet: Self-Supervised LiDAR Snow Removal Framework for Autonomous Driving under Adverse Weather Conditions0
Show:102550
← PrevPage 3 of 13168Next →