SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 201250 of 474278 papers

TitleStatusHype
DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models0
From Feature Learning to Spectral Basis Learning: A Unifying and Flexible Framework for Efficient and Robust Shape Matching0
F4Splat: Feed-Forward Predictive Densification for Feed-Forward 3D Gaussian Splatting0
DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation0
Extending Precipitation Nowcasting Horizons via Spectral Fusion of Radar Observations and Foundation Model Priors0
The DeepXube Software Package for Solving Pathfinding Problems with Learned Heuristic Functions and Search0
EnvSocial-Diff: A Diffusion-Based Crowd Simulation Model with Environmental Conditioning and Individual-Group Interaction0
SynMVCrowd: A Large Synthetic Benchmark for Multi-view Crowd Counting and Localization0
Leave No Stone Unturned: Uncovering Holistic Audio-Visual Intrinsic Coherence for Deepfake Detection0
Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection0
Reservoir-Based Graph Convolutional Networks0
Spectral Scalpel: Amplifying Adjacent Action Discrepancy via Frequency-Selective Filtering for Skeleton-Based Action Segmentation0
HEART-PFL: Stable Personalized Federated Learning under Heterogeneity with Hierarchical Directional Alignment and Adversarial Knowledge Transfer0
RVLM: Recursive Vision-Language Models with Adaptive Depth0
SpinGQE: A Generative Quantum Eigensolver for Spin Hamiltonians0
Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing0
CoordLight: Learning Decentralized Coordination for Network-Wide Traffic Signal Control0
Unleashing Vision-Language Semantics for Deepfake Video Detection0
Towards Training-Free Scene Text Editing0
VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models0
MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination0
An Invariant Compiler for Neural ODEs in AI-Accelerated Scientific Simulation0
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?0
MuQ-Eval: An Open-Source Per-Sample Quality Metric for AI Music Generation Evaluation0
UniCA: Unified Covariate Adaptation for Time Series Foundation Model0
PiLoT: Neural Pixel-to-3D Registration for UAV-based Ego and Target Geo-localization0
LPNSR: Prior-Enhanced Diffusion Image Super-Resolution via LR-Guided Noise Prediction0
2Xplat: Two Experts Are Better Than One Generalist0
Uncertainty-guided Compositional Alignment with Part-to-Whole Semantic Representativeness in Hyperbolic Vision-Language Models0
Reconstruction-Guided Slot Curriculum: Addressing Object Over-Fragmentation in Video Object-Centric Learning0
Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases0
Viewport-based Neural 360° Image Compression0
Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models0
It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal0
MultiCam: On-the-fly Multi-Camera Pose Estimation Using Spatiotemporal Overlaps of Known Objects0
UAV-DETR: DETR for Anti-Drone Target Detection0
A Feature Shuffling and Restoration Strategy for Universal Unsupervised Anomaly Detection0
Multilingual KokoroChat: A Multi-LLM Ensemble Translation Method for Creating a Multilingual Counseling Dialogue Dataset0
EVA: Efficient Reinforcement Learning for End-to-End Video Agent0
HGNet: Scalable Foundation Model for Automated Knowledge Graph Generation from Scientific Literature0
FDIF: Formula-Driven supervised Learning with Implicit Functions for 3D Medical Image Segmentation0
Harnessing Lightweight Transformer with Contextual Synergic Enhancement for Efficient 3D Medical Image Segmentation0
WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG0
DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models0
OccAny: Generalized Unconstrained Urban 3D Occupancy0
MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis0
Prototype Fusion: A Training-Free Multi-Layer Approach to OOD Detection0
Learning What Can Be Picked: Active Reachability Estimation for Efficient Robotic Fruit Harvesting0
Sparse Autoencoders for Interpretable Medical Image Representation Learning0
GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning0
Show:102550
← PrevPage 5 of 9486Next →