SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 65516600 of 661570 papers

TitleStatusHype
Exploring Subnetwork Interactions in Heterogeneous Brain Network via Prior-Informed Graph Learning0
MemReward: Graph-Based Experience Memory for LLM Reward Prediction with Limited Labels0
PrefPO: Pairwise Preference Prompt Optimization0
GT-Space: Enhancing Heterogeneous Collaborative Perception with Ground Truth Feature SpaceCode0
LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels3
Leveraging Large Vision Model for Multi-UAV Co-perception in Low-Altitude Wireless Networks0
Music Source Restoration with Ensemble Separation and Targeted ReconstructionCode0
The causal structure of galactic astrophysics0
UE5-Forest: A Photorealistic Synthetic Stereo Dataset for UAV Forestry Depth Estimation0
DRCY: Agentic Hardware Design Reviews0
MedArena: Comparing LLMs for Medicine-in-the-Wild Clinician Preferences0
Automated Self-Testing as a Quality Gate: Evidence-Driven Release Management for LLM Applications0
From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Research0
EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection0
Association-Aware GNN for Precoder Learning in Cell-Free Systems0
Privacy-Preserving Federated Fraud Detection in Payment Transactions with NVIDIA FLARE0
Resource Rational Contractualism Should Guide AI Alignment0
AMES: Approximate Multi-modal Enterprise Search via Late Interaction Retrieval0
NexusFlow: Unifying Disparate Tasks under Partial Supervision via Invertible Flow Networks0
Investigating Nonlinear Quenching Effects on Polar Field Buildup in the Sun Using Physics-Informed Neural Networks0
Neuromorphic Computing: A Theoretical Framework for Time, Space, and Energy Scaling0
Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference0
CCMamba: Topologically-Informed Selective State-Space Networks on Combinatorial Complexes for Higher-Order Graph Learning0
MoHETS: Long-term Time Series Forecasting with Mixture-of-Heterogeneous-Experts0
CORE: Context-Robust Remasking for Diffusion Language Models0
LongStream: Long-Sequence Streaming Autoregressive Visual Geometry0
TIRAuxCloud: A Thermal Infrared Dataset for Day and Night Cloud Detection0
Let Your Image Move with Your Motion! -- Implicit Multi-Object Multi-Motion Transfer0
H2LooP Spark Preview: Continual Pretraining of Large Language Models for Low-Level Embedded Systems Code0
ESPIRE: A Diagnostic Benchmark for Embodied Spatial Reasoning of Vision-Language Models0
Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation0
Convergence Rate of a Functional Learning Method for Contextual Stochastic Optimization0
Team RAS in 10th ABAW Competition: Multimodal Valence and Arousal Estimation Approach0
Mitigating Memorization in Text-to-Image Diffusion via Region-Aware Prompt Augmentation and Multimodal Copy Detection0
Competition-Aware CPC Forecasting with Near-Market Coverage0
L2GTX: From Local to Global Time Series Explanations0
Fractals made Practical: Denoising Diffusion as Partitioned Iterated Function Systems0
Breaking the Tuning Barrier: Zero-Hyperparameters Yield Multi-Corner Analysis Via Learned Priors0
Influence Malleability in Linearized Attention: Dual Implications of Non-Convergent NTK Dynamics0
Evaluating VLMs' Spatial Reasoning Over Robot Motion: A Step Towards Robot Planning with Motion Preferences0
BenDFM: A taxonomy and synthetic CAD dataset for manufacturability assessment in sheet metal bending0
Clustering Astronomical Orbital Synthetic Data Using Advanced Feature Extraction and Dimensionality Reduction Techniques0
LingoMotion: An Interpretable and Unambiguous Symbolic Representation for Human Motion0
BoSS: A Best-of-Strategies Selector as an Oracle for Deep Active Learning0
Geometry-Guided Camera Motion Understanding in VideoLLMs0
FDeID-Toolbox: Face De-Identification Toolbox0
When Right Meets Wrong: Bilateral Context Conditioning with Reward-Confidence Correction for GRPOCode0
ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation0
Towards Faithful Multimodal Concept Bottleneck Models0
Perceive What Matters: Relevance-Driven Scheduling for Multimodal Streaming Perception0
Show:102550
← PrevPage 132 of 13232Next →