SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 31513200 of 659983 papers

TitleStatusHype
GIST: Gauge-Invariant Spectral Transformers for Scalable Graph Neural Operators0
Online Experiential Learning for Language Models0
Long-Horizon Traffic Forecasting via Incident-Aware Conformal Spatio-Temporal Transformers0
ManiTwin: Scaling Data-Generation-Ready Digital Object Dataset to 100K0
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation2
Mechanistic Interpretability of Diffusion Models: Circuit-Level Analysis and Causal Validation0
Improving Epidemic Analyses with Privacy-Preserving Integration of Sensitive Data0
Stable Forgetting: Bounded Parameter-Efficient Unlearning in Foundation Models0
Volumetric Ergodic Control0
Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning0
Continual Low-Rank Adapters for LLM-based Generative Recommender Systems0
SoilX: Calibration-Free Comprehensive Soil Sensing through Contrastive Cross-Component Learning0
Genomic Next-Token Predictors are In-Context Learners0
Empirical Recipes for Efficient and Compact Vision-Language Models0
Evaluating Feature Dependent Noise in Preference-based Reinforcement Learning0
Representation-Aware Unlearning via Activation Signatures: From Suppression to Knowledge-Signature Erasure0
Dynamic Black-hole Emission Tomography with Physics-informed Neural Fields0
Failing on Bias Mitigation: A Case Study on the Challenges of Fairness in Government Data0
Unsupervised Decomposition and Recombination with Discriminator-Driven Diffusion Models0
Distribution-Free Sequential Prediction with Abstentions0
Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning0
Dual Space Preconditioning for Gradient Descent in the Overparameterized Regime0
Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning0
Solving physics-constrained inverse problems with conditional flow matching0
InstantHDR: Single-forward Gaussian Splatting for High Dynamic Range 3D Reconstruction0
ASAP: Attention-Shift-Aware Pruning for Efficient LVLM Inference0
Online Learning for Supervisory Switching Control0
Mechanistic Foundations of Goal-Directed Control0
Machine intelligence supports the full chain of 2D dendrite synthesis0
Adversarial attacks against Modern Vision-Language Models0
Topology-Guided Biomechanical Profiling: A White-Box Framework for Opportunistic Screening of Spinal Instability on Routine CT0
Behavior-Centric Extraction of Scenarios from Highway Traffic Data and their Domain-Knowledge-Guided Clustering using CVQ-VAE0
CineSRD: Leveraging Visual, Acoustic, and Linguistic Cues for Open-World Visual Media Speaker Diarization0
Continual Multimodal Egocentric Activity Recognition via Modality-Aware Novel Detection0
The State of Generative AI in Software Development: Insights from Literature and a Developer Survey0
Rewarding DINO: Predicting Dense Rewards with Vision Foundation Models0
Interpretable AI-Assisted Early Reliability Prediction for a Two-Parameter Parallel Root-Finding Scheme0
Transformers Can Learn Rules They've Never Seen: Proof of Computation Beyond Interpolation0
Shared Representation Learning for Reference-Guided Targeted Sound Detection0
Dependence Fidelity and Downstream Inference Stability in Generative Models0
OpenQlaw: An Agentic AI Assistant for Analysis of 2D Quantum Materials0
SCE-LITE-HQ: Smooth visual counterfactual explanations with generative foundation models0
Attractor-Keyed Memory0
PaAgent: Portrait-Aware Image Restoration Agent via Subjective-Objective Reinforcement Learning0
Optimization-Embedded Active Multi-Fidelity Surrogate Learning for Multi-Condition Airfoil Shape Optimization0
Transformers are Bayesian Networks0
TrackDeform3D: Markerless and Autonomous 3D Keypoint Tracking and Dataset Collection for Deformable Objects0
Large Reasoning Models Struggle to Transfer Parametric Knowledge Across Scripts0
PRISM: Demystifying Retention and Interaction in Mid-Training0
Evaluating LLM-Simulated Conversations in Modeling Inconsistent and Uncollaborative Behaviors in Human Social Interaction0
Show:102550
← PrevPage 64 of 13200Next →