SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 68016850 of 661570 papers

TitleStatusHype
Local Mechanisms of Compositional Generalization in Conditional Diffusion0
The Economics of AI Supply Chain Regulation0
CALF: Communication-Aware Learning Framework for Distributed Reinforcement Learning0
Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents0
Evaluation Faking: Unveiling Observer Effects in Safety Evaluation of Frontier AI Systems0
Lyapunov Stable Graph Neural Flow0
Fisher-Rao Gradient Flow: Geodesic Convexity and Functional Inequalities0
Towards Interactive Intelligence for Digital Humans0
Quantum-Informed Machine Learning for Predicting Spatiotemporal Chaos with Practical Quantum Advantage0
Invariant Graph Transformer for Out-of-Distribution Generalization0
Distilling the Past: Information-Dense and Style-Aware Replay for Lifelong Person Re-Identification0
Generative Bid Shading in Real-Time Bidding Advertising0
MoVieDrive: Urban Scene Synthesis with Multi-Modal Multi-View Video Diffusion Transformer0
CRAFT-GUI: Curriculum-Reinforced Agent For GUI Tasks0
The GPT-4o Shock Emotional Attachment to AI Models and Its Impact on Regulatory Acceptance: A Cross-Cultural Analysis of the Immediate Transition from GPT-4o to GPT-50
Neurodynamics-Driven Coupled Neural P Systems for Multi-Focus Image FusionCode0
Extended Low-Rank Approximation Accelerates Learning of Elastic Response in Heterogeneous Materials0
Robust Fine-Tuning from Non-Robust Pretrained Models: Mitigating Suboptimal Transfer With Epsilon-Scheduling0
Building Benchmarks from the Ground Up: Community-Centered Evaluation of LLMs in Healthcare Chatbot Settings0
SDPose: Exploiting Diffusion Priors for Out-of-Domain and Robust Pose Estimation2
Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs0
Precise Dynamics of Diagonal Linear Networks: A Unifying Analysis by Dynamical Mean-Field Theory0
Disentangling Recall and Reasoning in Transformer Models through Layer-wise Attention and Activation Analysis0
When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling0
Transferable Graph Learning for Transmission Congestion Management via Busbar Splitting0
Retrofitters, pragmatists and activists: Public interest litigation for accountable automated decision-making0
Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression0
One Supervisor, Many Modalities: Adaptive Tool Orchestration for Autonomous Queries0
FAPE-IR: Frequency-Aware Planning and Execution Framework for All-in-One Image Restoration0
SuperQuadricOcc: Real-Time Self-Supervised Semantic Occupancy Estimation with Superquadric Volume Rendering0
NI-Tex: Non-isometric Image-based Garment Texture Generation0
AVFakeBench: A Comprehensive Audio-Video Forgery Detection Benchmark for AV-LMMs0
TrianguLang: Geometry-Aware Semantic Consensus for Pose-Free 3D Localization0
Stochastic Dominance Constrained Optimization with S-shaped Utilities: Poor-Performance-Region Algorithm and Neural Network0
NavForesee: A Unified Vision-Language World Model for Hierarchical Planning and Dual-Horizon Navigation Prediction0
MIND-V: Hierarchical World Model for Long-Horizon Robotic Manipulation with RL-based Physical Alignment1
EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy0
Uni-Parser Technical Report0
FCMBench: The First Large-scale Financial Credit Multimodal Benchmark for Real-world Applications0
A Wachspress-based transfinite formulation for exactly enforcing Dirichlet boundary conditions on convex polygonal domains in physics-informed neural networks0
Development of Ontological Knowledge Bases by Leveraging Large Language Models0
Prediction of Cellular Malignancy Using Electrical Impedance Signatures and Supervised Machine Learning0
Key-Value Pair-Free Continual Learner via Task-Specific Prompt-Prototype0
Auditing Student-AI Collaboration: A Case Study of Online Graduate CS Students0
TreeDGS: Aerial Gaussian Splatting for Distant DBH Measurement0
Seeing through Light and Darkness: Sensor-Physics Grounded Deblurring HDR NeRF from Single-Exposure Images and Events0
FARM: Few-shot Adaptive Malware Family Classification under Concept Drift0
Cross Pseudo Labeling For Weakly Supervised Video Anomaly Detection0
Learnable Koopman-Enhanced Transformer-Based Time Series Forecasting with Spectral Control0
VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos0
Show:102550
← PrevPage 137 of 13232Next →