SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 37263750 of 661570 papers

TitleStatusHype
Neuron-Guided Interpretation of Code LLMs: Where, Why, and How?0
Image2Garment: Simulation-ready Garment Generation from a Single Image0
HyperAlign: Hyperbolic Entailment Cones for Adaptive Text-to-Image Alignment Assessment0
GeoMotionGPT: Geometry-Aligned Motion Understanding with Large Language Models0
The Coordination Gap: Multi-Agent Alternation Metrics for Temporal Fairness in Repeated Games0
Towards Efficient and Stable Ocean State Forecasting: A Continuous-Time Koopman Approach0
How to Take a Memorable Picture? Empowering Users with Actionable Feedback1
LucidNFT: LR-Anchored Multi-Reward Preference Optimization for Generative Real-World Super-Resolution0
Neural Networks as Local-to-Global Computations0
SIA: A Synthesize-Inject-Align Framework for Knowledge-Grounded and Secure E-commerce Search LLMs with Industrial Deployment0
Flow Matching Policy with Entropy Regularization0
Rigorous Error Certification for Neural PDE Solvers: From Empirical Residuals to Solution Guarantees0
The Impact of Corporate AI Washing on Farmers' Digital Financial Behavior Response -- An Analysis from the Perspective of Digital Financial Exclusion0
MLOW: Interpretable Low-Rank Frequency Magnitude Decomposition of Multiple Effects for Time Series Forecasting0
Recovering Sparse Neural Connectivity from Partial Measurements: A Covariance-Based Approach with Granger-Causality Refinement0
When Names Change Verdicts: Intervention Consistency Reveals Systematic Bias in LLM Decision-Making0
Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds0
Balancing the Reasoning Load: Difficulty-Differentiated Policy Optimization with Length Redistribution for Efficient and Robust Reinforcement LearningCode0
SCISSR: Scribble-Conditioned Interactive Surgical Segmentation and Refinement0
Learning Decision-Sufficient Representations for Linear Optimization0
HiMu: Hierarchical Multimodal Frame Selection for Long Video Question Answering0
UEPS: Robust and Efficient MRI Reconstruction0
Interplay: Training Independent Simulators for Reference-Free Conversational Recommendation0
Cross-Modal Rationale Transfer for Explainable Humanitarian Classification on Social Media0
ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs0
Show:102550
← PrevPage 150 of 26463Next →