SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1185111900 of 661570 papers

TitleStatusHype
The power of small initialization in noisy low-tubal-rank tensor recovery0
Practical FP4 Training for Large-Scale MoE Models on Hopper GPUs0
Rethinking Time Series Domain Generalization via Structure-Stratified Calibration0
Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration0
EvoSkill: Automated Skill Discovery for Multi-Agent Systems0
From Solver to Tutor: Evaluating the Pedagogical Intelligence of LLMs with KMP-Bench0
Scores Know Bobs Voice: Speaker Impersonation Attack0
HiLoRA: Hierarchical Low-Rank Adaptation for Personalized Federated Learning0
Guideline-Grounded Evidence Accumulation for High-Stakes Agent Verification0
R3GW: Relightable 3D Gaussians for Outdoor Scenes in the Wild0
Structure-Aware Text Recognition for Ancient Greek Critical Editions0
The Price of Robustness: Stable Classifiers Need Overparameterization0
Faster, Cheaper, More Accurate: Specialised Knowledge Tracing Models Outperform LLMs0
LLM-based Argument Mining meets Argumentation and Description Logics: a Unified Framework for Reasoning about Debates0
Adapting Time Series Foundation Models through Data Mixtures0
Learning Memory-Enhanced Improvement Heuristics for Flexible Job Shop Scheduling0
CoFL: Continuous Flow Fields for Language-Conditioned Navigation0
The Distribution of Phoneme Frequencies across the World's Languages: Macroscopic and Microscopic Information-Theoretic Models0
Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models0
Multimodal-Prior-Guided Importance Sampling for Hierarchical Gaussian Splatting in Sparse-View Novel View Synthesis0
Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures0
Eval4Sim: An Evaluation Framework for Persona Simulation0
SIGMark: Scalable In-Generation Watermark with Blind Extraction for Video Diffusion0
StegaFFD: Privacy-Preserving Face Forgery Detection via Fine-Grained Steganographic Domain Lifting0
LLandMark: A Multi-Agent Framework for Landmark-Aware Multimodal Interactive Video Retrieval0
Intrinsic Geometry-Appearance Consistency Optimization for Sparse-View Gaussian Splatting0
ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization0
Distributed Dynamic Invariant Causal Prediction in Environmental Time Series0
Towards Accurate and Interpretable Time-series Forecasting: A Polynomial Learning Approach0
Harmonic Beltrami Signature Network: a Shape Prior Module in Deep Learning Framework0
SEALing the Gap: A Reference Framework for LLM Inference Carbon Estimation via Multi-Benchmark Driven Embodiment0
ShipTraj-R1: Reinforcing Ship Trajectory Prediction in Large Language Models via Group Relative Policy Optimization0
Articulation in Motion: Prior-free Part Mobility Analysis for Articulated Objects By Dynamic-Static Disentanglement0
Eliciting Numerical Predictive Distributions of LLMs Without Autoregression0
GloPath: An Entity-Centric Foundation Model for Glomerular Lesion Assessment and Clinicopathological Insights0
On the Structural Limitations of Weight-Based Neural Adaptation and the Role of Reversible Behavioral Learning0
Bias and Fairness in Self-Supervised Acoustic Representations for Cognitive Impairment Detection0
The Geometry of Learning Under AI Delegation0
Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models0
ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation0
Architecting Trust in Artificial Epistemic Agents0
Enhancing Physics-Informed Neural Networks with Domain-aware Fourier Features: Towards Improved Performance and Interpretable Results0
Sparse autoencoders reveal organized biological knowledge but minimal regulatory logic in single-cell foundation models: a comparative atlas of Geneformer and scGPT0
Leveraging Label Proportion Prior for Class-Imbalanced Semi-Supervised Learning0
Layer-wise QUBO-Based Training of CNN Classifiers for Quantum Annealing0
Semi-Supervised Few-Shot Adaptation of Vision-Language Models0
Integrating Homomorphic Encryption and Synthetic Data in FL for Privacy and Learning Quality0
LAGO: A Local-Global Optimization Framework Combining Trust Region Methods and Bayesian Optimization0
The Dresden Dataset for 4D Reconstruction of Non-Rigid Abdominal Surgical Scenes0
On the Expressive Power of Transformers for Maxout Networks and Continuous Piecewise Linear Functions0
Show:102550
← PrevPage 238 of 13232Next →