SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 951975 of 659983 papers

TitleStatusHype
Data-Free Layer-Adaptive Merging via Fisher Information for Long-to-Short Reasoning LLMs0
Climate Prompting: Generating the Madden-Julian Oscillation using Video Diffusion and Low-Dimensional Conditioning0
EpiMask: Leveraging Epipolar Distance Based Masks in Cross-Attention for Satellite Image Matching0
Generalized Incremental Learning under Concept Drift across Evolving Data Streams0
BOxCrete: A Bayesian Optimization Open-Source AI Model for Concrete Strength Forecasting and Mix Optimization0
Generalization Limits of In-Context Operator Networks for Higher-Order Partial Differential Equations0
Cross-Context Verification: Hierarchical Detection of Benchmark Contamination through Session-Isolated Analysis0
Compressive single-pixel imaging via a wavelength-multiplexed spatially incoherent diffractive optical processor0
When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models0
DSPA: Dynamic SAE Steering for Data-Efficient Preference Alignment0
Unified-MAS: Universally Generating Domain-Specific Nodes for Empowering Automatic Multi-Agent Systems0
TaigiSpeech: A Low-Resource Real-World Speech Intent Dataset and Preliminary Results with Scalable Data Mining In-the-Wild0
ALADIN:Attribute-Language Distillation Network for Person Re-Identification0
Which Concepts to Forget and How to Refuse? Decomposing Concepts for Continual Unlearning in Large Vision-Language Models0
Quotient Geometry, Effective Curvature, and Implicit Bias in Simple Shallow Neural Networks0
Parameter-efficient Prompt Tuning and Hierarchical Textual Guidance for Few-shot Whole Slide Image Classification0
Optimizing Feature Extraction for On-device Model Inference with User Behavior Sequences0
Unregistered Spectral Image Fusion: Unmixing, Adversarial Learning, and Recoverability0
Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection0
Triangulating Temporal Dynamics in Multilingual Swiss Online News0
Generalizable Self-Evolving Memory for Automatic Prompt Optimization0
Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation0
SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems0
CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs0
VIGIL: Part-Grounded Structured Reasoning for Generalizable Deepfake Detection0
Show:102550
← PrevPage 39 of 26400Next →