SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 9511000 of 659983 papers

TitleStatusHype
Data-Free Layer-Adaptive Merging via Fisher Information for Long-to-Short Reasoning LLMs0
Climate Prompting: Generating the Madden-Julian Oscillation using Video Diffusion and Low-Dimensional Conditioning0
EpiMask: Leveraging Epipolar Distance Based Masks in Cross-Attention for Satellite Image Matching0
Generalized Incremental Learning under Concept Drift across Evolving Data Streams0
BOxCrete: A Bayesian Optimization Open-Source AI Model for Concrete Strength Forecasting and Mix Optimization0
Generalization Limits of In-Context Operator Networks for Higher-Order Partial Differential Equations0
Cross-Context Verification: Hierarchical Detection of Benchmark Contamination through Session-Isolated Analysis0
Compressive single-pixel imaging via a wavelength-multiplexed spatially incoherent diffractive optical processor0
When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models0
DSPA: Dynamic SAE Steering for Data-Efficient Preference Alignment0
Unified-MAS: Universally Generating Domain-Specific Nodes for Empowering Automatic Multi-Agent Systems0
TaigiSpeech: A Low-Resource Real-World Speech Intent Dataset and Preliminary Results with Scalable Data Mining In-the-Wild0
ALADIN:Attribute-Language Distillation Network for Person Re-Identification0
Which Concepts to Forget and How to Refuse? Decomposing Concepts for Continual Unlearning in Large Vision-Language Models0
Quotient Geometry, Effective Curvature, and Implicit Bias in Simple Shallow Neural Networks0
Parameter-efficient Prompt Tuning and Hierarchical Textual Guidance for Few-shot Whole Slide Image Classification0
Optimizing Feature Extraction for On-device Model Inference with User Behavior Sequences0
Unregistered Spectral Image Fusion: Unmixing, Adversarial Learning, and Recoverability0
Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection0
Triangulating Temporal Dynamics in Multilingual Swiss Online News0
Generalizable Self-Evolving Memory for Automatic Prompt Optimization0
Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation0
SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems0
CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs0
VIGIL: Part-Grounded Structured Reasoning for Generalizable Deepfake Detection0
PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation0
SynSym: A Synthetic Data Generation Framework for Psychiatric Symptom Identification0
LLM-Based Test Case Generation in DBMS through Monte Carlo Tree Search0
Evolutionary Biparty Multiobjective UAV Path Planning: Problems and Empirical Comparisons0
What Do World Models Learn in RL? Probing Latent Representations in Learned Environment Simulators0
PROBE: Diagnosing Residual Concept Capacity in Erased Text-to-Video Diffusion Models0
From Part to Whole: 3D Generative World Model with an Adaptive Structural Hierarchy0
Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment0
Revisiting Weakly-Supervised Video Scene Graph Generation via Pair Affinity Learning0
Exploring Multimodal Prompts For Unsupervised Continuous Anomaly Detection0
Counterfactual Credit Policy Optimization for Multi-Agent Collaboration0
HACMatch Semi-Supervised Rotation Regression with Hardness-Aware Curriculum Pseudo Labeling0
SSAM: Singular Subspace Alignment for Merging Multimodal Large Language Models0
Feature Incremental Clustering with Generalization Bounds0
Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limited Communications0
DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers0
Rule-State Inference (RSI): A Bayesian Framework for Compliance Monitoring in Rule-Governed Domains0
SARe: Structure-Aware Large-Scale 3D Fragment Reassembly0
Towards Multimodal Time Series Anomaly Detection with Semantic Alignment and Condensed Interaction0
PGR-Net: Prior-Guided ROI Reasoning Network for Brain Tumor MRI Segmentation0
Dual-level Adaptation for Multi-Object Tracking: Building Test-Time Calibration from Experience and Intuition0
EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises0
Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks0
No Dense Tensors Needed: Fully Sparse Object Detection on Event-Camera Voxel Grids0
A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Commonalities and Model-Specific Signatures0
Show:102550
← PrevPage 20 of 13200Next →