SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 46264650 of 661570 papers

TitleStatusHype
Encoding Predictability and Legibility for Style-Conditioned Diffusion Policy0
FederatedFactory: Generative One-Shot Learning for Extremely Non-IID Distributed Scenarios0
Prior-Informed Neural Network Initialization: A Spectral Approach for Function Parameterizing Architectures0
DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification0
PlotTwist: A Creative Plot Generation Framework with Small Language Models0
RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery0
Trained Persistent Memory for Frozen Encoder--Decoder LLMs: Six Architectural Methods0
IndexRAG: Bridging Facts for Cross-Document Reasoning at Index Time0
Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences0
SF-Mamba: Rethinking State Space Model for Vision0
An approximate graph elicits detonation lattice0
3D Fourier-based Global Feature Extraction for Hyperspectral Image Classification0
IRIS: A Real-World Benchmark for Inverse Recovery and Identification of Physical Dynamic Systems from Monocular Video0
Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models0
Visual Distraction Undermines Moral Reasoning in Vision-Language Models0
TinyGLASS: Real-Time Self-Supervised In-Sensor Anomaly Detection0
RetailBench: Evaluating Long-Horizon Autonomous Decision-Making and Strategy Stability of LLM Agents in Realistic Retail Environments0
Evo-Retriever: LLM-Guided Curriculum Evolution with Viewpoint-Pathway Collaboration for Multimodal Document Retrieval0
DynHD: Hallucination Detection for Diffusion Large Language Models via Denoising Dynamics Deviation Learning0
GAP-MLLM: Geometry-Aligned Pre-training for Activating 3D Spatial Perception in Multimodal Large Language Models0
DST-Net: A Dual-Stream Transformer with Illumination-Independent Feature Guidance and Multi-Scale Spatial Convolution for Low-Light Image Enhancement0
AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents0
Bridging the High-Frequency Data Gap: A Millisecond-Resolution Network Dataset for Advancing Time Series Foundation Models0
FEAT: A Linear-Complexity Foundation Model for Extremely Large Structured Data0
Exploring different approaches to customize language models for domain-specific text-to-code generation0
Show:102550
← PrevPage 186 of 26463Next →