SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 91519175 of 474278 papers

TitleStatusHype
Local Reinforcement Learning with Action-Conditioned Root Mean Squared Q-FunctionsCode0
A Diffusion Model for Regular Time Series Generation from Irregular Data with Completion and MaskingCode0
CNN-TFT explained by SHAP with multi-head attention weights for time series forecastingCode0
Lung Infection Severity Prediction Using Transformers with Conditional TransMix Augmentation and Cross-AttentionCode0
M3Retrieve: Benchmarking Multimodal Retrieval for MedicineCode0
Unified Molecule Pre-training with Flexible 2D and 3D Modalities: Single and Paired Modality IntegrationCode0
U-Bench: A Comprehensive Understanding of U-Net through 100-Variant BenchmarkingCode0
Search-R3: Unifying Reasoning and Embedding Generation in Large Language ModelsCode0
Accelerating Diffusion LLM Inference via Local Determinism PropagationCode0
GenPilot: A Multi-Agent System for Test-Time Prompt Optimization in Image GenerationCode0
How much speech data is necessary for ASR in African languages? An evaluation of data scaling in Kinyarwanda and KikuyuCode0
SpecGuard: Spectral Projection-based Advanced Invisible WatermarkingCode0
SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation ModelsCode0
ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint ProgrammingCode0
When LLMs Can't Help: Real-World Evaluation of LLMs in NutritionCode0
Low-Rank Tensor Recovery via Variational Schatten-p Quasi-Norm and Jacobian RegularizationCode0
BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks0
Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?0
Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep ResearchCode0
Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences0
Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation0
Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context0
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding0
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels0
Multimodal Feature Prototype Learning for Interpretable and Discriminative Cancer Survival PredictionCode0
Show:102550
← PrevPage 367 of 18972Next →