SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 43764400 of 661570 papers

TitleStatusHype
Attention-guided Evidence Grounding for Spoken Question Answering0
Explanations Go Linear: Post-hoc Explainability for Tabular Data with Interpretable Meta-Encoding0
Hebbian Physics Networks: A Self-Organizing Computational Architecture Based on Local Physical Laws0
ReviewScore: Misinformed Peer Review Detection with Large Language Models0
On the identifiability of causal graphs with multiple environments0
Provably Safe Model Updates0
Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering0
The Moralization Corpus: Frame-Based Annotation and Analysis of Moralizing Speech Acts across Diverse Text Genres0
Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning0
Global Optimization By Gradient From Hierarchical Score-Matching Spaces0
Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning0
CogGen: Cognitive-Load-Informed Fully Unsupervised Deep Generative Modeling for Compressively Sampled MRI Reconstruction0
LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis0
Event-Driven Video Generation0
Next-Frame Decoding for Ultra-Low-Bitrate Image Compression with Video Diffusion Priors0
NV-Bench: Benchmark of Nonverbal Vocalization Synthesis for Expressive Text-to-Speech Generation0
EngGPT2: Sovereign, Efficient and Open Intelligence0
TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas0
HGP-Mamba: Integrating Histology and Generated Protein Features for Mamba-based Multimodal Survival Risk PredictionCode0
Draft-and-Prune: Improving the Reliability of Auto-formalization for Logical Reasoning0
ConfusionBench: An Expert-Validated Benchmark for Confusion Recognition and Localization in Educational Videos0
Directing the Narrative: A Finetuning Method for Controlling Coherence and Style in Story Generation0
Embedding World Knowledge into Tabular Models: Towards Best Practices for Embedding Pipeline Design0
Physics-informed offline reinforcement learning eliminates catastrophic fuel waste in maritime routing0
Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach Driven by Numerical and Structural Dual-Sensitivity0
Show:102550
← PrevPage 176 of 26463Next →