SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 87518775 of 474278 papers

TitleStatusHype
Can Prompts Rewind Time for LLMs? Evaluating the Effectiveness of Prompted Knowledge CutoffsCode0
Reciprocal Space Attention for Learning Long-Range InteractionsCode0
L_2-Regularized Empirical Risk Minimization Guarantees Small Smooth Calibration ErrorCode0
UniVector: Unified Vector Extraction via Instance-Geometry InteractionCode0
Universal Image Restoration Pre-training via Masked Degradation ClassificationCode0
Taming the Fragility of KV Cache Eviction in LLM InferenceCode0
Rectify and Align GPS Points to Parking Spots via Rank-1 ConstraintCode0
How Sampling Affects the Detectability of Machine-written texts: A Comprehensive StudyCode0
UrbanFusion: Stochastic Multimodal Fusion for Contrastive Learning of Robust Spatial RepresentationsCode0
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot PolicyCode0
KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual AnimationCode0
TimeRecipe: A Time-Series Forecasting Recipe via Benchmarking Module Level EffectivenessCode0
LiteraryQA: Towards Effective Evaluation of Long-document Narrative QACode0
Variational Reasoning for Language ModelsCode0
StrikeWatch: Wrist-worn Gait Recognition with Compact Time-series Models on Low-power FPGAsCode0
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning0
Modular Embedding Recomposition for Incremental Learning0
EduDial: Constructing a Large-scale Multi-turn Teacher-Student Dialogue CorpusCode0
CSI-4CAST: A Hybrid Deep Learning Model for CSI Prediction with Comprehensive Robustness and Generalization TestingCode0
FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution0
Dr.LLM: Dynamic Layer Routing in LLMs0
What If : Understanding Motion Through Sparse Interactions0
SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models0
Detect Anything via Next Point Prediction0
StyleDecipher: Robust and Explainable Detection of LLM-Generated Texts with Stylistic AnalysisCode0
Show:102550
← PrevPage 351 of 18972Next →