SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 19011925 of 177339 papers

TitleStatusHype
Aria Everyday Activities DatasetCode4
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement LearningCode4
Distilling Tiny and Ultra-fast Deep Neural Networks for Autonomous Navigation on Nano-UAVsCode4
A-MEM: Agentic Memory for LLM AgentsCode4
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language AnnotationsCode4
FILM: Frame Interpolation for Large MotionCode4
WorldVLA: Towards Autoregressive Action World ModelCode4
SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh RenderingCode4
Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal InteractionCode4
AnyGPT: Unified Multimodal LLM with Discrete Sequence ModelingCode4
Open Problems in Applied Deep LearningCode4
ReAct: Synergizing Reasoning and Acting in Language ModelsCode4
A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future DirectionsCode4
Diffusion Models for Medical Image Analysis: A Comprehensive SurveyCode4
LLM Maybe LongLM: Self-Extend LLM Context Window Without TuningCode4
Kolmogorov-Arnold Convolutions: Design Principles and Empirical StudiesCode4
ChatGPT for Robotics: Design Principles and Model AbilitiesCode4
An Entropy-based Text Watermarking Detection MethodCode4
RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation GenerationCode4
MINIMA: Modality Invariant Image MatchingCode4
SparseDrive: End-to-End Autonomous Driving via Sparse Scene RepresentationCode4
Tower: An Open Multilingual Large Language Model for Translation-Related TasksCode4
TrustLLM: Trustworthiness in Large Language ModelsCode4
Null-text Inversion for Editing Real Images using Guided Diffusion ModelsCode4
GriTS: Grid table similarity metric for table structure recognitionCode4
Show:102550
← PrevPage 77 of 7094Next →