SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 67766800 of 474278 papers

TitleStatusHype
G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning0
Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization0
Asking like Socrates: Socrates helps VLMs understand remote sensing images0
Adversarial Flow Models0
Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration0
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning0
Revisiting the Necessity of Lengthy Chain-of-Thought in Vision-centric Reasoning Generalization0
Geometrically-Constrained Agent for Spatial Reasoning0
IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection0
Beyond MSE: Ordinal Cross-Entropy for Probabilistic Time Series ForecastingCode0
VinciCoder: Unifying Multimodal Code Generation via Coarse-to-fine Visual Reinforcement LearningCode0
A Fast and Flat Federated Learning Method via Weighted Momentum and Sharpness-Aware MinimizationCode0
Unlabeled Data Improves Fine-Grained Image Zero-shot Classification with Multimodal LLMsCode0
RouterArena: An Open Platform for Comprehensive Comparison of LLM RoutersCode0
Memo: Training Memory-Efficient Embodied Agents with Reinforcement LearningCode0
Towards Non-Stationary Time Series Forecasting with Temporal Stabilization and Frequency DifferencingCode0
Toward Data-Driven Surrogates of the Solar Wind with Spherical Fourier Neural OperatorCode0
SemOD: Semantic Enabled Object Detection Network under Various Weather ConditionsCode0
C^2DLM: Causal Concept-Guided Diffusion Large Language ModelsCode0
Controllable 3D Object Generation with Single Image PromptCode0
FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated LearningCode0
AnchorFlow: Training-Free 3D Editing via Latent Anchor-Aligned FlowsCode0
RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene UnderstandingCode0
AnoRefiner: Anomaly-Aware Group-Wise Refinement for Zero-Shot Industrial Anomaly DetectionCode0
ITS3D: Inference-Time Scaling for Text-Guided 3D Diffusion ModelsCode0
Show:102550
← PrevPage 272 of 18972Next →