SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 39013925 of 177340 papers

TitleStatusHype
APOLLO: SGD-like Memory, AdamW-level PerformanceCode3
MUSt3R: Multi-view Network for Stereo 3D ReconstructionCode3
SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model InferenceCode3
Inversion-Free Image Editing with Language-Guided Diffusion ModelsCode3
Beyond Quacking: Deep Integration of Language Models and RAG into DuckDBCode3
OpenSpiel: A Framework for Reinforcement Learning in GamesCode3
Scikit-fingerprints: easy and efficient computation of molecular fingerprints in PythonCode3
NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance FieldsCode3
CLIMB: Class-imbalanced Learning Benchmark on Tabular DataCode3
Around the World in 80 Timesteps: A Generative Approach to Global Visual GeolocationCode3
Take the aTrain. Introducing an Interface for the Accessible Transcription of InterviewsCode3
Meta-Transformer: A Unified Framework for Multimodal LearningCode3
GroundingGPT:Language Enhanced Multi-modal Grounding ModelCode3
Evaluating Large Language Models with fmevalCode3
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A SurveyCode3
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at ScaleCode3
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model PromptsCode3
Rethinking Early Stopping: Refine, Then CalibrateCode3
CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent BenchmarkCode3
Better by Default: Strong Pre-Tuned MLPs and Boosted Trees on Tabular DataCode3
Automatic Gradient Estimation for Calibrating Crowd Models with Discrete Decision MakingCode3
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-ThoughtCode3
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State SpacesCode3
MVGS: Multi-view-regulated Gaussian Splatting for Novel View SynthesisCode3
Classification Done Right for Vision-Language Pre-TrainingCode3
Show:102550
← PrevPage 157 of 7094Next →