SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 14011425 of 659983 papers

TitleStatusHype
Predicting Subjective Features of Questions of QA Websites using BERTCode4
Resources for Brewing BEIR: Reproducible Reference Models and an Official LeaderboardCode4
FuseChat: Knowledge Fusion of Chat ModelsCode4
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM ServingCode4
Vidur: A Large-Scale Simulation Framework For LLM InferenceCode4
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 smallCode4
Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language ModelsCode4
VideoChat-Flash: Hierarchical Compression for Long-Context Video ModelingCode4
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and RecipeCode4
RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization BenchmarkCode4
GLIPv2: Unifying Localization and Vision-Language UnderstandingCode4
Cube: A Roblox View of 3D IntelligenceCode4
Open-Set Image Tagging with Multi-Grained Text SupervisionCode4
DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank AdaptationCode4
VILA: On Pre-training for Visual Language ModelsCode4
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis TestingCode4
Streaming 4D Visual Geometry TransformerCode4
Skywork Open Reasoner 1 Technical ReportCode4
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language ModelsCode4
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction TuningCode4
Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image SegmentationCode4
MutaPLM: Protein Language Modeling for Mutation Explanation and EngineeringCode4
OpenWebMath: An Open Dataset of High-Quality Mathematical Web TextCode4
XGBoost: Scalable GPU Accelerated LearningCode4
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion ModelsCode4
Show:102550
← PrevPage 57 of 26400Next →