SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 22512275 of 661570 papers

TitleStatusHype
Vidur: A Large-Scale Simulation Framework For LLM InferenceCode4
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 smallCode4
Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language ModelsCode4
VideoChat-Flash: Hierarchical Compression for Long-Context Video ModelingCode4
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and RecipeCode4
RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization BenchmarkCode4
GLIPv2: Unifying Localization and Vision-Language UnderstandingCode4
Cube: A Roblox View of 3D IntelligenceCode4
Open-Set Image Tagging with Multi-Grained Text SupervisionCode4
DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank AdaptationCode4
VILA: On Pre-training for Visual Language ModelsCode4
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis TestingCode4
Streaming 4D Visual Geometry TransformerCode4
Skywork Open Reasoner 1 Technical ReportCode4
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language ModelsCode4
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction TuningCode4
Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image SegmentationCode4
MutaPLM: Protein Language Modeling for Mutation Explanation and EngineeringCode4
OpenWebMath: An Open Dataset of High-Quality Mathematical Web TextCode4
XGBoost: Scalable GPU Accelerated LearningCode4
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion ModelsCode4
Galactica: A Large Language Model for ScienceCode4
RTMDet: An Empirical Study of Designing Real-Time Object DetectorsCode4
3D TransUNet: Advancing Medical Image Segmentation through Vision TransformersCode4
DN-Splatter: Depth and Normal Priors for Gaussian Splatting and MeshingCode4
Show:102550
← PrevPage 91 of 26463Next →