SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,984 papers248,105 code links4,818 tasks

Papers

Showing 16761700 of 177340 papers

TitleStatusHype
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at ScaleCode4
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall SpacesCode4
CodeI/O: Condensing Reasoning Patterns via Code Input-Output PredictionCode4
Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLMCode4
Highly Accurate Dichotomous Image SegmentationCode4
Distill Any Depth: Distillation Creates a Stronger Monocular Depth EstimatorCode4
MIMIC-IT: Multi-Modal In-Context Instruction TuningCode4
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text GenerationCode4
Stop Overthinking: A Survey on Efficient Reasoning for Large Language ModelsCode4
OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action ModelCode4
LSKNet: A Foundation Lightweight Backbone for Remote SensingCode4
Reflexion: Language Agents with Verbal Reinforcement LearningCode4
EmbodiedSAM: Online Segment Any 3D Thing in Real TimeCode4
Ming-Omni: A Unified Multimodal Model for Perception and GenerationCode4
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language ModelsCode4
Enhance-A-Video: Better Generated Video for FreeCode4
OnPrem.LLM: A Privacy-Conscious Document Intelligence ToolkitCode4
Token Merging for Fast Stable DiffusionCode4
Agile But Safe: Learning Collision-Free High-Speed Legged LocomotionCode4
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPTCode4
PufferLib: Making Reinforcement Learning Libraries and Environments Play NiceCode4
Latent Swap Joint Diffusion for 2D Long-Form Latent GenerationCode4
Elucidating the Design Space of Diffusion-Based Generative ModelsCode4
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language modelsCode4
BitNet a4.8: 4-bit Activations for 1-bit LLMsCode4
Show:102550
← PrevPage 68 of 7094Next →