SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 24762500 of 177340 papers

TitleStatusHype
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language ModelsCode3
SlimPajama-DC: Understanding Data Combinations for LLM TrainingCode3
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMsCode3
Splatter Image: Ultra-Fast Single-View 3D ReconstructionCode3
MatterGen: a generative model for inorganic materials designCode3
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
Universal Instance Perception as Object Discovery and RetrievalCode3
EfficientNet: Rethinking Model Scaling for Convolutional Neural NetworksCode3
Beat this! Accurate beat tracking without DBN postprocessingCode3
BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object DetectionCode3
Parametric Retrieval Augmented GenerationCode3
LLMs Get Lost In Multi-Turn ConversationCode3
ORLM: A Customizable Framework in Training Large Models for Automated Optimization ModelingCode3
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible PipelineCode3
ViNT: A Foundation Model for Visual NavigationCode3
The Prusti project: Formal verification for RustCode3
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic SegmentationCode3
RAKG:Document-level Retrieval Augmented Knowledge Graph ConstructionCode3
ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernelsCode3
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API CallsCode3
RecurrentGPT: Interactive Generation of (Arbitrarily) Long TextCode3
Punica: Multi-Tenant LoRA ServingCode3
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image GenerationCode3
RepViT-SAM: Towards Real-Time Segmenting AnythingCode3
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language ModelsCode3
Show:102550
← PrevPage 100 of 7094Next →