SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 18761900 of 659983 papers

TitleStatusHype
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User ModelingCode4
EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network OperationsCode4
AutoWebGLM: A Large Language Model-based Web Navigating AgentCode4
Prompt-to-Prompt Image Editing with Cross Attention ControlCode4
QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice CodebooksCode4
Differential Privacy: What is all the noise about?Code4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
Diffusion Models: A Comprehensive Survey of Methods and ApplicationsCode4
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and VideoCode4
Retrieval-Augmented Generation for Knowledge-Intensive NLP TasksCode4
LettuceDetect: A Hallucination Detection Framework for RAG ApplicationsCode4
InternVideo: General Video Foundation Models via Generative and Discriminative LearningCode4
Optimizing Prompts for Text-to-Image GenerationCode4
DAMO-YOLO : A Report on Real-Time Object Detection DesignCode4
ViTMatte: Boosting Image Matting with Pretrained Plain Vision TransformersCode4
DeepInverse: A Python package for solving imaging inverse problems with deep learningCode4
Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-EvolutionCode4
Holistic Evaluation of Language ModelsCode4
Seed-Coder: Let the Code Model Curate Data for ItselfCode4
FullStack Bench: Evaluating LLMs as Full Stack CodersCode4
Motion Capture Dataset for Practical Use of AI-based Motion Editing and StylizationCode4
The Platonic Representation HypothesisCode4
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation ExpertsCode4
ArchiSound: Audio Generation with DiffusionCode4
Aligning benchmark datasets for table structure recognitionCode4
Show:102550
← PrevPage 76 of 26400Next →